Tag: Inference
-
Hacker News: DeepSeek-V3
Source URL: https://github.com/deepseek-ai/DeepSeek-V3 Source: Hacker News Title: DeepSeek-V3 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces DeepSeek-V3, a significant advancement in language model technology, showcasing its innovative architecture and training techniques designed for improving efficiency and performance. For AI, cloud, and infrastructure security professionals, the novel methodologies and benchmarks presented can…
-
Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model
Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…
-
Simon Willison’s Weblog: Quoting Jack Clark
Source URL: https://simonwillison.net/2024/Dec/23/jack-clark/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: There’s been a lot of strange reporting recently about how ‘scaling is hitting a wall’ – in a very narrow sense this is true in that larger models were getting less score improvement on challenging benchmarks than their predecessors, but in a…
-
Cloud Blog: Scaling to zero on Google Kubernetes Engine with KEDA
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/scale-to-zero-on-gke-with-keda/ Source: Cloud Blog Title: Scaling to zero on Google Kubernetes Engine with KEDA Feedly Summary: For developers and businesses that run applications on Google Kubernetes Engine (GKE), scaling deployments down to zero when they are idle can offer significant financial savings. GKE’s Cluster Autoscaler efficiently manages node pool sizes, but for applications…
-
Cloud Blog: Scaling to zero on Google Kubernetes Engine with KEDA
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/scale-to-zero-on-gke-with-keda/ Source: Cloud Blog Title: Scaling to zero on Google Kubernetes Engine with KEDA Feedly Summary: For developers and businesses that run applications on Google Kubernetes Engine (GKE), scaling deployments down to zero when they are idle can offer significant financial savings. GKE’s Cluster Autoscaler efficiently manages node pool sizes, but for applications…
-
Simon Willison’s Weblog: OpenAI O3 breakthrough high score on ARC-AGI-PUB
Source URL: https://simonwillison.net/2024/Dec/20/openai-o3-breakthrough/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI O3 breakthrough high score on ARC-AGI-PUB Feedly Summary: OpenAI O3 breakthrough high score on ARC-AGI-PUB François Chollet is the co-founder of the ARC Prize and had advanced access to today’s o3 results. His article here is the most insightful coverage I’ve seen of o3, going beyond…
-
Simon Willison’s Weblog: Quoting François Chollet
Source URL: https://simonwillison.net/2024/Dec/20/francois-chollet/#atom-everything Source: Simon Willison’s Weblog Title: Quoting François Chollet Feedly Summary: OpenAI’s new o3 system – trained on the ARC-AGI-1 Public Training set – has scored a breakthrough 75.7% on the Semi-Private Evaluation set at our stated public leaderboard $10k compute limit. A high-compute (172x) o3 configuration scored 87.5%. This is a surprising…
-
Cloud Blog: The Year in Google Cloud – 2024
Source URL: https://cloud.google.com/blog/products/gcp/top-google-cloud-blogs/ Source: Cloud Blog Title: The Year in Google Cloud – 2024 Feedly Summary: If you’re a regular reader of this blog, you know that 2024 was a busy year for Google Cloud. From AI to Zero Trust, and everything in between, here’s a chronological recap of our top blogs of 2024, according…