test-time compute – Experimental News Clipping Site

Simon Willison’s Weblog: OpenAI’s gold medal performance on the International Math Olympiad

Jul 19, 2025

—

by

Source URL: https://simonwillison.net/2025/Jul/19/openai-gold-medal-math-olympiad/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI’s gold medal performance on the International Math Olympiad Feedly Summary: OpenAI’s gold medal performance on the International Math Olympiad OpenAI research scientist Alexander Wei: I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance…

Hacker News: Tao: Using test-time compute to train efficient LLMs without labeled data

Mar 26, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.databricks.com/blog/tao-using-test-time-compute-train-efficient-llms-without-labeled-data Source: Hacker News Title: Tao: Using test-time compute to train efficient LLMs without labeled data Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces a new model tuning method for large language models (LLMs) called Test-time Adaptive Optimization (TAO) that enhances model quality without requiring large amounts of labeled…

Slashdot: Majority of AI Researchers Say Tech Industry Is Pouring Billions Into a Dead End

Mar 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/03/22/0341222/majority-of-ai-researchers-say-tech-industry-is-pouring-billions-into-a-dead-end Source: Slashdot Title: Majority of AI Researchers Say Tech Industry Is Pouring Billions Into a Dead End Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the recent survey results from the Association for the Advancement of AI, revealing a significant skepticism among researchers regarding the effectiveness of simply “scaling…

Hacker News: Scaling Up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Feb 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://arxiv.org/abs/2502.05171 Source: Hacker News Title: Scaling Up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a novel language model architecture that enhances test-time computation through latent reasoning, presenting a new methodology that contrasts with traditional reasoning models. It emphasizes the…

Simon Willison’s Weblog: Trading Inference-Time Compute for Adversarial Robustness

Jan 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/22/trading-inference-time-compute/ Source: Simon Willison’s Weblog Title: Trading Inference-Time Compute for Adversarial Robustness Feedly Summary: Trading Inference-Time Compute for Adversarial Robustness Brand new research paper from OpenAI, exploring how inference-scaling “reasoning" models such as o1 might impact the search for improved security with respect to things like prompt injection. We conduct experiments on the…

Hacker News: Nvidia CEO says his AI chips are improving faster than Moore’s Law

Jan 8, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://techcrunch.com/2025/01/07/nvidia-ceo-says-his-ai-chips-are-improving-faster-than-moores-law/ Source: Hacker News Title: Nvidia CEO says his AI chips are improving faster than Moore’s Law Feedly Summary: Comments AI Summary and Description: Yes Summary: Jensen Huang, CEO of Nvidia, asserts that the performance of the company’s AI chips is advancing at a pace exceeding the historical benchmark of Moore’s Law. This…

Simon Willison’s Weblog: Is AI progress slowing down?

Dec 19, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/19/is-ai-progress-slowing-down/#atom-everything Source: Simon Willison’s Weblog Title: Is AI progress slowing down? Feedly Summary: Is AI progress slowing down? This piece by Arvind Narayanan and Sayash Kapoor is the single most insightful essay about AI and LLMs I’ve seen in a long time. It’s long and worth reading every inch of it – it…

Hacker News: Alibaba releases an ‘open’ challenger to OpenAI’s O1 reasoning model

Nov 29, 2024

—

by

system automation

in Uncategorized

Source URL: https://techcrunch.com/2024/11/27/alibaba-releases-an-open-challenger-to-openais-o1-reasoning-model/ Source: Hacker News Title: Alibaba releases an ‘open’ challenger to OpenAI’s O1 reasoning model Feedly Summary: Comments AI Summary and Description: Yes Summary: The arrival of the QwQ-32B-Preview model from Alibaba’s Qwen team introduces a significant competitor to OpenAI’s offerings in the AI reasoning space. With its innovative self-fact-checking capabilities and ability…

Hacker News: OpenAI o1 Results on ARC-AGI-Pub

Sep 14, 2024

—

by

system automation

in Uncategorized

Source URL: https://arcprize.org/blog/openai-o1-results-arc-prize Source: Hacker News Title: OpenAI o1 Results on ARC-AGI-Pub Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses OpenAI’s newly released o1 models, which utilize a “chain-of-thought” (CoT) reasoning paradigm that enhances the AI’s performance in reasoning tasks. It highlights the improvements over existing models such as GPT-4o and…

Tag: test-time compute