Tag: o1
- 
		
		
		
Simon Willison’s Weblog: Using LLMs and Cursor to become a finisher
Source URL: https://simonwillison.net/2025/Jan/4/llms-and-cursor/#atom-everything Source: Simon Willison’s Weblog Title: Using LLMs and Cursor to become a finisher Feedly Summary: Using LLMs and Cursor to become a finisher Zohaib Rauf describes a pattern I’ve seen quite a few examples of now: engineers who moved into management but now find themselves able to ship working code again (at…
 - 
		
		
		
Hacker News: A path to O1 open source
Source URL: https://arxiv.org/abs/2412.14135 Source: Hacker News Title: A path to O1 open source Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses advancements in artificial intelligence, particularly focusing on the reinforcement learning approach to reproduce OpenAI’s o1 model. It highlights key components like policy initialization, reward design, search, and learning that contribute…
 - 
		
		
		
Hacker News: Letting Language Models Write My Website
Source URL: https://nicholas.carlini.com/writing/2025/llms-write-my-bio.html Source: Hacker News Title: Letting Language Models Write My Website Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents an engaging exploration of the capabilities and limitations of large language models (LLMs) through a creative project where the author generates a new homepage and biography each day using different…
 - 
		
		
		
Hacker News: Interesting Interview with DeepSeek’s CEO
Source URL: https://www.chinatalk.media/p/deepseek-ceo-interview-with-chinas Source: Hacker News Title: Interesting Interview with DeepSeek’s CEO Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text centers on Deepseek, a Chinese AI startup that has distinguished itself by developing models that surpass OpenAI’s in performance while maintaining a commitment to open-source principles. The startup demonstrates a unique approach…
 - 
		
		
		
Simon Willison’s Weblog: Quoting Alexis Gallagher
Source URL: https://simonwillison.net/2024/Dec/31/alexis-gallagher/ Source: Simon Willison’s Weblog Title: Quoting Alexis Gallagher Feedly Summary: Basically, a frontier model like OpenAI’s O1 is like a Ferrari SF-23. It’s an obvious triumph of engineering, designed to win races, and that’s why we talk about it. But it takes a special pit crew just to change the tires and…
 - 
		
		
		
Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model
Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…
 - 
		
		
		
Simon Willison’s Weblog: OpenAI O3 breakthrough high score on ARC-AGI-PUB
Source URL: https://simonwillison.net/2024/Dec/20/openai-o3-breakthrough/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI O3 breakthrough high score on ARC-AGI-PUB Feedly Summary: OpenAI O3 breakthrough high score on ARC-AGI-PUB François Chollet is the co-founder of the ARC Prize and had advanced access to today’s o3 results. His article here is the most insightful coverage I’ve seen of o3, going beyond…
 - 
		
		
		
Simon Willison’s Weblog: Quoting François Chollet
Source URL: https://simonwillison.net/2024/Dec/20/francois-chollet/#atom-everything Source: Simon Willison’s Weblog Title: Quoting François Chollet Feedly Summary: OpenAI’s new o3 system – trained on the ARC-AGI-1 Public Training set – has scored a breakthrough 75.7% on the Semi-Private Evaluation set at our stated public leaderboard $10k compute limit. A high-compute (172x) o3 configuration scored 87.5%. This is a surprising…