o1 – Page 9 – Experimental News Clipping Site

Simon Willison’s Weblog: Weeknotes: Starting 2025 a little slow

Jan 5, 2025

—

by

Source URL: https://simonwillison.net/2025/Jan/4/weeknotes/#atom-everything Source: Simon Willison’s Weblog Title: Weeknotes: Starting 2025 a little slow Feedly Summary: I published my review of 2024 in LLMs and then got into a fight with most of the internet over the phone microphone targeted ads conspiracy theory. In my last weeknotes I talked about how December in LLMs has…

Simon Willison’s Weblog: Using LLMs and Cursor to become a finisher

Jan 4, 2025

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/4/llms-and-cursor/#atom-everything Source: Simon Willison’s Weblog Title: Using LLMs and Cursor to become a finisher Feedly Summary: Using LLMs and Cursor to become a finisher Zohaib Rauf describes a pattern I’ve seen quite a few examples of now: engineers who moved into management but now find themselves able to ship working code again (at…

Hacker News: A path to O1 open source

Jan 3, 2025

—

by

system automation

in Uncategorized

Source URL: https://arxiv.org/abs/2412.14135 Source: Hacker News Title: A path to O1 open source Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses advancements in artificial intelligence, particularly focusing on the reinforcement learning approach to reproduce OpenAI’s o1 model. It highlights key components like policy initialization, reward design, search, and learning that contribute…

Hacker News: Letting Language Models Write My Website

Dec 31, 2024

—

by

system automation

in Uncategorized

Source URL: https://nicholas.carlini.com/writing/2025/llms-write-my-bio.html Source: Hacker News Title: Letting Language Models Write My Website Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents an engaging exploration of the capabilities and limitations of large language models (LLMs) through a creative project where the author generates a new homepage and biography each day using different…

Hacker News: Interesting Interview with DeepSeek’s CEO

Dec 31, 2024

—

by

system automation

in Uncategorized

Source URL: https://www.chinatalk.media/p/deepseek-ceo-interview-with-chinas Source: Hacker News Title: Interesting Interview with DeepSeek’s CEO Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text centers on Deepseek, a Chinese AI startup that has distinguished itself by developing models that surpass OpenAI’s in performance while maintaining a commitment to open-source principles. The startup demonstrates a unique approach…

Simon Willison’s Weblog: Quoting Alexis Gallagher

Dec 31, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/31/alexis-gallagher/ Source: Simon Willison’s Weblog Title: Quoting Alexis Gallagher Feedly Summary: Basically, a frontier model like OpenAI’s O1 is like a Ferrari SF-23. It’s an obvious triumph of engineering, designed to win races, and that’s why we talk about it. But it takes a special pit crew just to change the tires and…

Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-Base

Dec 25, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/25/deepseek-v3/#atom-everything Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-Base Feedly Summary: deepseek-ai/DeepSeek-V3-Base No model card or announcement yet, but this new model release from Chinese AI lab DeepSeek (an arm of Chinese hedge fund High-Flyer) looks very significant. It’s a huge model – 685B parameters, 687.9 GB on disk (TIL how to size a git-lfs…

Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model

Dec 24, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…

Simon Willison’s Weblog: OpenAI O3 breakthrough high score on ARC-AGI-PUB

Dec 20, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/20/openai-o3-breakthrough/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI O3 breakthrough high score on ARC-AGI-PUB Feedly Summary: OpenAI O3 breakthrough high score on ARC-AGI-PUB François Chollet is the co-founder of the ARC Prize and had advanced access to today’s o3 results. His article here is the most insightful coverage I’ve seen of o3, going beyond…

Simon Willison’s Weblog: Quoting François Chollet

Dec 20, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Dec/20/francois-chollet/#atom-everything Source: Simon Willison’s Weblog Title: Quoting François Chollet Feedly Summary: OpenAI’s new o3 system – trained on the ARC-AGI-1 Public Training set – has scored a breakthrough 75.7% on the Semi-Private Evaluation set at our stated public leaderboard $10k compute limit. A high-compute (172x) o3 configuration scored 87.5%. This is a surprising…

Tag: o1