o1 – Page 8 – Experimental News Clipping Site

Simon Willison’s Weblog: DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B

Jan 20, 2025

—

by

Source URL: https://simonwillison.net/2025/Jan/20/deepseek-r1/ Source: Simon Willison’s Weblog Title: DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B Feedly Summary: DeepSeek are the Chinese AI lab who dropped the best currently available open weights LLM on Christmas day, DeepSeek v3. That model was trained in part using their unreleased R1 “reasoning" model. Today they’ve released R1 itself, along with a whole…

Hacker News: DeepSeek-R1

Jan 20, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://github.com/deepseek-ai/DeepSeek-R1 Source: Hacker News Title: DeepSeek-R1 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents advancements in AI reasoning models, specifically DeepSeek-R1-Zero and DeepSeek-R1, emphasizing the unique approach of training solely through large-scale reinforcement learning (RL) without initial supervised fine-tuning. These models demonstrate significant reasoning capabilities and highlight breakthroughs in…

Simon Willison’s Weblog: Quoting gwern

Jan 16, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/16/gwern/#atom-everything Source: Simon Willison’s Weblog Title: Quoting gwern Feedly Summary: […] much of the point of a model like o1 is not to deploy it, but to generate training data for the next model. Every problem that an o1 solves is now a training data point for an o3 (eg. any o1 session…

Slashdot: OpenAI’s AI Reasoning Model ‘Thinks’ In Chinese Sometimes, No One Really Knows Why

Jan 15, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/01/14/239246/openais-ai-reasoning-model-thinks-in-chinese-sometimes-no-one-really-knows-why?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI’s AI Reasoning Model ‘Thinks’ In Chinese Sometimes, No One Really Knows Why Feedly Summary: AI Summary and Description: Yes Summary: The behavior exhibited by OpenAI’s reasoning AI model, o1, which seemingly “thinks” in multiple languages regardless of the input language, has raised questions within the AI community. Experts…

Simon Willison’s Weblog: Codestral 25.01

Jan 13, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/13/codestral-2501/ Source: Simon Willison’s Weblog Title: Codestral 25.01 Feedly Summary: Codestral 25.01 Brand new code-focused model from Mistral. Unlike the first Codestral this one isn’t (yet) available as open weights. The model has a 256k token context – a new record for Mistral. The new model scored an impressive joint first place with…

Simon Willison’s Weblog: Quoting Ben Hylak

Jan 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/12/ben-hylak/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ben Hylak Feedly Summary: I was using o1 like a chat model — but o1 is not a chat model. If o1 is not a chat model — what is it? I think of it like a “report generator.” If you give it enough context, and…

Hacker News: Contemplative LLMs

Jan 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://maharshi.bearblog.dev/contemplative-llms-prompt/ Source: Hacker News Title: Contemplative LLMs Feedly Summary: Comments AI Summary and Description: Yes **Short Summary with Insight:** The text discusses the novel approach of prompting Large Language Models (LLMs) to engage in a contemplation phase before generating answers. By mimicking a reasoning process, which encourages exploration and questioning assumptions, this method…

OpenAI : Deliberative alignment: reasoning enables safer language models

Jan 8, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/deliberative-alignment Source: OpenAI Title: Deliberative alignment: reasoning enables safer language models Feedly Summary: Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety specifications and how to reason over them. AI Summary and Description: Yes Summary: The text discusses a new alignment strategy…

The Register: Even at $200/mo, Altman admits ChatGPT Pro struggles to turn a profit

Jan 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/01/06/altman_gpt_profits/ Source: The Register Title: Even at $200/mo, Altman admits ChatGPT Pro struggles to turn a profit Feedly Summary: But don’t worry, he’s ‘figured out’ AGI comment Even at $200 a month for ChatGPT Pro, the service is struggling to turn a profit, OpenAI CEO Sam Altman lamented on the platform formerly known…

Simon Willison’s Weblog: Quoting François Chollet

Jan 6, 2025

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/6/francois-chollet/#atom-everything Source: Simon Willison’s Weblog Title: Quoting François Chollet Feedly Summary: I don’t think people really appreciate how simple ARC-AGI-1 was, and what solving it really means. It was designed as the simplest, most basic assessment of fluid intelligence possible. Failure to pass signifies a near-total inability to adapt or problem-solve in unfamiliar…

Tag: o1