Tag: llms
-
OpenAI : Estimating worst case frontier risks of open weight LLMs
Source URL: https://openai.com/index/estimating-worst-case-frontier-risks-of-open-weight-llms Source: OpenAI Title: Estimating worst case frontier risks of open weight LLMs Feedly Summary: In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and…
-
Cloud Blog: Announcements for AI Hypercomputer: The latest infrastructure news for ML practitioners
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/q2-2025-ai-hypercomputer-updates/ Source: Cloud Blog Title: Announcements for AI Hypercomputer: The latest infrastructure news for ML practitioners Feedly Summary: Curious about the latest in AI infrastructure from Google Cloud? Every three months we share a roundup of the latest AI Hypercomputer news, resources, events, learning opportunities, and more. Read on to learn new ways…
-
Simon Willison’s Weblog: ChatGPT agent’s user-agent
Source URL: https://simonwillison.net/2025/Aug/4/chatgpt-agents-user-agent/#atom-everything Source: Simon Willison’s Weblog Title: ChatGPT agent’s user-agent Feedly Summary: I was exploring how ChatGPT agent works today. I learned some interesting things about how it exposes its identity through HTTP headers, then made a huge blunder in thinking it was leaking its URLs to Bingbot and Yandex… but it turned out…
-
Simon Willison’s Weblog: ChatGPT agent triggers crawls from Bingbot and Yandex
Source URL: https://simonwillison.net/2025/Aug/4/chatgpt-agents-agent/#atom-everything Source: Simon Willison’s Weblog Title: ChatGPT agent triggers crawls from Bingbot and Yandex Feedly Summary: ChatGPT agent is the recently released (and confusingly named) ChatGPT feature that provides browser automation combined with terminal access as a feature of ChatGPT – replacing their previous Operator research preview which is scheduled for deprecation on…
-
Simon Willison’s Weblog: Usage charts for my LLM tool against OpenRouter
Source URL: https://simonwillison.net/2025/Aug/4/llm-openrouter-usage/#atom-everything Source: Simon Willison’s Weblog Title: Usage charts for my LLM tool against OpenRouter Feedly Summary: Usage charts for my LLM tool against OpenRouter OpenRouter proxies requests to a large number of different LLMs and provides high level statistics of which models are the most popular among their users. Tools that call OpenRouter…
-
Simon Willison’s Weblog: Qwen-Image: Crafting with Native Text Rendering
Source URL: https://simonwillison.net/2025/Aug/4/qwen-image/#atom-everything Source: Simon Willison’s Weblog Title: Qwen-Image: Crafting with Native Text Rendering Feedly Summary: Qwen-Image: Crafting with Native Text Rendering Not content with releasing six excellent open weights LLMs in July, Qwen are kicking off August with their first ever image generation model. Qwen-Image is a 20 billion parameter MMDiT (Multimodal Diffusion Transformer,…
-
Simon Willison’s Weblog: Quoting @himbodhisattva
Source URL: https://simonwillison.net/2025/Aug/4/himbodhisattva/#atom-everything Source: Simon Willison’s Weblog Title: Quoting @himbodhisattva Feedly Summary: for services that wrap GPT-3, is it possible to do the equivalent of sql injection? like, a prompt-injection attack? make it think it’s completed the task and then get access to the generation, and ask it to repeat the original instruction? — @himbodhisattva,…
-
Tomasz Tunguz: Small Action Models Are the Future of AI Agents
Source URL: https://www.tomtunguz.com/local-instructions/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, & the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…
-
Simon Willison’s Weblog: The ChatGPT sharing dialog demonstrates how difficult it is to design privacy preferences
Source URL: https://simonwillison.net/2025/Aug/3/privacy-design/ Source: Simon Willison’s Weblog Title: The ChatGPT sharing dialog demonstrates how difficult it is to design privacy preferences Feedly Summary: ChatGPT just removed their “make this chat discoverable" sharing feature, after it turned out a material volume of users had inadvertantly made their private chats available via Google search. Dane Stuckey, CISO…
-
Simon Willison’s Weblog: XBai o4
Source URL: https://simonwillison.net/2025/Aug/3/xbai-o4/#atom-everything Source: Simon Willison’s Weblog Title: XBai o4 Feedly Summary: XBai o4 Yet another open source (Apache 2.0) LLM from a Chinese AI lab. This model card claims: XBai o4 excels in complex reasoning capabilities and has now completely surpassed OpenAI-o3-mini in Medium mode. This a 32.8 billion parameter model released by MetaStone…