Tag: context window
-
Simon Willison’s Weblog: Claude Sonnet 4 now supports 1M tokens of context
Source URL: https://simonwillison.net/2025/Aug/12/claude-sonnet-4-1m/ Source: Simon Willison’s Weblog Title: Claude Sonnet 4 now supports 1M tokens of context Feedly Summary: Claude Sonnet 4 now supports 1M tokens of context Gemini and OpenAI both have million token models, so it’s good to see Anthropic catching up. This is 5x the previous 200,000 context length limit of the…
-
Cloud Blog: Taming the stragglers: Maximize AI training performance with automated straggler detection
Source URL: https://cloud.google.com/blog/products/compute/stragglers-in-ai-a-guide-to-automated-straggler-detection/ Source: Cloud Blog Title: Taming the stragglers: Maximize AI training performance with automated straggler detection Feedly Summary: Stragglers are an industry-wide issue for developers working with large-scale machine learning workloads. The larger and more powerful these systems become, the more their performance is hostage to the subtle misbehavior of a single component.…
-
AWS News Blog: OpenAI open weight models now available on AWS
Source URL: https://aws.amazon.com/blogs/aws/openai-open-weight-models-now-available-on-aws/ Source: AWS News Blog Title: OpenAI open weight models now available on AWS Feedly Summary: AWS continues to expand access to the most advanced foundation models with OpenAI open weight models now available in Amazon Bedrock and Amazon SageMaker JumpStart. Accessing these new models from OpenAI on AWS, gpt-oss-120b and gpt-oss-20b, gives…
-
Simon Willison’s Weblog: Coding with LLMs in the summer of 2025 (an update)
Source URL: https://simonwillison.net/2025/Jul/21/coding-with-llms/#atom-everything Source: Simon Willison’s Weblog Title: Coding with LLMs in the summer of 2025 (an update) Feedly Summary: Coding with LLMs in the summer of 2025 (an update) Salvatore Sanfilippo describes his current AI-assisted development workflow. He’s all-in on LLMs for code review, exploratory prototyping, pair-design and writing “part of the code under…
-
Cloud Blog: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-memory-bank-in-public-preview/ Source: Cloud Blog Title: Announcing Vertex AI Agent Engine Memory Bank available for everyone in preview Feedly Summary: Developers are racing to productize agents, but a common limitation is the absence of memory. Without memory, agents treat each interaction as the first, asking repetitive questions and failing to recall user preferences. This…
-
Irrational Exuberance: What can agents actually do?
Source URL: https://lethain.com/what-can-agents-do/ Source: Irrational Exuberance Title: What can agents actually do? Feedly Summary: There’s a lot of excitement about what AI (specifically the latest wave of LLM-anchored AI) can do, and how AI-first companies are different from the prior generations of companies. There are a lot of important and real opportunities at hand, but…
-
Simon Willison’s Weblog: How to Fix Your Context
Source URL: https://simonwillison.net/2025/Jun/29/how-to-fix-your-context/#atom-everything Source: Simon Willison’s Weblog Title: How to Fix Your Context Feedly Summary: How to Fix Your Context Drew Breunig has been publishing some very detailed notes on context engineering recently. In How Long Contexts Fail he described four common patterns for context rot, which he summarizes like so: Context Poisoning: When a…
-
Simon Willison’s Weblog: Gemini CLI
Source URL: https://simonwillison.net/2025/Jun/25/gemini-cli/ Source: Simon Willison’s Weblog Title: Gemini CLI Feedly Summary: Gemini CLI First there was Claude Code in February, then OpenAI Codex (CLI) in April, and now Gemini CLI in June. All three of the largest AI labs now have their own version of what I am calling a “terminal agent" – a…
-
Cloud Blog: The secret to document intelligence: Box builds Enhanced Extract Agents using Google’s Agent-2-Agent framework
Source URL: https://cloud.google.com/blog/topics/customers/box-ai-agents-with-googles-agent-2-agent-protocol/ Source: Cloud Blog Title: The secret to document intelligence: Box builds Enhanced Extract Agents using Google’s Agent-2-Agent framework Feedly Summary: Box is one of the original information sharing and collaboration platforms of the digital era. They’ve helped define how we work, and have continued to evolve those practices alongside successive waves of…
-
Simon Willison’s Weblog: How OpenElections Uses LLMs
Source URL: https://simonwillison.net/2025/Jun/19/how-openelections-uses-llms/#atom-everything Source: Simon Willison’s Weblog Title: How OpenElections Uses LLMs Feedly Summary: How OpenElections Uses LLMs The OpenElections project collects detailed election data for the USA, all the way down to the precinct level. This is a surprisingly hard problem: while county and state-level results are widely available, precinct-level results are published in…