Tag: prompt-engineering

  • Simon Willison’s Weblog: microsoft/vscode-copilot-chat

    Source URL: https://simonwillison.net/2025/Jun/30/vscode-copilot-chat/#atom-everything Source: Simon Willison’s Weblog Title: microsoft/vscode-copilot-chat Feedly Summary: microsoft/vscode-copilot-chat As promised at Build 2025 in May, Microsoft have released the GitHub Copilot Chat client for VS Code under an open source (MIT) license. So far this is just the extension that provides the chat component of Copilot, but the launch announcement promises…

  • Simon Willison’s Weblog: How to Fix Your Context

    Source URL: https://simonwillison.net/2025/Jun/29/how-to-fix-your-context/#atom-everything Source: Simon Willison’s Weblog Title: How to Fix Your Context Feedly Summary: How to Fix Your Context Drew Breunig has been publishing some very detailed notes on context engineering recently. In How Long Contexts Fail he described four common patterns for context rot, which he summarizes like so: Context Poisoning: When a…

  • Simon Willison’s Weblog: Anthropic: How we built our multi-agent research system

    Source URL: https://simonwillison.net/2025/Jun/14/multi-agent-research-system/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic: How we built our multi-agent research system Feedly Summary: Anthropic: How we built our multi-agent research system OK, I’m sold on multi-agent LLM systems now. I’ve been pretty skeptical of these until recently: why make your life more complicated by running multiple different prompts in parallel…

  • Simon Willison’s Weblog: claude-trace

    Source URL: https://simonwillison.net/2025/Jun/2/claude-trace/ Source: Simon Willison’s Weblog Title: claude-trace Feedly Summary: claude-trace I’ve been thinking for a while it would be interesting to run some kind of HTTP proxy against the Claude Code CLI app and take a peek at how it works. Mario Zechner just published a really nice version of that. It works…

  • Cloud Blog: Create shareable generative AI apps in less than 60 seconds with Vertex AI and Cloud Run

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/create-gen-ai-apps-in-less-than-60-seconds-with-vertex-ai/ Source: Cloud Blog Title: Create shareable generative AI apps in less than 60 seconds with Vertex AI and Cloud Run Feedly Summary: Want to turn your generative AI ideas into real web applications with one click?  Any developer knows it’s a complex process to build shareable, interactive applications: you have to set…

  • Simon Willison’s Weblog: System Card: Claude Opus 4 & Claude Sonnet 4

    Source URL: https://simonwillison.net/2025/May/25/claude-4-system-card/#atom-everything Source: Simon Willison’s Weblog Title: System Card: Claude Opus 4 & Claude Sonnet 4 Feedly Summary: System Card: Claude Opus 4 & Claude Sonnet 4 Direct link to a PDF on Anthropic’s CDN because they don’t appear to have a landing page anywhere for this document. Anthropic’s system cards are always worth…

  • Simon Willison’s Weblog: Gemini 2.5 Models now support implicit caching

    Source URL: https://simonwillison.net/2025/May/9/gemini-implicit-caching/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5 Models now support implicit caching Feedly Summary: Gemini 2.5 Models now support implicit caching I just spotted a cacheTokensDetails key in the token usage JSON while running a long chain of prompts against Gemini 2.5 Flash – despite not configuring caching myself: {“cachedContentTokenCount": 200658, "promptTokensDetails":…

  • Simon Willison’s Weblog: Quoting Claude’s system prompt

    Source URL: https://simonwillison.net/2025/May/8/claudes-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Claude’s system prompt Feedly Summary: If asked to write poetry, Claude avoids using hackneyed imagery or metaphors or predictable rhyming schemes. — Claude’s system prompt, via Drew Breunig Tags: drew-breunig, prompt-engineering, anthropic, claude, generative-ai, ai, llms AI Summary and Description: Yes Summary: The text pertains to…

  • Simon Willison’s Weblog: Dummy’s Guide to Modern LLM Sampling

    Source URL: https://simonwillison.net/2025/May/4/llm-sampling/#atom-everything Source: Simon Willison’s Weblog Title: Dummy’s Guide to Modern LLM Sampling Feedly Summary: Dummy’s Guide to Modern LLM Sampling This is an extremely useful, detailed set of explanations by @AlpinDale covering the various different sampling strategies used by modern LLMs. LLMs return a set of next-token probabilities for every token in their…