Tag: prompt-engineering

  • Simon Willison’s Weblog: Let the LLM Write the Prompts: An Intro to DSPy in Compound Al Pipelines

    Source URL: https://simonwillison.net/2025/Oct/4/drew-on-dspy/#atom-everything Source: Simon Willison’s Weblog Title: Let the LLM Write the Prompts: An Intro to DSPy in Compound Al Pipelines Feedly Summary: Let the LLM Write the Prompts: An Intro to DSPy in Compound Al Pipelines I’ve had trouble getting my head around DSPy in the past. This half hour talk by Drew…

  • Simon Willison’s Weblog: GPT-5-Codex

    Source URL: https://simonwillison.net/2025/Sep/23/gpt-5-codex/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5-Codex Feedly Summary: GPT-5-Codex OpenAI half-relased this model earlier this month, adding it to their Codex CLI tool but not their API. Today they’ve fixed that – the new model can now be accessed as gpt-5-codex. It’s priced the same as regular GPT-5: $1.25/million input tokens, $10/million…

  • Simon Willison’s Weblog: CompileBench: Can AI Compile 22-year-old Code?

    Source URL: https://simonwillison.net/2025/Sep/22/compilebench/ Source: Simon Willison’s Weblog Title: CompileBench: Can AI Compile 22-year-old Code? Feedly Summary: CompileBench: Can AI Compile 22-year-old Code? Interesting new LLM benchmark from Piotr Grabowski and Piotr Migdał: how well can different models handle compilation challenges such as cross-compiling gucr for ARM64 architecture? This is one of my favorite applications of…

  • Simon Willison’s Weblog: Models can prompt now

    Source URL: https://simonwillison.net/2025/Sep/14/models-can-prompt/#atom-everything Source: Simon Willison’s Weblog Title: Models can prompt now Feedly Summary: Here’s an interesting example of models incrementally improving over time: I am finding that today’s leading models are competent at writing prompts for themselves and each other. A year ago I was quite skeptical of the pattern where models are used…

  • Simon Willison’s Weblog: DeepSeek 3.1

    Source URL: https://simonwillison.net/2025/Aug/22/deepseek-31/#atom-everything Source: Simon Willison’s Weblog Title: DeepSeek 3.1 Feedly Summary: DeepSeek 3.1 The latest model from DeepSeek, a 685B monster (like DeepSeek v3 before it) but this time it’s a hybrid reasoning model. DeepSeek claim: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly. Drew Breunig points out that their benchmarks…

  • Simon Willison’s Weblog: too many model context protocol servers and LLM allocations on the dance floor

    Source URL: https://simonwillison.net/2025/Aug/22/too-many-mcps/#atom-everything Source: Simon Willison’s Weblog Title: too many model context protocol servers and LLM allocations on the dance floor Feedly Summary: too many model context protocol servers and LLM allocations on the dance floor Useful reminder from Geoffrey Huntley of the infrequently discussed significant token cost of using MCP. Geoffrey estimate estimates that…

  • Slashdot: KPMG Wrote 100-Page Prompt To Build Agentic TaxBot

    Source URL: https://slashdot.org/story/25/08/22/1110257/kpmg-wrote-100-page-prompt-to-build-agentic-taxbot?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: KPMG Wrote 100-Page Prompt To Build Agentic TaxBot Feedly Summary: AI Summary and Description: Yes Summary: The text discusses how KPMG Australia is utilizing advanced AI technology to automate complex tax advisory processes. By creating a sophisticated AI agent that leverages extensive prompt engineering, they significantly reduce the time…

  • Simon Willison’s Weblog: GPT-5 has a hidden system prompt

    Source URL: https://simonwillison.net/2025/Aug/15/gpt-5-has-a-hidden-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5 has a hidden system prompt Feedly Summary: GPT-5 has a hidden system prompt It looks like GPT-5 when accessed via the OpenAI API may have its own hidden system prompt, independent from the system prompt you can specify in an API call. At the very least…

  • Simon Willison’s Weblog: Reverse engineering some updates to Claude

    Source URL: https://simonwillison.net/2025/Jul/31/updates-to-claude/#atom-everything Source: Simon Willison’s Weblog Title: Reverse engineering some updates to Claude Feedly Summary: Anthropic released two major new features for their consumer-facing Claude apps in the past couple of days. Sadly, they don’t do a very good job of updating the release notes for those apps – neither of these releases came…

  • Simon Willison’s Weblog: OpenAI: Introducing study mode

    Source URL: https://simonwillison.net/2025/Jul/29/openai-introducing-study-mode/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI: Introducing study mode Feedly Summary: OpenAI: Introducing study mode New ChatGPT feature, which can be triggered by typing /study or by visiting chatgpt.com/studymode. OpenAI say: Under the hood, study mode is powered by custom system instructions we’ve written in collaboration with teachers, scientists, and pedagogy experts…