Tag: prompt-engineering

  • Simon Willison’s Weblog: Models can prompt now

    Source URL: https://simonwillison.net/2025/Sep/14/models-can-prompt/#atom-everything Source: Simon Willison’s Weblog Title: Models can prompt now Feedly Summary: Here’s an interesting example of models incrementally improving over time: I am finding that today’s leading models are competent at writing prompts for themselves and each other. A year ago I was quite skeptical of the pattern where models are used…

  • Simon Willison’s Weblog: DeepSeek 3.1

    Source URL: https://simonwillison.net/2025/Aug/22/deepseek-31/#atom-everything Source: Simon Willison’s Weblog Title: DeepSeek 3.1 Feedly Summary: DeepSeek 3.1 The latest model from DeepSeek, a 685B monster (like DeepSeek v3 before it) but this time it’s a hybrid reasoning model. DeepSeek claim: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly. Drew Breunig points out that their benchmarks…

  • Simon Willison’s Weblog: too many model context protocol servers and LLM allocations on the dance floor

    Source URL: https://simonwillison.net/2025/Aug/22/too-many-mcps/#atom-everything Source: Simon Willison’s Weblog Title: too many model context protocol servers and LLM allocations on the dance floor Feedly Summary: too many model context protocol servers and LLM allocations on the dance floor Useful reminder from Geoffrey Huntley of the infrequently discussed significant token cost of using MCP. Geoffrey estimate estimates that…

  • Slashdot: KPMG Wrote 100-Page Prompt To Build Agentic TaxBot

    Source URL: https://slashdot.org/story/25/08/22/1110257/kpmg-wrote-100-page-prompt-to-build-agentic-taxbot?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: KPMG Wrote 100-Page Prompt To Build Agentic TaxBot Feedly Summary: AI Summary and Description: Yes Summary: The text discusses how KPMG Australia is utilizing advanced AI technology to automate complex tax advisory processes. By creating a sophisticated AI agent that leverages extensive prompt engineering, they significantly reduce the time…

  • Simon Willison’s Weblog: GPT-5 has a hidden system prompt

    Source URL: https://simonwillison.net/2025/Aug/15/gpt-5-has-a-hidden-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5 has a hidden system prompt Feedly Summary: GPT-5 has a hidden system prompt It looks like GPT-5 when accessed via the OpenAI API may have its own hidden system prompt, independent from the system prompt you can specify in an API call. At the very least…

  • Simon Willison’s Weblog: Reverse engineering some updates to Claude

    Source URL: https://simonwillison.net/2025/Jul/31/updates-to-claude/#atom-everything Source: Simon Willison’s Weblog Title: Reverse engineering some updates to Claude Feedly Summary: Anthropic released two major new features for their consumer-facing Claude apps in the past couple of days. Sadly, they don’t do a very good job of updating the release notes for those apps – neither of these releases came…

  • Simon Willison’s Weblog: OpenAI: Introducing study mode

    Source URL: https://simonwillison.net/2025/Jul/29/openai-introducing-study-mode/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI: Introducing study mode Feedly Summary: OpenAI: Introducing study mode New ChatGPT feature, which can be triggered by typing /study or by visiting chatgpt.com/studymode. OpenAI say: Under the hood, study mode is powered by custom system instructions we’ve written in collaboration with teachers, scientists, and pedagogy experts…

  • Simon Willison’s Weblog: xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated"

    Source URL: https://simonwillison.net/2025/Jul/15/xai-mitigated/ Source: Simon Willison’s Weblog Title: xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated" Feedly Summary: xAI: “We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated" They continue: One was that if you ask it "What is…

  • Simon Willison’s Weblog: Quoting @grok

    Source URL: https://simonwillison.net/2025/Jul/12/grok/#atom-everything Source: Simon Willison’s Weblog Title: Quoting @grok Feedly Summary: On the morning of July 8, 2025, we observed undesired responses and immediately began investigating. To identify the specific language in the instructions causing the undesired behavior, we conducted multiple ablations and experiments to pinpoint the main culprits. We identified the operative lines…