Tag: system prompt
-
CSA: The OWASP Top 10 for LLMs: CSA’s Defense Playbook
Source URL: https://cloudsecurityalliance.org/articles/the-owasp-top-10-for-llms-csa-s-strategic-defense-playbook Source: CSA Title: The OWASP Top 10 for LLMs: CSA’s Defense Playbook Feedly Summary: AI Summary and Description: Yes Summary: The text outlines the OWASP Top 10 vulnerabilities specific to large language models (LLMs) and provides actionable guidance from the Cloud Security Alliance (CSA) to mitigate these risks. This is crucial for…
-
Simon Willison’s Weblog: Quoting Claude’s system prompt
Source URL: https://simonwillison.net/2025/May/8/claudes-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Claude’s system prompt Feedly Summary: If asked to write poetry, Claude avoids using hackneyed imagery or metaphors or predictable rhyming schemes. — Claude’s system prompt, via Drew Breunig Tags: drew-breunig, prompt-engineering, anthropic, claude, generative-ai, ai, llms AI Summary and Description: Yes Summary: The text pertains to…
-
Simon Willison’s Weblog: Saying "hi" to Microsoft’s Phi-4-reasoning
Source URL: https://simonwillison.net/2025/May/6/phi-4-reasoning/#atom-everything Source: Simon Willison’s Weblog Title: Saying "hi" to Microsoft’s Phi-4-reasoning Feedly Summary: Microsoft released a new sub-family of models a few days ago: Phi-4 reasoning. They introduced them in this blog post celebrating a year since the release of Phi-3: Today, we are excited to introduce Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning – marking…
-
Simon Willison’s Weblog: Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25)
Source URL: https://simonwillison.net/2025/May/5/llm-video-frames/#atom-everything Source: Simon Willison’s Weblog Title: Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25) Feedly Summary: The new llm-video-frames plugin can turn a video file into a sequence of JPEG frames and feed them directly into a long context vision LLM such…
-
Simon Willison’s Weblog: Expanding on what we missed with sycophancy
Source URL: https://simonwillison.net/2025/May/2/what-we-missed-with-sycophancy/ Source: Simon Willison’s Weblog Title: Expanding on what we missed with sycophancy Feedly Summary: Expanding on what we missed with sycophancy I criticized OpenAI’s initial post about their recent ChatGPT sycophancy rollback as being “relatively thin" so I’m delighted that they have followed it with a much more in-depth explanation of what…
-
Simon Willison’s Weblog: Sycophancy in GPT-4o: What happened and what we’re doing about it
Source URL: https://simonwillison.net/2025/Apr/30/sycophancy-in-gpt-4o/ Source: Simon Willison’s Weblog Title: Sycophancy in GPT-4o: What happened and what we’re doing about it Feedly Summary: Sycophancy in GPT-4o: What happened and what we’re doing about it Relatively thin post from OpenAI talking about their recent rollback of the GPT-4o model that made the model way too sycophantic – “overly…
-
Simon Willison’s Weblog: A comparison of ChatGPT/GPT-4o’s previous and current system prompts
Source URL: https://simonwillison.net/2025/Apr/29/chatgpt-sycophancy-prompt/ Source: Simon Willison’s Weblog Title: A comparison of ChatGPT/GPT-4o’s previous and current system prompts Feedly Summary: A comparison of ChatGPT/GPT-4o’s previous and current system prompts GPT-4o’s recent update caused it to be way too sycophantic and disingenuously praise anything the user said. OpenAI’s Aidan McLaughlin: last night we rolled out our first…
-
Simon Willison’s Weblog: A comparison of ChatGPT/GPT-4o’s previous and current system prompts
Source URL: https://simonwillison.net/2025/Apr/29/a-comparison-of-chatgptgpt-4os-previous-and-current-system-promp/#atom-everything Source: Simon Willison’s Weblog Title: A comparison of ChatGPT/GPT-4o’s previous and current system prompts Feedly Summary: A comparison of ChatGPT/GPT-4o’s previous and current system prompts GPT-4o’s recent update caused it to be way too sycophantic and disingenuously praise anything the user said. OpenAI’s Aidan McLaughlin: last night we rolled out our first…
-
Slashdot: YC Partner Argues Most AI Apps Are Currently ‘Horseless Carriages’
Source URL: https://it.slashdot.org/story/25/04/25/1545223/yc-partner-argues-most-ai-apps-are-currently-horseless-carriages?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: YC Partner Argues Most AI Apps Are Currently ‘Horseless Carriages’ Feedly Summary: AI Summary and Description: Yes Summary: Pete Koomen critiques current AI applications for overly constraining their models and emphasizes the need for enhanced customization in AI systems, particularly in user-agent interactions. His insights into the limitations of…
-
Simon Willison’s Weblog: debug-gym
Source URL: https://simonwillison.net/2025/Mar/31/debug-gym/#atom-everything Source: Simon Willison’s Weblog Title: debug-gym Feedly Summary: debug-gym New paper and code from Microsoft Research that experiments with giving LLMs access to the Python debugger. They found that the best models could indeed improve their results by running pdb as a tool. They saw the best results overall from Claude 3.7…