Tag: mini

  • Simon Willison’s Weblog: Leaked Windsurf prompt

    Source URL: https://simonwillison.net/2025/Feb/25/leaked-windsurf-prompt/ Source: Simon Willison’s Weblog Title: Leaked Windsurf prompt Feedly Summary: Leaked Windsurf prompt The Windurf Editor is Codeium’s highly regarded entrant into the fork-of-VS-code AI-enhanced IDE model first pioneered by Cursor (and by VS Code itself). I heard online that it had a quirky system prompt, and was able to replicate that…

  • The Register: How nice that state-of-the-art LLMs reveal their reasoning … for miscreants to exploit

    Source URL: https://www.theregister.com/2025/02/25/chain_of_thought_jailbreaking/ Source: The Register Title: How nice that state-of-the-art LLMs reveal their reasoning … for miscreants to exploit Feedly Summary: Blueprints shared for jail-breaking models that expose their chain-of-thought process Analysis AI models like OpenAI o1/o3, DeepSeek-R1, and Gemini 2.0 Flash Thinking can mimic human reasoning through a process called chain of thought.……

  • Hacker News: Grab AI Gateway: Connecting Grabbers to Multiple GenAI Providers

    Source URL: https://engineering.grab.com/grab-ai-gateway Source: Hacker News Title: Grab AI Gateway: Connecting Grabbers to Multiple GenAI Providers Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the implementation and significance of Grab’s AI Gateway, an integrated platform that facilitates access to multiple AI providers for users within the organization. It highlights the gateway’s…

  • Simon Willison’s Weblog: Aider Polyglot leaderboard results for Claude 3.7 Sonnet

    Source URL: https://simonwillison.net/2025/Feb/25/aider-polyglot-leaderboard/ Source: Simon Willison’s Weblog Title: Aider Polyglot leaderboard results for Claude 3.7 Sonnet Feedly Summary: Aider Polyglot leaderboard results for Claude 3.7 Sonnet Paul Gauthier’s Aider Polyglot benchmark is one of my favourite independent benchmarks for LLMs, partly because it focuses on code and partly because Paul is very responsive at evaluating…

  • Rekt: Infini – Rekt

    Source URL: https://www.rekt.news/infini-rekt Source: Rekt Title: Infini – Rekt Feedly Summary: The perfect DeFi hack. No flash loans, no zero-days. Just a rogue dev who built a backdoor, waited 114 days, then drained $49.5M from Infini with admin privileges. Same old story, new-age incompetence. When will protocols learn that admin keys aren’t toys? AI Summary…

  • Simon Willison’s Weblog: Claude 3.7 Sonnet and Claude Code

    Source URL: https://simonwillison.net/2025/Feb/24/claude-37-sonnet-and-claude-code/#atom-everything Source: Simon Willison’s Weblog Title: Claude 3.7 Sonnet and Claude Code Feedly Summary: Claude 3.7 Sonnet and Claude Code Anthropic released Claude 3.7 Sonnet today – skipping the name “Claude 3.6" because the Anthropic user community had already started using that as the unofficial name for their October update to 3.5 Sonnet.…

  • Cloud Blog: Announcing Claude 3.7 Sonnet, Anthropic’s first hybrid reasoning model, is available on Vertex AI

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-3-7-sonnet-is-available-on-vertex-ai/ Source: Cloud Blog Title: Announcing Claude 3.7 Sonnet, Anthropic’s first hybrid reasoning model, is available on Vertex AI Feedly Summary: Today, we’re announcing Claude 3.7 Sonnet, Anthropic’s most intelligent model to date and the first hybrid reasoning model on the market, is available in preview on Vertex AI Model Garden. Claude 3.7…

  • Hacker News: Claude 3.7 Sonnet and Claude Code

    Source URL: https://www.anthropic.com/news/claude-3-7-sonnet Source: Hacker News Title: Claude 3.7 Sonnet and Claude Code Feedly Summary: Comments AI Summary and Description: Yes Summary: The announcement details the launch of Claude 3.7 Sonnet, a significant advancement in AI models, touted as the first hybrid reasoning model capable of providing both instant responses and longer, more thoughtful outputs.…