Tag: tokens

  • Tomasz Tunguz: 1000x Increase in AI Demand

    Source URL: https://www.tomtunguz.com/nvda-2025-05-29/ Source: Tomasz Tunguz Title: 1000x Increase in AI Demand Feedly Summary: NVIDIA announced earnings yesterday. In addition to continued exceptional growth, the most interesting observations revolve around a shift from simple one-shot AI to reasoning. Reasoning improves accuracy for robots – like telling a person to stop and think about an answer…

  • Simon Willison’s Weblog: llm-tools-exa

    Source URL: https://simonwillison.net/2025/May/29/llm-tools-exa/ Source: Simon Willison’s Weblog Title: llm-tools-exa Feedly Summary: llm-tools-exa When I shipped LLM 0.26 yesterday one of the things I was most excited about was seeing what new tool plugins people would build for it. Dan Turkel’s llm-tools-exa is one of the first. It adds web search to LLM using Exa (previously),…

  • Simon Willison’s Weblog: Codestral Embed

    Source URL: https://simonwillison.net/2025/May/28/codestral-embed/#atom-everything Source: Simon Willison’s Weblog Title: Codestral Embed Feedly Summary: Codestral Embed Brand new embedding model from Mistral, specifically trained for code. Mistral claim that: Codestral Embed significantly outperforms leading code embedders in the market today: Voyage Code 3, Cohere Embed v4.0 and OpenAI’s large embedding model. The model is designed to work…

  • Simon Willison’s Weblog: Large Language Models can run tools in your terminal with LLM 0.26

    Source URL: https://simonwillison.net/2025/May/27/llm-tools/ Source: Simon Willison’s Weblog Title: Large Language Models can run tools in your terminal with LLM 0.26 Feedly Summary: LLM 0.26 is out with the biggest new feature since I started the project: support for tools. You can now use the LLM CLI tool – and Python library – to grant LLMs…

  • Simon Willison’s Weblog: GitHub MCP Exploited: Accessing private repositories via MCP

    Source URL: https://simonwillison.net/2025/May/26/github-mcp-exploited/#atom-everything Source: Simon Willison’s Weblog Title: GitHub MCP Exploited: Accessing private repositories via MCP Feedly Summary: GitHub MCP Exploited: Accessing private repositories via MCP GitHub’s official MCP server grants LLMs a whole host of new abilities, including being able to read and issues in repositories the user has access to and submit new…

  • Simon Willison’s Weblog: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft

    Source URL: https://simonwillison.net/2025/May/23/remote-prompt-injection-in-gitlab-duo/ Source: Simon Willison’s Weblog Title: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Feedly Summary: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Yet another example of the classic Markdown image exfiltration attack, this time affecting GitLab Duo – GitLab’s chatbot. Omer Mayraz reports on how…

  • Simon Willison’s Weblog: Updated Anthropic model comparison table

    Source URL: https://simonwillison.net/2025/May/22/updated-anthropic-models/#atom-everything Source: Simon Willison’s Weblog Title: Updated Anthropic model comparison table Feedly Summary: Updated Anthropic model comparison table A few details in here about Claude 4 that I hadn’t spotted elsewhere: The training cut-off date for Claude Opus 4 and Claude Sonnet 4 is March 2025! That’s the most recent cut-off for any…

  • Simon Willison’s Weblog: Gemini Diffusion

    Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…