Tag: tokens

  • Simon Willison’s Weblog: llama.cpp guide: running gpt-oss with llama.cpp

    Source URL: https://simonwillison.net/2025/Aug/19/gpt-oss-with-llama-cpp/ Source: Simon Willison’s Weblog Title: llama.cpp guide: running gpt-oss with llama.cpp Feedly Summary: llama.cpp guide: running gpt-oss with llama.cpp Really useful official guide to running the OpenAI gpt-oss models using llama-server from llama.cpp – which provides an OpenAI-compatible localhost API and a neat web interface for interacting with the models. TLDR version…

  • Simon Willison’s Weblog: Google Gemini URL Context

    Source URL: https://simonwillison.net/2025/Aug/18/google-gemini-url-context/ Source: Simon Willison’s Weblog Title: Google Gemini URL Context Feedly Summary: Google Gemini URL Context New feature in the Gemini API: you can now enable a url_context tool which the models can use to request the contents of URLs as part of replying to a prompt. I released llm-gemini 0.25 with a…

  • The Register: Little LLM on the RAM: Google’s Gemma 270M hits the scene

    Source URL: https://www.theregister.com/2025/08/15/little_llm_on_the_ram/ Source: The Register Title: Little LLM on the RAM: Google’s Gemma 270M hits the scene Feedly Summary: A tiny model trained on trillions of tokens, ready for specialized tasks Google has unveiled a pint-sized new addition to its “open" large language model lineup: Gemma 3 270M.… AI Summary and Description: Yes Summary:…

  • Docker: MCP Horror Stories: The GitHub Prompt Injection Data Heist

    Source URL: https://www.docker.com/blog/mcp-horror-stories-github-prompt-injection/ Source: Docker Title: MCP Horror Stories: The GitHub Prompt Injection Data Heist Feedly Summary: This is Part 3 of our MCP Horror Stories series, where we examine real-world security incidents that validate the critical vulnerabilities threatening AI infrastructure and demonstrate how Docker MCP Toolkit provides enterprise-grade protection. The Model Context Protocol (MCP)…

  • Simon Willison’s Weblog: Claude Sonnet 4 now supports 1M tokens of context

    Source URL: https://simonwillison.net/2025/Aug/12/claude-sonnet-4-1m/ Source: Simon Willison’s Weblog Title: Claude Sonnet 4 now supports 1M tokens of context Feedly Summary: Claude Sonnet 4 now supports 1M tokens of context Gemini and OpenAI both have million token models, so it’s good to see Anthropic catching up. This is 5x the previous 200,000 context length limit of the…

  • Simon Willison’s Weblog: AI for data engineers with Simon Willison

    Source URL: https://simonwillison.net/2025/Aug/11/ai-for-data-engineers/#atom-everything Source: Simon Willison’s Weblog Title: AI for data engineers with Simon Willison Feedly Summary: AI for data engineers with Simon Willison I recorded an episode last week with Claire Giordano for the Talking Postgres podcast. The topic was “AI for data engineers" but we ended up covering an enjoyable range of different…