Tag: llama
- 
		
		
		Simon Willison’s Weblog: llm-github-models 0.15Source URL: https://simonwillison.net/2025/May/29/llm-github-models-015/ Source: Simon Willison’s Weblog Title: llm-github-models 0.15 Feedly Summary: llm-github-models 0.15 Anthony Shaw’s llm-github-models plugin just got an upgrade: it now supports LLM 0.26 tool use for a subset of the models hosted on the GitHub Models API, contributed by Caleb Brose. The neat thing about this GitHub Models plugin is that… 
- 
		
		
		Simon Willison’s Weblog: llm-llama-server 0.2Source URL: https://simonwillison.net/2025/May/28/llama-server-tools/ Source: Simon Willison’s Weblog Title: llm-llama-server 0.2 Feedly Summary: llm-llama-server 0.2 Here’s a second option for using LLM’s new tool support against local models (the first was via llm-ollama). It turns out the llama.cpp ecosystem has pretty robust OpenAI-compatible tool support already, so my llm-llama-server plugin only needed a quick upgrade to… 
- 
		
		
		Simon Willison’s Weblog: Large Language Models can run tools in your terminal with LLM 0.26Source URL: https://simonwillison.net/2025/May/27/llm-tools/ Source: Simon Willison’s Weblog Title: Large Language Models can run tools in your terminal with LLM 0.26 Feedly Summary: LLM 0.26 is out with the biggest new feature since I started the project: support for tools. You can now use the LLM CLI tool – and Python library – to grant LLMs… 
- 
		
		
		Cloud Blog: Vertex AI Studio, redesigned: Your source for generative AI media models across all modalitiesSource URL: https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-studio-redesigned/ Source: Cloud Blog Title: Vertex AI Studio, redesigned: Your source for generative AI media models across all modalities Feedly Summary: Google Cloud’s Vertex AI platform makes it easy to experiment with and customize over 200 advanced foundation models – like the latest Google Gemini models, and third-party partner models such as Meta’s… 
- 
		
		
		Slashdot: People Should Know About the ‘Beliefs’ LLMs Form About Them While ConversingSource URL: https://slashdot.org/story/25/05/24/1946203/people-should-know-about-the-beliefs-llms-form-about-them-while-conversing Source: Slashdot Title: People Should Know About the ‘Beliefs’ LLMs Form About Them While Conversing Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the implications of using large language models (LLMs) like Llama that exhibit human-like biases based on user interactions. This raises critical policy and ethical issues related… 
- 
		
		
		Simon Willison’s Weblog: DevstralSource URL: https://simonwillison.net/2025/May/21/devstral/#atom-everything Source: Simon Willison’s Weblog Title: Devstral Feedly Summary: Devstral New Apache 2.0 licensed LLM release from Mistral, this time specifically trained for code. Devstral achieves a score of 46.8% on SWE-Bench Verified, outperforming prior open-source SoTA models by more than 6% points. When evaluated under the same test scaffold (OpenHands, provided by… 
- 
		
		
		Simon Willison’s Weblog: Gemini DiffusionSource URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text… 
- 
		
		
		Simon Willison’s Weblog: We did the math on AI’s energy footprint. Here’s the story you haven’t heard.Source URL: https://simonwillison.net/2025/May/20/ai-energy-footprint/#atom-everything Source: Simon Willison’s Weblog Title: We did the math on AI’s energy footprint. Here’s the story you haven’t heard. Feedly Summary: We did the math on AI’s energy footprint. Here’s the story you haven’t heard. James O’Donnell and Casey Crownhart try to pull together a detailed account of AI energy usage for… 
- 
		
		
		AWS News Blog: AWS Weekly Roundup: Strands Agents, AWS Transform, Amazon Bedrock Guardrails, AWS CodeBuild, and more (May 19, 2025)Source URL: https://aws.amazon.com/blogs/aws/aws-weekly-roundup-strands-agents-aws-transform-amazon-bedrock-guardrails-aws-codebuild-and-more-may-19-2025/ Source: AWS News Blog Title: AWS Weekly Roundup: Strands Agents, AWS Transform, Amazon Bedrock Guardrails, AWS CodeBuild, and more (May 19, 2025) Feedly Summary: Many events are taking place in this period! Last week I was at the AI Week in Italy. This week I’ll be in Zurich for the AWS Community… 
- 
		
		
		Simon Willison’s Weblog: qwen2.5vl in OllamaSource URL: https://simonwillison.net/2025/May/18/qwen25vl-in-ollama/#atom-everything Source: Simon Willison’s Weblog Title: qwen2.5vl in Ollama Feedly Summary: qwen2.5vl in Ollama Ollama announced a complete overhaul of their vision support the other day. Here’s the first new model they’ve shipped since then – a packaged version of Qwen 2.5 VL which was first released on January 26th 2025. Here are…