Tag: tasks

  • Hamel’s Blog: LLM Eval FAQ

    Source URL: https://hamel.dev/blog/posts/evals-faq/ Source: Hamel’s Blog Title: LLM Eval FAQ Feedly Summary: Our Course On AI Evals I’m teaching a course on AI Evals with Shreya Shankar. Here are some of the most common questions we’ve been asked. We’ll be updating this list frequently. Q: Is RAG dead? Question: Should I avoid using RAG for…

  • Simon Willison’s Weblog: llm-tools-exa

    Source URL: https://simonwillison.net/2025/May/29/llm-tools-exa/ Source: Simon Willison’s Weblog Title: llm-tools-exa Feedly Summary: llm-tools-exa When I shipped LLM 0.26 yesterday one of the things I was most excited about was seeing what new tool plugins people would build for it. Dan Turkel’s llm-tools-exa is one of the first. It adds web search to LLM using Exa (previously),…

  • Simon Willison’s Weblog: llm-mistral 0.14

    Source URL: https://simonwillison.net/2025/May/29/llm-mistral-014/#atom-everything Source: Simon Willison’s Weblog Title: llm-mistral 0.14 Feedly Summary: llm-mistral 0.14 I added tool-support to my plugin for accessing the Mistral API from LLM today, plus support for Mistral’s new Codestral Embed embedding model. An interesting challenge here is that I’m not using an official client library for llm-mistral – I rolled…

  • Scott Logic: Bridging the AI Valley of Doubt

    Source URL: https://blog.scottlogic.com/2025/05/23/bridging-the-AI-valley-of-doubt.html Source: Scott Logic Title: Bridging the AI Valley of Doubt Feedly Summary: Despite the UK being the world’s third-largest AI industry, only 1 in 6 UK firms are actively using AI due to financial, skills, and risk concerns, but businesses can bridge this “valley of doubt" by adopting measured "AI in the…

  • Simon Willison’s Weblog: Large Language Models can run tools in your terminal with LLM 0.26

    Source URL: https://simonwillison.net/2025/May/27/llm-tools/ Source: Simon Willison’s Weblog Title: Large Language Models can run tools in your terminal with LLM 0.26 Feedly Summary: LLM 0.26 is out with the biggest new feature since I started the project: support for tools. You can now use the LLM CLI tool – and Python library – to grant LLMs…

  • Cloud Blog: Vertex AI Studio, redesigned: Your source for generative AI media models across all modalities

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-studio-redesigned/ Source: Cloud Blog Title: Vertex AI Studio, redesigned: Your source for generative AI media models across all modalities Feedly Summary: Google Cloud’s Vertex AI platform makes it easy to experiment with and customize over 200 advanced foundation models – like the latest Google Gemini models, and third-party partner models such as Meta’s…

  • Microsoft Security Blog: The future of AI agents—and why OAuth must evolve

    Source URL: https://techcommunity.microsoft.com/blog/microsoft-entra-blog/the-future-of-ai-agents%E2%80%94and-why-oauth-must-evolve/3827391%20 Source: Microsoft Security Blog Title: The future of AI agents—and why OAuth must evolve Feedly Summary: Our industry needs to continue working together on identity standards for agent access across systems. Read about how Microsoft is building a robust and sophisticated set of agents. The post The future of AI agents—and why…

  • Simon Willison’s Weblog: Build AI agents with the Mistral Agents API

    Source URL: https://simonwillison.net/2025/May/27/mistral-agents-api/ Source: Simon Willison’s Weblog Title: Build AI agents with the Mistral Agents API Feedly Summary: Build AI agents with the Mistral Agents API Big upgrade to Mistral’s API this morning: they’ve announced a new “Agents API". Mistral have been using the term "agents" for a while now. Here’s how they describe them:…