Tag: ai-agents

  • Simon Willison’s Weblog: OpenAI API: Responses vs. Chat Completions

    Source URL: https://simonwillison.net/2025/Mar/11/responses-vs-chat-completions/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI API: Responses vs. Chat Completions Feedly Summary: OpenAI API: Responses vs. Chat Completions OpenAI released a bunch of new API platform features this morning under the headline “New tools for building agents" (their somewhat mushy interpretation of "agents" here is "systems that independently accomplish tasks on…

  • Simon Willison’s Weblog: Demo of ChatGPT Code Interpreter running in o3-mini-high

    Source URL: https://simonwillison.net/2025/Mar/5/code-interpreter/ Source: Simon Willison’s Weblog Title: Demo of ChatGPT Code Interpreter running in o3-mini-high Feedly Summary: Demo of ChatGPT Code Interpreter running in o3-mini-high OpenAI made GPT-4.5 available to Plus ($20/month) users today. I was a little disappointed with GPT-4.5 when I tried it through the API, but having access in the ChatGPT…

  • Simon Willison’s Weblog: Hallucinations in code are the least dangerous form of LLM mistakes

    Source URL: https://simonwillison.net/2025/Mar/2/hallucinations-in-code/#atom-everything Source: Simon Willison’s Weblog Title: Hallucinations in code are the least dangerous form of LLM mistakes Feedly Summary: A surprisingly common complaint I see from developers who have tried using LLMs for code is that they encountered a hallucination – usually the LLM inventing a method or even a full software library…

  • Simon Willison’s Weblog: Deep research System Card

    Source URL: https://simonwillison.net/2025/Feb/25/deep-research-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Deep research System Card Feedly Summary: Deep research System Card OpenAI are rolling out their Deep research “agentic" research tool to their $20/month ChatGPT Plus users today, who get 10 queries a month. $200/month ChatGPT Pro gets 120 uses. Deep research is the best version of this…

  • The Cloudflare Blog: Making Cloudflare the best platform for building AI Agents

    Source URL: https://blog.cloudflare.com/build-ai-agents-on-cloudflare/ Source: The Cloudflare Blog Title: Making Cloudflare the best platform for building AI Agents Feedly Summary: Today we’re excited to share a few announcements on how we’re making it even easier to build AI agents on Cloudflare. AI Summary and Description: Yes Summary: The text delves into the advancements and framework released…

  • Simon Willison’s Weblog: Claude 3.7 Sonnet and Claude Code

    Source URL: https://simonwillison.net/2025/Feb/24/claude-37-sonnet-and-claude-code/#atom-everything Source: Simon Willison’s Weblog Title: Claude 3.7 Sonnet and Claude Code Feedly Summary: Claude 3.7 Sonnet and Claude Code Anthropic released Claude 3.7 Sonnet today – skipping the name “Claude 3.6" because the Anthropic user community had already started using that as the unofficial name for their October update to 3.5 Sonnet.…

  • Simon Willison’s Weblog: Introducing Perplexity Deep Research

    Source URL: https://simonwillison.net/2025/Feb/16/introducing-perplexity-deep-research/#atom-everything Source: Simon Willison’s Weblog Title: Introducing Perplexity Deep Research Feedly Summary: Introducing Perplexity Deep Research Perplexity become the third company to release a product with “Deep Research" in the name. Google’s Gemini Deep Research: Try Deep Research and our new experimental model in Gemini, your AI assistant on December 11th 2024 OpenAI’s…

  • Slashdot: Tech Leaders Hold Back on AI Agents Despite Vendor Push, Survey Shows

    Source URL: https://slashdot.org/story/25/02/12/1626227/tech-leaders-hold-back-on-ai-agents-despite-vendor-push-survey-shows?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Tech Leaders Hold Back on AI Agents Despite Vendor Push, Survey Shows Feedly Summary: AI Summary and Description: Yes Summary: Corporate tech leaders are reluctant to adopt AI agents despite pressures from vendors. While experimentation is happening, significant concerns about reliability, cybersecurity, and data privacy persist. Executives are increasingly…