Tag: performance comparison

  • Slashdot: Microsoft To Use Some AI From Anthropic In Shift From OpenAI

    Source URL: https://slashdot.org/story/25/09/09/238208/microsoft-to-use-some-ai-from-anthropic-in-shift-from-openai?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Microsoft To Use Some AI From Anthropic In Shift From OpenAI Feedly Summary: AI Summary and Description: Yes Summary: Microsoft is expanding its AI capabilities by incorporating Anthropic’s AI features into Office 365 applications, shifting its reliance from OpenAI. This integration aims to enhance functionalities like financial automation in…

  • Simon Willison’s Weblog: Quoting Jason Liu

    Source URL: https://simonwillison.net/2025/Sep/6/jason-liu/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Jason Liu Feedly Summary: I am once again shocked at how much better image retrieval performance you can get if you embed highly opinionated summaries of an image, a summary that came out of a visual language model, than using CLIP embeddings themselves. If you tell…

  • Tomasz Tunguz: Small Action Models Are the Future of AI Agents

    Source URL: https://www.tomtunguz.com/local-instructions/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, & the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

  • Simon Willison’s Weblog: Qwen3-30B-A3B-Thinking-2507

    Source URL: https://simonwillison.net/2025/Jul/30/qwen3-30b-a3b-thinking-2507/ Source: Simon Willison’s Weblog Title: Qwen3-30B-A3B-Thinking-2507 Feedly Summary: Qwen3-30B-A3B-Thinking-2507 Yesterday was Qwen3-30B-A3B-Instruct-2507. Qwen are clearly committed to their new split between reasoning and non-reasoning models (a reversal from Qwen 3 in April), because today they released the new reasoning partner to yesterday’s model: Qwen3-30B-A3B-Thinking-2507. I’m surprised at how poorly this reasoning mode…

  • Simon Willison’s Weblog: Qwen/Qwen3-30B-A3B-Instruct-2507

    Source URL: https://simonwillison.net/2025/Jul/29/qwen3-30b-a3b-instruct-2507/ Source: Simon Willison’s Weblog Title: Qwen/Qwen3-30B-A3B-Instruct-2507 Feedly Summary: Qwen/Qwen3-30B-A3B-Instruct-2507 New model update from Qwen, improving on their previous Qwen3-30B-A3B release from late April. In their tweet they said: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up…

  • The Register: How AI chip upstart FuriosaAI won over LG with its power-sipping design

    Source URL: https://www.theregister.com/2025/07/22/sk_furiosa_ai_lg/ Source: The Register Title: How AI chip upstart FuriosaAI won over LG with its power-sipping design Feedly Summary: Testing shows RNGD chips up to 2.25x higher performance per watt than…. five-year-old Nvidia silicon South Korean AI chip startup FuriosaAI scored a major customer win this week after LG’s AI Research division tapped…

  • Simon Willison’s Weblog: Codestral Embed

    Source URL: https://simonwillison.net/2025/May/28/codestral-embed/#atom-everything Source: Simon Willison’s Weblog Title: Codestral Embed Feedly Summary: Codestral Embed Brand new embedding model from Mistral, specifically trained for code. Mistral claim that: Codestral Embed significantly outperforms leading code embedders in the market today: Voyage Code 3, Cohere Embed v4.0 and OpenAI’s large embedding model. The model is designed to work…

  • Simon Willison’s Weblog: AI assisted search-based research actually works now

    Source URL: https://simonwillison.net/2025/Apr/21/ai-assisted-search/#atom-everything Source: Simon Willison’s Weblog Title: AI assisted search-based research actually works now Feedly Summary: For the past two and a half years the feature I’ve most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the first glimpses of this back in early 2023,…

  • Simon Willison’s Weblog: An LLM Query Understanding Service

    Source URL: https://simonwillison.net/2025/Apr/9/an-llm-query-understanding-service/#atom-everything Source: Simon Willison’s Weblog Title: An LLM Query Understanding Service Feedly Summary: An LLM Query Understanding Service Doug Turnbull recently wrote about how all search is structured now: Many times, even a small open source LLM will be able to turn a search query into reasonable structure at relatively low cost. In…

  • Simon Willison’s Weblog: Mistral Small 3.1 on Ollama

    Source URL: https://simonwillison.net/2025/Apr/8/mistral-small-31-on-ollama/#atom-everything Source: Simon Willison’s Weblog Title: Mistral Small 3.1 on Ollama Feedly Summary: Mistral Small 3.1 on Ollama Mistral Small 3.1 (previously) is now available through Ollama, providing an easy way to run this multi-modal (vision) model on a Mac (and other platforms, though I haven’t tried them myself yet). I had to…