Tag: Qwen

  • Tomasz Tunguz: Small Action Models Are the Future of AI Agents

    Source URL: https://www.tomtunguz.com/local-instructions/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, & the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

  • Simon Willison’s Weblog: XBai o4

    Source URL: https://simonwillison.net/2025/Aug/3/xbai-o4/#atom-everything Source: Simon Willison’s Weblog Title: XBai o4 Feedly Summary: XBai o4 Yet another open source (Apache 2.0) LLM from a Chinese AI lab. This model card claims: XBai o4 excels in complex reasoning capabilities and has now completely surpassed OpenAI-o3-mini in Medium mode. This a 32.8 billion parameter model released by MetaStone…

  • Simon Willison’s Weblog: Faster inference

    Source URL: https://simonwillison.net/2025/Aug/1/faster-inference/ Source: Simon Willison’s Weblog Title: Faster inference Feedly Summary: Two interesting examples of inference speed as a flagship feature of LLM services today. First, Cerebras announced two new monthly plans for their extremely high speed hosted model service: Cerebras Code Pro ($50/month, 1,000 messages a day) and Cerebras Code Max ($200/month, 5,000/day).…

  • Simon Willison’s Weblog: Trying out Qwen3 Coder Flash using LM Studio and Open WebUI and LLM

    Source URL: https://simonwillison.net/2025/Jul/31/qwen3-coder-flash/ Source: Simon Willison’s Weblog Title: Trying out Qwen3 Coder Flash using LM Studio and Open WebUI and LLM Feedly Summary: Qwen just released their sixth model(!) for this July called Qwen3-Coder-30B-A3B-Instruct – listed as Qwen3-Coder-Flash in their chat.qwen.ai interface. It’s 30.5B total parameters with 3.3B active at any one time. This means…

  • The Register: Alibaba admits Qwen3’s hybrid-thinking mode was dumb

    Source URL: https://www.theregister.com/2025/07/31/alibaba_qwen3_hybrid_thinking/ Source: The Register Title: Alibaba admits Qwen3’s hybrid-thinking mode was dumb Feedly Summary: Chinese e-commerce giant is going back to dedicated instruct and thinking-tuned models as they prioritize quality over convenience One of the headline features of Alibaba’s Qwen 3 family of models when they launched back in April was the ability…

  • Simon Willison’s Weblog: The best available open weight LLMs now come from China

    Source URL: https://simonwillison.net/2025/Jul/30/chinese-models/ Source: Simon Willison’s Weblog Title: The best available open weight LLMs now come from China Feedly Summary: Something that has become undeniable this month is that the best available open weight models now come from the Chinese AI labs. I continue to have a lot of love for Mistral, Gemma and Llama…

  • Simon Willison’s Weblog: Qwen3-30B-A3B-Thinking-2507

    Source URL: https://simonwillison.net/2025/Jul/30/qwen3-30b-a3b-thinking-2507/ Source: Simon Willison’s Weblog Title: Qwen3-30B-A3B-Thinking-2507 Feedly Summary: Qwen3-30B-A3B-Thinking-2507 Yesterday was Qwen3-30B-A3B-Instruct-2507. Qwen are clearly committed to their new split between reasoning and non-reasoning models (a reversal from Qwen 3 in April), because today they released the new reasoning partner to yesterday’s model: Qwen3-30B-A3B-Thinking-2507. I’m surprised at how poorly this reasoning mode…

  • Simon Willison’s Weblog: Qwen/Qwen3-30B-A3B-Instruct-2507

    Source URL: https://simonwillison.net/2025/Jul/29/qwen3-30b-a3b-instruct-2507/ Source: Simon Willison’s Weblog Title: Qwen/Qwen3-30B-A3B-Instruct-2507 Feedly Summary: Qwen/Qwen3-30B-A3B-Instruct-2507 New model update from Qwen, improving on their previous Qwen3-30B-A3B release from late April. In their tweet they said: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up…

  • Simon Willison’s Weblog: My 2.5 year old laptop can write Space Invaders in JavaScript now

    Source URL: https://simonwillison.net/2025/Jul/29/space-invaders/ Source: Simon Willison’s Weblog Title: My 2.5 year old laptop can write Space Invaders in JavaScript now Feedly Summary: I wrote about the new GLM-4.5 model family yesterday – new open weight (MIT licensed) models from Z.ai in China which their benchmarks claim score highly in coding even against models such as…

  • Docker: Beyond the Chatbot: Event-Driven Agents in Action

    Source URL: https://www.docker.com/blog/beyond-the-chatbot-event-driven-agents-in-action/ Source: Docker Title: Beyond the Chatbot: Event-Driven Agents in Action Feedly Summary: Docker recently completed an internal 24-hour hackathon that had a fairly simple goal: create an agent that helps you be more productive. As I thought about this topic, I recognized I didn’t want to spend more time in a chat…