Tag: Qwen

  • Simon Willison’s Weblog: The best available open weight LLMs now come from China

    Source URL: https://simonwillison.net/2025/Jul/30/chinese-models/ Source: Simon Willison’s Weblog Title: The best available open weight LLMs now come from China Feedly Summary: Something that has become undeniable this month is that the best available open weight models now come from the Chinese AI labs. I continue to have a lot of love for Mistral, Gemma and Llama…

  • Simon Willison’s Weblog: Qwen3-30B-A3B-Thinking-2507

    Source URL: https://simonwillison.net/2025/Jul/30/qwen3-30b-a3b-thinking-2507/ Source: Simon Willison’s Weblog Title: Qwen3-30B-A3B-Thinking-2507 Feedly Summary: Qwen3-30B-A3B-Thinking-2507 Yesterday was Qwen3-30B-A3B-Instruct-2507. Qwen are clearly committed to their new split between reasoning and non-reasoning models (a reversal from Qwen 3 in April), because today they released the new reasoning partner to yesterday’s model: Qwen3-30B-A3B-Thinking-2507. I’m surprised at how poorly this reasoning mode…

  • Simon Willison’s Weblog: Qwen/Qwen3-30B-A3B-Instruct-2507

    Source URL: https://simonwillison.net/2025/Jul/29/qwen3-30b-a3b-instruct-2507/ Source: Simon Willison’s Weblog Title: Qwen/Qwen3-30B-A3B-Instruct-2507 Feedly Summary: Qwen/Qwen3-30B-A3B-Instruct-2507 New model update from Qwen, improving on their previous Qwen3-30B-A3B release from late April. In their tweet they said: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up…

  • Simon Willison’s Weblog: My 2.5 year old laptop can write Space Invaders in JavaScript now

    Source URL: https://simonwillison.net/2025/Jul/29/space-invaders/ Source: Simon Willison’s Weblog Title: My 2.5 year old laptop can write Space Invaders in JavaScript now Feedly Summary: I wrote about the new GLM-4.5 model family yesterday – new open weight (MIT licensed) models from Z.ai in China which their benchmarks claim score highly in coding even against models such as…

  • Docker: Beyond the Chatbot: Event-Driven Agents in Action

    Source URL: https://www.docker.com/blog/beyond-the-chatbot-event-driven-agents-in-action/ Source: Docker Title: Beyond the Chatbot: Event-Driven Agents in Action Feedly Summary: Docker recently completed an internal 24-hour hackathon that had a fairly simple goal: create an agent that helps you be more productive. As I thought about this topic, I recognized I didn’t want to spend more time in a chat…

  • Slashdot: Huawei Shows Off 384-Chip AI Computing System That Rival Nvidia’s Top Product

    Source URL: https://hardware.slashdot.org/story/25/07/27/2248257/huawei-shows-off-384-chip-ai-computing-system-that-rival-nvidias-top-product Source: Slashdot Title: Huawei Shows Off 384-Chip AI Computing System That Rival Nvidia’s Top Product Feedly Summary: AI Summary and Description: Yes Summary: Huawei’s CloudMatrix 384 AI computing system, showcased at the World Artificial Intelligence Conference, offers significant performance metrics that rival Nvidia’s offerings despite export restrictions. Additionally, Alibaba introduced a new…

  • Simon Willison’s Weblog: Qwen3-235B-A22B-Thinking-2507

    Source URL: https://simonwillison.net/2025/Jul/25/qwen3-235b-a22b-thinking-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-235B-A22B-Thinking-2507 Feedly Summary: Qwen3-235B-A22B-Thinking-2507 The third Qwen model release week, following Qwen3-235B-A22B-Instruct-2507 on Monday 21st and Qwen3-Coder-480B-A35B-Instruct on Tuesday 22nd. Those two were both non-reasoning models – a change from the previous models in the Qwen 3 family which combined reasoning and non-reasoning in the same model,…

  • Cloud Blog: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/take-an-open-model-from-discovery-to-endpoint-on-vertex-ai/ Source: Cloud Blog Title: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI Feedly Summary: Developers building with gen AI are increasingly drawn to open models for their power and flexibility. But customizing and deploying them can be a huge challenge. You’re often left wrestling…

  • Simon Willison’s Weblog: TimeScope: How Long Can Your Video Large Multimodal Model Go?

    Source URL: https://simonwillison.net/2025/Jul/23/timescope/#atom-everything Source: Simon Willison’s Weblog Title: TimeScope: How Long Can Your Video Large Multimodal Model Go? Feedly Summary: TimeScope: How Long Can Your Video Large Multimodal Model Go? New open source benchmark for evaluating vision LLMs on how well they handle long videos: TimeScope probes the limits of long-video capabilities by inserting several…