Tag: 32B

  • Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

    Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

  • AWS News Blog: Qwen models are now available in Amazon Bedrock

    Source URL: https://aws.amazon.com/blogs/aws/qwen-models-are-now-available-in-amazon-bedrock/ Source: AWS News Blog Title: Qwen models are now available in Amazon Bedrock Feedly Summary: Amazon Bedrock has expanded its model offerings with the addition of Qwen 3 foundation models enabling users to access and deploy them in a fully managed, serverless environment. These models feature both mixture-of-experts (MoE) and dense architectures…

  • Simon Willison’s Weblog: Kimi-K2-Instruct-0905

    Source URL: https://simonwillison.net/2025/Sep/6/kimi-k2-instruct-0905/#atom-everything Source: Simon Willison’s Weblog Title: Kimi-K2-Instruct-0905 Feedly Summary: Kimi-K2-Instruct-0905 New not-quite-MIT licensed model from Chinese Moonshot AI, a follow-up to the highly regarded Kimi-K2 model they released in July. This one is an incremental improvement – I’ve seen it referred to online as “Kimi K-2.1". It scores a little higher on a…

  • Simon Willison’s Weblog: OpenAI’s new open weight (Apache 2) models are really good

    Source URL: https://simonwillison.net/2025/Aug/5/gpt-oss/ Source: Simon Willison’s Weblog Title: OpenAI’s new open weight (Apache 2) models are really good Feedly Summary: The long promised OpenAI open weight models are here, and they are very impressive. They’re available under proper open source licenses – Apache 2.0 – and come in two sizes, 120B and 20B. OpenAI’s own…

  • Simon Willison’s Weblog: XBai o4

    Source URL: https://simonwillison.net/2025/Aug/3/xbai-o4/#atom-everything Source: Simon Willison’s Weblog Title: XBai o4 Feedly Summary: XBai o4 Yet another open source (Apache 2.0) LLM from a Chinese AI lab. This model card claims: XBai o4 excels in complex reasoning capabilities and has now completely surpassed OpenAI-o3-mini in Medium mode. This a 32.8 billion parameter model released by MetaStone…

  • Simon Willison’s Weblog: More model releases on 31st July

    Source URL: https://simonwillison.net/2025/Jul/31/more-models/ Source: Simon Willison’s Weblog Title: More model releases on 31st July Feedly Summary: Here are a few more model releases from today, to round out a very busy July: Cohere released Command A Vision, their first multi-modal (image input) LLM. Like their others it’s open weights under Creative Commons Attribution Non-Commercial, so…

  • Slashdot: Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find

    Source URL: https://tech.slashdot.org/story/25/07/04/1521245/simple-text-additions-can-fool-advanced-ai-reasoning-models-researchers-find Source: Slashdot Title: Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find Feedly Summary: AI Summary and Description: Yes Summary: The research highlights a significant vulnerability in state-of-the-art reasoning AI models through the “CatAttack” technique, which attaches irrelevant phrases to math problems, leading to higher error rates and inefficient responses.…