Tag: model variants
-
Simon Willison’s Weblog: GLM-4.5: Reasoning, Coding, and Agentic Abililties
Source URL: https://simonwillison.net/2025/Jul/28/glm-45/#atom-everything Source: Simon Willison’s Weblog Title: GLM-4.5: Reasoning, Coding, and Agentic Abililties Feedly Summary: GLM-4.5: Reasoning, Coding, and Agentic Abililties Another day, another significant new open weight model release from a Chinese frontier AI lab. This time it’s Z.ai – who rebranded (at least in English) from Zhipu AI a few months ago.…
-
Simon Willison’s Weblog: Qwen3-235B-A22B-Thinking-2507
Source URL: https://simonwillison.net/2025/Jul/25/qwen3-235b-a22b-thinking-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-235B-A22B-Thinking-2507 Feedly Summary: Qwen3-235B-A22B-Thinking-2507 The third Qwen model release week, following Qwen3-235B-A22B-Instruct-2507 on Monday 21st and Qwen3-Coder-480B-A35B-Instruct on Tuesday 22nd. Those two were both non-reasoning models – a change from the previous models in the Qwen 3 family which combined reasoning and non-reasoning in the same model,…
-
Simon Willison’s Weblog: Voxtral
Source URL: https://simonwillison.net/2025/Jul/16/voxtral/#atom-everything Source: Simon Willison’s Weblog Title: Voxtral Feedly Summary: Voxtral Mistral released their first audio-input models yesterday: Voxtral Small and Voxtral Mini. These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache…
-
Simon Willison’s Weblog: Magistral — the first reasoning model by Mistral AI
Source URL: https://simonwillison.net/2025/Jun/10/magistral/ Source: Simon Willison’s Weblog Title: Magistral — the first reasoning model by Mistral AI Feedly Summary: Magistral — the first reasoning model by Mistral AI Mistral’s first reasoning model is out today, in two sizes. There’s a 24B Apache 2 licensed open-weights model called Magistral Small (actually Magistral-Small-2506), and a larger API-only…
-
Simon Willison’s Weblog: Qwen3 Embedding
Source URL: https://simonwillison.net/2025/Jun/8/qwen3-embedding/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3 Embedding Feedly Summary: Qwen3 Embedding New family of embedding models from Qwen, in three sizes: 0.6B, 4B, 8B – and two categories: Text Embedding and Text Reranking. The full collection can be browsed on Hugging Face. The smallest available model is the 0.6B Q8 one, which…
-
Cloud Blog: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms report
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gartner-2025-magic-quadrant-for-data-science-and-ml-platforms/ Source: Cloud Blog Title: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms report Feedly Summary: Today, we are excited to announce that Gartner® has named Google as a Leader in the 2025 Magic Quadrant™ for Data Science and Machine Learning Platforms report (DSML).…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-R1-0528
Source URL: https://simonwillison.net/2025/May/31/deepseek-aideepseek-r1-0528/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-R1-0528 Feedly Summary: deepseek-ai/DeepSeek-R1-0528 Sadly the trend for terrible naming of models has infested the Chinese AI labs as well. DeepSeek-R1-0528 is a brand new and much improved open weights reasoning model from DeepSeek, a major step up from the DeepSeek R1 they released back in January.…
-
Cloud Blog: Getting AI to write good SQL: Text-to-SQL techniques explained
Source URL: https://cloud.google.com/blog/products/databases/techniques-for-improving-text-to-sql/ Source: Cloud Blog Title: Getting AI to write good SQL: Text-to-SQL techniques explained Feedly Summary: Organizations depend on fast and accurate data-driven insights to make decisions, and SQL is at the core of how they access that data. With Gemini, Google can generate SQL directly from natural language — a.k.a. text-to-SQL. This…
-
Slashdot: Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark
Source URL: https://slashdot.org/story/25/05/01/0525208/study-accuses-lm-arena-of-helping-top-ai-labs-game-its-benchmark?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark Feedly Summary: AI Summary and Description: Yes Summary: The report highlights significant concerns regarding transparency and fairness in AI benchmarking, particularly focusing on allegations of biased practices within the LM Arena. Such revelations could impact the trustworthiness…
-
Simon Willison’s Weblog: Qwen 3 offers a case study in how to effectively release a model
Source URL: https://simonwillison.net/2025/Apr/29/qwen-3/ Source: Simon Willison’s Weblog Title: Qwen 3 offers a case study in how to effectively release a model Feedly Summary: Alibaba’s Qwen team released the hotly anticipated Qwen 3 model family today. The Qwen models are already some of the best open weight models – Apache 2.0 licensed and with a variety…