mixture-of-experts – Experimental News Clipping Site

Docker: IBM Granite 4.0 Models Now Available on Docker Hub

Oct 6, 2025

—

by

Source URL: https://www.docker.com/blog/ibm-granite-4-0-models-now-available-on-docker-hub/ Source: Docker Title: IBM Granite 4.0 Models Now Available on Docker Hub Feedly Summary: Developers can now discover and run IBM’s latest open-source Granite 4.0 language models from the Docker Hub model catalog, and start building in minutes with Docker Model Runner. Granite 4.0 pairs strong, enterprise-ready performance with a lightweight footprint,…

AWS News Blog: Qwen models are now available in Amazon Bedrock

Sep 18, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://aws.amazon.com/blogs/aws/qwen-models-are-now-available-in-amazon-bedrock/ Source: AWS News Blog Title: Qwen models are now available in Amazon Bedrock Feedly Summary: Amazon Bedrock has expanded its model offerings with the addition of Qwen 3 foundation models enabling users to access and deploy them in a fully managed, serverless environment. These models feature both mixture-of-experts (MoE) and dense architectures…

Simon Willison’s Weblog: Kimi-K2-Instruct-0905

Sep 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/6/kimi-k2-instruct-0905/#atom-everything Source: Simon Willison’s Weblog Title: Kimi-K2-Instruct-0905 Feedly Summary: Kimi-K2-Instruct-0905 New not-quite-MIT licensed model from Chinese Moonshot AI, a follow-up to the highly regarded Kimi-K2 model they released in July. This one is an incremental improvement – I’ve seen it referred to online as “Kimi K-2.1". It scores a little higher on a…

Slashdot: Microsoft Reveals Two In-House AI Models

Aug 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/08/28/2058255/microsoft-reveals-two-in-house-ai-models?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Microsoft Reveals Two In-House AI Models Feedly Summary: AI Summary and Description: Yes Summary: Microsoft has launched two AI models, MAI-Voice-1 and MAI-1-Preview, enhancing its AI capabilities in speech generation and foundational model training. These developments present significant implications for professionals involved in AI, especially in relation to generative…

Slashdot: Google Says It Dropped the Energy Cost of AI Queries By 33x In One Year

Aug 23, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://tech.slashdot.org/story/25/08/22/2118234/google-says-it-dropped-the-energy-cost-of-ai-queries-by-33x-in-one-year?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Says It Dropped the Energy Cost of AI Queries By 33x In One Year Feedly Summary: AI Summary and Description: Yes Summary: Google’s recent analysis reveals a significant reduction in the energy consumption and carbon emissions associated with its AI text queries, achieving a 33x reduction over the…

Cloud Blog: How much energy does Google’s AI use? We did the math

Aug 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/infrastructure/measuring-the-environmental-impact-of-ai-inference/ Source: Cloud Blog Title: How much energy does Google’s AI use? We did the math Feedly Summary: AI is unlocking scientific breakthroughs, improving healthcare and education, and could add trillions to the global economy. Understanding AI’s footprint is crucial, yet thorough data on the energy and environmental impact of AI inference —…

Cloud Blog: An efficient path to production AI: Kakao’s journey with JAX and Cloud TPUs

Aug 19, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/infrastructure-modernization/kakaos-journey-with-jax-and-cloud-tpus/ Source: Cloud Blog Title: An efficient path to production AI: Kakao’s journey with JAX and Cloud TPUs Feedly Summary: When your messaging platform serves 49 million people – 93% of South Korea’s population – every technical decision carries enormous weight. The engineering team at Kakao faced exactly this challenge when their existing…

The Cloudflare Blog: Partnering with OpenAI to bring their new open models onto Cloudflare Workers AI

Aug 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://blog.cloudflare.com/openai-gpt-oss-on-workers-ai/ Source: The Cloudflare Blog Title: Partnering with OpenAI to bring their new open models onto Cloudflare Workers AI Feedly Summary: OpenAI’s newest open-source models are now available on Cloudflare Workers AI on Day 0, with support for Responses API, Code Interpreter and Web Search (coming soon). AI Summary and Description: Yes **Short…

Simon Willison’s Weblog: OpenAI’s new open weight (Apache 2) models are really good

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/5/gpt-oss/ Source: Simon Willison’s Weblog Title: OpenAI’s new open weight (Apache 2) models are really good Feedly Summary: The long promised OpenAI open weight models are here, and they are very impressive. They’re available under proper open source licenses – Apache 2.0 – and come in two sizes, 120B and 20B. OpenAI’s own…

Simon Willison’s Weblog: Qwen3-Coder: Agentic Coding in the World

Jul 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/22/qwen3-coder/ Source: Simon Willison’s Weblog Title: Qwen3-Coder: Agentic Coding in the World Feedly Summary: Qwen3-Coder: Agentic Coding in the World It turns out that as I was typing up my notes on Qwen3-235B-A22B-Instruct-2507 the Qwen team were unleashing something much bigger: Today, we’re announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder…

Tag: mixture-of-experts