Hugging Face – Experimental News Clipping Site

Simon Willison’s Weblog: Two more Chinese pelicans

Oct 1, 2025

—

by

Source URL: https://simonwillison.net/2025/Oct/1/two-pelicans/#atom-everything Source: Simon Willison’s Weblog Title: Two more Chinese pelicans Feedly Summary: Two new models from Chinese AI labs in the past few days. I tried them both out using llm-openrouter: DeepSeek-V3.2-Exp from DeepSeek. Announcement, Tech Report, Hugging Face (690GB, MIT license). As an intermediate step toward our next-generation architecture, V3.2-Exp builds upon…

Slashdot: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected

Sep 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://hardware.slashdot.org/story/25/09/27/0249201/hugging-face-researchers-warn-ai-generated-video-consumes-much-more-power-than-expected?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected Feedly Summary: AI Summary and Description: Yes Summary: The findings from researchers at Hugging Face reveal that generative AI tools for text-to-video production have a significantly larger carbon footprint than expected. The study highlights a non-linear increase…

Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

Sep 24, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

Simon Willison’s Weblog: Four new releases from Qwen

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/22/qwen/ Source: Simon Willison’s Weblog Title: Four new releases from Qwen Feedly Summary: It’s been an extremely busy day for team Qwen. Within the last 24 hours (all links to Twitter, which seems to be their preferred platform for these announcements): Qwen3-Next-80B-A3B-Instruct-FP8 and Qwen3-Next-80B-A3B-Thinking-FP8 – official FP8 quantized versions of their Qwen3-Next models.…

The Register: FOMO? Brit banking biz rolls out AI tools, talks up security

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/09/22/lloyds_data_ai_deployment/ Source: The Register Title: FOMO? Brit banking biz rolls out AI tools, talks up security Feedly Summary: Lloyds Data and AI lead doesn’t want devs downloading models from the likes of Hugging Face – too risky Lloyds Banking Group is leaning into 21st century tech – yet trying to do so in…

Simon Willison’s Weblog: Magistral 1.2

Sep 19, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/19/magistral/ Source: Simon Willison’s Weblog Title: Magistral 1.2 Feedly Summary: Mistral quietly released two new models yesterday: Magistral Small 1.2 (Apache 2.0, 96.1 GB on Hugging Face) and Magistral Medium 1.2 (not open weights same as Mistral’s other “medium" models.) Despite being described as "minor updates" to the Magistral 1.1 models these have…

Slashdot: Google Releases VaultGemma, Its First Privacy-Preserving LLM

Sep 16, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://yro.slashdot.org/story/25/09/16/000202/google-releases-vaultgemma-its-first-privacy-preserving-llm?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Releases VaultGemma, Its First Privacy-Preserving LLM Feedly Summary: AI Summary and Description: Yes Summary: The text discusses recent advancements in LLMs, particularly surrounding the integration of differential privacy to mitigate the risk of memorization of sensitive training data. It highlights the balance between privacy and model performance, introducing…

Simon Willison’s Weblog: Qwen3-Next-80B-A3B: 🐧🦩 Who needs legs?!

Sep 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/12/qwen3-next/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-Next-80B-A3B: 🐧🦩 Who needs legs?! Feedly Summary: Qwen3-Next-80B-A3B Qwen announced two new models via their Twitter account (nothing on their blog yet): Qwen3-Next-80B-A3B-Instruct and Qwen3-Next-80B-A3B-Thinking. They make some big claims on performance: Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship. Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking. The name “80B-A3B" indicates 80 billion parameters…

Slashdot: Switzerland Releases Open-Source AI Model Built For Privacy

Sep 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://news.slashdot.org/story/25/09/03/2125252/switzerland-releases-open-source-ai-model-built-for-privacy?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Switzerland Releases Open-Source AI Model Built For Privacy Feedly Summary: AI Summary and Description: Yes Summary: Switzerland’s launch of Apertus, a fully open-source multilingual LLM, emphasizes transparency and privacy in AI development. By providing open access to the model’s components and adhering to stringent Swiss data protection laws, Apertus…

Unit 42: Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model Name Trust

Sep 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://unit42.paloaltonetworks.com/model-namespace-reuse/ Source: Unit 42 Title: Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model Name Trust Feedly Summary: Model namespace reuse is a potential security risk in the AI supply chain. Attackers can misuse platforms like Hugging Face for remote code execution. The post Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model…

Tag: Hugging Face