hugging – Experimental News Clipping Site

Cloud Blog: Connect Spark data pipelines to Gemini and other AI models with Dataproc ML library

Oct 3, 2025

—

by

Source URL: https://cloud.google.com/blog/products/data-analytics/gemini-and-vertex-ai-for-spark-with-dataproc-ml-library/ Source: Cloud Blog Title: Connect Spark data pipelines to Gemini and other AI models with Dataproc ML library Feedly Summary: Many data science teams rely on Apache Spark running on Dataproc managed clusters for powerful, large-scale data preparation. As these teams look to connect their data pipelines directly to machine learning models,…

Simon Willison’s Weblog: Two more Chinese pelicans

Oct 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Oct/1/two-pelicans/#atom-everything Source: Simon Willison’s Weblog Title: Two more Chinese pelicans Feedly Summary: Two new models from Chinese AI labs in the past few days. I tried them both out using llm-openrouter: DeepSeek-V3.2-Exp from DeepSeek. Announcement, Tech Report, Hugging Face (690GB, MIT license). As an intermediate step toward our next-generation architecture, V3.2-Exp builds upon…

Slashdot: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected

Sep 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://hardware.slashdot.org/story/25/09/27/0249201/hugging-face-researchers-warn-ai-generated-video-consumes-much-more-power-than-expected?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected Feedly Summary: AI Summary and Description: Yes Summary: The findings from researchers at Hugging Face reveal that generative AI tools for text-to-video production have a significantly larger carbon footprint than expected. The study highlights a non-linear increase…

Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

Sep 24, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

Simon Willison’s Weblog: Four new releases from Qwen

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/22/qwen/ Source: Simon Willison’s Weblog Title: Four new releases from Qwen Feedly Summary: It’s been an extremely busy day for team Qwen. Within the last 24 hours (all links to Twitter, which seems to be their preferred platform for these announcements): Qwen3-Next-80B-A3B-Instruct-FP8 and Qwen3-Next-80B-A3B-Thinking-FP8 – official FP8 quantized versions of their Qwen3-Next models.…

The Register: FOMO? Brit banking biz rolls out AI tools, talks up security

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/09/22/lloyds_data_ai_deployment/ Source: The Register Title: FOMO? Brit banking biz rolls out AI tools, talks up security Feedly Summary: Lloyds Data and AI lead doesn’t want devs downloading models from the likes of Hugging Face – too risky Lloyds Banking Group is leaning into 21st century tech – yet trying to do so in…

Simon Willison’s Weblog: Magistral 1.2

Sep 19, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/19/magistral/ Source: Simon Willison’s Weblog Title: Magistral 1.2 Feedly Summary: Mistral quietly released two new models yesterday: Magistral Small 1.2 (Apache 2.0, 96.1 GB on Hugging Face) and Magistral Medium 1.2 (not open weights same as Mistral’s other “medium" models.) Despite being described as "minor updates" to the Magistral 1.1 models these have…

Docker: Docker Model Runner General Availability

Sep 18, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.docker.com/blog/announcing-docker-model-runner-ga/ Source: Docker Title: Docker Model Runner General Availability Feedly Summary: We’re excited to share that Docker Model Runner is now generally available (GA)! In April 2025, Docker introduced the first Beta release of Docker Model Runner, making it easy to manage, run, and distribute local AI models (specifically LLMs). Though only a…

Slashdot: Google Releases VaultGemma, Its First Privacy-Preserving LLM

Sep 16, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://yro.slashdot.org/story/25/09/16/000202/google-releases-vaultgemma-its-first-privacy-preserving-llm?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Releases VaultGemma, Its First Privacy-Preserving LLM Feedly Summary: AI Summary and Description: Yes Summary: The text discusses recent advancements in LLMs, particularly surrounding the integration of differential privacy to mitigate the risk of memorization of sensitive training data. It highlights the balance between privacy and model performance, introducing…

Simon Willison’s Weblog: Qwen3-Next-80B-A3B: 🐧🦩 Who needs legs?!

Sep 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/12/qwen3-next/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-Next-80B-A3B: 🐧🦩 Who needs legs?! Feedly Summary: Qwen3-Next-80B-A3B Qwen announced two new models via their Twitter account (nothing on their blog yet): Qwen3-Next-80B-A3B-Instruct and Qwen3-Next-80B-A3B-Thinking. They make some big claims on performance: Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship. Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking. The name “80B-A3B" indicates 80 billion parameters…

Tag: hugging