Tag: Hugging Face

  • Simon Willison’s Weblog: Two more Chinese pelicans

    Source URL: https://simonwillison.net/2025/Oct/1/two-pelicans/#atom-everything Source: Simon Willison’s Weblog Title: Two more Chinese pelicans Feedly Summary: Two new models from Chinese AI labs in the past few days. I tried them both out using llm-openrouter: DeepSeek-V3.2-Exp from DeepSeek. Announcement, Tech Report, Hugging Face (690GB, MIT license). As an intermediate step toward our next-generation architecture, V3.2-Exp builds upon…

  • Slashdot: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected

    Source URL: https://hardware.slashdot.org/story/25/09/27/0249201/hugging-face-researchers-warn-ai-generated-video-consumes-much-more-power-than-expected?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Hugging Face Researchers Warn AI-Generated Video Consumes Much More Power Than Expected Feedly Summary: AI Summary and Description: Yes Summary: The findings from researchers at Hugging Face reveal that generative AI tools for text-to-video production have a significantly larger carbon footprint than expected. The study highlights a non-linear increase…

  • Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

    Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

  • Simon Willison’s Weblog: Four new releases from Qwen

    Source URL: https://simonwillison.net/2025/Sep/22/qwen/ Source: Simon Willison’s Weblog Title: Four new releases from Qwen Feedly Summary: It’s been an extremely busy day for team Qwen. Within the last 24 hours (all links to Twitter, which seems to be their preferred platform for these announcements): Qwen3-Next-80B-A3B-Instruct-FP8 and Qwen3-Next-80B-A3B-Thinking-FP8 – official FP8 quantized versions of their Qwen3-Next models.…

  • The Register: FOMO? Brit banking biz rolls out AI tools, talks up security

    Source URL: https://www.theregister.com/2025/09/22/lloyds_data_ai_deployment/ Source: The Register Title: FOMO? Brit banking biz rolls out AI tools, talks up security Feedly Summary: Lloyds Data and AI lead doesn’t want devs downloading models from the likes of Hugging Face – too risky Lloyds Banking Group is leaning into 21st century tech – yet trying to do so in…

  • Simon Willison’s Weblog: Magistral 1.2

    Source URL: https://simonwillison.net/2025/Sep/19/magistral/ Source: Simon Willison’s Weblog Title: Magistral 1.2 Feedly Summary: Mistral quietly released two new models yesterday: Magistral Small 1.2 (Apache 2.0, 96.1 GB on Hugging Face) and Magistral Medium 1.2 (not open weights same as Mistral’s other “medium" models.) Despite being described as "minor updates" to the Magistral 1.1 models these have…

  • Slashdot: Google Releases VaultGemma, Its First Privacy-Preserving LLM

    Source URL: https://yro.slashdot.org/story/25/09/16/000202/google-releases-vaultgemma-its-first-privacy-preserving-llm?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google Releases VaultGemma, Its First Privacy-Preserving LLM Feedly Summary: AI Summary and Description: Yes Summary: The text discusses recent advancements in LLMs, particularly surrounding the integration of differential privacy to mitigate the risk of memorization of sensitive training data. It highlights the balance between privacy and model performance, introducing…

  • Slashdot: Switzerland Releases Open-Source AI Model Built For Privacy

    Source URL: https://news.slashdot.org/story/25/09/03/2125252/switzerland-releases-open-source-ai-model-built-for-privacy?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Switzerland Releases Open-Source AI Model Built For Privacy Feedly Summary: AI Summary and Description: Yes Summary: Switzerland’s launch of Apertus, a fully open-source multilingual LLM, emphasizes transparency and privacy in AI development. By providing open access to the model’s components and adhering to stringent Swiss data protection laws, Apertus…

  • Unit 42: Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model Name Trust

    Source URL: https://unit42.paloaltonetworks.com/model-namespace-reuse/ Source: Unit 42 Title: Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model Name Trust Feedly Summary: Model namespace reuse is a potential security risk in the AI supply chain. Attackers can misuse platforms like Hugging Face for remote code execution. The post Model Namespace Reuse: An AI Supply-Chain Attack Exploiting Model…