Tag: Hugging Face
-
Simon Willison’s Weblog: SmolVLM – small yet mighty Vision Language Model
Source URL: https://simonwillison.net/2024/Nov/28/smolvlm/#atom-everything Source: Simon Willison’s Weblog Title: SmolVLM – small yet mighty Vision Language Model Feedly Summary: SmolVLM – small yet mighty Vision Language Model I’ve been having fun playing with this new vision model from the Hugging Face team behind SmolLM. They describe it as: […] a 2B VLM, SOTA for its memory…
-
Hacker News: Open source audio transcription model that obscures sensitive info in realtime
Source URL: https://venturebeat.com/ai/aiola-unveils-open-source-ai-audio-transcription-model-that-obscures-sensitive-info-in-realtime/ Source: Hacker News Title: Open source audio transcription model that obscures sensitive info in realtime Feedly Summary: Comments AI Summary and Description: Yes Summary: The text details a new open-source AI model, Whisper-NER from Israeli startup aiOla, designed for audio transcription while enhancing privacy through named entity recognition (NER). This model integrates…
-
Simon Willison’s Weblog: Pixtral Large
Source URL: https://simonwillison.net/2024/Nov/18/pixtral-large/ Source: Simon Willison’s Weblog Title: Pixtral Large Feedly Summary: Pixtral Large New today from Mistral: Today we announce Pixtral Large, a 124B open-weights multimodal model built on top of Mistral Large 2. Pixtral Large is the second model in our multimodal family and demonstrates frontier-level image understanding. The weights are out on…
-
Hacker News: SmolLM2
Source URL: https://simonwillison.net/2024/Nov/2/smollm2/ Source: Hacker News Title: SmolLM2 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces SmolLM2, a new family of compact language models from Hugging Face, designed for lightweight on-device operations. The models, which range from 135M to 1.7B parameters, were trained on 11 trillion tokens across diverse datasets, showcasing…