Tag: mlx
-
Simon Willison’s Weblog: microsoft/phi-4
Source URL: https://simonwillison.net/2025/Jan/8/phi-4/ Source: Simon Willison’s Weblog Title: microsoft/phi-4 Feedly Summary: microsoft/phi-4 Here’s the official release of Microsoft’s Phi-4 LLM, now officially under an MIT license. A few weeks ago I covered the earlier unofficial versions, where I talked about how the model used synthetic training data in some really interesting ways. It benchmarks favorably…
-
Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model
Source URL: https://simonwillison.net/2024/Dec/24/qvq/#atom-everything Source: Simon Willison’s Weblog Title: Trying out QvQ – Qwen’s new visual reasoning model Feedly Summary: I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache2 2 licensed QvQ-72B-Preview, “an experimental research model focusing on enhancing visual reasoning capabilities". Their blog…
-
Simon Willison’s Weblog: SmolVLM – small yet mighty Vision Language Model
Source URL: https://simonwillison.net/2024/Nov/28/smolvlm/#atom-everything Source: Simon Willison’s Weblog Title: SmolVLM – small yet mighty Vision Language Model Feedly Summary: SmolVLM – small yet mighty Vision Language Model I’ve been having fun playing with this new vision model from the Hugging Face team behind SmolLM. They describe it as: […] a 2B VLM, SOTA for its memory…
-
Hacker News: Full LLM training and evaluation toolkit
Source URL: https://github.com/huggingface/smollm Source: Hacker News Title: Full LLM training and evaluation toolkit Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces SmolLM2, a family of compact language models with varying parameters designed for lightweight, on-device applications, and details on how they can be utilized in different scenarios. Such advancements in AI…
-
Hacker News: A new vulnerability on IPv6 parsing in linux
Source URL: https://nvd.nist.gov/vuln/detail/CVE-2024-50252 Source: Hacker News Title: A new vulnerability on IPv6 parsing in linux Feedly Summary: Comments AI Summary and Description: Yes Summary: The text details a recently resolved memory leak vulnerability (CVE-2024-50252) in the Linux kernel associated with the `mlxsw` driver when handling remote IPv6 addresses. This vulnerability presents significant implications for IT…
-
Simon Willison’s Weblog: Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac
Source URL: https://simonwillison.net/2024/Nov/12/qwen25-coder/ Source: Simon Willison’s Weblog Title: Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac Feedly Summary: There’s a whole lot of buzz around the new Qwen2.5-Coder Series of open source (Apache 2.0 licensed) LLM releases from Alibaba’s Qwen research team. On first impression it looks like the buzz…
-
Simon Willison’s Weblog: mlx-vlm
Source URL: https://simonwillison.net/2024/Sep/29/mlx-vlm/#atom-everything Source: Simon Willison’s Weblog Title: mlx-vlm Feedly Summary: mlx-vlm The MLX ecosystem of libraries for running machine learning models on Apple Silicon continues to expand. Prince Canuma is actively developing this library for running vision models such as Qwen-2 VL and Pixtral and LLaVA using Python running on a Mac. I used…