Tag: ollama

Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Simon Willison’s Weblog Title: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Feedly Summary: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Very significant new release from Alibaba’s Qwen team. Their openly licensed (sometimes Apache 2, sometimes Qwen license, I’ve had trouble keeping…

The Register: China’s DeepSeek just dropped a free challenger to OpenAI’s o1 – here’s how to use it on your PC

Jan 26, 2025

—

by

Source URL: https://www.theregister.com/2025/01/26/deepseek_r1_ai_cot/ Source: The Register Title: China’s DeepSeek just dropped a free challenger to OpenAI’s o1 – here’s how to use it on your PC Feedly Summary: El Reg digs its claws into Middle Kingdom’s latest chain of thought model Hands on Chinese AI startup DeepSeek this week unveiled a family of LLMs it…

Hacker News: Official DeepSeek R1 Now on Ollama

Jan 21, 2025

—

by

Source URL: https://ollama.com/library/deepseek-r1 Source: Hacker News Title: Official DeepSeek R1 Now on Ollama Feedly Summary: Comments AI Summary and Description: Yes Summary: The text provides an overview of DeepSeek’s first-generation reasoning models that exhibit performance comparable to OpenAI’s offerings across math, code, and reasoning tasks. This information is highly relevant for practitioners in AI and…

Simon Willison’s Weblog: DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B

Jan 20, 2025

—

by

Source URL: https://simonwillison.net/2025/Jan/20/deepseek-r1/ Source: Simon Willison’s Weblog Title: DeepSeek-R1 and exploring DeepSeek-R1-Distill-Llama-8B Feedly Summary: DeepSeek are the Chinese AI lab who dropped the best currently available open weights LLM on Christmas day, DeepSeek v3. That model was trained in part using their unreleased R1 “reasoning" model. Today they’ve released R1 itself, along with a whole…

Hacker News: Phi4 Available on Ollama

Jan 12, 2025

—

by

Source URL: https://ollama.com/library/phi4 Source: Hacker News Title: Phi4 Available on Ollama Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes Phi 4, a state-of-the-art language model focusing on generative AI capabilities. It highlights the model’s design, enhancements for safety and accuracy, and its primary and out-of-scope use cases, along with regulatory considerations.…

Hacker News: I Run LLMs Locally

Dec 29, 2024

—

by

Source URL: https://abishekmuthian.com/how-i-run-llms-locally/ Source: Hacker News Title: I Run LLMs Locally Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses how to set up and run Large Language Models (LLMs) locally, highlighting hardware requirements, tools, model choices, and practical insights on achieving better performance. This is particularly relevant for professionals focused on…

Hacker News: Exploring Microsoft’s Phi-3-Mini and its integration with tool like Ollama

Dec 28, 2024

—

by

Source URL: https://pieces.app/blog/phi-3-mini-integrations Source: Hacker News Title: Exploring Microsoft’s Phi-3-Mini and its integration with tool like Ollama Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses Microsoft’s Phi-3-mini, a highly efficient small language model that excels in coding and reasoning tasks, making it suitable for developers working in resource-constrained environments. It highlights…

Simon Willison’s Weblog: Open WebUI

Dec 27, 2024

—

by

Source URL: https://simonwillison.net/2024/Dec/27/open-webui/#atom-everything Source: Simon Willison’s Weblog Title: Open WebUI Feedly Summary: Open WebUI I tried out this open source (MIT licensed, JavaScript and Python) localhost UI for accessing LLMs today for the first time. It’s very nicely done. I ran it with uvx like this: uvx –python 3.11 open-webui serve On first launch it…

Simon Willison’s Weblog: Trying out QvQ – Qwen’s new visual reasoning model

Dec 24, 2024

—

by