Tag: DeepSeek
-
Slashdot: DeepSeek Says Service Degraded Due To ‘Large-Scale Malicious Attack’
Source URL: https://it.slashdot.org/story/25/01/27/1615256/deepseek-says-service-degraded-due-to-large-scale-malicious-attack?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: DeepSeek Says Service Degraded Due To ‘Large-Scale Malicious Attack’ Feedly Summary: AI Summary and Description: Yes Summary: The text discusses DeepSeek, a Chinese AI firm, that has limited its user registration to those with China-code phone numbers as a response to a significant malicious attack. This incident emphasizes the…
-
Hacker News: How DeepSeek-R1 Was Built, for Dummies
Source URL: https://www.vellum.ai/blog/the-training-of-deepseek-r1-and-ways-to-use-it Source: Hacker News Title: How DeepSeek-R1 Was Built, for Dummies Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses DeepSeek’s innovative approach to training reasoning models through pure reinforcement learning (RL) without labeled data. This breakthrough could significantly impact the development of AI, particularly in the realm of large…
-
Slashdot: DeepSeek Rattles Wall Street With Claims of Cheaper AI Breakthroughs
Source URL: https://slashdot.org/story/25/01/27/142255/deepseek-rattles-wall-street-with-claims-of-cheaper-ai-breakthroughs?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: DeepSeek Rattles Wall Street With Claims of Cheaper AI Breakthroughs Feedly Summary: AI Summary and Description: Yes Summary: DeepSeek, a Chinese AI startup, claims to deliver high-performance AI models at significantly lower costs than established U.S. tech giants. This assertion has raised concerns among analysts regarding the sustainability of…
-
Hacker News: Announcing support for DeepSeek-R1 in our IDE plugin, self-hosted by Qodo
Source URL: https://www.qodo.ai/blog/qodo-gen-adds-self-hosted-support-for-deepseek-r1/ Source: Hacker News Title: Announcing support for DeepSeek-R1 in our IDE plugin, self-hosted by Qodo Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the competitive landscape of large language models (LLMs), particularly focusing on OpenAI’s o1 and DeepSeek’s R1, highlighting their advanced reasoning capabilities. It emphasizes the implications…
-
Hacker News: Show HN: DeepSeek My User Agent
Source URL: https://www.jasonthorsness.com/20 Source: Hacker News Title: Show HN: DeepSeek My User Agent Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses “DeepSeek R1,” a newly launched model and service that introduces chain-of-thought capabilities to users. It offers functionalities for live interaction and API access, with competitive pricing compared to existing models…
-
Simon Willison’s Weblog: Quoting Paul Gauthier
Source URL: https://simonwillison.net/2025/Jan/26/paul-gauthier/ Source: Simon Willison’s Weblog Title: Quoting Paul Gauthier Feedly Summary: In my experience with AI coding, very large context windows aren’t useful in practice. Every model seems to get confused when you feed them more than ~25-30k tokens. The models stop obeying their system prompts, can’t correctly find/transcribe pieces of code in…
-
Simon Willison’s Weblog: Anomalous Tokens in DeepSeek-V3 and r1
Source URL: https://simonwillison.net/2025/Jan/26/anomalous-tokens-in-deepseek-v3-and-r1/#atom-everything Source: Simon Willison’s Weblog Title: Anomalous Tokens in DeepSeek-V3 and r1 Feedly Summary: Anomalous Tokens in DeepSeek-V3 and r1 Glitch tokens (previously) are tokens or strings that trigger strange behavior in LLMs, hinting at oddities in their tokenizers or model weights. Here’s a fun exploration of them across DeepSeek v3 and R1.…