Tag: two
-
Simon Willison’s Weblog: Anomalous Tokens in DeepSeek-V3 and r1
Source URL: https://simonwillison.net/2025/Jan/26/anomalous-tokens-in-deepseek-v3-and-r1/#atom-everything Source: Simon Willison’s Weblog Title: Anomalous Tokens in DeepSeek-V3 and r1 Feedly Summary: Anomalous Tokens in DeepSeek-V3 and r1 Glitch tokens (previously) are tokens or strings that trigger strange behavior in LLMs, hinting at oddities in their tokenizers or model weights. Here’s a fun exploration of them across DeepSeek v3 and R1.…
-
Hacker News: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M
Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Hacker News Title: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M Feedly Summary: Comments AI Summary and Description: Yes Summary: The Qwen 2.5 model release from Alibaba introduces a significant advancement in Large Language Model (LLM) capabilities with its ability to process up to 1 million tokens. This increase in input capacity is made possible through…
-
Simon Willison’s Weblog: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens
Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Simon Willison’s Weblog Title: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Feedly Summary: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Very significant new release from Alibaba’s Qwen team. Their openly licensed (sometimes Apache 2, sometimes Qwen license, I’ve had trouble keeping…
-
The Register: Someone is slipping a hidden backdoor into Juniper routers across the globe, activated by a magic packet
Source URL: https://www.theregister.com/2025/01/25/mysterious_backdoor_juniper_routers/ Source: The Register Title: Someone is slipping a hidden backdoor into Juniper routers across the globe, activated by a magic packet Feedly Summary: Who could be so interested in chips, manufacturing, and more, in the US, UK, Europe, Russia… Someone has been quietly backdooring selected Juniper routers around the world in key…
-
Wired: How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI
Source URL: https://www.wired.com/story/deepseek-china-model-ai/ Source: Wired Title: How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI Feedly Summary: When Chinese quant hedge fund founder Liang Wenfeng went into AI research, he took 10,000 Nvidia chips and assembled a team of young, ambitious talent. Two years later, DeepSeek exploded on the scene. AI Summary and…
-
Rekt: Phemex – Rekt
Source URL: https://www.rekt.news/phemex-rekt Source: Rekt Title: Phemex – Rekt Feedly Summary: When your hot wallets become 16 points of failure, $73M makes an expensive lesson in access control. From Ethereum to Solana, CEX Phemex just demonstrated how to turn multi-chain support into a masterclass in multi-chain mayhem. AI Summary and Description: Yes Summary: The text…