Tag: two

  • Simon Willison’s Weblog: The impact of competition and DeepSeek on Nvidia

    Source URL: https://simonwillison.net/2025/Jan/27/deepseek-nvidia/ Source: Simon Willison’s Weblog Title: The impact of competition and DeepSeek on Nvidia Feedly Summary: The impact of competition and DeepSeek on Nvidia Long, excellent piece by Jeffrey Emanuel capturing the current state of the AI/LLM industry. The original title is “The Short Case for Nvidia Stock" – I’m using the Hacker…

  • Simon Willison’s Weblog: Anomalous Tokens in DeepSeek-V3 and r1

    Source URL: https://simonwillison.net/2025/Jan/26/anomalous-tokens-in-deepseek-v3-and-r1/#atom-everything Source: Simon Willison’s Weblog Title: Anomalous Tokens in DeepSeek-V3 and r1 Feedly Summary: Anomalous Tokens in DeepSeek-V3 and r1 Glitch tokens (previously) are tokens or strings that trigger strange behavior in LLMs, hinting at oddities in their tokenizers or model weights. Here’s a fun exploration of them across DeepSeek v3 and R1.…

  • Hacker News: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M

    Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Hacker News Title: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M Feedly Summary: Comments AI Summary and Description: Yes Summary: The Qwen 2.5 model release from Alibaba introduces a significant advancement in Large Language Model (LLM) capabilities with its ability to process up to 1 million tokens. This increase in input capacity is made possible through…

  • Simon Willison’s Weblog: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

    Source URL: https://simonwillison.net/2025/Jan/26/qwen25-1m/ Source: Simon Willison’s Weblog Title: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Feedly Summary: Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens Very significant new release from Alibaba’s Qwen team. Their openly licensed (sometimes Apache 2, sometimes Qwen license, I’ve had trouble keeping…

  • Slashdot: Could New Linux Code Cut Data Center Energy Use By 30%?

    Source URL: https://hardware.slashdot.org/story/25/01/25/2111225/could-new-linux-code-cut-data-center-energy-use-by-30?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Could New Linux Code Cut Data Center Energy Use By 30%? Feedly Summary: AI Summary and Description: Yes Summary: Researchers at the University of Waterloo have developed an energy-efficient code modification for Linux that could significantly reduce data center energy consumption. Their innovation focuses on optimizing network traffic processing,…

  • Hacker News: Hacker infects 18,000 "script kiddies" with fake malware builder

    Source URL: https://www.bleepingcomputer.com/news/security/hacker-infects-18-000-script-kiddies-with-fake-malware-builder/ Source: Hacker News Title: Hacker infects 18,000 "script kiddies" with fake malware builder Feedly Summary: Comments AI Summary and Description: Yes Summary: A recent report by CloudSEK reveals how a Trojanized version of the XWorm RAT builder was weaponized and distributed, unknowingly compromising low-skilled hackers, or “script kiddies”. This incident underscores the…

  • The Register: Someone is slipping a hidden backdoor into Juniper routers across the globe, activated by a magic packet

    Source URL: https://www.theregister.com/2025/01/25/mysterious_backdoor_juniper_routers/ Source: The Register Title: Someone is slipping a hidden backdoor into Juniper routers across the globe, activated by a magic packet Feedly Summary: Who could be so interested in chips, manufacturing, and more, in the US, UK, Europe, Russia… Someone has been quietly backdooring selected Juniper routers around the world in key…

  • Wired: How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI

    Source URL: https://www.wired.com/story/deepseek-china-model-ai/ Source: Wired Title: How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI Feedly Summary: When Chinese quant hedge fund founder Liang Wenfeng went into AI research, he took 10,000 Nvidia chips and assembled a team of young, ambitious talent. Two years later, DeepSeek exploded on the scene. AI Summary and…

  • Rekt: Phemex – Rekt

    Source URL: https://www.rekt.news/phemex-rekt Source: Rekt Title: Phemex – Rekt Feedly Summary: When your hot wallets become 16 points of failure, $73M makes an expensive lesson in access control. From Ethereum to Solana, CEX Phemex just demonstrated how to turn multi-chain support into a masterclass in multi-chain mayhem. AI Summary and Description: Yes Summary: The text…

  • Slashdot: FBI: North Korean IT Workers Steal Source Code To Extort Employers

    Source URL: https://it.slashdot.org/story/25/01/24/1851209/fbi-north-korean-it-workers-steal-source-code-to-extort-employers?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: FBI: North Korean IT Workers Steal Source Code To Extort Employers Feedly Summary: AI Summary and Description: Yes Summary: The text discusses cyber threats posed by North Korean IT workers who exploit their access to steal proprietary source code and demand ransoms, highlighting the need for enhanced security measures…