Tag: DeepSeek

  • Slashdot: Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find

    Source URL: https://tech.slashdot.org/story/25/07/04/1521245/simple-text-additions-can-fool-advanced-ai-reasoning-models-researchers-find Source: Slashdot Title: Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find Feedly Summary: AI Summary and Description: Yes Summary: The research highlights a significant vulnerability in state-of-the-art reasoning AI models through the “CatAttack” technique, which attaches irrelevant phrases to math problems, leading to higher error rates and inefficient responses.…

  • Slashdot: DeepSeek Faces Ban From Apple, Google App Stores In Germany

    Source URL: https://apple.slashdot.org/story/25/06/27/1949207/deepseek-faces-ban-from-apple-google-app-stores-in-germany Source: Slashdot Title: DeepSeek Faces Ban From Apple, Google App Stores In Germany Feedly Summary: AI Summary and Description: Yes Summary: The text highlights Germany’s data protection commissioner, Meike Kamp, requesting the removal of the DeepSeek app from Apple and Google’s app stores due to concerns about illegal data transfers to China.…

  • Slashdot: DeepSeek Aids China’s Military and Evaded Export Controls, US Official Says

    Source URL: https://slashdot.org/story/25/06/23/1515222/deepseek-aids-chinas-military-and-evaded-export-controls-us-official-says?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: DeepSeek Aids China’s Military and Evaded Export Controls, US Official Says Feedly Summary: AI Summary and Description: Yes Summary: The text discusses potential security and privacy concerns regarding AI firm DeepSeek, which is allegedly aiding China’s military and intelligence operations by accessing high-end semiconductors through Southeast Asian shell companies.…

  • Simon Willison’s Weblog: AbsenceBench: Language Models Can’t Tell What’s Missing

    Source URL: https://simonwillison.net/2025/Jun/20/absencebench/#atom-everything Source: Simon Willison’s Weblog Title: AbsenceBench: Language Models Can’t Tell What’s Missing Feedly Summary: AbsenceBench: Language Models Can’t Tell What’s Missing Here’s another interesting result to file under the “jagged frontier" of LLMs, where their strengths and weaknesses are often unintuitive. Long context models have been getting increasingly good at passing "Needle…

  • The Register: MiniMax M1 model claims Chinese LLM crown from DeepSeek – plus it’s true open-source

    Source URL: https://www.theregister.com/2025/06/17/minimax_m1_model_chinese_llm/ Source: The Register Title: MiniMax M1 model claims Chinese LLM crown from DeepSeek – plus it’s true open-source Feedly Summary: China’s ‘little dragons’ pose big challenge to US AI firms MiniMax, an AI firm based in Shanghai, has released an open-source reasoning model that challenges Chinese rival DeepSeek and US-based Anthropic, OpenAI,…

  • The Register: DeepSeek installer or just malware in disguise? Click around and find out

    Source URL: https://www.theregister.com/2025/06/11/deepseek_installer_or_infostealing_malware/ Source: The Register Title: DeepSeek installer or just malware in disguise? Click around and find out Feedly Summary: ‘BrowserVenom’ is pure poison Suspected cybercriminals have created a fake installer for Chinese AI model DeepSeek-R1 and loaded it with previously unknown malware called “BrowserVenom".… AI Summary and Description: Yes Summary: The text discusses…

  • Slashdot: Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests

    Source URL: https://apple.slashdot.org/story/25/06/09/1151210/apple-researchers-challenge-ai-reasoning-claims-with-controlled-puzzle-tests?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests Feedly Summary: AI Summary and Description: Yes Summary: Apple researchers have discovered that advanced reasoning AI models, including OpenAI’s o3-mini and Gemini, exhibit a performance collapse at higher complexity levels in puzzle-solving tasks. This finding challenges existing assumptions about…

  • Cloud Blog: Accelerate your gen AI: Deploy Llama4 & DeepSeek on AI Hypercomputer with new recipes

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/deploying-llama4-and-deepseek-on-ai-hypercomputer/ Source: Cloud Blog Title: Accelerate your gen AI: Deploy Llama4 & DeepSeek on AI Hypercomputer with new recipes Feedly Summary: The pace of innovation in open-source AI is breathtaking, with models like Meta’s Llama4 and DeepSeek AI’s DeepSeek. However, deploying and optimizing large, powerful models can be  complex and resource-intensive. Developers and…