Tag: benchmark
-
Slashdot: Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests
Source URL: https://apple.slashdot.org/story/25/06/09/1151210/apple-researchers-challenge-ai-reasoning-claims-with-controlled-puzzle-tests?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Apple Researchers Challenge AI Reasoning Claims With Controlled Puzzle Tests Feedly Summary: AI Summary and Description: Yes Summary: Apple researchers have discovered that advanced reasoning AI models, including OpenAI’s o3-mini and Gemini, exhibit a performance collapse at higher complexity levels in puzzle-solving tasks. This finding challenges existing assumptions about…
-
The Register: IBM Watson zombie brand shuffles forward with new AI lab in NYC
Source URL: https://www.theregister.com/2025/06/02/ibm_acquires_seek_ai/ Source: The Register Title: IBM Watson zombie brand shuffles forward with new AI lab in NYC Feedly Summary: Unsurprisingly, it’s all about agents, the buzzword du jour IBM on Monday unveiled watsonx AI Labs, a New York City hub where startups, researchers, and IBM engineers are expected to co-create agentic AI tools…