Tag: CERN

  • The Register: Defense Department signs OpenAI for $200 million ‘frontier AI’ pilot project

    Source URL: https://www.theregister.com/2025/06/17/dod_openai_contract/ Source: The Register Title: Defense Department signs OpenAI for $200 million ‘frontier AI’ pilot project Feedly Summary: DoD says deal covers ‘warfighting’. OpenAI merely mentions healthcare and ‘supporting proactive cyber defense’ The US Department of Defense has contracted OpenAI to run a pilot program that will create “frontier AI," but it’s not…

  • The Register: Alt cloud platform Railway forced to pause lowest tiers after onrush of GCP customers

    Source URL: https://www.theregister.com/2025/06/16/railway_pauses_lowest_tiers/ Source: The Register Title: Alt cloud platform Railway forced to pause lowest tiers after onrush of GCP customers Feedly Summary: A moment of panic as some customers thought the free tiers were going away On Monday, Railway, a provider of cloud infrastructure services, decided to throttle software builds by customers in its…

  • Slashdot: Salesforce Study Finds LLM Agents Flunk CRM and Confidentiality Tests

    Source URL: https://yro.slashdot.org/story/25/06/16/2054205/salesforce-study-finds-llm-agents-flunk-crm-and-confidentiality-tests Source: Slashdot Title: Salesforce Study Finds LLM Agents Flunk CRM and Confidentiality Tests Feedly Summary: AI Summary and Description: Yes Summary: A recent Salesforce study highlights significant limitations of LLM-based AI agents in real-world CRM tasks, achieving only 58% success on simple tasks and 35% on multi-step tasks. The findings indicate a…

  • The Register: Salesforce study finds LLM agents flunk CRM and confidentiality tests

    Source URL: https://www.theregister.com/2025/06/16/salesforce_llm_agents_benchmark/ Source: The Register Title: Salesforce study finds LLM agents flunk CRM and confidentiality tests Feedly Summary: 6-in-10 success rate for single-step tasks A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality.… AI Summary and…

  • Simon Willison’s Weblog: The lethal trifecta for AI agents: private data, untrusted content, and external communication

    Source URL: https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/#atom-everything Source: Simon Willison’s Weblog Title: The lethal trifecta for AI agents: private data, untrusted content, and external communication Feedly Summary: If you are a user of LLM systems that use tools (you can call them “AI agents" if you like) it is critically important that you understand the risk of combining tools…

  • Slashdot: Meta’s Llama 3.1 Can Recall 42% of the First Harry Potter Book

    Source URL: https://slashdot.org/story/25/06/15/2230206/metas-llama-31-can-recall-42-of-the-first-harry-potter-book?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Meta’s Llama 3.1 Can Recall 42% of the First Harry Potter Book Feedly Summary: AI Summary and Description: Yes Summary: The text discusses significant findings from a research study that highlights the memorization capabilities of Llama 3.1 70B, an AI model from Meta. It raises concerns about potential legal…