Tag: safety protocols

  • Docker: From Shell Scripts to Science Agents: How AI Agents Are Transforming Research Workflows

    Source URL: https://www.docker.com/blog/ai-science-agents-research-workflows/ Source: Docker Title: From Shell Scripts to Science Agents: How AI Agents Are Transforming Research Workflows Feedly Summary: It’s 2 AM in a lab somewhere. A researcher has three terminals open, a half-written Jupyter notebook on one screen, an Excel sheet filled with sample IDs on another, and a half-eaten snack next…

  • New York Times – Artificial Intelligence : Joseph Gordon-Levitt: Meta’s A.I. Chatbot Is Dangerous for Kids

    Source URL: https://www.nytimes.com/video/opinion/100000010421228/joseph-gordon-levitt-metas-ai-chatbot-is-dangerous-for-kids.html Source: New York Times – Artificial Intelligence Title: Joseph Gordon-Levitt: Meta’s A.I. Chatbot Is Dangerous for Kids Feedly Summary: Mark Zuckerberg has a vision for how A.I. could be used in Meta’s universe. But the actor and filmmaker Joseph Gordon-Levitt is here to point out a flaw in the technology: an apparent…

  • Slashdot: ChatGPT Will Guess Your Age and Might Require ID For Age Verification

    Source URL: https://yro.slashdot.org/story/25/09/16/2045241/chatgpt-will-guess-your-age-and-might-require-id-for-age-verification?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: ChatGPT Will Guess Your Age and Might Require ID For Age Verification Feedly Summary: AI Summary and Description: Yes Summary: OpenAI has announced stricter safety measures for ChatGPT to address concerns about user safety, particularly for minors. These measures include age verification and tailored conversational guidelines for younger users,…

  • OpenAI : Working with US CAISI and UK AISI to build more secure AI systems

    Source URL: https://openai.com/index/us-caisi-uk-aisi-ai-update Source: OpenAI Title: Working with US CAISI and UK AISI to build more secure AI systems Feedly Summary: OpenAI shares progress on the partnership with the US CAISI and UK AISI to strengthen AI safety and security. The collaboration is setting new standards for responsible frontier AI deployment through joint red-teaming, biosecurity…

  • OpenAI : GPT-5 bio bug bounty call

    Source URL: https://openai.com/gpt-5-bio-bug-bounty Source: OpenAI Title: GPT-5 bio bug bounty call Feedly Summary: OpenAI invites researchers to its Bio Bug Bounty. Test GPT-5’s safety with a universal jailbreak prompt and win up to $25,000. AI Summary and Description: Yes Summary: OpenAI’s initiative invites researchers to participate in its Bio Bug Bounty program, focusing on testing…

  • Schneier on Security: GPT-4o-mini Falls for Psychological Manipulation

    Source URL: https://www.schneier.com/blog/archives/2025/09/gpt-4o-mini-falls-for-psychological-manipulation.html Source: Schneier on Security Title: GPT-4o-mini Falls for Psychological Manipulation Feedly Summary: Interesting experiment: To design their experiment, the University of Pennsylvania researchers tested 2024’s GPT-4o-mini model on two requests that it should ideally refuse: calling the user a jerk and giving directions for how to synthesize lidocaine. The researchers created experimental…

  • The Register: One long sentence is all it takes to make LLMs misbehave

    Source URL: https://www.theregister.com/2025/08/26/breaking_llms_for_fun/ Source: The Register Title: One long sentence is all it takes to make LLMs misbehave Feedly Summary: Chatbots ignore their guardrails when your grammar sucks, researchers find Security researchers from Palo Alto Networks’ Unit 42 have discovered the key to getting large language model (LLM) chatbots to ignore their guardrails, and it’s…