harm – Page 6 – Experimental News Clipping Site

NCSC Feed: From bugs to bypasses: adapting vulnerability disclosure for AI safeguards

Sep 2, 2025

—

by

Source URL: https://www.ncsc.gov.uk/blog-post/from-bugs-to-bypasses-adapting-vulnerability-disclosure-for-ai-safeguards Source: NCSC Feed Title: From bugs to bypasses: adapting vulnerability disclosure for AI safeguards Feedly Summary: Exploring how far cyber security approaches can help mitigate risks in generative AI systems AI Summary and Description: Yes Summary: The text addresses the intersection of cybersecurity strategies and generative AI systems, highlighting how established cybersecurity…

Slashdot: OpenAI Is Scanning Users’ ChatGPT Conversations and Reporting Content To Police

Aug 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://yro.slashdot.org/story/25/08/31/2311231/openai-is-scanning-users-chatgpt-conversations-and-reporting-content-to-police Source: Slashdot Title: OpenAI Is Scanning Users’ ChatGPT Conversations and Reporting Content To Police Feedly Summary: AI Summary and Description: Yes Summary: The text highlights OpenAI’s controversial practice of monitoring user conversations in ChatGPT for threats, revealing significant security and privacy implications. This admission raises questions about the balance between safety and…

Tomasz Tunguz: The Rise and Fall of Vibe Coding

Aug 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.tomtunguz.com/the-rise-and-fall-of-vibe-coding/ Source: Tomasz Tunguz Title: The Rise and Fall of Vibe Coding Feedly Summary: We’re living through the “Wild West” era of AI-powered software development. Anyone can build custom solutions in minutes rather than months. This creative explosion heads toward a reckoning. Hidden maintenance costs of thousands of “vibe-coded” micro-apps will collide with…

Slashdot: Meta Changes Teen AI Chatbot Responses as Senate Begins Probe Into ‘Romantic’ Conversations

Aug 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://tech.slashdot.org/story/25/08/29/2116246/meta-changes-teen-ai-chatbot-responses-as-senate-begins-probe-into-romantic-conversations Source: Slashdot Title: Meta Changes Teen AI Chatbot Responses as Senate Begins Probe Into ‘Romantic’ Conversations Feedly Summary: AI Summary and Description: Yes Summary: Meta is instituting temporary limitations on its AI chatbots for teenage users to safeguard them from engaging in inappropriate conversations. The adjustments aim to redirect conversations away from…

Slashdot: Vivaldi Browser Doubles Down On Gen AI Ban

Aug 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://tech.slashdot.org/story/25/08/29/217243/vivaldi-browser-doubles-down-on-gen-ai-ban Source: Slashdot Title: Vivaldi Browser Doubles Down On Gen AI Ban Feedly Summary: AI Summary and Description: Yes Summary: Vivaldi CEO Jon von Tetzchner emphasizes the company’s stance against integrating generative AI into its browser, arguing that such technologies can dehumanize the web, detract from content creators, and prioritize user data collection…

The Register: Researcher who found McDonald’s free-food hack turns her attention to Chinese restaurant robots

Aug 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/08/29/pudu_robots_hackable/ Source: The Register Title: Researcher who found McDonald’s free-food hack turns her attention to Chinese restaurant robots Feedly Summary: The admin controls were left wide open on Pudu’s robots A researcher caught the world’s leading supplier of commercial service robots using shoddy admin security that let attackers redirect the delivery machines to…

Cloud Blog: Run Gemini anywhere, including on-premises, with Google Distributed Cloud

Aug 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/topics/hybrid-cloud/gemini-is-now-available-anywhere/ Source: Cloud Blog Title: Run Gemini anywhere, including on-premises, with Google Distributed Cloud Feedly Summary: Earlier this year, we announced our commitment to bring Gemini to on-premises environments with Google Distributed Cloud (GDC). Today, we are excited to announce that Gemini on GDC is now available to customers. For years, enterprises and…

Slashdot: One Long Sentence is All It Takes To Make LLMs Misbehave

Aug 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/08/27/1756253/one-long-sentence-is-all-it-takes-to-make-llms-misbehave?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: One Long Sentence is All It Takes To Make LLMs Misbehave Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a significant security research finding from Palo Alto Networks’ Unit 42 regarding vulnerabilities in large language models (LLMs). The researchers explored methods that allow users to bypass…

Schneier on Security: We Are Still Unable to Secure LLMs from Malicious Inputs

Aug 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.schneier.com/blog/archives/2025/08/we-are-still-unable-to-secure-llms-from-malicious-inputs.html Source: Schneier on Security Title: We Are Still Unable to Secure LLMs from Malicious Inputs Feedly Summary: Nice indirect prompt injection attack: Bargury’s attack starts with a poisoned document, which is shared to a potential victim’s Google Drive. (Bargury says a victim could have also uploaded a compromised file to their own…

Simon Willison’s Weblog: Piloting Claude for Chrome

Aug 26, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/26/piloting-claude-for-chrome/#atom-everything Source: Simon Willison’s Weblog Title: Piloting Claude for Chrome Feedly Summary: Piloting Claude for Chrome Two days ago I said: I strongly expect that the entire concept of an agentic browser extension is fatally flawed and cannot be built safely. Today Anthropic announced their own take on this pattern, implemented as an…

Tag: harm