safety – Page 55 – Experimental News Clipping Site

New York Times – Artificial Intelligence : DeepSeek and the Dangerous A.I. Nonsense That Trump and Biden Fell For

Feb 5, 2025

—

by

Source URL: https://www.nytimes.com/2025/02/05/opinion/ai-deepseek-trump-biden.html Source: New York Times – Artificial Intelligence Title: DeepSeek and the Dangerous A.I. Nonsense That Trump and Biden Fell For Feedly Summary: This is not a Sputnik moment. It’s way past that. AI Summary and Description: Yes Summary: The text discusses China’s emergence in the AI sector with the introduction of DeepSeek,…

The Register: DeepSeek rated too dodgy down under: Banned from Australian government devices

Feb 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/02/05/australia_deepseek_ban/ Source: The Register Title: DeepSeek rated too dodgy down under: Banned from Australian government devices Feedly Summary: As American big tech companies lashed for their slow efforts to prevent harms Australia’s Department of Home Affairs has banned the use of DeepSeek on federal government devices.… AI Summary and Description: Yes Summary: Australia’s…

The Register: Google torpedoes ‘no AI for weapons’ rules

Feb 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/02/05/google_ai_principles_update/ Source: The Register Title: Google torpedoes ‘no AI for weapons’ rules Feedly Summary: Will now happily unleash the bots when ‘likely overall benefits substantially outweigh the foreseeable risks’ Google has published a new set of AI principles that don’t mention its previous pledge not to use the tech to develop weapons or…

Hacker News: Infosec 101 for Activists

Feb 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://infosecforactivists.org Source: Hacker News Title: Infosec 101 for Activists Feedly Summary: Comments AI Summary and Description: Yes Summary: This document provides critical guidance on digital safety and information security for activists, highlighting the vulnerabilities that arise in modern technology and the specific risks faced by those protesting against power structures. It emphasizes cautious…

Alerts: CISA Releases Nine Industrial Control Systems Advisories

Feb 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.cisa.gov/news-events/alerts/2025/02/04/cisa-releases-nine-industrial-control-systems-advisories Source: Alerts Title: CISA Releases Nine Industrial Control Systems Advisories Feedly Summary: CISA released nine Industrial Control Systems (ICS) advisories on February 4, 2025. These advisories provide timely information about current security issues, vulnerabilities, and exploits surrounding ICS. ICSA-25-035-01 Western Telematic Inc NPS Series, DSM Series, CPM Series ICSA-25-035-02 Rockwell Automation 1756-L8zS3…

Wired: Meet the Hired Guns Who Make Sure School Cyberattacks Stay Hidden

Feb 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.wired.com/story/meet-the-hired-guns-who-make-sure-school-cyberattacks-stay-hidden/ Source: Wired Title: Meet the Hired Guns Who Make Sure School Cyberattacks Stay Hidden Feedly Summary: An investigation into more than 300 cyberattacks against US K–12 schools over the past five years shows how schools can withhold crucial details from students and parents whose data was stolen. AI Summary and Description: Yes…

Slashdot: Anthropic Makes ‘Jailbreak’ Advance To Stop AI Models Producing Harmful Results

Feb 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/02/03/1810255/anthropic-makes-jailbreak-advance-to-stop-ai-models-producing-harmful-results?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Anthropic Makes ‘Jailbreak’ Advance To Stop AI Models Producing Harmful Results Feedly Summary: AI Summary and Description: Yes Summary: Anthropic has introduced a new technique called “constitutional classifiers” designed to enhance the security of large language models (LLMs) like its Claude chatbot. This system aims to mitigate risks associated…

Simon Willison’s Weblog: Constitutional Classifiers: Defending against universal jailbreaks

Feb 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Feb/3/constitutional-classifiers/ Source: Simon Willison’s Weblog Title: Constitutional Classifiers: Defending against universal jailbreaks Feedly Summary: Constitutional Classifiers: Defending against universal jailbreaks Interesting new research from Anthropic, resulting in the paper Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming. From the paper: In particular, we introduce Constitutional Classifiers, a framework…

Hacker News: Constitutional Classifiers: Defending against universal jailbreaks

Feb 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.anthropic.com/research/constitutional-classifiers Source: Hacker News Title: Constitutional Classifiers: Defending against universal jailbreaks Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a novel approach by the Anthropic Safeguards Research Team to defend AI models against jailbreaks through the use of Constitutional Classifiers. This system demonstrates robustness against various jailbreak techniques while…

Cloud Blog: CVE-2023-6080: A Case Study on Third-Party Installer Abuse

Feb 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/topics/threat-intelligence/cve-2023-6080-third-party-installer-abuse/ Source: Cloud Blog Title: CVE-2023-6080: A Case Study on Third-Party Installer Abuse Feedly Summary: Written By: Jacob Paullus, Daniel McNamara, Jake Rawlins, Steven Karschnia Executive Summary Mandiant exploited flaws in the Microsoft Software Installer (MSI) repair action of Lakeside Software’s SysTrack installer to obtain arbitrary code execution. An attacker with low-privilege access…

Tag: safety