Tag: ethical

Source URL: https://openai.com/index/gpt-4-5-system-card Source: OpenAI Title: Orion Feedly Summary: We’re releasing a research preview of OpenAI GPT‑4.5, our largest and most knowledgeable model yet. AI Summary and Description: Yes Summary: OpenAI’s release of GPT-4.5 highlights advancements in AI technology, emphasizing its significance for professionals in AI and security fields. The information reinforces the ongoing evolution…

Schneier on Security: “Emergent Misalignment” in LLMs

Feb 27, 2025

—

by

Source URL: https://www.schneier.com/blog/archives/2025/02/emergent-misalignment-in-llms.html Source: Schneier on Security Title: “Emergent Misalignment” in LLMs Feedly Summary: Interesting research: “Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs“: Abstract: We present a surprising result regarding LLMs and alignment. In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model…

Slashdot: Cellebrite Suspends Serbia as Customer After Claims Police Used Firm’s Tech To Plant Spyware

—

by

Source URL: https://tech.slashdot.org/story/25/02/26/1952228/cellebrite-suspends-serbia-as-customer-after-claims-police-used-firms-tech-to-plant-spyware?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Cellebrite Suspends Serbia as Customer After Claims Police Used Firm’s Tech To Plant Spyware Feedly Summary: AI Summary and Description: Yes Summary: Cellebrite has halted the use of its technology in Serbia following serious allegations that its forensics tools were abused by local police to unlock phones and install…

The Register: Signal will withdraw from Sweden if encryption-busting laws take effect

—

by

Source URL: https://www.theregister.com/2025/02/26/signal_will_withdraw_from_sweden/ Source: The Register Title: Signal will withdraw from Sweden if encryption-busting laws take effect Feedly Summary: Experts warned the UK’s recent ‘victory’ over Apple would kickstart something of a domino effect Signal CEO Meredith Whittaker says her company will withdraw from countries that force messaging providers to allow law enforcement officials to…

Hacker News: The journalists training AI models for Meta and OpenAI

—

by

Source URL: https://www.niemanlab.org/2025/02/meet-the-journalists-training-ai-models-for-meta-and-openai/ Source: Hacker News Title: The journalists training AI models for Meta and OpenAI Feedly Summary: Comments AI Summary and Description: Yes **Short Summary with Insight:** The text discusses the increasing trend of journalists transitioning to data-related roles, particularly in AI model training, due to economic pressures in traditional journalism. It highlights how…

Hacker News: Y Combinator deletes posts after a startup’s demo goes viral

—

by

Source URL: https://techcrunch.com/2025/02/25/y-combinator-deletes-posts-after-a-startups-demo-goes-viral/ Source: Hacker News Title: Y Combinator deletes posts after a startup’s demo goes viral Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a controversial demo by Optifye.ai, a startup using AI-powered security cameras to monitor factory worker performance in real-time. It highlights public backlash against the demo, concerns…

Simon Willison’s Weblog: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Feb 25, 2025

—

by

Source URL: https://simonwillison.net/2025/Feb/25/emergent-misalignment/ Source: Simon Willison’s Weblog Title: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs Feedly Summary: In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it asserts…

Simon Willison’s Weblog: Deep research System Card

Feb 25, 2025

—

by

Source URL: https://simonwillison.net/2025/Feb/25/deep-research-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Deep research System Card Feedly Summary: Deep research System Card OpenAI are rolling out their Deep research “agentic" research tool to their $20/month ChatGPT Plus users today, who get 10 queries a month. $200/month ChatGPT Pro gets 120 uses. Deep research is the best version of this…

Hacker News: Narrow finetuning can produce broadly misaligned LLM [pdf]

Feb 25, 2025

—

by