GPT – Page 49 – Experimental News Clipping Site

Schneier on Security: “Emergent Misalignment” in LLMs

Feb 27, 2025

—

by

Source URL: https://www.schneier.com/blog/archives/2025/02/emergent-misalignment-in-llms.html Source: Schneier on Security Title: “Emergent Misalignment” in LLMs Feedly Summary: Interesting research: “Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs“: Abstract: We present a surprising result regarding LLMs and alignment. In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model…

Slashdot: Jensen Huang: AI Has To Do ‘100 Times More’ Computation Now Than When ChatGPT Was Released

Feb 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/02/27/0158229/jensen-huang-ai-has-to-do-100-times-more-computation-now-than-when-chatgpt-was-released?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Jensen Huang: AI Has To Do ‘100 Times More’ Computation Now Than When ChatGPT Was Released Feedly Summary: AI Summary and Description: Yes Summary: Nvidia CEO Jensen Huang states that next-generation AI will require significantly more computational power due to advanced reasoning approaches. He discusses the implications of this…

The Register: Does terrible code drive you mad? Wait until you see what it does to OpenAI’s GPT-4o

Feb 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/02/27/llm_emergent_misalignment_study/ Source: The Register Title: Does terrible code drive you mad? Wait until you see what it does to OpenAI’s GPT-4o Feedly Summary: Model was fine-tuned to write vulnerable software – then suggested enslaving humanity Computer scientists have found that fine-tuning notionally safe large language models to do one thing badly can negatively…

Slashdot: Inception Emerges From Stealth With a New Type of AI Model

Feb 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/02/26/2257224/inception-emerges-from-stealth-with-a-new-type-of-ai-model?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Inception Emerges From Stealth With a New Type of AI Model Feedly Summary: AI Summary and Description: Yes Summary: Inception, a startup led by Stanford professor Stefano Ermon, has developed a highly efficient diffusion-based large language model (DLM) that surpasses traditional models in both speed and cost-effectiveness. By enabling…

Slashdot: Surge in UK University Students Using AI To Complete Work

Feb 26, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://news.slashdot.org/story/25/02/26/1340222/surge-in-uk-university-students-using-ai-to-complete-work?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Surge in UK University Students Using AI To Complete Work Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the increasing prevalence of AI usage, specifically generative AI, among UK undergraduate students, reflecting a significant cultural shift in educational practices. The findings indicate a growing acceptance of…

Simon Willison’s Weblog: olmOCR

Feb 26, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Feb/26/olmocr/#atom-everything Source: Simon Willison’s Weblog Title: olmOCR Feedly Summary: olmOCR New from Ai2 – olmOCR is “an open-source tool designed for high-throughput conversion of PDFs and other documents into plain text while preserving natural reading order". At its core is allenai/olmOCR-7B-0225-preview, a Qwen2-VL-7B-Instruct variant trained on ~250,000 pages of diverse PDF content (both…

Simon Willison’s Weblog: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Feb 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Feb/25/emergent-misalignment/ Source: Simon Willison’s Weblog Title: Quoting Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs Feedly Summary: In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it asserts…

Simon Willison’s Weblog: Deep research System Card

Feb 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Feb/25/deep-research-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Deep research System Card Feedly Summary: Deep research System Card OpenAI are rolling out their Deep research “agentic" research tool to their $20/month ChatGPT Plus users today, who get 10 queries a month. $200/month ChatGPT Pro gets 120 uses. Deep research is the best version of this…

Hacker News: Narrow finetuning can produce broadly misaligned LLM [pdf]

Feb 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://martins1612.github.io/emergent_misalignment_betley.pdf Source: Hacker News Title: Narrow finetuning can produce broadly misaligned LLM [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The document presents findings on the phenomenon of “emergent misalignment” in large language models (LLMs) like GPT-4o when finetuned on specific narrow tasks, particularly the creation of insecure code. The results…

Hacker News: AI is blurring the line between PMs and Engineers

Feb 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://humanloop.com/blog/ai-is-blurring-the-lines-between-pms-and-engineers Source: Hacker News Title: AI is blurring the line between PMs and Engineers Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the emerging trend of prompt engineering in AI applications, emphasizing how it increasingly involves product managers (PMs) rather than just software engineers. This shift indicates a blurring…

Tag: GPT