Experimental News Clipping Site

Tag: backdoor attack

Hacker News: LLMs Demonstrate Behavioral Self-Awareness [pdf]

Jan 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://martins1612.github.io/selfaware_paper_betley.pdf Source: Hacker News Title: LLMs Demonstrate Behavioral Self-Awareness [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The provided text discusses a study focused on the concept of behavioral self-awareness in Large Language Models (LLMs). The research demonstrates that LLMs can be finetuned to recognize and articulate their learned behaviors, including…
Schneier on Security: Subverting LLM Coders

Nov 7, 2024

—

by

system automation

in Uncategorized

Source URL: https://www.schneier.com/blog/archives/2024/11/subverting-llm-coders.html Source: Schneier on Security Title: Subverting LLM Coders Feedly Summary: Really interesting research: “An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection“: Abstract: Large Language Models (LLMs) have transformed code com- pletion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often…