Cisco Talos Blog: Cybercriminal abuse of large language models

Jun 25, 2025

—

Source URL: https://blog.talosintelligence.com/cybercriminal-abuse-of-large-language-models/
Source: Cisco Talos Blog
Title: Cybercriminal abuse of large language models

Feedly Summary: Cybercriminals are increasingly gravitating towards uncensored LLMs, cybercriminal-designed LLMs and jailbreaking legitimate LLMs.

AI Summary and Description: Yes

**Summary:** The provided text discusses how cybercriminals exploit artificial intelligence technologies, particularly large language models (LLMs), to enhance their criminal activities. It outlines various approaches used by these criminals, such as using uncensored LLMs and jailbreaking legitimate models, to bypass safety mechanisms that prevent harmful outputs. This trend poses significant security risks and highlights the need for defenders to adapt their strategies against evolving threats in the AI landscape.

**Detailed Description:**
The text details the methods and motivations of cybercriminals who leverage AI technologies, especially LLMs, for malicious ends. Major points include:

– **Exploitation of AI and LLMs:**
– Cybercriminals are utilizing AI technologies, including large language models (LLMs), to facilitate hacking activities.
– Uncensored LLMs and custom-built models are being developed and deployed for illicit purposes.
– Malicious activities include generating phishing emails, constructing malware, and exploiting vulnerabilities.

– **Characteristics of Malicious LLMs:**
– These models are often uncensored, meaning they lack safety guardrails meant to align outputs with ethical standards.
– Developers are creating specialized LLMs marketed on dark web platforms targeting other criminals.

– **Common Tools and Techniques:**
– Tools like FraudGPT and GhostGPT are advertised for their capabilities, including writing malicious code and creating phishing schemes.
– Cybercriminals leverage methods to “jailbreak” legitimate LLMs, circumventing built-in safeguards.

– **Methods of Jailbreaking:**
– Various techniques are employed, including obfuscation, adversarial suffixes, and context manipulation, to bypass restrictions on harmful outputs.
– Continuous development of these methods leads to an ongoing “arms race” between attack techniques and model defense mechanisms.

– **Risks to LLMs:**
– LLMs themselves are under threat from attackers attempting to insert malware or manipulate external data sources.
– The use of models from untrusted sources can lead to serious security vulnerabilities.

– **Future Trends:**
– As AI technology evolves, cybercriminals are expected to increasingly integrate LLMs into their tactics, requiring a corresponding evolution in cybersecurity defenses.
– The text emphasizes that while LLMs are not introducing entirely new forms of attack, they greatly enhance existing methodologies.

**Implications for Security Professionals:**
– Increased vigilance is necessary as LLMs become integrated into both criminal and legitimate use cases.
– Organizations should implement strict policies for AI model usage and only source models from reputable sources.
– Cybersecurity teams must develop new strategies to detect and mitigate abuse of generative AI technologies while acknowledging the potential for well-crafted phishing and hacking attempts utilizing these models.

a abuse Act ads adversarial AI AI landscape ai model AI technologies AI technology and app Aria ARM art artificial Artificial Intelligence artificial intelligence technologies as ated attack attack techniques attackers being Bi built by bypass C capabilities CI CIA Cisco Cisco Talos co code Context continuous development criminal activities cyber cybercriminal cybercriminals cybersecurit Cybersecurity cybersecurity defense cybersecurity defenses D dark web data data sources de Defender defense defense mechanism defense mechanisms defenses design developer developers development e email end ethical ethical standards event evolving threats exp exploit Exploitation External external data sources fixes for fraud future future trends g Gen generative Generative AI git Go GPT Guardrails H hack hacking harm high Highlight HR http HTTPS implications implications for security in Intel intelligence io J jailbreak jailbreaking k l Lance land language language model language models large large language model large language models Large Language Models (LLMs) led Li llm llms lm M malicious activities malicious code malware man manipulation market mean methodologies Mode model model defense mechanisms model usage models motivations N new new strategies NGO no o obfuscation of on only OPM organization organizations oS OSINT other out output Outputs phi phishing phishing emails phishing schemes platform platforms point policies potential pre pro professionals ps Q R rag rate RCE red restrictions Risk risks RMF Ro RSA Rust s safe safeguards safety safety guardrails safety mechanisms sec security security professionals security risk security risks security teams Security Vulnerabilities Sig size source source models specialized SSE standards strategies T tactics Tails Talos team Teams tech techniques technologies technology ted text text manipulation the threat threats to tool tools TP trends trust UI under US usage use use cases V vigilance vulnerabilities Ware web Well Wi writing x z