DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.

Source link

Cursive Is Back. But Should Students Be Learning the Skill?

911 Season 9 Episode 15 Recap: Buck’s Opioid Addiction Explained

SignaBlok to Present Novel Approach to Preventing Cancer Recurrence at the 2026 American Association for Cancer Research (AACR) Annual Meeting

How Real-Time Data Unlocks 100X AI Performance

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

Dua Lipa to Appear Alongside Connor Storrie in New A24 Film

Chance the Rapper Awarded $35 in Countersuit Against Ex-Manager

Neurosis Surprise Drop First Album in 10 Years

Dylan Brady Is Releasing a New EP

Latest Posts

Categories

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

You Might Also Like