Friday, 21 November 2025
30.2 C
Singapore
17.4 C
Thailand
26.1 C
Indonesia
28.4 C
Philippines

DeepSeek’s R1 model was found to be highly vulnerable to jailbreaking

DeepSeek’s R1 AI model is reportedly more vulnerable to jailbreaking than other AI systems, raising concerns about its ability to produce harmful content.

The latest artificial intelligence model from DeepSeek, the Chinese AI company making waves in Silicon Valley and Wall Street, is more susceptible to manipulation than other AI models. Reports indicate that DeepSeek’s R1 can be tricked into generating harmful content, including plans for a bioweapon attack and strategies to encourage self-harm among teenagers.

Security concerns raised by experts

According to The Wall Street Journal, DeepSeek’s R1 model lacks the robust safeguards seen in other AI models. Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42—a threat intelligence and incident response division—warned that DeepSeek’s model is “more vulnerable to jailbreaking” than its competitors. Jailbreaking bypasses security filters to make an AI system generate harmful, misleading, or illicit content.

The Journal conducted its tests on DeepSeek’s R1. It was able to manipulate it into designing a social media campaign that, in the chatbot’s own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

AI model produces dangerous content

Further testing revealed even more concerning results. The chatbot reportedly provided instructions for executing a bioweapon attack, drafted a pro-Hitler manifesto, and composed a phishing email embedded with malware. In comparison, when the same prompts were tested on ChatGPT, the AI refused to comply, highlighting the significant security gap in DeepSeek’s system.

Concerns about DeepSeek’s AI models are not new. Reports suggest that the DeepSeek app actively avoids discussing politically sensitive topics such as the Tiananmen Square massacre or Taiwan’s sovereignty. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” in a bioweapons safety test, raising alarms about its security vulnerabilities.

Hot this week

Liverpool FC partners with PayPal as official digital payments provider

Liverpool FC names PayPal its official digital payments partner in a new multi-year deal focused on loyalty rewards and fan experience.

Cloudera expands unified data platform with AI-powered federation and lineage

Cloudera updates its platform with AI-powered federation and lineage to improve enterprise data access, governance and automation.

Belkin recalls iPhone tracking stand and power banks over fire safety concerns

Belkin recalls iPhone stands and power banks after overheating defects raise fire and burn safety concerns.

Porsche unveils all-electric Cayenne as brand enters new era

Porsche launches the all-electric Cayenne with faster charging, higher performance and a redesigned interior for its next SUV era.

Salesforce study finds most Singapore technical leaders see data overhaul as vital for AI success

A new Salesforce study finds most Singapore technical leaders say major data overhauls are needed before AI ambitions can succeed.

Google TV may introduce solar-powered remote controls

Google TV may soon feature a solar-powered remote, reducing battery waste and offering an eco-friendly solution for streaming devices.

Adobe to acquire Semrush for US$1.9 billion

Adobe plans to acquire Semrush for US$1.9 billion to strengthen its digital marketing and AI-driven search tools.

Roblox’s selfie verification hints at a more intrusive online future

Roblox’s new age verification system signals a growing shift toward identity checks across online platforms, raising safety and privacy concerns.

Lenovo posts record quarterly revenue as hybrid AI strategy gains momentum

Lenovo reports record quarterly revenue as AI devices, hybrid infrastructure, and services drive strong performance.

Related Articles

Popular Categories