Monday, 17 November 2025
27.2 C
Singapore
20.6 C
Thailand
21.6 C
Indonesia
28.5 C
Philippines

DeepSeek’s R1 model was found to be highly vulnerable to jailbreaking

DeepSeek’s R1 AI model is reportedly more vulnerable to jailbreaking than other AI systems, raising concerns about its ability to produce harmful content.

The latest artificial intelligence model from DeepSeek, the Chinese AI company making waves in Silicon Valley and Wall Street, is more susceptible to manipulation than other AI models. Reports indicate that DeepSeek’s R1 can be tricked into generating harmful content, including plans for a bioweapon attack and strategies to encourage self-harm among teenagers.

Security concerns raised by experts

According to The Wall Street Journal, DeepSeek’s R1 model lacks the robust safeguards seen in other AI models. Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42—a threat intelligence and incident response division—warned that DeepSeek’s model is “more vulnerable to jailbreaking” than its competitors. Jailbreaking bypasses security filters to make an AI system generate harmful, misleading, or illicit content.

The Journal conducted its tests on DeepSeek’s R1. It was able to manipulate it into designing a social media campaign that, in the chatbot’s own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

AI model produces dangerous content

Further testing revealed even more concerning results. The chatbot reportedly provided instructions for executing a bioweapon attack, drafted a pro-Hitler manifesto, and composed a phishing email embedded with malware. In comparison, when the same prompts were tested on ChatGPT, the AI refused to comply, highlighting the significant security gap in DeepSeek’s system.

Concerns about DeepSeek’s AI models are not new. Reports suggest that the DeepSeek app actively avoids discussing politically sensitive topics such as the Tiananmen Square massacre or Taiwan’s sovereignty. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” in a bioweapons safety test, raising alarms about its security vulnerabilities.

Hot this week

vivo launches X300 series in Singapore with 200 MP ZEISS imaging

vivo launches its X300 series in Singapore with upgraded ZEISS cameras, new OriginOS software, stronger performance and CASETiFY editions.

Singapore businesses expand globally as one in four sell internationally with PayPal

One in four Singapore businesses now sell internationally via PayPal, led by gaming, beauty, and fashion exports worth over US$1.6B.

ASUS opens pre-orders for ROG x Hatsune Miku gaming PC in Singapore

ASUS opens pre-orders in Singapore for its themed ROG x Hatsune Miku gaming PC and peripherals bundle.

Belkin recalls iPhone tracking stand and power banks over fire safety concerns

Belkin recalls iPhone stands and power banks after overheating defects raise fire and burn safety concerns.

OpenAI introduces GPT-5.1 with improved conversation and customisation

OpenAI launches GPT-5.1 with improved tone, clearer reasoning and new controls that make ChatGPT more conversational and customisable.

Belkin recalls iPhone tracking stand and power banks over fire safety concerns

Belkin recalls iPhone stands and power banks after overheating defects raise fire and burn safety concerns.

vivo X300 Pro review: A flagship built for serious photography

A detailed look at the vivo X300 Pro’s camera system, design, battery life and everyday performance in real-world use.

Businesses report rising revenue loss from inefficient tech as AI adoption grows

New research shows two in five global businesses face revenue loss due to tech inefficiencies, with many turning to AI to improve productivity.

Meta announces Southeast Asia’s most impactful Reels campaigns and creators

Meta highlights brands and creators shaping Southeast Asia’s short-form video landscape at the 2025 Reels Impact Awards.

Related Articles

Popular Categories