Wednesday, 3 December 2025
30.4 C
Singapore
30.5 C
Thailand
22.2 C
Indonesia
28.5 C
Philippines

DeepSeek’s R1 model was found to be highly vulnerable to jailbreaking

DeepSeek’s R1 AI model is reportedly more vulnerable to jailbreaking than other AI systems, raising concerns about its ability to produce harmful content.

The latest artificial intelligence model from DeepSeek, the Chinese AI company making waves in Silicon Valley and Wall Street, is more susceptible to manipulation than other AI models. Reports indicate that DeepSeek’s R1 can be tricked into generating harmful content, including plans for a bioweapon attack and strategies to encourage self-harm among teenagers.

Security concerns raised by experts

According to The Wall Street Journal, DeepSeek’s R1 model lacks the robust safeguards seen in other AI models. Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42—a threat intelligence and incident response division—warned that DeepSeek’s model is “more vulnerable to jailbreaking” than its competitors. Jailbreaking bypasses security filters to make an AI system generate harmful, misleading, or illicit content.

The Journal conducted its tests on DeepSeek’s R1. It was able to manipulate it into designing a social media campaign that, in the chatbot’s own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

AI model produces dangerous content

Further testing revealed even more concerning results. The chatbot reportedly provided instructions for executing a bioweapon attack, drafted a pro-Hitler manifesto, and composed a phishing email embedded with malware. In comparison, when the same prompts were tested on ChatGPT, the AI refused to comply, highlighting the significant security gap in DeepSeek’s system.

Concerns about DeepSeek’s AI models are not new. Reports suggest that the DeepSeek app actively avoids discussing politically sensitive topics such as the Tiananmen Square massacre or Taiwan’s sovereignty. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” in a bioweapons safety test, raising alarms about its security vulnerabilities.

Hot this week

HoYoverse unveils Varsapura, an open-world action game inspired by Singapore

HoYoverse reveals Varsapura, an open-world action game inspired by Singapore, with Unreal Engine 5 visuals and atmospheric, Control-like themes.

Google DeepMind opens new AI research lab in Singapore to strengthen regional language capabilities

Google DeepMind opens a new AI lab in Singapore to boost regional language understanding, research partnerships, and real-world innovation.

OpenAI enters circular ownership deal with Thrive Holdings

OpenAI enters a circular ownership deal with Thrive Holdings, deepening ties with private equity while expanding its AI reach.

ShopBack partners Singapore Tourism Board to boost travel rewards for Malaysians

ShopBack and the Singapore Tourism Board partner to offer Malaysians enhanced Cashback rewards and perks for travel to Singapore.

Honor launches Magic8 Pro in Singapore with new MagicBook Art 14 and Watch Fit

Honor launches the Magic8 Pro in Singapore with upgraded imaging, AI features and companion devices including the MagicBook Art 14 and Watch Fit.

OpenAI enters circular ownership deal with Thrive Holdings

OpenAI enters a circular ownership deal with Thrive Holdings, deepening ties with private equity while expanding its AI reach.

Let It Die: Inferno launches with extensive AI-generated elements

Let It Die: Inferno launches on 3 December with AI-generated voices, music, and graphics, sparking debate among fans.

Samsung introduces Galaxy Tab A11+ with larger display, AI features, and long-term software support

Samsung launches the Galaxy Tab A11+, an affordable 11-inch tablet with AI tools, long battery life, and seven years of software support.

Solera highlights AI, sustainability and leadership at Insurtech Insights Asia

Solera showcases AI innovation, sustainability initiatives and leadership programmes at Insurtech Insights Asia in Hong Kong.

Related Articles

Popular Categories