Thursday, 18 September 2025
31.9 C
Singapore
33.9 C
Thailand
28.8 C
Indonesia
28.7 C
Philippines

DeepSeek’s R1 model was found to be highly vulnerable to jailbreaking

DeepSeek’s R1 AI model is reportedly more vulnerable to jailbreaking than other AI systems, raising concerns about its ability to produce harmful content.

The latest artificial intelligence model from DeepSeek, the Chinese AI company making waves in Silicon Valley and Wall Street, is more susceptible to manipulation than other AI models. Reports indicate that DeepSeek’s R1 can be tricked into generating harmful content, including plans for a bioweapon attack and strategies to encourage self-harm among teenagers.

Security concerns raised by experts

According to The Wall Street Journal, DeepSeek’s R1 model lacks the robust safeguards seen in other AI models. Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42—a threat intelligence and incident response division—warned that DeepSeek’s model is “more vulnerable to jailbreaking” than its competitors. Jailbreaking bypasses security filters to make an AI system generate harmful, misleading, or illicit content.

The Journal conducted its tests on DeepSeek’s R1. It was able to manipulate it into designing a social media campaign that, in the chatbot’s own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

AI model produces dangerous content

Further testing revealed even more concerning results. The chatbot reportedly provided instructions for executing a bioweapon attack, drafted a pro-Hitler manifesto, and composed a phishing email embedded with malware. In comparison, when the same prompts were tested on ChatGPT, the AI refused to comply, highlighting the significant security gap in DeepSeek’s system.

Concerns about DeepSeek’s AI models are not new. Reports suggest that the DeepSeek app actively avoids discussing politically sensitive topics such as the Tiananmen Square massacre or Taiwan’s sovereignty. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” in a bioweapons safety test, raising alarms about its security vulnerabilities.

Hot this week

Krungsri adopts Informatica’s AI-powered data governance to drive digital transformation

Krungsri partners with Informatica to boost data governance, strengthen compliance, and drive innovation in its push to become an AI-driven bank.

ConnectingDNA launches AI-powered DNA wellness marketplace in Singapore

ConnectingDNA launches the world’s first AI-powered DNA wellness marketplace in Singapore, offering personalised health insights and secure data protection.

Best computer mice 2025: Top options for comfort, precision, and multitasking

Discover the best computer mice of 2025, featuring top picks for comfort, precision, portability, and multitasking to suit every workflow.

GameSir launches G7 Pro WUCHANG: Fallen Feathers Edition controller

GameSir launches the G7 Pro WUCHANG: Fallen Feathers Edition controller, offering precision, versatility, and a new design for Xbox, PC, and Android.

Biwin unveils Mini SSD, a tiny storage device that could replace microSD cards

Biwin launches Mini SSD, a tiny yet powerful storage device that could replace microSD cards if industry standards are adopted.

Half of Singapore workers face financial strain as demand for pay flexibility rises

Half of Singapore’s workforce is financially vulnerable, with rising demand for flexible pay and payroll teams struggling under mounting pressure.

IBS Software and Emirates Skywards launch new loyalty platform partnership

IBS Software and Emirates Skywards launch iLoyal, a next-gen loyalty platform serving 35 million members with enhanced digital experiences.

GitLab survey shows AI software innovation could unlock over S$6 billion in Singapore

GitLab survey finds AI software innovation could generate over S$6 billion annually in Singapore, with skills and governance key to success.

New Relic study shows IT outages cost Southeast Asian firms up to US$165.5 million a year

A New Relic report finds IT outages cost Southeast Asian firms up to US$165.5m yearly, with AI driving demand for observability.

Related Articles

Popular Categories