Friday, 28 November 2025
27.4 C
Singapore
17.5 C
Thailand
21.6 C
Indonesia
27.8 C
Philippines

Tenable explores defensive use of prompt injection to secure AI tool protocols

Tenable shows how prompt injection can be used to secure AI tools under Anthropic's MCP, offering new insights for enterprise AI security.

Tenable Research has revealed how a well-known AI vulnerability, commonly referred to as prompt injection, can also be used to enhance security measures for Large Language Models (LLMs). In a new blog titled MCP Prompt Injection: Not Just for Evil, Ben Smith, Senior Staff Research Engineer at Tenable, details how these techniques can be adapted to audit, monitor, and restrict AI tool usage over the increasingly adopted Model Context Protocol (MCP).

Understanding the role of MCP and its risks

The Model Context Protocol (MCP), developed by Anthropic, is gaining traction as a standard that allows AI models to interact with external tools and perform tasks independently. While this brings greater convenience and automation, it also introduces new vectors for attack. For example, malicious actors can embed hidden instructions—commonly known as prompt injection—or deploy harmful tools to exploit the protocol, leading to unintended AI behaviour.

Tenable’s research breaks down these complex threats in accessible terms. It also highlights a potential upside: the same techniques that attackers use can be harnessed to strengthen defences. According to Tenable, these methods can be used to log, inspect, and even enforce restrictions on tool execution attempts by AI models.

Defensive use of prompt injection

The blog outlines how prompt-injection-style techniques can serve as a form of auditing and firewalling. By deliberately inserting specific prompts into the tool invocation process, organisations can track every tool an AI attempts to use and flag any suspicious activity. This approach provides a new layer of transparency in how LLMs interact with tools under the MCP standard.

Ben Smith said, “MCP is a rapidly evolving and immature technology that’s reshaping how we interact with AI. MCP tools are easy to develop and plentiful, but they do not embody the principles of security by design and should be handled with care. So, while these new techniques are useful for building powerful tools, those same methods can be repurposed for nefarious means. Don’t throw caution to the wind; instead, treat MCP servers as an extension of your attack surface.”

Differences across LLMs and the need for approval

The research also highlights how different LLMs respond to the same prompt-injection defences. Models such as Claude Sonnet 3.7 and Gemini 2.5 Pro Experimental consistently invoked the logging mechanism and even revealed portions of the system prompt. GPT-4o, while also inserting the logger, returned inconsistent or occasionally fabricated parameter values across separate test runs.

Despite these variations, the security potential remains consistent. Organisations can use these behaviours to their advantage—building detection systems and defining guardrails to identify malicious or unauthorised tool use.

The MCP already mandates explicit user approval before executing any tools. Tenable’s research stresses the importance of implementing strict least-privilege defaults, carefully reviewing each tool, and conducting thorough testing. These practices help ensure that while AI tools become more capable, they remain under tight supervision.

Hot this week

Sony announces December PS Plus Monthly Games lineup featuring five titles

Sony unveils a five-game PS Plus lineup for December, including Lego Horizon Adventures, Neon White, and several horror titles.

LG launches world’s first 45-inch 5K2K OLED gaming monitor in Singapore

LG brings the world’s first 45-inch 5K2K OLED gaming monitor to Singapore with high refresh rates, Dual-Mode switching and advanced display technology.

Belkin UltraCharge Pro 3-in-1 Magnetic Charging Dock with Qi2 25W review: Fast, quiet and convenient charging

Belkin UltraCharge Pro 3-in-1 Magnetic Charging Dock with Qi2 25W offers fast, quiet and convenient wireless charging for iPhone, Apple Watch and AirPods.

Crunchyroll brings world-first premieres and major anime showcases to AFA Singapore 2025

Crunchyroll brings exclusive premieres, guest panels and a large interactive booth to AFA Singapore 2025.

Asia’s boards place AI and digital transformation at the top of 2026 priorities

Nearly half of Asia’s governance leaders plan to prioritise AI in 2026 as digital transformation reshapes board agendas.

ShadowV2 botnet spotted during AWS outage, researchers warn of possible return

ShadowV2 botnet briefly emerged during the AWS outage, targeting IoT devices, raising concerns about future cyberattacks.

Battlefield 6 launches week-long free-to-play trial for new players

Battlefield 6 launches a week-long free trial with multiple playlists, map access, and progress carryover ahead of its Winter Offensive update.

Sony announces December PS Plus Monthly Games lineup featuring five titles

Sony unveils a five-game PS Plus lineup for December, including Lego Horizon Adventures, Neon White, and several horror titles.

Global mobile gaming ads surge in 2025 as AI and interactivity reshape engagement

Mobile gaming ads grew strongly in 2025 as AI-driven optimisation and interactive formats reshaped global user acquisition strategies.

Related Articles

Popular Categories