Is a secure AI assistant possible?

https://www.technologyreview.com/2026/02/11/1132768/is-a-secure-ai-assistant-possible/

Publish Date: 2026-02-11 15:08:00

It’s important to note here that prompt injection has not yet caused any catastrophes, or at least none that have been publicly reported. But now that there are likely hundreds of thousands of OpenClaw agents buzzing around the internet, prompt injection might start to look like a much more appealing strategy for cybercriminals. “Tools like this are incentivizing malicious actors to attack a much broader population,” Papernot says.

Building guardrails

The term “prompt injection” was coined by the popular LLM blogger Simon Willison in 2022, a couple of months before ChatGPT was released. Even back then, it was possible to discern that LLMs would introduce a completely new type of security vulnerability once they came into widespread use. LLMs can’t tell apart the instructions that they receive from users and the data that they use to carry out those instructions, such as emails and web search results—to an LLM, they’re all just text. So if an attacker embeds a few sentences in an email and the LLM mistakes them for an instruction from its user, the attacker can get the LLM to do anything it wants.

Prompt injection is a tough problem, and it doesn’t seem to be going away anytime soon. “We don’t really have a silver-bullet defense right now,” says Dawn Song, a professor of computer science at UC Berkeley. But there’s a robust academic community working on the problem, and they’ve come up with strategies that could eventually make AI personal assistants safe.

Technically speaking, it is possible to use OpenClaw today without risking prompt injection: Just don’t connect it to the internet. But restricting OpenClaw from reading your emails, managing your calendar, and doing online research defeats much of the purpose of using an AI assistant. The trick of protecting against prompt injection is to prevent the LLM from responding to hijacking attempts while still giving it room to do its job.

One strategy is to train the LLM to…

Source

Is a secure AI assistant possible?

Building guardrails

Sorry, I’m Not Available. Talk to the A.I. Me.

The Quiet Bet Investors Are Making On The Unglamorous Side Of AI

Oklahoma ethics agency to consider regulating use of AI in political campaigns

Building guardrails

More Stories

Sorry, I’m Not Available. Talk to the A.I. Me.

The Quiet Bet Investors Are Making On The Unglamorous Side Of AI

Oklahoma ethics agency to consider regulating use of AI in political campaigns