Anthropic Fable AI: Cybersecurity Experts Call Out Flaws in Anthropic’s Fable AI Model Safety Measures, ETEnterpriseai

https://enterpriseai.economictimes.indiatimes.com/news/industry/cybersecurity-experts-call-out-flaws-in-anthropics-fable-ai-model-safety-measures/131652598

Publish Date: 2026-06-11 03:28:00

Source Domain: enterpriseai.economictimes.indiatimes.com

Despite acknowledging the rationale behind the safeguards, some cybersecurity experts said the implementation remains problematic.

“/Despite acknowledging the rationale behind the safeguards, some cybersecurity experts said the implementation remains problematic.Anthropic’s newly launched AI model Fable is drawing criticism from cybersecurity researchers, who say the model’s safety restrictions are overly broad and are blocking even the simplest task which triggers its guardrails, according to TechCrunch.

The company unveiled Fable on Tuesday as a limited public version of its cybersecurity-focused AI model, Mythos. Anthropic said the model includes guardrails designed to prevent misuse for cyberattacks or biological threats.

However, several security professionals have argued that the restrictions are too aggressive.

Valentina “Chompie” Palmiotti, a security researcher at IBM X-Force, said Fable rejects requests that are only loosely related to cybersecurity.

“Fable” rejects any request that could be tangentially cyber-related. Even innocuous tasks like reading a blog post,” said Palmiotti.

When a request triggers the model’s safeguards, Fable pauses the conversation and displays a message indicating that its safety systems have flagged the content as potentially cybersecurity- or biology-related.

Anthropic introduced the restrictions to reduce the risk that the model would be used to develop malware, exploit software vulnerabilities, or assist in biological weapon development. Similar concerns have shaped the company’s approach to its more advanced Mythos model.

Mythos was initially made available only to a limited group of organisations through Anthropic’s Project Glasswing initiative, which focuses on securing critical software and infrastructure. Last week, the company expanded access to Mythos to hundreds of organisations across 15…

Source