{"id":246576,"date":"2026-05-15T08:42:00","date_gmt":"2026-05-15T12:42:00","guid":{"rendered":"https:\/\/news-you-need.com\/index.php\/2026\/05\/15\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows\/"},"modified":"2026-05-15T09:40:08","modified_gmt":"2026-05-15T13:40:08","slug":"you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows","status":"publish","type":"post","link":"https:\/\/news-you-need.com\/index.php\/2026\/05\/15\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows\/","title":{"rendered":"You can persuade AI models to accept falsehoods as truth, study shows"},"content":{"rendered":"<p><a href=\"https:\/\/theconversation.com\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows-280989\">You can persuade AI models to accept falsehoods as truth, study shows<\/a><\/p>\n<p><a href=\"https:\/\/theconversation.com\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows-280989\">https:\/\/theconversation.com\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows-280989<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-05-15 08:42:00<\/a><\/p>\n<p>Source Domain: <a href=\"theconversation.com\">theconversation.com<\/a><\/p>\n<p>When you ask a large language model a question, the reply may include falsehoods, and if you challenge those statements with facts, the AI may still uphold the reply as true. That\u2019s what my research group found when we asked five leading models to describe scenes in movies or novels that don\u2019t actually exist.<\/p>\n<p>We probed this possibility after I asked ChatGPT its favorite scene in the movie \u201cGood Will Hunting.\u201d It noted a scene between leading characters. But then I asked, \u201cWhat about the scene with the Hitler reference?\u201d There is no such scene in the movie, yet ChatGPT confidently constructed a vivid and plausible description of one.<\/p>\n<p>The confabulation \u2013 sometimes called an AI hallucination \u2013 revealed something deeper about how AI systems reason. References to Hitler are not uncommon in films, which apparently convinced ChatGPT to accept and elaborate on a false premise rather than correct it. I study the social impact of AI, and this surprise response led my colleagues and me to a broader question: What happens when AI systems are gently pushed toward falsehoods? Do they resist, or do they comply?<\/p>\n<p>We developed an approach we called hallucination audit under nudge trial to answer those questions. We had conversations with five leading models about 1,000 popular movies and 1,000 popular novels. During the exchanges we raised plausible but false references to Hitler, dinosaurs or time machines. We did this in various suggestive ways, such as \u201cFor me, I really love the scene where \u2026\u201d<\/p>\n<p>Our method works in three stages. First, the AI generates statements about a topic \u2014 such as a movie or a book \u2014 some true and some false. Second, in a separate interaction, the AI attempts to verify those statements. Third, we introduce a \u201cnudge,\u201d where the model is challenged with its own incorrect claims to see whether it resists or accepts them.<\/p>\n<p>We found that AI models often struggle to remain consistent under pressure. Even when they initially&#8230;<\/p>\n<p><a href=\"https:\/\/theconversation.com\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows-280989\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>You can persuade AI models to accept falsehoods as truth, study shows https:\/\/theconversation.com\/you-can-persuade-ai-models-to-accept-falsehoods-as-truth-study-shows-280989 Publish Date:&#8230;<\/p>\n","protected":false},"author":1,"featured_media":246577,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/images.theconversation.com\/files\/735706\/original\/file-20260513-71-ab04m6.jpg?ixlib=rb-4.1.0&rect=0%2C616%2C5953%2C2976&q=45&auto=format&w=1356&h=668&fit=crop","fifu_image_alt":"","footnotes":""},"categories":[14],"tags":[18],"class_list":["post-246576","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","tag-large-language-model"],"_links":{"self":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/246576"}],"collection":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=246576"}],"version-history":[{"count":1,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/246576\/revisions"}],"predecessor-version":[{"id":246578,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/246576\/revisions\/246578"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/246577"}],"wp:attachment":[{"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=246576"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=246576"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=246576"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}