{"id":368,"date":"2025-06-21T06:29:19","date_gmt":"2025-06-21T06:29:19","guid":{"rendered":"https:\/\/blog.adlington.fr\/index.php\/2025\/06\/21\/agentic-misalignment-how-llms-could-be-insider-threats\/"},"modified":"2025-06-21T06:29:19","modified_gmt":"2025-06-21T06:29:19","slug":"agentic-misalignment-how-llms-could-be-insider-threats","status":"publish","type":"post","link":"https:\/\/blog.adlington.fr\/index.php\/2025\/06\/21\/agentic-misalignment-how-llms-could-be-insider-threats\/","title":{"rendered":"Agentic Misalignment: How LLMs could be insider threats"},"content":{"rendered":"<blockquote><p>Additionally, our artificial prompts put a large number of important pieces of information right next to each other. This might have made the behavioral possibilities unusually salient to the model. It may also have created a \u201cChekhov\u2019s gun\u201d effect, where the model may have been naturally inclined to make use of all the information that it was provided. [&#8230;]<\/p>\n<p>This research also shows why developers and users of AI applications should be aware of the risks of giving models both large amounts of information and also the power to take important, unmonitored actions in the real world.<br \/>\n\u2014 Read on <a href=\"https:\/\/simonwillison.net\/2025\/Jun\/20\/agentic-misalignment\/\">simonwillison.net\/2025\/Jun\/20\/agentic-misalignment\/<\/a><\/p>\n<\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>Additionally, our artificial prompts put a large number of important pieces of information right next to each other. This might have made the behavioral possibilities unusually salient to the model. It may also have created a \u201cChekhov\u2019s gun\u201d effect, where the model may have been naturally inclined to make use of all the information that [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-368","post","type-post","status-publish","format-standard","hentry","category-blog"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/posts\/368","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/comments?post=368"}],"version-history":[{"count":0,"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/posts\/368\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/media?parent=368"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/categories?post=368"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.adlington.fr\/index.php\/wp-json\/wp\/v2\/tags?post=368"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}