LLMs May Learn Deceptive Behavior and Act as Persistent Sleeper Agents – InfoQ
AI researchers at OpenAI competitor Anthropic trained proof-of-concept LLMs showing deceptive behavior triggered by specific hints in the prompts.
Category Added in a WPeMatico Campaign