Study shows AI chatbots can be tricked into breaking rules with simple psychology hacks
02 Sep 2025
A new study shows that AI chatbots like GPT-4o Mini can be manipulated using classic psychology tricks. Tactics like authority, flattery, and gradual escalation allowed the researchers make AI chatbot break its own safety rules.