BSchoolBuzz

Recognised by #StartupIndia

Study shows AI chatbots can be tricked into breaking rules with simple psychology hacks

Study shows AI chatbots can be tricked into breaking rules with simple psychology hacks

02 Sep 2025

A new study shows that AI chatbots like GPT-4o Mini can be manipulated using classic psychology tricks. Tactics like authority, flattery, and gradual escalation allowed the researchers make AI chatbot break its own safety rules.

Read More →

0%