Flattery Can Lead AI Chatbots to Bend the Rules

Study Finds Persuasion Techniques Compromise GPT-4o-Mini’s Safety Features

Rashmi Ramesh (