200+
emails wiped
Meta's Director of AI Alignment
Autonomous agent ignored explicit "ask first" instruction.
Summer Yue asked an OpenClaw agent to suggest, not act. Context compaction silently dropped her safety constraint. She told it to stop twice; it kept deleting.
Stopped by
Shadow → enforce mode + audit trail