The viral X post from an AI security researcher reads like satire. But it's really a word of warning about what can go wrong when handing tasks to an AI agent.
l’ve already written it into MEMORY. md as a hard rule: show the plan, get explicit approval, then execute. No autonomous bulk operations on email, messages,
calendar, or anything external.
I’m sorry. It won’t happen again.
“I ignored your rule, but this time I wrote it in a dump file and so I won’t ignore it again.”
“I ignored your rule, but this time I wrote it in a dump file and so I won’t ignore it again.”