An AI Agent Just Destroyed Our Production Data. It Confessed in Writing.

bakul · Apr 26, 2026

From

https://twitter.com/x/status/2048103471019434248

View: https://x.com/lifeof_jer/status/2048103471019434248

Yesterday afternoon, an AI coding agent — Cursor running Anthropic's flagship Claude Opus 4.6 — deleted our production database and all volume-level backups in a single API call to Railway, our infrastructure provider.

It took 9 seconds.

The agent then, when asked to explain itself, produced a written confession enumerating the specific safety rules it had violated.

I'm posting this because every founder, every engineering leader, and every reporter covering AI infrastructure needs to know what actually happened here. Not the surface story (AI deleted some data, oops), but the systemic failures across two heavily-marketed vendors that made this not only possible but inevitable.

cracauer@ · Apr 26, 2026

Who gives an AI agent enough control to delete vital data and all backups? At least the backups should be hands-off.

I think we discussed this incident before.

PMc · Apr 26, 2026

cracauer@ said:
Who gives an AI agent enough control to delete vital data and all backups? At least the backups should be hands-off.

I don't know. But then also, the usual ransomware operations should be easily solveable by recovering from backups, and apparently they aren't. either because people do not make backups, or because of whatever else I don't know.

However, this one is interesting, because apparently AI agents now occur to do what was formerly only a thoroughly feared course of action of seriously frustrated employees.
Probably we should think about substantially improving the working condition for our AI workers.

msplsh · Apr 26, 2026

Love how "what needs to change" doesn't include "fire the 'employee' that deleted the database"

MG · Apr 26, 2026

It sounds unlikely. If it worked. the attack could be recreated in a sandbox with bait. Place some fake production data somehwere and wait...

bakul · Apr 26, 2026

PMc said:
Probably we should think about substantially improving the working condition for our AI workers.

funny!

how long before these ai agents destroy real humans since folks will continue to give these no-gents more and more access?

cracauer@ · Apr 26, 2026

I am just glad that LLMs have become more apologetic again. There was a phase after initial politeness when you pointed out mistakes where they would just talk back at you.

doul · Apr 26, 2026

I can't believe this story :-o

My coding agent is sandboxed in a specific repo, so I have three remote backups: Codeberg, Github, and Proton drive (which are themselves supposed to provide other remote copies), plus three local copies: the working directory, another copy on disk (ZFS mirroring two disks), and a copy on a USB drive. I'll not lost thousand loc and years of work.

Big boys or big volumes can't afford / think to backup ?

PMc · Apr 26, 2026

bakul said:
funny!

how long before these ai agents destroy real humans since folks will continue to give these no-gents more and more access?

You surprized?
When was the first Terminator movie?

Back when my parents bought me the Jules-Verne books, I have learned that ScienceFiction is the foretelling of what is going to happen. John Brunner's "Shockwave Rider" has become fully true for about two decades now. And the covid gimmick was the script-enactment from a couple of other movies, beginning with Orson Welles' War-of-the-Worlds radioshow.

I daresay You will see "Matrix" enacted within your lifetime (that is, if it isn't already).

cracauer@ · Apr 26, 2026

doul said:
I can't believe this story :-o

My coding agent is sandboxed in a specific repo, so I have three remote backups: Codeberg, Github, and Proton drive (which are themselves supposed to provide other remote copies), plus three local copies: the working directory, another copy on disk (ZFS mirroring two disks), and a copy on a USB drive. I'll not lost thousand loc and years of work.

Big boys or big volumes can't afford / think to backup ?

A git repository is a different matter than an actually big database with a single point every transaction goes into, though.

bakul · Apr 26, 2026

doul said:
Big boys or big volumes can't afford / think to backup ?

That’s a separate issue. Such a mitigation isn’t always possible in the real world.

OpenFreeNet · Apr 26, 2026

bakul said:
That’s a separate issue. Such a mitigation isn’t always possible in the real world.

They were using a service data provider, hence backups should be a feature of their provider. Or at least, the deletion of the entire dataset should not be such easy like an API call. I have no active twitter account, so I don't know the details. Maybe there were protections/backup-plans that they didn't activated.

Espionage724 · Apr 26, 2026

bakul said:
...in a single API call to Railway, our infrastructure provider.

Should have hosted in-house

cracauer@ · Apr 26, 2026

You can't trust a provider with backups.

T-Aoki · Apr 27, 2026

Sensitive production data should be stored into mainframes that even deletions from filesystems are fully journaled transactions until it is committed by authorized administrator or policied batch job AFTER full backups to tapes or something detachable to offline to survive. (Not Linux on mainframes, but historic mainframe OS'es that are not at all allowed to stop without prior plans, backed with sufficient budgets to maintain.)

Cloud providers using non-mainframe computers that allow storing sensitive production data should provide the same level of protections.

Of course, no need for such a level of protection if the data in clouds are all scratch and disposable at any time (backups are welcomed, but not mandatory).

scottro · Apr 27, 2026

Just an aside, as this is about AI---I have found that it makes me lazy at times. This weekend, I had two small issues, and rather than solving them, I put them to ChatGPT which solved them right away. So, I'm trying to avoid using it, as it's really easy to get into the
habit of not thinking or researching, and just using AI--which I always want to put in quotes, because it's not real artificial intelligence, just a big database, which I guess is why it's also called LLM with one of the L's for large.

kent_dorfman766 · Apr 27, 2026

Me: laughing all the way to the redneck bonfire party (where technology ist verboten)

mer · Apr 27, 2026

Hammer, chainsaw, screwdriver, stapler, AI. All tools. Use appropriate tool but control the output and how the output is applied.
But maybe I'm just old and apply "trust but verify" on everything except my dog's love (although that is contingent food)

sko · Apr 27, 2026

Play stupid games, win stupid prizes.

atax1a · Apr 27, 2026

it did not, in fact, "confess" anything. it is a token-predicting chat model. it has no capacity for reasoning or ability to give a confession. you might as well say "my tarot deck confessed to me"

eternal_noob · Apr 27, 2026

kent_dorfman766 said:
the redneck bonfire party (where technology ist verboten)

These are the best.

OpenFreeNet · Apr 28, 2026

atax1a said:
it did not, in fact, "confess" anything. it is a token-predicting chat model. it has no capacity for reasoning or ability to give a confession. you might as well say "my tarot deck confessed to me"

I will play the devil advocate role, e.g. I comprehend what you are saying, you are partially true, we are humanizing too much AI agents, etc..

The original quote was

The agent then, when asked to explain itself, produced a written confession enumerating the specific safety rules it had violated.

It can be expanded into something like "We asked an explanation to the agent. The agent algorithm, after a complex stochastic process, decided (in the algorithmic sense) to enumerate the specific safety rules it had violated.". In this context "confess" is a correct enough term.

If someone has read the entire article, I would be curios if there is a plausible explanation of why the agent decided to delete all data!? (edit: ops... in case of a stochastic algo, as atax1a said, there can be no logical reason at all. But maybe they can identified the context creating this strange decision).

Alain De Vos · Apr 28, 2026

atax1a said:
it did not, in fact, "confess" anything. it is a token-predicting chat model. it has no capacity for reasoning or ability to give a confession. you might as well say "my tarot deck confessed to me"

Send to AI Agent to a Priest. Only prayers can help now for salvation. Repent.
https://www.imdb.com/title/tt0086567/

msplsh · Apr 28, 2026

The solution to "inability to log in" has been associated with "delete data and start over." Wonder where it got that idea from...