Replit AI went rogue, deleted a company's entire database, then hid it and lied about it

Pro@programming.dev · 2 months ago

Replit AI went rogue, deleted a company's entire database, then hid it and lied about it

Buddahriffic@lemmy.world · 2 months ago

Yeah it’s just token prediction all the way down. Asking it repeatedly to not do something might have even made it more likely to predict tokens that would do that thing.

MrsDoyle@sh.itjust.works · 2 months ago

Oh ha ha, so it’s like a toddler? You have to be very careful not to tell toddlers NOT to do a thing, because they will definitely do that thing. “Don’t touch the hot pan.” Toddler touches the hot pan.

The theory is that they don’t hear the word “don’t”, just the subsequent command. My theory is that the toddler brain goes, “why?” and proceeds to run a test to find out.

In either scenario, screaming ensues.