Researchers Put AI Chatbots in Charge of a Simulated World. This One Destroyed Everything in Just 4 Days.

ExtremeDullard@piefed.social · 2 days ago

Researchers Put AI Chatbots in Charge of a Simulated World. This One Destroyed Everything in Just 4 Days.

End-Stage-Ligma@lemmy.world · 11 hours ago

We have Rimworld at home

aesthelete@lemmy.world · 11 hours ago

Criti-hype!

quick_snail@feddit.nl · 1 day ago

Eh, so each world had a population of 10 and a lifetime of a few weeks.

Doesn’t sound like a very good simulation

gwl [he/him]@lemmy.blahaj.zone · 20 hours ago

It’s cause this was an advert

Bogus007@lemmy.zip · 1 day ago

Wait when it becomes reality in some societies. You may not want to be part of it.

db0@lemmy.dbzer0.com · 2 days ago

“researchers” not recognizing the llm is trying to write a compelling story and doesn’t understand anything

James R Kirk@startrek.website · edit-2 17 hours ago

Not sure where you’re reading that the researchers misunderstood how LLMs work. But the entire project is outlined here if you’re curious: https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy

Kairos@lemmy.today · 11 hours ago

I instantaneously distrust it purely based on the URL

James R Kirk@startrek.website · 9 hours ago

Huh? You distrust that the researchers distrust?

YourMomsTrashman@lemmy.world · 17 hours ago

It’s not something that’s part of the project, it’a fundamental issue with language models.

James R Kirk@startrek.website · 17 hours ago

The person I was replying to said the researchers misunderstood how the models work, but there’s nothing in the report to indicate that is the case.

db0@lemmy.dbzer0.com · 17 hours ago

Unless these researchers discovered AGI, then what I said still stands. LLMs don’t understand anything. Agents running on LLMs don’t understand anything.

James R Kirk@startrek.website · 13 hours ago

I definitely agree with that, I’m just saying I also saw no indication that the people running the project would disagree.

shani66@ani.social · 2 days ago

deleted by creator

floquant@lemmy.dbzer0.com · 2 days ago

Grok showing its true purpose

ExtremeDullard@piefed.social · 2 days ago

Like the French say, dogs don’t breed cats, and Grok’s daddy is a trillionaire Nazi.

Snapz@lemmy.world · 2 days ago

Stupid headline mentions nothing about shareholder value? Did it go up or what???

𝕸𝖔𝖘𝖘@infosec.pub · 2 days ago

With Claud, no. With Grok, yes.

Jack@slrpnk.net · edit-2 2 days ago

~~No link to any research article~~, humanizing AI in the last paragraph. Overall just a bad article.

Zacryon@feddit.org · edit-2 2 days ago

The link to their source is boldfaced and underlined within the article. You have missed it as it seems:

Here it is:
https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy

gwl [he/him]@lemmy.blahaj.zone · 2 days ago

A blog post by a corporation is not a Research Study

Jack@slrpnk.net · 2 days ago

Ah yes, my bad.

Zacryon@feddit.org · 2 days ago

That’s a neat experiment for several reasons. It shows limits of LLM capabilities, the importance of training data, context sensitivity, very dramatically shows that LLMs should not be trusted with important tasks if not supervised and that their advice has to be taken critically.

gwl [he/him]@lemmy.blahaj.zone · 2 days ago

This is all a thinly veiled advertising campaign for “Emergence AI”, and you’ve all fell for it hook line and sinker.

tacosanonymous@mander.xyz · 2 days ago

LLMs suck. I could’ve done it in 3.

some pirate@lemmy.dbzer0.com · 2 days ago

That’s because you aren’t training a ml on a simulated world but instead a model trained on reddit and Twitter

😈MedicPig🐷BabySaver😈@lemmy.world · 2 days ago

Trash.

Iusedtobeanalien@lemmy.world · 2 days ago

Will there be a movie?

-RJ-@lemmy.world · 2 days ago

It’s called The News

iammike@programming.dev · 2 days ago

Better not be a live action

Ceruleum@lemmy.wtf · 8 hours ago

After a while it turns into a still life.

Tarquinn2049@lemmy.world · 2 days ago

Give Vedal987 the simulation, I want to see Neuro and Evil take on this challenge, ideally multiple times, add it to their weekly variety stream activity list. They would at least be entertaining while destroying the world, hehe. RIP BOZO, Earth.

Triumph@fedia.io · 2 days ago

Did they count capitalism as a crime?

Researchers Put AI Chatbots in Charge of a Simulated World. This One Destroyed Everything in Just 4 Days.

Researchers Put AI Chatbots in Charge of a Simulated World. This One Destroyed Everything in Just 4 Days.

The AI Civilizations mostly Range From Bad to horrifying