Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Tea@programming.dev · 7 days ago

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

XeroxCool@lemmy.world · 7 days ago

Will this further fuck up the inaccurate nature of AI results? While I’m rooting against shitty AI usage, the general population is still trusting it and making results worse will, most likely, make people believe even more wrong stuff.

ladel@feddit.uk · edit-2 7 days ago

The article says it’s not poisoning the AI data, only providing valid facts. The scraper still gets content, just not the content it was aiming for.

E:

It is important to us that we don’t generate inaccurate content that contributes to the spread of misinformation on the Internet, so the content we generate is real and related to scientific facts, just not relevant or proprietary to the site being crawled.

melpomenesclevage@lemmy.dbzer0.com · edit-2 1 hour ago

Removed by mod

XeroxCool@lemmy.world · 6 days ago

That take would be more digest able if I wasn’t stuck on the same planet as those people.

melpomenesclevage@lemmy.dbzer0.com · edit-2 1 hour ago

Removed by mod

XeroxCool@lemmy.world · 6 days ago

Thank you for catching that. Even reading through again, I couldn’t find it while skimming. With the mention of X2 and RSS, I assumed that paragraph would just be more technical description outside my knowledge. Instead, what I did hone in on was

“No real human would go four links deep into a maze of AI-generated nonsense.”

Leading me to be pessimistic.

melpomenesclevage@lemmy.dbzer0.com · edit-2 1 hour ago

Removed by mod

ObsidianZed@lemmy.world · 7 days ago

Until the AI generating the content starts hallucinating.

melpomenesclevage@lemmy.dbzer0.com · edit-2 1 hour ago

Removed by mod

einlander@lemmy.world · 7 days ago

The problem I see with poisoning the data is the AI’s being trained for law enforcement hallucinating false facts used to arrest and convict people.

patatahooligan@lemmy.world · 7 days ago

Law enforcement AI is a terrible idea and it doesn’t matter whether you feed it “false facts” or not. There’s enough bias in law enforcement that the data is essentially always poisoned.

melpomenesclevage@lemmy.dbzer0.com · edit-2 1 hour ago

Removed by mod

limonfiesta@lemmy.world · 7 days ago

They aren’t poisoning the data with disinformation.

They’re poisoning it with accurate, but irrelevant information.

For example, if a bot is crawling sites relating to computer programming, or weather, this tool might lure the crawler into pages related to animal facts, or human biology.

sugar_in_your_tea@sh.itjust.works · 7 days ago

Law enforcement doesn’t convict anyone, that’s a judge’s job. If a LEO falsely arrests you, you can sue them, and it should be pretty open-and-shut if it’s due to AI hallucination. Enough of that and LEO will stop it.

Jarix@lemmy.world · 6 days ago

More likely they will remove your ability to sue them if you are talking about the usa and many other countries

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Cloudflare announces AI Labyrinth, which uses AI-generated content to confuse and waste the resources of AI Crawlers and bots that ignore “no crawl” directives.

Trapping misbehaving bots in an AI Labyrinth