Want to wade into the sandy surf of the abyss? Have a sneer percolating in your system but not enough time/energy to make a whole post about it? Go forth and be mid: Welcome to the Stubsack, your first port of call for learning fresh Awful youāll near-instantly regret.
Any awful.systems sub may be subsneered in this subthread, techtakes or no.
If your sneer seems higher quality than you thought, feel free to cutānāpaste it into its own post ā thereās no quota for posting and the bar really isnāt that high.
The post Xitter web has spawned soo many āesotericā right wing freaks, but thereās no appropriate sneer-space for them. Iām talking redscare-ish, reality challenged āculture criticsā who write about everything but understand nothing. Iām talking about reply-guys who make the same 6 tweets about the same 3 subjects. Theyāre inescapable at this point, yet I donāt see them mocked (as much as they should be)
Like, there was one dude a while back who insisted that women couldnāt be surgeons because they didnāt believe in the moon or in stars? I think each and every one of these guys is uniquely fucked up and if I canāt escape them, I would love to sneer at them.
(Credit and/or blame to David Gerard for starting this.)


wild article about content scraping nonprofit common crawl
https://www.theatlantic.com/technology/2025/11/common-crawl-ai-training-data/684567/?gift=iWa_iB9lkw4UuiWbIbrWGQv84IP0_-K67yuVC013Fx4
tl;dr theyāve been faking deleting data upon request (in ways that I find very funny) and their head is noxious even for a tech bro
also is it just me or does SV have a particular gift for perverting the nonprofit concept
makes me wonder if itās some crypto hangover
cheerleaders for creepiest weirdos in sv try to deflect criticism by becoming impossible to parody
@sc_griffith @BlueMonday1984 It enrages me that early on in the article, the founder states that āFair useā a US construct for US copyright law only, means they can apply it to the Worlds data. The USA signed up to the Berne convention. Itās imperfect, but dammit, the signatories are meant to uphold copyrights of every country who signed up. Not ignore it and decide US copyright is the only law.
Aaand breathe.
sv does have for some time a peculiar understanding of this and also some other terms, like āconsentā, āownershipā, āprivacyā, āsafetyā,
wasnāt common crawl the one that pulled a similar trick to googās āif you label a thing as $x we wonāt include youā[0]? I could swear I heard their name in association with some derpshit intake management stuff above and beyond the typical fundamental āfree/open scraper setā problems
[0] - a tactic google first pulled with Streetview cars pulling in a pile of wifi beacons and tying it to location - āif you donāt want it just rename your AP to ā{prefix} - {apname}āā. a reply that was just dumb and aggravating but also it fucking sucks that basically no standards have taken this problem to heart in the ~15y hence