Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 27 October 2024
https://awful.systems/post/2668740
=> More informations about this toot | More toots from froztbyte@awful.systems
has the era of active sabotage of the autoplag inputs begun? let’s hope so
=> More informations about this toot | More toots from froztbyte@awful.systems
It would be funny if someone was literally beating up servers with a wooden shoe.
=> More informations about this toot | More toots from JFranek@awful.systems
“percussive maintenance”
=> More informations about this toot | More toots from froztbyte@awful.systems
momty python style giant sabot descends on Microsoft data centre
=> More informations about this toot | More toots from dgerard@awful.systems
Considering Glaze and Nightshade have been around for a while, and I talked about sabotaging scrapers back in July, arguably, it already has.
Hell, I ran across a much smaller scale case of this a couple days ago:
Not sure how effective it is, but if Elon’s stealing your data for his autoplag no matter what, you might as well try to force-feed it as much poison as you can.
=> More informations about this toot | More toots from BlueMonday1984@awful.systems
I saw people say they would add 10% opaque layers of the musk with Epstein’s accomplice (whos name i forgot for a second and too lazy to look her up) photo. Would be nice if there was a tool to do so automatically. (Not that i post on twitter anymore).
=> More informations about this toot | More toots from Soyweiser@awful.systems
tbh that sounds like a pretty easy script to write! Too bad I am not near a computer rn
=> More informations about this toot | More toots from swlabr@awful.systems
Wouldn’t really need a script, though. Just open up photoshop or GIMP and add a layer after everything is finished.
=> More informations about this toot | More toots from ShakingMyHead@awful.systems
But that doesn’t scale properly, you want ideally some sort of browser extension that just automatically does it for you before the data gets send to twitter.
=> More informations about this toot | More toots from Soyweiser@awful.systems
I got nerd sniped into trying to resize felons_musk_and_maxwell.webp to the same size as some base image before compositing it on top with a 10% dissolve in the same magick invocation but I need to sleep so I’m giving up for now.
=> More informations about this toot | More toots from bitofhope@awful.systems
It’s almost completely ineffective, sorry. It’s certainly not as effective as exfiltrating weights via neighborly means.
On Glaze and Nightshade, my prior rant hasn’t yet been invalidated and there’s no upcoming mathematics which tilt the scales in favor of anti-training techniques. In general, scrapers for training sets are now augmented with alignment models, which test inputs to see how well the tags line up; your example might be rejected as insufficiently normal-cat-like.
I think that “force-feeding” is probably not the right metaphor. At scale, more effort goes into cleaning and tagging than into scraping; most of that “forced” input is destined to be discarded or retagged.
=> More informations about this toot | More toots from corbin@awful.systems
yeah this is the thing I’ve been thinking a lot about
fucking reCaptcha is literally mass-weaponising users for data filtration, and there is no good counter besides just not using reCaptcha (which is something one can’t easily pull off without things like regulatory action, massive reputational problems that make people gtfo, etc)
I have similar worries about cloudflare being such a massive chokepoint and using that position to enable “ai bot filter” services. feels extremely monopolistic, but ianal and I’m not entirely sure what the case grounds/structure on that would be (if any)
the only other viable strategy at the moment is fully breaking contact with any potential bad traffic systems, and that’s extremely fucking dire because that’s yet another nail in the coffin of the increasingly less open internet
=> More informations about this toot | More toots from froztbyte@awful.systems
The whole Cloudflare bot detection is so weird and eerie. I’ve had issues where I can’t get past it presumably just because I’m using some in-application browser just to get a login cookie, but other times it just lets fucking curl through no questions asked.
=> More informations about this toot | More toots from bitofhope@awful.systems
it just lets fucking curl through no questions asked
Fucking what. I’ve heard of sites blocking curl and I’ve been able to get around it by copying user agent and sometimes cookies from the browser. Now I’m cursed with the knowledge that I could probably just scrape stuff from everywhere
=> More informations about this toot | More toots from ibt3321@lemmy.blahaj.zone
I thought they were gonna do that themselves by feeding on their own outputs littered all over the www. Maybe they can use some help.
=> More informations about this toot | More toots from luciole@beehaw.org
that’s also happening, but yeah it’s going to have to be a team effort
=> More informations about this toot | More toots from froztbyte@awful.systems
They added sleeps to training jobs? Sounds like they deserve a raise for improving energy efficiency instead…
=> More informations about this toot | More toots from antifuchs@awful.systems
it’s called clogging folks, we put a little clog in the machine
=> More informations about this toot | More toots from sc_griffith@awful.systems
ooh I like that
=> More informations about this toot | More toots from froztbyte@awful.systems
I’m sorry Mr. Musk, grok’s a bit constipated today. Someone fed it too much cheese.
=> More informations about this toot | More toots from o7___o7@awful.systems This content has been proxied by September (3851b).Proxy Information
text/gemini