Sabot in the Age of AI
Here is a curated list of strategies, offensive methods, and tactics for (algorithmic) sabotage, disruption, and deliberate poisoning.
π» iocaine
The deadliest AI poisonβiocaine generates garbage rather than slowing crawlers.
π https://git.madhouse-project.org/algernon/iocaine
π» Nepenthes
A tarpit designed to catch web crawlers, especially those scraping for LLMs. It devours anything that gets too close. @aaron
π https://zadzmo.org/code/nepenthes/
π» Quixotic
Feeds fake content to bots and robots.txt-ignoring #LLM scrapers. @marcusb
π https://marcusb.org/hacks/quixotic.html
π» Poison the WeLLMs
A reverse-proxy that serves diassociated-press style reimaginings of your upstream pages, poisoning any LLMs that scrape your content. @mike
π https://codeberg.org/MikeCoats/poison-the-wellms
π» Django-llm-poison
A django app that poisons content when served to #AI bots. @Fingel
π https://github.com/Fingel/django-llm-poison
π» KonterfAI
A model poisoner that generates nonsense content to degenerate LLMs.
π https://codeberg.org/konterfai/konterfai
=> More informations about this toot | More toots from asrg@tldr.nettime.org
@mike @asrg @marcusb @Fingel @aaron And, for something lightweight and easy for anyone to implement, may I submit a #WordPress plugin prototype:
https://kevinfreitas.net/tools-experiments/
[#]AI
=> More informations about this toot | More toots from KevinFreitas@mastodon.social
@KevinFreitas
Question: does this distinguish between AI scraping bots and search bots? Can we assume they are not the same thing?
@mike @asrg @marcusb @Fingel @aaron
=> More informations about this toot | More toots from rgulick@social.coop
@aaron @marcusb @mike @asrg @Fingel @rgulick It does. I look through lists of AI bot identifiers and include those. In a future version Iβll set it up so folks can customize this themselves, too.
=> More informations about this toot | More toots from KevinFreitas@mastodon.social
@rgulick @KevinFreitas @mike @asrg @marcusb @aaron List of AI user agents comes from here: https://github.com/ai-robots-txt/ai.robots.txt
=> More informations about this toot | More toots from Fingel@indieweb.social
@KevinFreitas @mike @asrg @marcusb @Fingel @aaron I turned on a hell pot like this for crawlers, and I wanted to give them tons of garbage, but I wasnβt expecting the hosting bill for the data transfer costs. That put an end to my hell pot experiment, real quick. π
=> More informations about this toot | More toots from ramsey@phpc.social This content has been proxied by September (ba2dc).Proxy Information
text/gemini