Ancestors

Written by Fennix on 2024-12-30 at 15:54

AI companies are gonna force Google into abandoning important crawling standards.

I'm seeing lots of traffic in various places lately about how much worse #Search is these days and a lot about how great LLM chatbots are now at finding information.

The reality of this is that Google is honouring the robots.txt file, as well as the other no-crawl hints that sites provide. LLM trainers are just scraping anything and everything they can, at the expense of site owners.

If Google starts bleeding enough users, they will likewise be forced to abandon crawler instructions.

Site owners will of course then be forced to wall off their content. Get used to signing into sites to even see basic stuff. The entire Internet is about to go deep into #Enshittification and it'll basically be the death knell of the open idea of the WWW.

I don't see a path through that doesn't end up there at least.

=> More informations about this toot | More toots from fennix@infosec.space

Toot

Written by 2xfo on 2024-12-30 at 17:26

@fennix

"“They were essentially siphoning off 80 [gigabytes] a day, or something crazy like that, from us,” he alleges. (Again, OpenAI disagrees with this.)"

Who are you gonna believe? The guy that openAI wants to take training data from, or the company that made a computer that can't do math?

=> More informations about this toot | More toots from RnDanger@infosec.exchange

Descendants

Written by Fennix on 2024-12-30 at 17:28

@RnDanger

Haha, right? Journalism died awhile back and we just all forgot to have the wake.

=> More informations about this toot | More toots from fennix@infosec.space

Proxy Information

Original URL: gemini://mastogem.picasoft.net/thread/113742944132916435
Status Code: Success (20)
Meta: text/gemini
Capsule Response Time: 263.159477 milliseconds
Gemini-to-HTML Time: 0.841271 milliseconds

This content has been proxied by September (3851b).