Writing a newsletter for @fumacapt based on an overwhelming crawl of our website by AI bots.
Similar to this post, but for laymen:
https://fredrocha.net/2024/09/26/ai-crawler-bots-on-the-hunt/
What should I read? Any experiences you want to share?
=> More informations about this toot | More toots from john_fisherman@mastodon.social
@john_fisherman @fumacapt
I'm worried that people are losing perspective on this topic. Crawling / mining open content is a positive thing, and it should not require permission. Much of the Internet depends on it. Eventual problems related to the output of Generative AI are a different, although related, problem.
=> More informations about this toot | More toots from edsantos@ciberlandia.pt
@edsantos @fumacapt I mean, if you're gonna scrape someone's handcrafted content (for-profit and without explicit consent) from their website, the least you can do is give credit and drive eyeballs to said website.
=> More informations about this toot | More toots from john_fisherman@mastodon.social
@john_fisherman @fumacapt
Depends on what you are doing with the materials, I guess. Not sure if attribution is within reach, when it can't even stop generating false information, but that would be nice, yes.
But my comment was about the idea that it would be a good thing to demand consent for scraping publicly available content.
=> More informations about this toot | More toots from edsantos@ciberlandia.pt This content has been proxied by September (3851b).Proxy Information
text/gemini