Ancestors

Toot

Written by Fred Rocha on 2024-11-28 at 19:50

Writing a newsletter for @fumacapt based on an overwhelming crawl of our website by AI bots.

Similar to this post, but for laymen:

https://fredrocha.net/2024/09/26/ai-crawler-bots-on-the-hunt/

What should I read? Any experiences you want to share?

=> More informations about this toot | More toots from john_fisherman@mastodon.social

Descendants

Written by Eduardo :d3:⚖️​:copyleft:🔓 on 2024-11-28 at 19:59

@john_fisherman @fumacapt

I'm worried that people are losing perspective on this topic. Crawling / mining open content is a positive thing, and it should not require permission. Much of the Internet depends on it. Eventual problems related to the output of Generative AI are a different, although related, problem.

=> More informations about this toot | More toots from edsantos@ciberlandia.pt

Written by Fred Rocha on 2024-11-29 at 09:54

@edsantos @fumacapt I mean, if you're gonna scrape someone's handcrafted content (for-profit and without explicit consent) from their website, the least you can do is give credit and drive eyeballs to said website.

=> More informations about this toot | More toots from john_fisherman@mastodon.social

Written by Eduardo :d3:⚖️​:copyleft:🔓 on 2024-11-29 at 18:20

@john_fisherman @fumacapt

Depends on what you are doing with the materials, I guess. Not sure if attribution is within reach, when it can't even stop generating false information, but that would be nice, yes.

But my comment was about the idea that it would be a good thing to demand consent for scraping publicly available content.

=> More informations about this toot | More toots from edsantos@ciberlandia.pt

Proxy Information
Original URL
gemini://mastogem.picasoft.net/thread/113562317289632383
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
274.40115 milliseconds
Gemini-to-HTML Time
0.629958 milliseconds

This content has been proxied by September (3851b).