Toot

Written by Clive Thompson on 2025-01-24 at 00:47

@marisa @asrg @peterfr

... attempt to detect if an OpenAI web-crawler is trying to copy the text off a web site ...

... and if so, it generates fake pages with crap text -- which the OpenAI web crawler assumes are real, and thus dutifully copies

So OpenAI winds up feeding junk/fake/mangled text as training material into its next version of ChatGPT

The attitude is: "So, you wanna copy our site, so you can train your AI -- without us getting a penny from you? Okay, here's some junk data"

=> More informations about this toot | View the thread | More toots from clive@saturation.social

Mentions

=> View marisa@mastodon.scot profile | View asrg@tldr.nettime.org profile | View peterfr@mastodon.art profile

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113880577598911784
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
226.657495 milliseconds
Gemini-to-HTML Time
0.478261 milliseconds

This content has been proxied by September (3851b).