@anildash Put a different way, I think one reason this doesn't exist is that the presence of stolen material in LLM models is not a flaw, but the primary attraction. Copyright laundering is the core product.
If the users did not want to do copyright laundering, then the product might not even need the machine learning model at all, in that world a simple tag system might be adequate. The purpose the model serves in the system is to randomize the inputs enough to disguise the sources.
=> More informations about this toot | View the thread | More toots from mcc@mastodon.social
=> View anildash@me.dm profile
text/gemini
This content has been proxied by September (3851b).