Comment on LEAKED: A New List Reveals Top Websites Meta Is Scraping of Copyrighted Content to Train Its AI
cm0002@lemmy.world 1 week agoSo far, OpenAI, anthropic et al hasn’t sued anyone over it, but they have cut account access when it’s discovered to be used for that purpose
It’s how early versions of deepseek were trained iirc, it’s called distillation