Multiple artificial intelligence companies are circumventing a common web standard used by publishers to block the scraping of their content for use in generative AI systems, content licensing startup TollBit has told publishers.
Hey member when google drove around and sopped up everyone’s wifi info and was all like, “What? We found it.” Then they threw it on the pile of data-4-sale and are still drowning in cash from?
Message received and understood! Oh, uh, here’s a couple-hundred-million fine for the uh, imposition. We’ll just leave it on the nightstand.
Arghblarg@lemmy.ca 5 months ago
Sounds like we’re all going to need to start putting the equivalent of Trap Streets in all our web content, source code, etc.
I heard someone has already had success placing nonsense in a white-on-white box of their site, later querying commercial AI to prove it was ingested w/o permission.
LiveLM@lemmy.zip 5 months ago
My fear is that those techniques will make the lives of people using screen readers increasingly harder
recursive_recursion@programming.dev 5 months ago
Here’s another example/variant (The Office - Recorder)
onlinepersona@programming.dev 5 months ago
There probably is a way to poison AI training material and it could be handy feature for social media.
Anti Commercial-AI license