Completely wrong.
They don’t need to keep the images because they hash them. They store the hashes - that’s the point. CSAM detection works the same way.
If your hash matches the database hash (on 2 or more databases), then it will be flagged for manual review. They don’t need to know which image it matched, because they look at your image and go “yeh that’s an intimate image so it’s a match”.
misk@sopuli.xyz 4 days ago
The whole point of using hashes here is to not store images.
irotsoma@lemmy.blahaj.zone 3 days ago
The system that scours search results doesn’t store the images, but they are stored. Maybe or maybe not by Google, but someone is collecting them and keeping them in order to feed whatever “AI” or hashing algorithm comes next.
And it’s actually not the “whole point” in a technical sense. It’s mentioned because they want to make it sound less harmful. You’d never compare actual images directly. That would take a ton of storage space and time to compare a large set of files byte for byte. You always use hashes. If it was easier or cheaper to use the images directly, they would, just like the “AI” agents that do this in other systems need the actual images not hashes.