• Riskable@programming.dev
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 day ago

    For images, it’s not even data collection because all the images that are used for these AI image generation tools are out on the internet for free for anyone to download right now. That’s how they’re obtained: A huge database of (highly categorized) image URLs (e.g. ImageNET) is crawled/downloaded.

    That’s not even remotely the same thing as “data collection”. That’s when a company vacuums everything they can from your private shit. Not that photo of an interesting building you uploaded to flickr over a decade ago.