We would need regulation to stop model ingesting data they do not have the right to, which would mean something like laws governing ML algorithms, having to declare what data you fed it and so on. Like some kind of SOC2 audit for data provenance.
Maybe ML weights are just numbers, but then so is a movie, an mp3, a logo, a brand, and so on.
Maybe ML weights are just numbers, but then so is a movie, an mp3, a logo, a brand, and so on.