Public content is still subject to copyright, and I doubt that AppleBot only scr...

xena · on June 10, 2024

All you have to do is drop a token swear word into your content and they remove it from the dataset. Easy.

jimbobthrowawy · on June 11, 2024

Why would they? From the moderate of testing I've done of their handwriting recognition on an ipad, they seem to have everything risqué/offensive I could think of in there, even if you have to write it more clearly than other words. I don't expect this to be much different, other than a word filter on the output.

xena · on June 11, 2024

I mean for their large language model training. They said they don't include low quality data and swearing. This means you can get out of it by swearing.