Thanks for the pointer but I don't think this is what happened in my case. I never opened the marketing assets in VSCode. They reside in completely separate folders.
According to microsoft's documentation[1] it's unintentional. Presumably because it's picked up from whatever's in memory, rather than being collected intentionally.
>which may unintentionally contain user content, such as parts of a file you were using when the problem occurred
If VSCode scanned your files to feed Copilot, it's not just Microsoft who can access that, but anyone using copilot. Every single file in every single machine that ever ran VSCode would potentially be at anyone's grasp.
Allowing this to happen would've been incredibly stupid. But it's worth investigating further.
> Every single file in every single machine that ever ran VSCode would potentially be at anyone's grasp.
Even if they were scanning all the files, Copilot's terms [1] explicitly says they do not use the data to provide suggestions for other users.
> GitHub Copilot does not use these URLs, file paths, or snippets collected in your telemetry as suggestions for other users of GitHub Copilot. This information is treated as confidential information and accessed on a need-to-know basis.