The point about fidelity/quality is moot anyway when most people are listening to overcompressed[1] music on crappy bluetooth speakers and/or in a noisy environment.
It is not necessarily a protocol/technology issue, more a cultural one. Most people are just not looking at quality first and will buy whatever is cheap, loud and has the form factor they want. Music is so compressed nowadays that they don't even hear a difference between crappy and better quality speakers.
[1] as in dynamic range compression, not encoding