Right, this calculation runs on the client and assumes the clients are fast enough. This helps a lot with interactivity because the network stack isn’t involved at all.
It could be interesting to optionally offload some compute to much faster servers when it would be useful, but that introduces request/response latency overhead too.