I wonder whether it is much more cost-effective in terms of token throughput / h...

2001zhaozhao 70 days ago | parent | context | favorite | on: Qwen3.6-27B: Flagship-Level Coding in a 27B Dense ...

I wonder whether it is much more cost-effective in terms of token throughput / hardware+power cost to get actual GPUs instead, given that the model size is only 27B.