What’s baffling about all this is both AMD and Intel have competing offerings an...

Clubber · on May 22, 2024

CUDA is their API and is proprietary. It's very fast and it's a significant competitive advantage.

https://analyticsindiamag.com/is-cuda-nvidias-competitive-mo...

ein0p · on May 22, 2024

It really doesn’t matter if you have cuda or not if you’re going to run inference at scale. As I said above (speaking from experience), porting models for inference is not a technically difficult problem. Indeed with both Intel Gaudi or AMD MI series of accelerators a lot of the popular architectures and their derivatives are supported either out of the box or with minimal tweaks.

latchkey · on May 22, 2024

AMD's only offering that matters, MI300x, is literally a couple months old. Give it time.

ein0p · on May 22, 2024

MI250 is also quite potent. Certainly sufficient for a good chunk of inference use cases

latchkey · on May 22, 2024

You are totally right, but few are buying them up at this point. The future is all MI300x.

joe_the_user · on May 22, 2024

I think the problem the other players have is they don't actually want to compete head-on with Nvidia.

AMD actually commissioned a drop-in CUDA emulator and we found out 'cause they stopped financing it and they open-sourced it as parts of the contract.

I would speculate that no one actually wants a "clone wars" situation since it would commodify the GPU and reduce everyone's profit rate.

acchow · on May 23, 2024

The companies buying billions of dollars of Nvidia GPUs definitely do want to commodify the GPU

StressedDev · on May 23, 2024

The question is what is the performance per watt of AMD's and Intel's products. My guess is both have significantly worse performance per watt. Energy and cooling are huge data center expenses and paying less for a product which requires more energy and cool is not a good idea because it costs more.

ein0p · on May 23, 2024

Nope. Not for LLM workloads at least. They’re competitive across the board.