Disclosure: I used to work for GCP and launched Preemptible VMs. Congrats! Can I...

vishnukool · on Oct 3, 2021

Thanks for the feedback, yeah at this stage we really just wanted to find out if we can build something useful for the community. Agreed on the pricing suggestion.

Also, interesting point about inference. I'm not sure though how common it is for companies to need GPUs for inference. Because if you can have a CPU based inference model, which I thought was most common, it's probably not a big usecase?

boulos · on Oct 3, 2021

I've got some comment somewhere on HN that says exactly that "try CPU inference first, it's pretty good".

The need to reach for a T4 comes when someone is doing a big model on images or video and wants sub-second response time. (Think some of the stuff on Snapchat, etc.)

nostrebored · on Oct 3, 2021

i've worked with people who have needed GPU powered inference on AWS. Training had to happen on p3's but inference happened on g4's. The pricing is lower (and depending on the use case, often the overall cost) and spot savings are usually less dramatic as these instances can be very in demand.