Because cost and speed. Smaller models can run on your phone for free, or on the...

		dudus on June 13, 2024 \| parent \| context \| favorite \| on: How Meta trains large language models at scale Because cost and speed. Smaller models can run on your phone for free, or on the cloud for pennies. An API call for a large LLM with a lot of context can cost orders of magnitude more and incur network latency