While you could run larger models with 128GB, I feel like 64GB is just the amoun...

api · on March 19, 2025

That's kinda why I'd like a 64GiB M4 Air. 64 is the magic number for local LLMs of any reasonable capability. Example: deepseek-coder can do unit tests, gemma3 can summarize PDFs and stuff pretty well, etc. You can't do much with LLMs with only 32GiB, just run baby ones.

seanmcdirmid · on March 19, 2025

M4 is going to be too weak on GPU, even an M4 pro is probably not going to get you far. If you go with an M4, 16 or 32 is fine. An M1/M2/M3/M4 Max is probably what you need minimum to run quantized 72b models, where 64GB is needed.