Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This, plus metal acceleration works quite well. 7~8B parameter models quantized to 3bpw or so run with good tok/s on my iphone 15 pro


It works quite well as long as you don't care about battery.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: