Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

SIMD intrinsics and manually unrolled loops are surely needed. That's the reason why all BLAS libraries vectorize and unroll loops manually. Even modern compilers can't properly auto-vectorize and unroll with 100% success rate.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: