Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Took me a while, but I got 97% efficiency with single core DGEMM.

In my experience, it's pretty widely accepted that VLIW (and EPIC) can achieve high performance and efficiency on highly regular tasks such as GEMM and FFT. That's why VLIW has been and continues to be popular for DSPs. The struggle for VLIW is general purpose code that doesn't necessarily have that same kind of regularity.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: