Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In case anyone missed the reference: https://arxiv.org/abs/1706.03762

> (...) We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: