Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You're exactly right. Hadoop and it's ecosystem are... a very poor replica of the system whose predecessor the respective Google papers were written about. There's glaring efficiency things like not supporting complex encodings (last I used HDFS, at least, it could only do full replication) or the query engines on top not implementing sampling in their aggregates as a default/out of the box, which the Google tools do.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: