Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

i suspect that "awfully high" drove the inference trajectory right onto the edge of the dopey part of the internet/compressed representation and random chance allowed it to fall right on in...


hah yeah. now that i think of it, i bet the word high with no punctuation that is immediately adjacent to a document separator is probably extremely correlated with the topic it fell into.

i wonder what other sorts of atoms with similar idf-like scores exist.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: