Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

are you using smoothing for large n? kneser-ney smoothing seems to give the best results.

http://nlp.stanford.edu/~wcmac/papers/20050421-smoothing-tut...



I did not read the paper, but what does more accurate mean in this case ? Likelihood of some unseen data ? Seems pretty hard to define or measure to me, if the goal is shear amusement.


Interesting link, thanks!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: