Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As other commenters have pointed out in one way or another, the problem seems to actually be that this simplistic model of voter choice can't capture all the structure of the real world that humans can quickly infer from the setup. Things like: state elections have millions of voters, 55/45 is actually a decisive, not a narrow win etc.

In a generic setup, imagine you have a binary classifier that outputs probabilities in the .45-.55 range - likely it won't be a really strong classifier. You would ideally like polarized predictions, not values around .5.

Come to think of it, could this be an issue of non-ergodicity too ( hope I'm using the term right)? i.e. state level prior is not that informative wrt individual vote?



No, you want your model to be well calibrated. If the model accurately assessed a 0.55 probability of going blue, then that is what you want.

People who try to correct for “unbalanced classes” and contort their model to give polarizing predictions are frankly being pretty dumb.

The correct answer is to take your well calibrated probabilities and use you brain on what to do with them.


This is not a matter of class balance that much. If you want to predict which of two parties somebody will vote with, the most natural framing is that of binary classification.

For that you need to threshold your predictions. Ideally you'd like your model to generate a bimodal distribution so that you can threshold without many false positives etc.


Yes but the prompt here states that all we know is the probability is either 0.55 or 0.45. By definition this is the best model you can produce.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: