They can generalise to novel inputs. Ok often they mess it up and they're clearl... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		IshKebab on Jan 1, 2025 \| parent \| context \| favorite \| on: 30% drop in O1-preview accuracy when Putnam proble... They can generalise to novel inputs. Ok often they mess it up and they're clearly better at dealing with inputs they have seen before (who isn't?), but they can still reason about things they have never seen before. Honestly if you don't believe me just go and use them. It's pretty obvious if you actually get experience with them.

jampekka on Jan 2, 2025 [–]

Current LLMs are equivalent to tabular Markov chains (though these are too huge to realistically compute). What's the size limit when a tabular Markov chain can generalize to novel inputs?

IshKebab on Jan 2, 2025 | [–]

No idea. I'm not sure how that's relevant anyway.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact