Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
dezmou
13 days ago
|
parent
|
context
|
favorite
| on:
Ask HN: Those making $500/month on side projects i...
I love it, just purchased a pack. I've also found that it is a very great tool to test LLM, like take a screenshot of a half resolved game and feed it to ChatGPT with the rules and ask him to select the next target
tikotus
13 days ago
|
next
[–]
Thank you so much! Also, you might find this interesting regarding testing LLMs:
https://www.nicksypteras.com/blog/cbs-benchmark.html
reply
dezmou
13 days ago
|
prev
[–]
turn out Claude Sonnet 4.5 is far better as resolving those as ChatGPT 5.2
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: