Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
reset-password
on April 18, 2023
|
parent
|
context
|
favorite
| on:
Reddit will begin charging for access to its API
LLMs already have problems with fact vs fiction. I don't see how Reddit of all places has "valuable data" in that regard.
uptownfunk
on April 18, 2023
|
next
[–]
I think the value is in the examples it provides of language.
nekoashide
on April 18, 2023
|
prev
|
next
[–]
Top upvoted comments can filter out the useless information and then it can be trained on actual data and refined.
Arrath
on April 18, 2023
|
parent
|
next
[–]
Except when top voted comments are hivemind approved 'funny' quips/responses, or in reply to exercises in creative writing like half the posts in relationshipadvice, iwantthemanager, nuclear/pettyrevenge, etc
aydyn
on April 18, 2023
|
parent
|
prev
|
next
[–]
Is this a joke that I'm missing? Top reddit posts are frequently trash filled with misinformation.
minimaxir
on April 18, 2023
|
prev
|
next
[–]
Many popular LLMs already include large amount of Reddit comment data which is (usually) cited in their respective papers.
surgical_fire
on April 18, 2023
|
prev
[–]
Reddit also has a problem with fact vs fiction.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: