Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Take thousands of prompts, generate several responses for each of them, and have human reviewers rank the responses for each prompt from best to worst

Recently I saw an image where Indian women sat in front of computers and the caption said they were classifying "AI" responses. I guess that's true and this kind of work is the new outsourced cheap labour in the AI age.



That Indian woman's idea of acceptable and not acceptable AI responses surely vary from that of a San Fransisco tech worker, or Cape Town motorcycle mechanic, or an English teacher from Liverpool.

I really doubt the mechanical turk method is applicable or even useful for the current state of AI-generated text.


i actually disagree a lot with this. Sure, if you asked something with heavy cultural baggage that would frequently be a real concern, but when you are primarily trying to bridge the machine-human chasm, our cultural differences among the examples you gave are trivial in comparison. For instance, if you offered an AI personal assistant but the catch was that it would (at least starting out) only have the perspective of an average middle-class Indian person, it would still beat the absolute crap out of "first generation" technology like Siri or Alexa!


It could be a first pass


I personally worked as a « human trainer » for the fine tuning of ChatGPT.

The pay was 50$ per hour, which is not bad for a side job as a student.


I'd say. Where do you apply for this kind of work?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: