Jan Leike: We had a significant team of people examine ChatGPT prompts and responses, after which say if a person response was preferable to a different reaction. All of this knowledge then acquired merged into 1 instruction operate. A lot of it is identical type of thing as what we did with InstructGPT. You wish it for being helpful, you need it b