The letter S in a light blue, stylized speech bubble followed by SpeakBits
SpeakBitsThe letter S in a light blue, stylized speech bubble followed by SpeakBits
Trending
Top
New
Controversial
Search
Groups

Enjoying SpeakBits?

Support the development of it by donating to Patreon or Ko-Fi.
About
Rules
Terms
Privacy
EULA
Cookies
Blog
Have feedback? We'd love to hear it!

OpenAI Wants AI to Help Humans Train AI

wired.com
submitted
a year ago
byiareuniquetotechnology

Summary

OpenAI developed a new model by fine-tuning its most powerful offering, GPT-4, to assist human trainers tasked with assessing code. The company found that the new model, dubbed CriticGPT, could catch bugs that humans missed. It is also part of an effort to ensure that AI behaves in acceptable ways even as it becomes more capable.

Dylan Hadfield-Menell, a professor at MIT who researches ways to align AI, says the idea of having AI models help train more powerful ones has been kicking around for a while. He says it remains to be seen how generally applicable and powerful it is.

 spatula ladle hair slide toaster-0
13

4 Comments

4
eldiscipulo
a year ago
A little telling most people don't ever want to talk about RLHF since it detracts from AI being an intelligent system by itself.
3
getthatmoneyyo
a year ago
Can't ruin the illusion
2
iareuniqueOP
a year ago
This is at least their way of admitting that
2
practicalmagic
a year ago
Makes sense. Models need intervention to ensure the right output. People need to be compensated appropriately for the time and data.