Deep Reinforcement Learning
from Human Preferences
Paul F Christiano
更多
OpenAI
paul@openai.com
Jan Leike
DeepMind
leike@google.com
Tom B Brown
Google Brain⇤
tombbrown@google.com
Miljan Martic
DeepMind
miljanm@google.com
Shane Legg
DeepMind
legg@google.com
Dario Amodei
OpenAI
damodei@openai.com
收起
文档评论