Policy Shaping: Integrating Human Feedback with Reinforcement Learning
Shane Griffith, Kaushik Subramanian, Jonathan Scholz, Charles L. Isbell, and Andrea Thomaz
College of Computing
更多
Georgia Institute of Technology, Atlanta, GA 30332, USA
{sgriffith7, kausubbu, jkscholz}@gatech.edu,
{isbell, athomaz}@cc.gatech.edu
收起
文档评论