Augmenting Reinforcement Learning with Human Feedback
W. Bradley Knox BRADKNOX@CS.UTEXAS.EDU
University of Texas at Austin, Department of Computer Science
Policy Shaping: Integrating Human Feedback with Reinforcement Learning
Shane Griffith, Kaushik Subramanian, Jonathan Scholz, Charles L. Isbell, and Andrea Thomaz
College of Computing
Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
Andrea L. Thomaz and Cynthia Breazeal
MIT Media Lab
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer1 and Joshua Uyheng3 and Stefan Riezler1;2
1Computational Linguistics & 2IWR, Heidelberg University, Germany
Reinforcement Learning with Human Teachers:
Understanding How People Want to Teach Robots
Andrea L. Thomaz, Guy Hoffman, Cynthia Breazeal