Prediction Error as Curiosity: Intrinsic Motivation
Was this section helpful?
Curiosity-driven Exploration by Self-supervised Prediction, Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell, 2017International Conference on Machine Learning (ICML)DOI: 10.48550/arXiv.1705.05363 - Introduces the Intrinsic Curiosity Module (ICM), a model that uses prediction error in a learned feature space to generate intrinsic rewards for exploration in deep reinforcement learning.
Reinforcement Learning: An Introduction, Richard S. Sutton, Andrew G. Barto, 2018 (The MIT Press) - A fundamental textbook that provides theoretical foundations of reinforcement learning, including principles of exploration and intrinsic motivation.
Exploration by Random Network Distillation, Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov, 2018International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1810.12894 - Presents Random Network Distillation (RND), an alternative intrinsic motivation method that robustly addresses common challenges in curiosity-driven exploration, such as the noisy TV problem.