Reinforcement Learning (original) (raw)

1 Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press. 2016.

2 Peter Stone. “Reinforcement Learning”. Encyclopedia of Machine Learning and Data Mining. Springer. 2017.

3 Xiang Li, Jinghuan Shang, Srijan Das, Michael Ryoo. “Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?” Advances in Neural Information Processing Systems. Vol. 35. 2022. pp. 30865–30881. https://proceedings.neurips.cc/paper_files/paper/2022/hash/c75abb33341363ee874a71f81dc45a3a-Abstract-Conference.html.

4 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018. Michael Hu. The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python. Apress. 2023.

5 Brandon Brown and Alexander Zai. Deep Reinforcement Learning in Action. Manning Publications. 2020.

6 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018. Brandon Brown and Alexander Zai. Deep Reinforcement Learning in Action. Manning Publications. 2020.

7 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018. B. Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A. Al Sallab, Senthil Yogamani, and Patrick Pérez. “Deep Reinforcement Learning for Autonomous Driving: A Survey”. IEEE Transactions on Intelligent Transportation Systems. Vol. 23, No. 6. 2022. pp. 4909–4926. https://ieeexplore.ieee.org/document/9351818.

8 Sergey Levine, Aviral Kumar, George Tucker, and Justin Fu. “Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems”. 2020. https://arxiv.org/abs/2005.01643. Julian Schrittwieser, Thomas Hubert, Amol Mandhane, Mohammadamin Barekatain, Ioannis Antonoglou, and David Silver. “Online and Offline Reinforcement Learning by Planning with a Learned Model”. Advances in Neural Information Processing Systems. Vol. 34. 2021. pp. 27580–27591. https://proceedings.neurips.cc/paper_files/paper/2021/hash/e8258e5140317ff36c7f8225a3bf9590-Abstract.html.

9 Martin Puterman and Jonathan Patrick. “Dynamic Programming”. Encyclopedia of Machine Learning and Data Mining. Springer. 2017.

10 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018. Phil Winder. Reinforcement Learning: Industrial Applications of Intelligent Agents. O’Reilly. 2020.

11 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018.

12 Michael Hu. The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python. Apress. 2023.

13 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018.

14 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018. Michael Hu. The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python. Apress. 2023.

15 Richard Sutton and Andrew Barto. Introduction to Reinforcement Learning. 2nd edition. MIT Press. 2018.

16 Julian Ibarz, Jie Tan, Chelsea Finn, Mrinal Kalakrishnan, Peter Pastor, and Sergey Levine. “How to Train Your Robot with Deep Reinforcement Learning: Lessons We Have Learned”. The International Journal of Robotics Research. Vol. 40. 2021. pp. 969–721. https://journals.sagepub.com/doi/full/10.1177/0278364920987859.

17 Saminda Wishwajith Abeyruwan, Laura Graesser, David B. D’Ambrosio, Avi Singh, Anish Shankar, Alex Bewley, Deepali Jain, Krzysztof Marcin Choromanski, and Pannag R. Sanketi. “i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops”. Proceedings of the 6th Conference on Robot Learning. PMLR. No. 205. 2023. pp. 212–224. https://proceedings.mlr.press/v205/abeyruwan23a.html.

18 Homer Rich Walke, Jonathan Heewon Yang, Albert Yu, Aviral Kumar, Jędrzej Orbik, Avi Singh, and Sergey Levine. “Don’t Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning”. Proceedings of the 6th Conference on Robot Learning. PMLR. No. 205. 2023. pp. 1652–1662. https://proceedings.mlr.press/v205/walke23a.html.

19 Nikolaj Goodger, Peter Vamplew, Cameron Foale, and Richard Dazeley. “Language Representations for Generalization in Reinforcement Learning”. Proceedings of the 13th Asian Conference on Machine Learning. PMLR. No. 157. 2021. pp. 390–405. https://proceedings.mlr.press/v157/goodger21a.html. Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, and Jacob Andreas. “Guiding Pretraining in Reinforcement Learning with Large Language Models”. Proceedings of the 40th International Conference on Machine Learning. PMLR. No. 202. 2023. pp. 8657–8677. https://proceedings.mlr.press/v202/du23f.html. Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, and Roy Fox. “Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making Using Language-Guided World Modelling”. Proceedings of the 40th International Conference on Machine Learning. PMLR. 2023. pp. 26311–26325. https://proceedings.mlr.press/v202/nottingham23a.html.

20 Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, and Prithviraj Ammanabrolu. “ScienceWorld: Is Your Agent Smarter Than a 5th Grader?” Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022. pp. 11279–11298. https://aclanthology.org/2022.emnlp-main.775/. Peter Jansen. “A Systematic Survey of Text Worlds as Embodied Natural Language Environments”. Proceedings of the 3rd Wordplay Workshop: When Language Meets Games. 2022. pp. 1–15. https://aclanthology.org/2022.wordplay-1.1.

21 Paloma Sodhi, Felix Wu, Ethan R. Elenberg, Kilian Q. Weinberger, and Ryan McDonald. “On the Effectiveness of Offline RL for Dialogue Response Generation”. Proceedings of the 40th International Conference on Machine Learning. PMLR. No. 202. 2023. pp. 32088–32104. https://proceedings.mlr.press/v202/sodhi23a.html. Siddharth Verma, Justin Fu, Sherry Yang, and Sergey Levine. “CHAI: A Chatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning”. Proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2022. pp. 4471–4491. https://aclanthology.org/2022.naacl-main.332/.