Fingerprint Policy Optimisation for Robust Reinforcement Learning (original) (raw)
Related papers
Contextual Policy Optimisation
ArXiv, 2018
Robust Reinforcement Learning with Bayesian Optimisation and Quadrature
2020
Bayesian Policy Optimization for Model Uncertainty
2019
Policy Gradient Bayesian Robust Optimization for Imitation Learning
ArXiv, 2021
Parameter-exploring policy gradients
Neural Networks, 2010
Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search
2018 IEEE International Conference on Robotics and Automation (ICRA)
Gradient-Aware Model-Based Policy Search
Proceedings of the AAAI Conference on Artificial Intelligence
Learning State Features from Policies to Bias Exploration in Reinforcement Learning
PILCO: A Model-Based and Data-Efficient Approach to Policy Search
2011
Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning
arXiv (Cornell University), 2023
Gaussian Process Policy Optimization
ArXiv, 2020
State-Dependent Exploration for Policy Gradient Methods
Lecture Notes in Computer Science, 2008
Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization
2021
Sample Efficient Bayesian Reinforcement Learning
2020
Fast Model-based Policy Search for Universal Policy Networks
ArXiv, 2022
Policy Gradients with Parameter-Based Exploration for Control
2008
Generalizing Policy Advice with Gaussian Process Bandits for Dynamic Skill Improvement
Proceedings of the AAAI Conference on Artificial Intelligence
Inferring the Optimal Policy using Markov Chain Monte Carlo
ArXiv, 2019
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
arXiv (Cornell University), 2022
Contextual Direct Policy Search
Journal of Intelligent and Robotic Systems, 2019
Policy Optimization Through Approximate Importance Sampling
2019
A Survey on Policy Search Algorithms for Learning Robot Controllers in a Handful of Trials
IEEE Transactions on Robotics
Model-free Policy Learning with Reward Gradients
arXiv (Cornell University), 2021
Variable Risk Control via Stochastic Optimization
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo
arXiv (Cornell University), 2022
Bayesian Policy Search with Policy Priors
2011
Robust Bayesian reinforcement learning through tight lower bounds
Arxiv preprint arXiv:1106.3651, 2011
Bayesian Distributional Policy Gradients
2021
Reward Shaping for Model-Based Bayesian Reinforcement Learning
2015
Data Efficient Learning of Robust Control Policies
2018
Learning and policy search in stochastic dynamical systems with Bayesian neural networks
2020
Proximal Policy Optimization Algorithms
2017
Provably Robust Blackbox Optimization for Reinforcement Learning
2019
Useful Policy Invariant Shaping from Arbitrary Advice
2020
A Practical and Conceptual Framework for Learning in Control
2010