Fingerprint Policy Optimisation for Robust Reinforcement Learning (original) (raw)

Contextual Policy Optimisation

Supratik Paul

ArXiv, 2018

View PDFchevron_right

Robust Reinforcement Learning with Bayesian Optimisation and Quadrature

Konstantinos Chatzilygeroudis

2020

View PDFchevron_right

Bayesian Policy Optimization for Model Uncertainty

Jeongseok Lee

2019

View PDFchevron_right

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Ashwin Balakrishna

ArXiv, 2021

View PDFchevron_right

Parameter-exploring policy gradients

Christian Osendorfer

Neural Networks, 2010

View PDFchevron_right

Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

Konstantinos Chatzilygeroudis

2018 IEEE International Conference on Robotics and Automation (ICRA)

View PDFchevron_right

Gradient-Aware Model-Based Policy Search

Matteo Papini

Proceedings of the AAAI Conference on Artificial Intelligence

View PDFchevron_right

Learning State Features from Policies to Bias Exploration in Reinforcement Learning

M. Afzal Upal

View PDFchevron_right

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

Marc Deisenroth

2011

View PDFchevron_right

Where2Start: Leveraging initial States for Robust and Sample-Efficient Reinforcement Learning

raoof moayedi

arXiv (Cornell University), 2023

View PDFchevron_right

Gaussian Process Policy Optimization

Tejas Narayanan

ArXiv, 2020

View PDFchevron_right

State-Dependent Exploration for Policy Gradient Methods

Martin Felder

Lecture Notes in Computer Science, 2008

View PDFchevron_right

Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization

John Dolan

2021

View PDFchevron_right

Sample Efficient Bayesian Reinforcement Learning

Divya Grover

2020

View PDFchevron_right

Fast Model-based Policy Search for Universal Policy Networks

Buddhika Laknath

ArXiv, 2022

View PDFchevron_right

Policy Gradients with Parameter-Based Exploration for Control

Christian Osendorfer

2008

View PDFchevron_right

Generalizing Policy Advice with Gaussian Process Bandits for Dynamic Skill Improvement

Jared Glover

Proceedings of the AAAI Conference on Artificial Intelligence

View PDFchevron_right

Inferring the Optimal Policy using Markov Chain Monte Carlo

Albert Qu

ArXiv, 2019

View PDFchevron_right

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

Baher Abdulhai

arXiv (Cornell University), 2022

View PDFchevron_right

Contextual Direct Policy Search

Luis Paulo Reis

Journal of Intelligent and Robotic Systems, 2019

View PDFchevron_right

Policy Optimization Through Approximate Importance Sampling

Peter Vrancx

2019

View PDFchevron_right

A Survey on Policy Search Algorithms for Learning Robot Controllers in a Handful of Trials

Konstantinos Chatzilygeroudis

IEEE Transactions on Robotics

View PDFchevron_right

Model-free Policy Learning with Reward Gradients

Samuele Tosatto

arXiv (Cornell University), 2021

View PDFchevron_right

Variable Risk Control via Stochastic Optimization

Roderic A. Grupen

View PDFchevron_right

Policy Learning and Evaluation with Randomized Quasi-Monte Carlo

yi-fan chen

arXiv (Cornell University), 2022

View PDFchevron_right

Bayesian Policy Search with Policy Priors

David Wingate

2011

View PDFchevron_right

Robust Bayesian reinforcement learning through tight lower bounds

Christos Dimitrakakis

Arxiv preprint arXiv:1106.3651, 2011

View PDFchevron_right

Bayesian Distributional Policy Gradients

Luchen Li

2021

View PDFchevron_right

Reward Shaping for Model-Based Bayesian Reinforcement Learning

Yung-Kyun Noh

2015

View PDFchevron_right

Data Efficient Learning of Robust Control Policies

Susmit Jha

2018

View PDFchevron_right

Learning and policy search in stochastic dynamical systems with Bayesian neural networks

Steffen Udluft

2020

View PDFchevron_right

Proximal Policy Optimization Algorithms

Prafulla Dhariwal

2017

View PDFchevron_right

Provably Robust Blackbox Optimization for Reinforcement Learning

jasmine Hsu

2019

View PDFchevron_right

Useful Policy Invariant Shaping from Arbitrary Advice

Paniz Behboudian

2020

View PDFchevron_right

A Practical and Conceptual Framework for Learning in Control

Marc Deisenroth

2010

View PDFchevron_right

Fingerprint Policy Optimisation for Robust Reinforcement Learning (original) (raw)

Related papers