Reinforcement Learning exploiting state-action equivalence (original) (raw)
Related papers
Model-Based Reinforcement Learning Exploiting State-Action Equivalence
2019
Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning
Clustering markov decision processes for continual transfer
Benjamin Rosman, Subramanian Ramamoorthy
Learning in Markov Decision Processes under Constraints
ArXiv, 2020
Near-optimal Regret Bounds for Reinforcement Learning
Journal of Machine Learning Research, 2010
Using expert knowledge to construct error bound state-action aggregations for reinforcement learning
Proceedings of the 19th Belgian-Dutch …, 2007
Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning
2019
Selecting near-optimal approximate state representations in reinforcement learning
A learning algorithm for Markov decision processes with adaptive state aggregation
2000
2013
Efficient Learning in Non-Stationary Linear Markov Decision Processes
ArXiv, 2020
Online Learning in Markov Decision Processes with Changing Cost Sequences
2014
Navigating to the Best Policy in Markov Decision Processes
2021
Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
2018
Metrics for Finite Markov Decision Processes
2004
1998
Markov Decision Processes with Long-Term Average Constraints
ArXiv, 2021
Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards
2019
Regret Bounds for Restless Markov Bandits
Lecture Notes in Computer Science, 2012
Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems Ii, 1999
Learning Successor States and Goal-Dependent Values: A Mathematical Viewpoint
2021
2019
The Advantage Regret-Matching Actor-Critic
ArXiv, 2020
Improved Exploration in Factored Average-Reward MDPs
2021
Learning Algorithms for Markov Decision Processes with Average Cost
Dimitri Bertsekas, J. Abounadi
SIAM Journal on Control and Optimization, 2001
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
arXiv (Cornell University), 2021
2018
Reinforcement Learning with Non-Markovian Rewards
Proceedings of the AAAI Conference on Artificial Intelligence, 2020
A generalized reinforcement-learning model: Convergence and applications
MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, 1996
Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
2021
Reinforcement Learning and Markov Decision Processes
Reinforcement Learning, 2012
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
Machine Learning, 2013
State Representation Learning for Goal-Conditioned Reinforcement Learning
2022
Hierarchical Representation Learning for Markov Decision Processes
2021
Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes
2020