Reinforcement Learning with Non-Markovian Rewards (original) (raw)

Learning Probabilistic Reward Machines from Non-Markovian Stochastic Reward Processes

Alvaro Velasquez

2021

View PDFchevron_right

Reinforcement Learning and Markov Decision Processes

Marco A. Wiering

Reinforcement Learning, 2012

View PDFchevron_right

Learning in Markov Decision Processes under Constraints

Abhishek Gupta

ArXiv, 2020

View PDFchevron_right

HQ-Learning: Discovering Markovian subgoals for non-Markovian reinforcement learning

Marco A. Wiering

1996

View PDFchevron_right

Advice-Guided Reinforcement Learning in a non-Markovian Environment

Jean Gaglione

Proceedings of the AAAI Conference on Artificial Intelligence

View PDFchevron_right

Generalized markov decision processes: Dynamic-programming and reinforcement-learning algorithms

Csaba Szepesvari

… of International Conference of Machine Learning

View PDFchevron_right

Learning without state-estimation in partially observable Markovian decision problems

Michael Jordan

Icml, 1984

View PDFchevron_right

Extending Markov Automata with State and Action Rewards

Mark Timmer

2014

View PDFchevron_right

NMRDPP: A System for Decision-Theoretic Planning with Non-Markovian Rewards

Charles Gretton

View PDFchevron_right

Non-markovian policies in sequential decision problems

Csaba Szepesvari

Acta Cybernetica, 1998

View PDFchevron_right

Decision-Theoretic Planning with non-Markovian Rewards

Charles Gretton, J. Slaney

Journal of Artificial Intelligence Research, 2006

View PDFchevron_right

Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version

Emmanuel Rachelson

2020

View PDFchevron_right

Learning in non-stationary Partially Observable Markov Decision Processes

Doina Precup

View PDFchevron_right

Maximum reward reinforcement learning: A non-cumulative reward criterion

Chai Quek

Expert Systems with Applications, 2006

View PDFchevron_right

Implementation and Comparison of Solution Methods for Decision Processes with Non-Markovian Rewards

Charles Gretton

2003

View PDFchevron_right

Human and machine learning in non-markovian decision making.

Silvia Marchesotti, Elisa Tartaglia, Aaron Clarke, M. Herzog

View PDFchevron_right

Non-Stationary Markov Decision Processes a Worst-Case Approach using Model-Based Reinforcement Learning

Emmanuel Rachelson

2019

View PDFchevron_right

Reinforcement learning methods for continuous-time Markov decision problems

Steven Bradtke

Advances in neural information processing systems, 1995

View PDFchevron_right

The effect of eligibility traces on finding optimal memoryless policies in partially observable Markov decision processes

John Loch

Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems Ii, 1999

View PDFchevron_right

A Sliding-Window Algorithm for Markov Decision Processes with Arbitrarily Changing Rewards and Transitions

Peter Auer

2018

View PDFchevron_right

Two stochastic dynamic programming problems by model-free actor-critic recurrent-network learning in non-Markovian settings

Stuart Dreyfus

2004

View PDFchevron_right

A generalized reinforcement-learning model: Convergence and applications

Csaba Szepesvari

MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, 1996

View PDFchevron_right

Value Function Based Reinforcement Learning in Changing Markovian Environments

Laszlo Monostori

Journal of Machine Learning Research - JMLR, 2008

View PDFchevron_right

Properties of Planning with Non-Markovian Rewards

Charles Gretton

View PDFchevron_right

A simulation-based learning automata framework for solving semi-Markov decision problems under long-run average reward

A. Gosavi

IIE Transactions, 2004

View PDFchevron_right

2 Reinforcement Learning 2 . 1 Markov Decision Processes

Justus Piater

2004

View PDFchevron_right

Markov Decision Processes with Long-Term Average Constraints

mridul agarwal

ArXiv, 2021

View PDFchevron_right

Eliciting additive reward functions for Markov decision processes

Craig Boutilier

2011

View PDFchevron_right

Algorithms for Reinforcement Learning Draft of the lecture published in the Synthesis Lectures on Artificial Intelligence and Machine Learning series by Morgan & Claypool Publishers

Dmitriy Portnyagin

View PDFchevron_right

Modelling and Analysis of Markov Reward Automata

Mark Timmer

Lecture Notes in Computer Science, 2014

View PDFchevron_right

Model and Reinforcement Learning for Markov Games with Risk Preferences

Wenjie Huang

Proceedings of the AAAI Conference on Artificial Intelligence, 2020

View PDFchevron_right

Reinforcement Learning with Stochastic Reward Machines

Jan Corazza

Proceedings of the AAAI Conference on Artificial Intelligence

View PDFchevron_right

Q-learning for history-based reinforcement learning

Peter Sunehag

View PDFchevron_right

Learning Successor States and Goal-Dependent Values: A Mathematical Viewpoint

Léonard Blier

2021

View PDFchevron_right

A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes

Taechoong Chung

IEEE Access

View PDFchevron_right