Computing behavioral distances, compositionally (original) (raw)
Abstract
ABSTRACT We propose a general definition of composition operator on Markov Decision Processes with rewards (MDPs) and identify a well behaved class of operators, called safe, that are guaranteed to be non-extensive with respect to the bisimilarity pseudometrics of Ferns et al. [10], which measure behavioral similarities between MDPs. For MDPs built using safe/non-extensive operators, we present the first method that exploits the structure of the system for (exactly) computing the bisimilarity distance on MDPs. Experimental results show significant improvements upon the non-compositional technique.
Giorgio Bacci hasn't uploaded this paper.
Let Giorgio know you want this paper to be uploaded.
Ask for this paper to be uploaded.