MDP function and sample the state based on b and the transition function. Then, PFSVI selects the observation z, which lets the successor point b^a*,z farthest from B. PFSVI can improve the effect by sampling according to the environment and reaching more vast space than FSVI. Experiment results of four benchmarks show that PFSVI can achieve better global optimal solutions than FSVI and PBVI, especially in large-scale problems.">

A Probabilistic Forward Search Value Iteration Algorithm for POMDP (original) (raw)

IEEE Account

Purchase Details

Profile Information

Need Help?

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
© Copyright 2026 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.