A learning algorithm for Markov decision processes with adaptive state aggregation
Title : A learning algorithm for Markov decision processes with adaptive state aggregation
Authors :
Date: December 01 - December 01, 2000
Download Full Paper
Authors :
Baras, John S.
Borkar, V.S
Conference : Proceedings of the IEEE Conference on Decision and Control pp. 3351-3356 Borkar, V.S
Date: December 01 - December 01, 2000
We propose a simulation-based algorithm for learning good policies for a Markov decision process with unknown transition law, with aggregated states. The state aggregation itself can be adapted on a slower time scale by an auxiliary learning algorithm. Rigorous justifications are provided for both algorithms.