On Optimization Formulations and Algorithms of Markov Decision Problems

Lexing Ying, Stanford University Mathematics and ICME SCAIM Seminar

January 24, 2023 12:00 pm ESB 4133

Markov decision problems and reinforcement learning have been active research areas in the past decade. Compared with the rapid algorithmic developments, the linear/convex programming formulations of the Markov decision problem are less well-known. In the first part of talk, we will discuss the convex optimization formulations of Markov decision problems in the primal, dual, and primal-dual forms. In the second part of the talk, we will present two new algorithms that are inspired by these optimization formulations and exhibit exponential or even super-exponential convergence.

Pizza lunch will be provided.

We acknowledge financial support from the Pacific Institute for the Mathematical Sciences (PIMS) and the UBC Institute of Applied Mathematics (IAM).

Institute of Applied Mathematics Vancouver Campus 311-6356 Agricultural Road University of British Columbia Vancouver, BC Canada V6T 1Z2 Website www.iam.ubc.ca Email iam@iam.ubc.ca Back to top ⌃