Convergence of two control problems

科学研究 |

学术报告|

报告题目：	Convergence of two control problems
报告人：	张宇鸣博士
报告人所在单位：	美国Auburn大学
报告日期：	2023-06-12
报告时间：	10:00 - 11:00
报告地点：	光华东主楼2001

报告摘要：	We study the exploratory Hamilton--Jacobi--Bellman (HJB) equation arising from the entropy-regularized exploratory control problem, which was formulated by Wang, Zariphopoulou and Zhou (J. Mach. Learn. Res., 21, 2020) in the context of reinforcement learning in continuous time and space. We establish the well-posedness and regularity of the viscosity solution to the equation, and we derive an explicit rate of convergence for this problem as exploration diminishes to zero. If time permitted, I would also discuss the analysis of the policy iteration algorithm used to study the control problem. These are joint works with Xun Yu Zhou, Hung Tran and Wenpin Tang. 学术海报.pdf

本年度学院报告总序号：	880