导航
学术报告|
当前位置:首页  科学研究  学术报告
报告题目: Convergence of two control problems
报 告 人: 张宇鸣 博士
报告人所在单位: 美国Auburn大学
报告日期: 2023-06-12
报告时间: 10:00 - 11:00
报告地点: 光华东主楼2001
   
报告摘要:

We study the exploratory Hamilton--Jacobi--Bellman (HJB) equation arising from the entropy-regularized exploratory control problem, which was formulated by Wang, Zariphopoulou and Zhou (J. Mach. Learn. Res., 21, 2020) in the context of reinforcement learning in continuous time and space. We establish the well-posedness and regularity of the viscosity solution to the equation, and we derive an explicit rate of convergence for this problem as exploration diminishes to zero. If time permitted, I would also discuss the analysis of the policy iteration algorithm used to study the control problem. These are joint works with Xun Yu Zhou, Hung Tran and Wenpin Tang.

学术海报.pdf

   
本年度学院报告总序号: 880

Copyright © |2012 复旦大学数学科学学院版权所有 沪ICP备042465