Approximate policy iteration:a survey and somenew methods-中国期刊网

摘要 Weconsidertheclassicalpolicyiterationmethodofdynamicprogramming(DP),whereapproximationsandsimulationareusedtodealwiththecurseofdimensionality.Wesurveyanumberofissues:convergenceandrateofconvergenceofapproximatepolicyevaluationmethods,singularityandsusceptibilitytosimulationnoiseofpolicyevaluation,explorationissues,constrainedandenhancedpolicyiteration,policyoscillationandchattering,andoptimisticanddistributedpolicyiteration.Ourdiscussionofpolicyeva...

1Warren B.POWELL. A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications.控制理论与控制工程,2011-03.
2Greg FODERARO;Vikram RAJU;Silvia FERRARI. A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms.Pac-Man.控制理论与控制工程,2011-03.
3Yongxin Dong;Chuanqing Gu. ON PMHSS ITERATION METHODS FOR CONTINUOUS SYLVESTER EQUATIONS.计算数学,2017-05.
4LU Xing-jiang;QIAN Chun. Some geometrical iteration methods for nonlinear equations.基础数学,2008-01.
5Guang-wei Yuan Xu-deng Hang. ACCELERATION METHODS OF NONLINEAR ITERATION FOR NONLINEAR PARABOLIC EQUATIONS.计算数学,2006-03.
6白中治. ON THE MONOTONE CONVERGENCE OF THE PROJECTED ITERATION METHODS FOR LINEAR COMPLEMENTARITY PROBLEMS.基础数学,1996-02.
7罗兴钧;陈仲英. MULTILEVEL ITERATION METHODS FOR SOLVING LINEAR ILL-POSED PROBLEMS.基础数学,2005-03.
8Abdellah Bnouhachem. A New Inexactness Criterion for Approximate Logarithmic-Quadratic Proximal Methods.基础数学,2006-01.
9Zhong-Zhi Bai. ON HERMITIAN AND SKEW-HERMITIAN SPLITTING ITERATION METHODS FOR CONTINUOUS SYLVESTER EQUATIONS.计算数学,2011-02.
10Fang Chen;Yaolin Jiang;Qingquan Liu. ON STRUCTURED VARIANTS OF MODIFIED HSS ITERATION METHODS FOR COMPLEX TOEPLITZ LINEAR SYSTEMS.计算数学,2013-01.

Approximate policy iteration:a survey and somenew methods

来源期刊

相关推荐

同分类资源更多

相关关键词

Approximate policy iteration:a survey and somenew methods

来源期刊

相关推荐

同分类资源 更多

相关关键词

同分类资源更多