Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||||
| DOI | 10.1137/22M1515550 | ||||
| Año | 2024 | ||||
| Tipo | artículo de investigación |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
We study a stochastically perturbed version of the well-known Krasnoselskii--Mann iteration for computing fixed points of nonexpansive maps in finite dimensional normed spaces. We discuss sufficient conditions on the stochastic noise and stepsizes that guarantee almost sure convergence of the iterates towards a fixed point and derive nonasymptotic error bounds and convergence rates for the fixed-point residuals. Our main results concern the case of a martingale difference noise with variances that can possibly grow unbounded. This supports an application to reinforcement learning for average reward Markov decision processes, for which we establish convergence and asymptotic rates. We also analyze in depth the case where the noise has uniformly bounded variance, obtaining error bounds with explicit computable constants.
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | BRAVO-GONZALEZ, MARIO | Hombre |
Universidad Adolfo Ibáñez - Chile
|
| 2 | COMINETTI-COTTI-COMETTI, ROBERTO MARIO | Hombre |
Universidad Adolfo Ibáñez - Chile
|
| Fuente |
|---|
| FONDECYT |
| Anillo |
| Fondo Nacional de Desarrollo Científico y Tecnológico |
| FONDE-CYT |
| Agradecimiento |
|---|
| The work of the first author was partially supported by FONDECYT grant 1191924 and by Anillo grant ANID/ACT210005. The work of the second author was supported by FONDECYT grant 1171501. |
| The work of the first author was partially supported by FONDECYT grant 1191924 and by Anillo grant ANID/ACT210005. The work of the second author was supported by FONDECYT grant 1171501. We are deeply indebted to four reviewers for their insightful comments on the original manuscript which prompted us to enhance the scope of the paper, notably by including the application to RVI-Q-learning. We also warmly thank our friend and colleague Crist\u00F3bal Guzm\u00E1n for stimulating discussions on this topic and, in particular, for pointing out the interpretation of (5.1) as an expected value for a random selection of the iterates and the resulting rate O(1/ \\surd4 n0) for constant stepsizes. |
| The work of the first author was partially supported by FONDECYT grant 1191924 and by Anillo grant ANID/ACT210005. The work of the second author was supported by FONDECYT grant 1171501. We are deeply indebted to four reviewers for their insightful comments on the original manuscript which prompted us to enhance the scope of the paper, notably by including the application to RVI-Q-learning. We also warmly thank our friend and colleague Crist\u00F3bal Guzm\u00E1n for stimulating discussions on this topic and, in particular, for pointing out the interpretation of (5.1) as an expected value for a random selection of the iterates and the resulting rate O(1/ \\surd4 n0) for constant stepsizes. |