Dataciencia

Colección SciELO Chile

Reinforcement Learning for Position Control Problem of a Mobile Robot

Indexado

WoS	WOS:000564191600001
Scopus	SCOPUS_ID:85090599506

DOI

10.1109/ACCESS.2020.3018026

Año

2020

Tipo

artículo de investigación

Citas Totales

Autores Afiliación Chile

Instituciones Chile

% Participación
Internacional

Autores
Afiliación Extranjera

Instituciones
Extranjeras

Abstract

Due to the increase in complexity in autonomous vehicles, most of the existing control systems are proving to be inadequate. Reinforcement Learning is gaining traction as it is posed to overcome these difficulties in a natural way. This approach allows an agent that interacts with the environment to get rewards for appropriate actions, learning to improve its performance continuously. The article describes the design and development of an algorithm to control the position of a wheeled mobile robot using Reinforcement Learning. One main advantage of this approach concerning traditional control algorithms is that the learning process is carried out automatically with a recursive procedure forward in time. Moreover, given the fidelity of the model for the particular implementation described in this work, the whole learning process can be carried out in simulation. This fact avoids damages to the actual robot during the learning stage. It shows that the position control of the robot (or similar specific tasks) can be done without the need to know the dynamic model of the system explicitly. Its main drawback is that the learning stage can take a long time to finish and that it depends on the complexity of the task and the availability of adequate hardware resources. This work provides a comparison between the proposed approach and traditional existing control laws in simulation and real environments. The article also discusses the main effects of using different controlled variables in the performance of the developed control law.

Revista

Revista	ISSN
Ieee Access	2169-3536

Métricas Externas

PlumX	Altmetric	Dimensions

Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:

Plumx: https://plumanalytics.com/learn/about-metrics/
Altmetric: https://www.altmetric.com/about-altmetrics/what-are-altmetrics/
Dimensions: https://www.dimensions.ai/why-dimensions/

Disciplinas de Investigación

WOS
Computer Science, Information Systems
Telecommunications
Engineering, Electrical & Electronic

Scopus
Materials Science (All)
Computer Science (All)
Engineering (All)

SciELO
Sin Disciplinas

Muestra la distribución de disciplinas para esta publicación.

Publicaciones WoS (Ediciones: ISSHP, ISTP, AHCI, SSCI, SCI), Scopus, SciELO Chile.

Colaboración Institucional

Muestra la distribución de colaboración, tanto nacional como extranjera, generada en esta publicación.

Autores - Afiliación

Ord.	Autor	Género	Institución - País
1	FARIAS-CASTRO, GONZALO ALBERTO	Hombre	Pontificia Universidad Católica de Valparaíso - Chile
2	García-Ros, Gonzalo	Hombre	Florida Atlantic Univ FAU - Estados Unidos FLORIDA ATLANTIC UNIVERSITY - Estados Unidos
3	Montenegro, G.	-	Pontificia Universidad Católica de Valparaíso - Chile
4	Fabregas, Ernesto	Hombre	Univ Nacl Educ Distancia - España Universidad Nacional de Educación a Distancia - España
6	Dormido-Canto, Sebastian	Hombre	Univ Nacl Educ Distancia - España Universidad Nacional de Educación a Distancia - España

Muestra la afiliación y género (detectado) para los co-autores de la publicación.

Origen de Citas Identificadas

Muestra la distribución de países cuyos autores citan a la publicación consultada.

Citas identificadas: Las citas provienen de documentos incluidos en la base de datos de DATACIENCIA

Citas Identificadas: 5.56 %
Citas No-identificadas: 94.44 %

Muestra la distribución de instituciones nacionales o extranjeras cuyos autores citan a la publicación consultada.

Citas identificadas: Las citas provienen de documentos incluidos en la base de datos de DATACIENCIA

Citas Identificadas: 5.56 %
Citas No-identificadas: 94.44 %

Financiamiento

Fuente
Spanish Ministry of Science and Innovation
Fondo Nacional de Desarrollo Científico y Tecnológico
Spanish Ministry of Economy and Competitiveness
Ministerio de Ciencia e Innovación
Ministerio de Ciencia e InnovaciÃ³n
Fondo Nacional de Desarrollo CientÃfico y TecnolÃ³gico
Pontificia Universidad Catolica del Peru
Pontificia Universidad CatÃ³lica del PerÃº
ANID
Chilean National Research and Development Agency (ANID) through the Project Fondo Nacional de Desarrollo Cientifico y Tecnologico (FONDECYT)
Direccion de Investigacion (DI)-Pontificia Universidad Catolica de Valparaiso
Chilean National Research and Development Agency

Muestra la fuente de financiamiento declarada en la publicación.

Agradecimientos

Agradecimiento
This work was supported in part by the Chilean National Research and Development Agency (ANID) through the Project Fondo Nacional de Desarrollo Cientifico y Tecnologico (FONDECYT) under Grant 1191188, in part by the Direccion de Investigacion (DI)-Pontificia Universidad Catolica de Valparaiso under Grant 039.437, in part by the Spanish Ministry of Economy and Competitiveness through the Projects under Grant RTI2018-094665-B-I00 and Grant ENE2015-64914-C3-3-R, and in part by the Spanish Ministry of Science and Innovation through the Project under Grant PID2019-108377RB-C32.
This work was supported in part by the Chilean National Research and Development Agency (ANID) through the Project Fondo Nacional de Desarrollo Científico y Tecnológico (FONDECYT) under Grant 1191188, in part by the Dirección de Investigación (DI)-Pontificia Universidad Católica de Valparaíso under Grant 039.437, in part by the Spanish Ministry of Economy and Competitiveness through the Projects under Grant RTI2018-094665-B-I00 and Grant ENE2015-64914-C3-3-R, and in part by the Spanish Ministry of Science and Innovation through the Project under Grant PID2019-108377RB-C32.

Agradecimiento

This work was supported in part by the Chilean National Research and Development Agency (ANID) through the Project Fondo Nacional de Desarrollo Cientifico y Tecnologico (FONDECYT) under Grant 1191188, in part by the Direccion de Investigacion (DI)-Pontificia Universidad Catolica de Valparaiso under Grant 039.437, in part by the Spanish Ministry of Economy and Competitiveness through the Projects under Grant RTI2018-094665-B-I00 and Grant ENE2015-64914-C3-3-R, and in part by the Spanish Ministry of Science and Innovation through the Project under Grant PID2019-108377RB-C32.

This work was supported in part by the Chilean National Research and Development Agency (ANID) through the Project Fondo Nacional de Desarrollo Científico y Tecnológico (FONDECYT) under Grant 1191188, in part by the Dirección de Investigación (DI)-Pontificia Universidad Católica de Valparaíso under Grant 039.437, in part by the Spanish Ministry of Economy and Competitiveness through the Projects under Grant RTI2018-094665-B-I00 and Grant ENE2015-64914-C3-3-R, and in part by the Spanish Ministry of Science and Innovation through the Project under Grant PID2019-108377RB-C32.

Muestra la fuente de financiamiento declarada en la publicación.

Abstract

Revista

Métricas Externas

Disciplinas de Investigación

Colaboración Institucional

Autores - Afiliación

Origen de Citas Identificadas

Financiamiento

Agradecimientos

Departamento Gestión de Conocimiento, Monitoreo y Prospección