Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||||
| DOI | 10.21437/INTERSPEECH.2024-1273 | ||||
| Año | 2024 | ||||
| Tipo | proceedings paper |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
Human-robot interaction (HRI) is becoming a truly relevant topic imposing many challenges for state-of-the-art speech technology. This paper describes the first evaluation of speech emotion recognition (SER) technology with non-acted speech data recorded in a real indoor HRI scenario using deep learning-based beamforming technologies. The results presented show that deep learning beamforming gives in average an average concordance correlation coefficient (CCC) that is 15.03% higher than the ordinary minimum variance distortionless response (MVDR) beamformer when the SER system was trained with simulated conditions, which included an acoustic model of the testing HRI environment. Training by simulating the test scenarios and testing with real HRI static data provides on average an average CCC that is just 22.5% smaller than the ideal condition where training and testing were performed with the original MSP-Podcast database. This suggests the possibility to train SER engines with methods that emulates complex testing scenarios without recording further data.
| Revista | ISSN |
|---|---|
| 19 Th Annual Conference Of The International Speech Communication Association (Interspeech 2018), Vols 1 6 | 2308-457X |
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | Garcia, Ricardo | - |
Universidad de Chile - Chile
|
| 2 | Mahul, Rodrigo | - |
Universidad de Chile - Chile
|
| 3 | Grágeda, Nicolás | Hombre |
Universidad de Chile - Chile
|
| 4 | Luzanto, Alejandro | Hombre |
Universidad de Chile - Chile
|
| 5 | Bohmer, Nicolas | - |
Universidad de Chile - Chile
|
| 6 | Busso, Carlos | - |
Univ Texas Dallas - Estados Unidos
Erik Jonsson School of Engineering and Computer Science - Estados Unidos |
| 7 | Yoma, Nestor Becerra | - |
Universidad de Chile - Chile
|
| 8 | Int Speech Commun Assoc | Corporación |
| Fuente |
|---|
| Fondo Nacional de Desarrollo Científico y Tecnológico |
| Agencia Nacional de Investigación y Desarrollo |
| ANID/FONDECYT (Chile) |