Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||||
| DOI | 10.1109/ICASSP.2017.7953120 | ||||
| Año | 2017 | ||||
| Tipo | proceedings paper |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
This study analyzes signals recorded using a neck-surface accelerometer from subjects producing speech with different voice modes. The purpose is to explore if the recorded waveforms can capture the glottal vibratory patterns which can be related to the movement of the vocal folds and thus voice quality. The accelerometer waveforms do not contain the supraglottal resonances, and these characteristics make the proposed method suitable for real-life voice quality assessment and monitoring as it does not breach patient privacy. The experiments with a Gaussian mexture model classifier demonstrate that different voice qualities produce distinctly different accelerometer waveforms. The system achieved 80.2% and 89.5% for frame-and utterance-level accuracy, respectively, for classifying among modal, breathy, pressed, and rough voice modes using a speaker-dependent classifier. Finally, the article presents characteristic waveforms for each modality and discusses their attributes.
| Revista | ISSN |
|---|---|
| 2019 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp) | 1520-6149 |
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | Borsky, Michal | Hombre |
Reykjavik Univ - Islandia
Reykjavik University - Islandia |
| 2 | Cocude, Marion | Mujer |
Reykjavik Univ - Islandia
Reykjavik University - Islandia |
| 3 | Mehta, Daryush D. | - |
MASSACHUSETTS GEN HOSP - Estados Unidos
Massachusetts General Hospital - Estados Unidos |
| 4 | ZANARTU-SALAS, MATIAS | Hombre |
Universidad Técnica Federico Santa María - Chile
|
| 5 | Gudnason, Jon | - |
Reykjavik Univ - Islandia
Reykjavik University - Islandia |
| 6 | IEEE | Corporación |
| Fuente |
|---|
| Voice Health Institute |
| National Institutes of Health (NIH) National Institute on Deafness and other Communication Disorders |
| Icelandic Centre for Research (RANNIS) under the project Model-based speech production analysis and voice quality assessment |
| Agradecimiento |
|---|
| This work is sponsored by The Icelandic Centre for Research (RANNIS) under the project Model-based speech production analysis and voice quality assessment, Grant No 152705-051. The work was performed during Marion Cocude's Erasmus+ traineeship at Reykjavik University. The authors thank J. H. Van Stan for data collection and pereptual evaluation. Additional support received from the Voice Health Institute and the National Institutes of Health (NIH) National Institute on Deafness and Other Communication Disorders under Grant R33 DC011588 (PI: R. E. Hillman). The paper's contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH. |