Synthesis of disordered voices
The context of this project is the clinical assessment of voice. Auditory and acoustic
assessment of voice (and by extension speech) is to laryngology and speech therapy
what electrocardiography is to cardiology and electroencephalography to neurology.
That is, it reports the function of the laryngeal oscillator and the adequacy of the
produced vocal timbre by relying on methods of investigation that are not intrusive and
do not obstruct the patient’s production of speech. Synthetic speech contributes to that purpose
as a computational tool to facilitate tests and training, and also to explore and understand the genesis of
abnormal vocal qualities.
A disordered voice is a voice that is perceived as abnormal with regard to pitch, loudness or timbre, and is often the consequence of a laryngeal pathology or some physiological dysfunction. In this project, we develop a synthesizer of speech sounds which is capable of simulating the timbre of disordered voices with an acceptable level of naturalness. We follow a physics-based strategy using models of the vocal fold vibration, glottal airflow and acoustic wave propagation in the vocal tract.
Dr. Jean Schoentgen, Université Libre de Bruxelles, Belgium.
Dr. Mara Behlau, Center for Voice Studies and Federal University of São Paulo.
Dr. Glaucya Madazio, Center for Voice Studies, São Paulo.
Henrique Rodrigues Costa, undergraduate, University of Brasília.
Marina T. Englert, Master's course, Federal University of São Paulo.
Elias Amadeus, bachelor in Computer Science, 2015, University of Brasilia.
(For published papers on other subjects, follow this link)
M. Englert, G. Madazio, I. Gielow, J. C. Lucero & M. Behlau. “Perceptual error identification of human and synthesized voices”, Journal of Voice, aceito.
J. C. Lucero, J. Schoentgen, J. Haas, P. Luizard & X. Pelorson. “Self-entrainment of the right and left vocal fold oscillators”, Journal of the Acoustical Society of America, 137, 2036-2046 (2015). PDF (*copyright note) | Info
J. Schoentgen, S. Fraj & J. C. Lucero. “ Testing the reliability of Grade, Roughness and
Breathiness scores by means of synthetic speech stimuli”, Logopedics Phoniatrics Vocology 40, 5-13 (2015). Info
J. C. Lucero, J. Schoentgen & M Behlau. “Physics-based synthesis of disordered voices”,
Interspeech 2013 (Lyon, França, 2013).
PDF | Amostras sintetizadas
J. C. Lucero & J. Schoentgen. “Modeling vocal fold asymmetries with coupled van der
Pol oscillators”, 21st International Congress
on Acoustics - ICA (Montreal, Canadá, 2013).
PDF (*copyright note) | Info
J. Schoentgen & J. C. Lucero. “Synthesis by rule of disordered voices”, in: T. Drugman & T. Dutroit (eds.),
Advances in Nonlinear Speech Processing
(Lecture Notes in Artificial Intelligence)], Springer, 120-127 (2013).
M. Englert, G. Madazio, I. Gielow, J. C. Lucero & M. Behlau. “Perceptual error identification of human and synthesized voices”, 44th Annual Symposium of the Voice Foundation (Philadelphia, EUA, 2015). Poster
PDF] (*copyright note) | Info
J. Schoentgen & J. C. Lucero. “Un modèle simple du débit du flux d'air glottique interactiff”, 6ièmes Journées de Phonétique Clinique (Montpellier, França, 2015).
M. Behlau, G. Madazio, M. Englert, J. C. Lucero & J. Schoentgen. “Vocal fold model for disordered voice synthesis to develop auditory perception in speech-language pathologists”, Annual Convention of the American Speech-Language-Hearing Association (Orlando, EUA, 2014).
M. Englert, G. Madazio, I. Gielow, J. C. Lucero & M. Behlau. “Erro de identificação perceptiva de vozes naturais e sintetizadas por ouvintes especialistas”, XXII Congresso Brasileiro de Fonoaudiologia (Joinville, 2014).
M. Behlau, G. Madazio, M. Englert, J. C. Lucero & J. Schoentgen. “A vocal fold model for disordered voice synthesis to develop auditory perception in speech language pathologists”, XXII Annual Pacific Voice Conference (Krakow, Polônia, 2014). Apresentação convidada.
J. C. Lucero, J. Schoentgen, M. Behlau & G. Madazio. ‘‘A vocal fold model for disordered voice synthesis,’’ 9th International Conference on Voice Physiology and Biomechanics (Salt Lake City, EUA, 2014). Poster e exemplos
J. Schoentgen & J. C. Lucero. “Solving the Riccati–Titze equation of the glottal airflow rate”, 9th International Conference on Voice Physiology and Biomechanics (Salt Lake City, EUA, 2014).
J. Schoentgen & J. C. Lucero. “Is formant frequency jitter audible?”, 9th International Conference on Voice Physiology and Biomechanics (Salt Lake City, EUA, 2014).
J. C. Lucero. "Physics-based
synthesis of disordered voices’, Buenos Aires Voice Meeting 2013 (Buenos Aires, Argentina, 2013).
J. Schoentgen e J.C. Lucero. “Is jitter irregularity the missing link between measured speech cycle length jitter and perceived vocal
timbre?”, 10th PanEuropean
Voice Conference – PEVOC (Praga, República Checa, 2013).
M. Behlau, G. Madazio, J. C. Lucero & J. Schoentgen. “Um novo paradigma no ensino da avaliação auditiva da voz Uso
de amostras sintetizadas”, XXI
Congresso Brasileiro e II IberoAmericano
de Fonoaudiologia (Porto de Galinhas, September 2013), 157-160. PDF
* Copyright Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America.