Physical Task Stress and Speaker Variability in Voice Quality

DSpace/Manakin Repository

Physical Task Stress and Speaker Variability in Voice Quality

Show full item record

Title: Physical Task Stress and Speaker Variability in Voice Quality
Author(s):
Godin, Keith W.;
Hansen, John H. L.
Date Created: 2015-10-08
Item Type: article
Keywords: Show Keywords
Abstract: The presence of physical task stress induces changes in the speech production system which in turn produces changes in speaking behavior. This results in measurable acoustic correlates including changes to formant center frequencies, breath pause placement, and fundamental frequency. Many of these changes are due to the subject’s internal competition between speaking and breathing during the performance of the physical task, which has a corresponding impact on muscle control and airflow within the glottal excitation structure as well as vocal tract articulatory structure. This study considers the effect of physical task stress on voice quality. Three signal processing-based values which include (i) the normalized amplitude quotient (NAQ), (ii) the harmonic richness factor (HRF), and (iii) the fundamental frequency are used to measure voice quality. The effects of physical stress on voice quality depend on the speaker as well as the specific task. While some speakers do not exhibit changes in voice quality, a subset exhibits changes in NAQ and HRF measures of similar magnitude to those observed in studies of soft, loud, and pressed speech. For those speakers demonstrating voice quality changes, the observed changes tend toward breathy or soft voicing as observed in other studies. The effect of physical stress on the fundamental frequency is correlated with the effect of physical stress on the HRF (r = −0.34) and the NAQ (r = −0.53). Also, the inter-speaker variation in baseline NAQ is significantly higher than the variation in NAQ induced by physical task stress. The results illustrate systematic changes in speech production under physical task stress, which in theory will impact subsequent speech technology such as speech recognition, speaker recognition, and voice diarization systems. .
Publisher: Springer International Publishing
ISSN: 1687-4714
Link to Related Resource: http://dx.doi.org/10.1186/s13636-015-0072-7
Persistent Link: http://hdl.handle.net/10735.1/4911
Bibliographic Citation: Godin, K. W., and J. H. L. Hansen. 2015. "Physical task stress and speaker variability in voice quality." EURASIP Journal on Audio, Speech, and Music Processing 2015(29), doi:10.1186/s13636-015-0072-7.
Terms of Use: CC BY 4.0 (Attribution) License
©2015 The Authors
Sponsors: This project was funded by AFRL under contract FA8750-15-1-0205 and partially by the University of Texas at Dallas from the Distinguished University Chair in Telecommunications Engineering held by J.H.L. Hansen.

Files in this item

Files Size Format View
JECS-3626-274223.28.pdf 1.646Mb PDF View/Open Article

This item appears in the following Collection(s)


Show full item record

CC BY 4.0 (Attribution) License Except where otherwise noted, this item's license is described as CC BY 4.0 (Attribution) License