A New Front-End for Classification of Non-Speech Sounds: A Study on Human Whistle

DSpace/Manakin Repository

A New Front-End for Classification of Non-Speech Sounds: A Study on Human Whistle

Show full item record

Title: A New Front-End for Classification of Non-Speech Sounds: A Study on Human Whistle
Author(s):
Nandwana, Mahesh Kumar (UT Dallas);
Bořil, Hynek (UT Dallas);
Hansen, John H. L. (UT Dallas)
Item Type: article
Keywords: Show Keywords
Abstract: Speech/non-speech sound classification is an important problem in audio diarization, audio document retrieval and advanced human interfaces. The focus of this study is on the development of spectral and temporal acoustic features for speech/non-speech sound classification based on production differences in speech versus whistle. Seven time- and frequency-domain based features are investigated. Performance of the proposed feature set for the task of speech/whistle classification is evaluated at frame level. This evaluation utilizes support vector machine (SVM) models and Gaussian mixture models (GMM) for back-end classifiers. At the frame-level, the proposed front-end fusion gives an absolute performance gain of +15.0% and +3.1% over MFCC with SVM and GMM based classifiers, respectively. This research will benefit the development of intelligent speech interfaces for identification, recognition, and speech coding, as a preprocessing step for real world audio streams.
Publisher: International Speech and Communication Association
ISSN: 2308-457X (ISSN)
Persistent Link: http://hdl.handle.net/10735.1/5061
Bibliographic Citation: Nandwana, M. K., H. Bořil, and J. H. L. Hansen. 2015. "A new front-end for classification of non-speech sounds: A study on human whistle." INTERSPEECH 2015 (16th Annual Conference of the International Speech Communication Association) 2015, p. 1982-1986.
Terms of Use: ©2015 ISCA

Files in this item

Files Size Format View
JECS-3626-4679.56.pdf 357.3Kb PDF View/Open Article

This item appears in the following Collection(s)


Show full item record