Voice Liveliness Identification Assisted by Noise Categorization |
Author(s): |
Jayashree G Magadum , Maratha Mandal Engineering College Belagavi; Prof. Sandhya Bevoor , MMEC Belagavi |
Keywords: |
Voice Activity Detection, Perceptual Wavelet Packet Transform, Noise Classification, Support Vector Machine |
Abstract |
Voice Activity Detection (VAD) is a very important front end processing in all Speech and Audio processing applications. The performance of most if not all speech/audio processing methods is crucially dependent on the performance of Voice Activity Detection. Voice activity detection (VAD), is to detect the presence of speech in an audio signal degraded by noise, is widely applied in numerous modern speech communication systems. Since speech signals are non-stationary and contain many transient components, it is appropriate to use, perceptual wavelet packet transform (PWPT) as a tool for feature extraction especially in noisy environments. Voice activity detection (VAD) is a process, which can detect speech and non-speech segments from a audio signal. This method combines a noise robust speech processing feature extraction process together with SVM models trained in different background noises for speech/non-speech classification. A multiclass SVM is also used to classify background noises in order to select SVM model for VAD. |
Other Details |
Paper ID: IJSRDV3I40259 Published in: Volume : 3, Issue : 4 Publication Date: 01/07/2015 Page(s): 416-419 |
Article Preview |
|
|