High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Text Independent Speech Recognition

Author(s):

Aniruddha Mohanty , Self

Keywords:

GMM, MFCCs, LPC, FFT, DFT, FT

Abstract

Voice is one of the well-researched biometrics for speaker authentication. So, in the literature a number of techniques have been proposed which make use of speaker’s voice to verify their identity, control access to services, utilize information services, voice mailing, security control for confidential information areas, and remote access to computers. Speaker recognition is the task of automatically recognizing the speaker among several reference speakers using speaker-specific information embedded in speech waves. The extraction of this information is termed as features, and matching process are implemented right after the pre-processing of the signal. In literature, techniques have been proposed where Mel-Frequency Cepstral Coefficients (MFCCs) are taken as features for modeling by the Gaussian Mixture Model (GMM) during the identification process. The GMM algorithm is one of the clustering algorithms employed for the text independent speaker recognition. In this context, by adopting the idea of hierarchical clustering and GMM, an adaptive clustering algorithm, i.e., AGMM that can determine the number of optimal clusters automatically has been proposed. Numerical experiments demonstrate that the AGMM achieves better performance than the traditional GMM where the number of clusters is fixed.

Other Details

Paper ID: IJSRDV6I100112
Published in: Volume : 6, Issue : 10
Publication Date: 01/01/2019
Page(s): 132-136

Article Preview

Download Article