Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition

N. Jakovljevic, D. Miskovic, M. Janev, M. Secujski, V. Delic


Speech recognition systems are commonly modelled by hidden Markov models with Gaussian mixture models as observation density functions. These models have a significant number of parameters, which usually leads to the problem of data sparsity, especially for under-resourced languages such as Serbian. One of the ways to overcome the problem of data sparsity is the reduction of the number of features. Linear discriminant analysis (LDA) and heteroscedastic LDA (HLDA) are two common ways to reduce the dimensionality in an automatic speech recognition task. The paper compares the properties of speech recognition systems for Serbian in which both techniques are applied with variable types of input features as well as the number of output features of (H)LDA. The best results are obtained in the case of HLDA with input vectors consisting of concatenations of feature vectors across 7 successive frames, where each feature vector contains 12 mel frequency cepstral coefficients (MFCCs) and normalized energy, and the number of output features is 32 or 35.



Speech recognition; linear discriminant analysis

Full Text: PDF


  • There are currently no refbacks.

Print ISSN: 1392-1215
Online ISSN: 2029-5731