Speaker Recognition using Excitation Source Parameters
Abstract
Excitation signal is used in speaker recognition. It corresponds to the frequency of oscillation of vocal cords and is one of the speaker's characteristics. Although this feature gives worse recognition results compared to the vocal tract parameters, but it is more robust to various distortions in the recording channels. As a result, pitch is commonly used in forensic investigations, where different recording channels is one of the main problems. Currently, the pitch distribution generally is modeled using histograms and calculating various distances or similarity measures between two histograms. However, pitch distribution is not Gaussian and view of the histograms and comparison results depend on the number of classes used. We model pitch distribution using Gaussian mixture models (GMM), and calculate similarity and distance measures between the GMM approximations of two comparative records. Best results were achieved using symmetric Kullback-Leibler distance.Downloads
Published
2011-01-04
How to Cite
Kamarauskas, J., & Salna, B. (2011). Speaker Recognition using Excitation Source Parameters. Elektronika Ir Elektrotechnika, 107(1), 55-58. Retrieved from https://eejournal.ktu.lt/index.php/elt/article/view/9081
Issue
Section
SYSTEM ENGINEERING, COMPUTER TECHNOLOGY
License
The copyright for the paper in this journal is retained by the author(s) with the first publication right granted to the journal. The authors agree to the Creative Commons Attribution 4.0 (CC BY 4.0) agreement under which the paper in the Journal is licensed.
By virtue of their appearance in this open access journal, papers are free to use with proper attribution in educational and other non-commercial settings with an acknowledgement of the initial publication in the journal.