Viseme Recognition System Based on Transformed Acoustic Models

A. Zgank; Z. Kacic

doi:10.5755/j01.eee.19.9.5657

Viseme Recognition System Based on Transformed Acoustic Models

Authors

A. Zgank Institute of Electronics and Telecommunications
Z. Kacic Institute of Electronics and Telecommunications

DOI:

https://doi.org/10.5755/j01.eee.19.9.5657

Keywords:

Automatic speech recognition, hidden Markov models, human computer interaction, viseme modeling

Abstract

Viseme recognition from speech is one of the methods needed to operate a talking head system, which can be used in various areas, such as mobile services and applications, gaming, the entertainment industry, and so on. This paper proposes a novel method for generating acoustic models for viseme recognition from speech. The viseme acoustic models were generated using transformations from trained phoneme acoustic models. The proposed transformation method is language-independent; only the available speech resources are needed. The viseme sequence with corresponding time information was produced as a result of recognition using context-dependent acoustic models. The evaluation of the proposed acoustic models’ transformation method was carried out on a test scenario with phonetically balanced words, in which the results were compared to the baseline viseme recognition system. The improvement in viseme accuracy was statistically significant when using the proposed method for transforming acoustic models.

DOI: http://dx.doi.org/10.5755/j01.eee.19.9.5657

Published

2013-11-07

Issue

Vol. 19 No. 9 (2013)

Section

SYSTEM ENGINEERING, COMPUTER TECHNOLOGY

License

The copyright for the paper in this journal is retained by the author(s) with the first publication right granted to the journal. The authors agree to the Creative Commons Attribution 4.0 (CC BY 4.0) agreement under which the paper in the Journal is licensed.

By virtue of their appearance in this open access journal, papers are free to use with proper attribution in educational and other non-commercial settings with an acknowledgement of the initial publication in the journal.

How to Cite

Zgank, A., & Kacic, Z. (2013). Viseme Recognition System Based on Transformed Acoustic Models. Elektronika Ir Elektrotechnika, 19(9), 93-96. https://doi.org/10.5755/j01.eee.19.9.5657