Automatic Segmetation of Phonemes using Artificial Neural Networks

Authors

  • J. Kamarauskas Institute of Mathematics and Informatics

Abstract

Automatic segmentation of phonemes is often used in speech technology. The purpose of this research is to find how the perceptron and back-propagation artificial neural networks (that can assimilate linear and non-linear connection of the pattern) distinguish between different phonemes, using various features of the speech signal used in speech or speaker recognition tasks: coefficients of linear prediction coding (LPC), cepstral coefficients, and coefficients of the Fourier transform (energy density spectrum). Artificial neural networks can be used for setting the start and end points of the word, too. They can separate not only voiced frames of the signal from noise, but also non-voiced, whose spectrum and that of noise are similar. Experiments were carried out and we can affirm that in order to segment the phonemes all the feature vectors used are suitable. However, if we want to separate different phonemes out of noise by automatically setting the start and end points of the word, the coefficients of the Fourier transform are most suitable, meanwhile cepstral coefficients do not fit. Ill. 8, bibl. 7 (in Lithuanian; summaries in English, Russian and Lithuanian).

Downloads

Published

2006-10-20

How to Cite

Kamarauskas, J. (2006). Automatic Segmetation of Phonemes using Artificial Neural Networks. Elektronika Ir Elektrotechnika, 72(8), 39-42. Retrieved from https://eejournal.ktu.lt/index.php/elt/article/view/10786

Issue

Section

T 170 ELECTRONICS