A New Soft Masking Method for Speech Enhancement in the Frequency Domain

Huan Zhao; Jun Liu; Zuo Chen; Fei Wang

doi:10.5755/j01.eee.20.2.3957

A New Soft Masking Method for Speech Enhancement in the Frequency Domain

Authors

Huan Zhao Hunan University
Jun Liu Hunan University
Zuo Chen Hunan University
Fei Wang Hunan University

DOI:

https://doi.org/10.5755/j01.eee.20.2.3957

Keywords:

Ideal binary mask (IdBM), threshold, speech quality and intelligibility, residual noise

Abstract

Recently, ideal binary mask (IdBM) method has attracted keen interest because of its superiority in improving speech intelligibility. This method processes noisy speech based on time-frequency (T-F) unit. If the local Signal to Noise Ratio (SNR) is higher than the threshold, the T-F unit is retained; else, the T-F unit would be removed. This method works well in computational auditory scene analysis (CASA) field. However, as the threshold is usually low, much residual noise would exist. In addition, the accurate local SNR is difficult to obtain in practice. In this paper, we try to propose a new method to improve speech quality and intelligibility. Instead of finding a new way to estimate the local SNR, we try to compute the probability of local SNR higher than the threshold. After that, we multiply T-F units with a proper value to compress the residual noise. Results from sufficient experiments showed that our method performs well.

DOI: http://dx.doi.org/10.5755/j01.eee.20.2.3957

Downloads

Published

2014-01-28

How to Cite

Zhao, H., Liu, J., Chen, Z., & Wang, F. (2014). A New Soft Masking Method for Speech Enhancement in the Frequency Domain. Elektronika Ir Elektrotechnika, 20(2), 58-63. https://doi.org/10.5755/j01.eee.20.2.3957

Download Citation

Issue

Vol. 20 No. 2 (2014)

Section

SIGNAL TECHNOLOGY

License

The copyright for the paper in this journal is retained by the author(s) with the first publication right granted to the journal. The authors agree to the Creative Commons Attribution 4.0 (CC BY 4.0) agreement under which the paper in the Journal is licensed.

By virtue of their appearance in this open access journal, papers are free to use with proper attribution in educational and other non-commercial settings with an acknowledgement of the initial publication in the journal.

A New Soft Masking Method for Speech Enhancement in the Frequency Domain

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Information