A New Soft Masking Method for Speech Enhancement in the Frequency Domain
DOI:
https://doi.org/10.5755/j01.eee.20.2.3957Keywords:
Ideal binary mask (IdBM), threshold, speech quality and intelligibility, residual noiseAbstract
Recently, ideal binary mask (IdBM) method has attracted keen interest because of its superiority in improving speech intelligibility. This method processes noisy speech based on time-frequency (T-F) unit. If the local Signal to Noise Ratio (SNR) is higher than the threshold, the T-F unit is retained; else, the T-F unit would be removed. This method works well in computational auditory scene analysis (CASA) field. However, as the threshold is usually low, much residual noise would exist. In addition, the accurate local SNR is difficult to obtain in practice. In this paper, we try to propose a new method to improve speech quality and intelligibility. Instead of finding a new way to estimate the local SNR, we try to compute the probability of local SNR higher than the threshold. After that, we multiply T-F units with a proper value to compress the residual noise. Results from sufficient experiments showed that our method performs well.Downloads
Published
2014-01-28
How to Cite
Zhao, H., Liu, J., Chen, Z., & Wang, F. (2014). A New Soft Masking Method for Speech Enhancement in the Frequency Domain. Elektronika Ir Elektrotechnika, 20(2), 58-63. https://doi.org/10.5755/j01.eee.20.2.3957
Issue
Section
SIGNAL TECHNOLOGY
License
The copyright for the paper in this journal is retained by the author(s) with the first publication right granted to the journal. The authors agree to the Creative Commons Attribution 4.0 (CC BY 4.0) agreement under which the paper in the Journal is licensed.
By virtue of their appearance in this open access journal, papers are free to use with proper attribution in educational and other non-commercial settings with an acknowledgement of the initial publication in the journal.