Abstract：This paper presents a speech enhancement algorithm that combines the ensemble empirical mode decomposition (EEMD) algorithm and the K-singular value decomposition (K-SVD) dictionary-training algorithm. The EEMD algorithm is used to obtain the intrinsic mode function (IMF) components from noisy speech. The cross-correlations and autocorrelations of each IMF are calculated from the IMF components to filter out the noisy IMF components. The transition IMF components are again decomposed with EEMD to further remove the noisy component. The remained IMFs and transition IMFs are superimposed to generate the de-noised speech. An over-complete dictionary is then trained from the clean speech by the K-SVD dictionary training algorithm. The de-noised speech is then sparse decomposed with the over-complete dictionary to obtain the enhanced speech by recovering the speech signal from sparse coefficient vectors. Tests show that the algorithm achieves better de-noising than the traditional spectral subtraction, wavelet threshold de-noising and K-SVD dictionary-training algorithms for both low signal-to-noise ratio (SNR) and high SNR environments.
Chen J, Benesty J, Huang Y, et al. New insights into the noise reduction Wiener filter[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2006, 14(4):1218-1234.
Boll S. Suppression of acoustic noise in speech using spectral subtraction[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2):113-120.
Donoho D L. De-noising by soft-thresholding[J]. IEEE Transactions on Information Theory, 1995, 41(3):613-627.
王文波, 张晓东, 汪祥莉. 基于主成分分析的经验模态分解消噪方法[J]. 电子学报, 2013, 41(7):1425-1430. WANG Wenbo, ZHANG Xiaodong, WANG Xiangli. Empirical mode decomposition de-noising method based on principal component analysis[J]. Chinese Journal of Electronics, 2013, 41(7):1425-1430. (in Chinese)
石光明, 刘丹华, 高大化, 等. 压缩感知理论及其研究进展[J]. 电子学报, 2009, 37(5):1070-1081.SHI Guangming, LIU Danhua, GAO Dahua, et al. Advances in theory and application of compressed sensing[J]. Chinese Journal of Electronics, 2009, 37(5):1070-1081. (in Chinese)
Donoho D L. Compressed sensing[J]. IEEE Transactions on Information Theory, 2006, 52(4):1289-1306.
Michal A, Michael E, Alfred B. <em>K</em>-SVD:An algorithm for designing overcomplete dictionaries for sparse representation[J]. IEEE Transactions on Signal Processing, 2006, 54(11):4311-4322.
Huang N E, Shen Z, Long S R, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis[C]//Proceedings of the Royal Society of London A:Mathematical, Physical and Engineering Sciences. London, UK:Royal Society, 1998:903-995.
Wu Z, Huang N E. Ensemble empirical mode decomposition:A noise-assisted data analysis method[J]. Advances in Adaptive Data Analysis, 2009, 1(1):1-41.
Karahanoglu N B, Erdogan H. A<sup>*</sup> Orthogonal matching pursuit:Best-first search for compressed sensing signal recovery[J]. Digital Signal Processing, 2012, 22(4):555-568.
Engan K, Aase S O, Husøy J H. Multi-frame compression:Theory and design[J]. Signal Processing, 2000, 80(10):2121-2140.
李月, 彭蛟龙, 马海涛, 等. 过渡内蕴模态函数对经验模态分解去噪结果的影响研究及改进算法[J]. 地球物理学报, 2013, 56(2):626-634. LI Yue, PENG Jiaolong, MA Haitao, et al. Study of the influence of transition IMF on EMD do-noising and improved algorithm[J]. Chinese Journal of Geophysics, 2013, 56(2):626-634. (in Chinese)
Donoho D L, Elad M, Temlyakov V N. Stable recovery of sparse overcomplete representations in the presence of noise[J]. IEEE Transactions on Information Theory, 2006, 52(1):6-18.
Li Z, Tan E C, McLoughlin I, et al. Proposal of standards for intelligibility tests of Chinese speech[J]. IEEE Proceedings-Vision, Image and Signal Processing, 2000, 147(3):254-260.