Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们 横山亮次奖 百年刊庆
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  横山亮次奖  |  百年刊庆
清华大学学报(自然科学版)  2016, Vol. 56 Issue (11): 1173-1178    DOI: 10.16511/j.cnki.qhdxxb.2016.26.007
  电子工程 本期目录 | 过刊浏览 | 高级检索 |
一种改善言语清晰度的子带自适应降噪算法
梁维谦1, 郑方2, 郑佳春1, 朴志刚3
1. 集美大学 信息工程学院, 海上通信与智能电子系统福建省高等学校重点实验室, 厦门 361021;
2. 清华大学 信息技术研究院, 语音和语言技术中心, 北京 100084;
3. 厦门莱亚特医疗器械有限公司, 厦门 361009
Sub-band adaptive noise reduction algorithm to improve speech intelligibility
LIANG Weiqian1, ZHENG Fang2, ZHENG Jiachun1, PIAO Zhigang3
1. Key Laboratory of Maritime Communication and Intelligent Electronic Systems of Fujian Province, College of Information Engineering, Jimei University, Xiamen 361021, China;
2. Center for Speech and Language Technologies, Research Institute of Information Technology, Tsinghua University, Beijing 100084, China;
3. Xiamen LA and Associates Medical Equipment Co., Ltd., Xiamen 361009, China
全文: PDF(1414 KB)  
输出: BibTeX | EndNote (RIS)      
摘要 助听器对声音进行压缩放大,需要高言语清晰度的降噪算法。该文提出了一种子带自适应噪声抑制方法,通过加权重叠相加滤波器组和基于心理声学模型的子带划分、基于先验和后验信噪比的快变的非线性降噪增益、基于噪声声压级估值的慢变的增益下限阈值、基于峰值跟踪的子带增益平滑及其跟踪和释放时间系数的精细选择等算法,明显提高了言语清晰度。主观测听实验表明:该方法对输入的不同信噪比的带噪语音的言语清晰度提高约12%~45%。在EZAIRO5900数字信号处理器上实现了此方法,通过对增益公式的量化处理使得整个算法的运行效率提高约30%。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
梁维谦
郑方
郑佳春
朴志刚
关键词 噪声抑制子带非线性增益言语清晰度    
Abstract:Noise reduction algorithms to improve speech intelligibility are needed when sounds are compressed and amplified in hearing aids. A sub-band adaptive noise reduction algorithm was developed with a weighted overlap-add filter bank and psycho-acoustic model for the sub-band splitting. The non-linear noise reduction gains are computed with an estimated a posteriori signal to noise ratio (SNR) and an a priori SNR. The gain floors are determined based on the estimated noise level expressed as the dB sound pressure level (SPL). The final gains are smoothed between the frames by a peak detector with carefully selected attack and release time constants. Listening tests show 12% to 45% improvements in intelligibility by this algorithm for noise corrupted speech. A quantified gain table is also used to replace the non-linear gain computing when the algorithm is implemented on the EZAIRO5900 digital signal processor, with the execution cycle reduced by about 30%.
Key wordsnoise reduction    sub-band    non-linear gains    speech intelligibility
收稿日期: 2016-05-11      出版日期: 2016-11-15
ZTFLH:  TN912.35  
引用本文:   
梁维谦, 郑方, 郑佳春, 朴志刚. 一种改善言语清晰度的子带自适应降噪算法[J]. 清华大学学报(自然科学版), 2016, 56(11): 1173-1178.
LIANG Weiqian, ZHENG Fang, ZHENG Jiachun, PIAO Zhigang. Sub-band adaptive noise reduction algorithm to improve speech intelligibility. Journal of Tsinghua University(Science and Technology), 2016, 56(11): 1173-1178.
链接本文:  
http://jst.tsinghuajournals.com/CN/10.16511/j.cnki.qhdxxb.2016.26.007  或          http://jst.tsinghuajournals.com/CN/Y2016/V56/I11/1173
  图1 子带降噪算法框图
  图2 降噪增益计算的流程图
  图3 式(7)降噪增益的三维图
  图4 先验信噪比ξ(n,k)=0时,式(7)降噪增益与线性增益的对比
  图5 降噪增益下限阈值?F(k)与噪声声压级PN(n,k)的函数关系
  图6 4种算法对言语清晰度的影响对比
  图7 本文方法中不同增益计算方法对言语清晰度的影响对比
  图8 本文方法中不同频带划分对言语清晰度的影响对比
[1] Kates J M. Digital Hearing Aids[M]. San Diego:Plural Publishing INC, 2008.
[2] Loizou P C. Speech Enhancement:Theory and Practice (2nd edition)[M]. Boca Raton:CRC Press, 2013.
[3] Ephraim Y, Malah D. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator[J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1984, 32(6):1109-1121.
[4] Tsoukalas D E, Mourjopoulos J N, Kokkinakis G. Speech enhancement based on audible noise suppression[J]. IEEE Transactions on Speech and Audio Processing, 1997, 5(6):497-514.
[5] XUE Fengjie, GUO Zhaoyang, WANG Xinan. A multiband noise reduction wiener filter algorithm for hearing aid[C]//Materials and Engineering Technology. Chicago, USA:TRANS TECH Publications INC, 2015:1082-1088.
[6] ON Semiconductor. WOLA Filterbank Coprocessor:Introductory Concepts and Techniques[R/OL].[2015-04-10]. http://www.onsemi.com/pub/Collateral/AND8382-D.PDF.
[7] ON Semiconductor. Hardware Reference Manual for Ezairo 5900, M-20468-009[R]. Waterloo, Ontario, Canada:Semiconductor Components Industries LLC, 2009.
[8] Zakis J A, Hau J, Blamey P J. Environmental noise reduction configuration:Effects on preferences, satisfaction, and speech understanding[J]. International Journal of Audiology, 2009, 48(12):853-867.
[9] Brons I, Houben R, Dreschler W A. Effects of noise reduction on speech intelligibility, perceived listening effort, and personal preference in hearing-impaired listeners[J]. Trends in Hearing, 2014, 18(10):1-10.
[10] Giannoulis D, Massberg M, Reiss J D. Digital dynamic range compressor design-A tutorial and analysis[J]. Journal of Audio Engineering Society, 2012, 60(6):399-408.
[11] 北京同仁医院.普通话言语测听材料[Z/OL].[2015-01-20]. http://www.trhos.com/mstms/index5.asp.Beijing Tongren Hospital. Mandarin Speech Test Materials[Z/OL].[2015-01-20]. http://www.trhos.com/mstms/index5.asp. (in Chinese)
[12] NXP Semiconductors. CoolFlux DSP the Embedded Ultra Low Power C-programmable DSP Core[Z/OL].[2015-03-09]. http://www.coolflux.com/nxp-coolflux-dsp.
[1] 梁维谦, 郑方, 陈朝阳, 陈高鋆. 基于GSPAP的子带自适应声反馈消除算法[J]. 清华大学学报(自然科学版), 2017, 57(7): 707-712.
[2] 孙甲松, 张菁芸, 杨毅. 基于子带频谱质心特征的高效音频指纹检索[J]. 清华大学学报(自然科学版), 2017, 57(4): 382-387.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 《清华大学学报(自然科学版)》编辑部
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn