Estimation for the location of multiple moving sound sources in small-distance dual-microphone

GONG Qin, LIU Yi

Journal of Tsinghua University(Science and Technology) ›› 2016, Vol. 56 ›› Issue (8) : 901-907.

PDF(1410 KB)
PDF(1410 KB)
Journal of Tsinghua University(Science and Technology) ›› 2016, Vol. 56 ›› Issue (8) : 901-907. DOI: 10.16511/j.cnki.qhdxxb.2016.25.032
BIOLOGICAL SCIENCE AND ENGINEERING

Estimation for the location of multiple moving sound sources in small-distance dual-microphone

  • {{article.zuoZhe_EN}}
Author information +
History +

Abstract

Angle measurements using traditional auditory analyse with dual-microphones have limited accuracy when the two microphones are close together and the sound sources are moving. This paper presents a method for multiple moving sound source localization with two closely spaced microphones. The method uses an improved gammatone filter bank for the time-frequency analysis of the mixed sound samples, extracts the zero-crossing time difference (ZCTD) that reflects the sound source location feature information and finally statistically analyzes the ZCTD with a Gaussian function to localize the sound source. Tests show that this method gives accurate angle estimates for various kinds of sound sources with various moving speeds along various trajection.

Key words

signal acoustics / dual-microphone / moving sound source / gammatone filter bank / zero-crossing time difference / Gauss's function

Cite this article

Download Citations
GONG Qin, LIU Yi. Estimation for the location of multiple moving sound sources in small-distance dual-microphone[J]. Journal of Tsinghua University(Science and Technology). 2016, 56(8): 901-907 https://doi.org/10.16511/j.cnki.qhdxxb.2016.25.032

References

[1] Evans S,Mcgettigan C,Agnew Z,et al.Getting the cocktail party started:Masking effects in speech perception[J].Journal of Cognitive Neuroscience,2016,28(3):483-500.
[2] Simon L S R,Andreopoulou A,Katz B F G.Investigation of perceptual interaural time difference evaluation protocols in a binaural context[J].Acta Acustica United with Acustica,2016,102(1):129-140.
[3] Zeng F G,Rebscher S,Harrison W,et al.Cochlear implants:System design,integration,and evaluation[J].IEEE Reviews in Biomedical Engineering,2008,1(1):115-142.
[4] Gygi B,Hall D A.Background sounds and hearing-aid users:A scoping review[J].International Journal of Audiology,2016,55(1):1-10.
[5] Chen Y,Qin G.Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant[J].Biomedical Engineering Online,2012,11(10):1-22.
[6] Chen Y,Qin G.Broadband beamforming compensation algorithm in CI front-end acquisition[J].Biomedical Engineering Online,2013,12(1):1-20.
[7] Nicoleta R,Deliang W,Brown G J.Speech segregation based on sound localization[J].Journal of the Acoustical Society of America,2003,114(4):2236-2252.
[8] Cui W,Cao Z,Wei J.Dual-microphone source location method in 2-D space[C]//IEEE International Conference on Acoustics,Speech&Signal Processing.Toulouse:IEEE Press,2006:845-848.
[9] Chen Y,Qin G.Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant[J].Biomedical Engineering Online,2012,11(10):2861-2861.
[10] Jourjine A,Rickard S,Yilmaz O.Blind separation of disjoint orthogonal signals:Demixing N sources from 2 mixtures[C]//IEEE International Conference on Acoustics,Speech&Signal Processing.Istanbul:IEEE Press,2000:2985-2988.
[11] Kim Y I,Kil R M.Estimation of interaural time differences based on zero-crossings in noisy multisource environments[J].IEEE Transactions on Audio Speech&Language Processing,2007,15(2):734-743.
[12] 李冰,夏秀渝,申庆超,等.基于过零点双耳时差的运动声源定位[J].计算机工程与应用,2012,48(9):127-130.LI Bing,XIA Xiuyu,SHEN Qingchao,et al.Moving sound localization based on zero-crossing points interaural time differences[J].Computer Engineering and Applications,2012,48(9):127-130.(in Chinese)
[13] 陈世雄,宫琴,金慧君.用Gammatone滤波器组仿真人耳基底膜的特性[J].清华大学学报(自然科学版),2008,48(6):1044-1048.CHEN Shixiong,GONG Qin,JIN Huijun.Gammatone filter bank to simulate the characteristics of the human basilar membrane[J].J Tsinghua Univ (Sci&Tech),2008,48(6):1044-1048.(in Chinese)
[14] Wang D L,Brown G J.Computational Auditory Scene Analysis:Principles,Algorithms,and Applications[M].Piscataway:IEEE Press,1993:15-17.
PDF(1410 KB)

Accesses

Citation

Detail

Sections
Recommended

/