Tone training for Mandarin two-syllable words based on pitch projection synthesized speech
XIE Yanlu1, ZHANG Bei2, ZHANG Jinsong1,2
1. College of Information Science, Beijing Language and Culture University, Beijing 100083, China;
2. Center for Studies of Chinese as a Second Language, Beijing Language and Culture University, Beijing 100083, China
Abstract：This study uses the pitch projection method to synthesize teaching speech with the appropriate standard voice. The teaching speech is synthesized by turning lexicon tones in the learners' speech into standard tones, while keeping the segments and timbie unchanged. This simplifies the complex variations in the speech signal except for the tones. Then, the system is used for tone training Japanese students based on the synthesized Mandarin two-syllable words. The training results show that this synthesized speech method is superior to a standard voice method with improved perception and production, as well as generalized production. The training results for the synthesized speech method are far better than a control group without training. Most of the results are statistically significant. Tests also show the existence of a selective attention mechanism in the human brain when learning speech. Thus, this study provides an experimental and theoretical basis for speech synthesized methods to be integrated into computer-assisted Mandarin tone learning systems.
解焱陆, 张蓓, 张劲松. 基于音高映射合成语音的汉语双字调声调训练[J]. 清华大学学报（自然科学版）, 2017, 57(2): 170-175.
XIE Yanlu, ZHANG Bei, ZHANG Jinsong. Tone training for Mandarin two-syllable words based on pitch projection synthesized speech. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 170-175.
TANG Min, WANG Chao, Seneff S. Voice transformations:From speech synthesis to mammalian vocalizations[J]. Proc of the Eurospeech, 2002, 18:357-360.
Probst K, Ke Y, Eskenazi M. Enhancing foreign language tutors:In search of the golden speaker[J]. Speech Communication, 2002, 37(3):161-173.
Nosofsky R M. Attention and learning processes in the identification and categorization of integral stimuli[J]. Journal of Experimental Psychology:Learning, Memory, and Cognition, 1987, 13(1):87-108.
Felps D, Bortfeld H, Gutierrez-Osuna R. Foreign accent conversion in computer assisted pronunciation training[J]. Speech Communication, 2009, 51(10):920-932.
Rodríguez W R, Saz O, Lleida E. A prelingual tool for the education of altered voices[J]. Speech Communication, 2012, 54(5):583-600.
ZHAO Sixuan, Koh S N, Luke K K. Accent reduction for computer-aided language learning[C]//2012 IEEE Proceedings of the 20th European Signal Processing Conference (EUSIPCO). Bucharest, 2012:335-339.
XIE Yanlu, ZHANG Jinsong, SHI Shuju. Standard speaker selection in speech synthesis for Mandarin tone learning[C]//Proceedings of the 2012 International Conference on Information Technology and Software Engineering. Heidelberg, 2013:375-383.
Peabody M, Seneff S. Towards automatic tone correction in non-native Mandarin[C]//International Symposium on Chinese Spoken Language Processing. Singapore, 2006:602-613.
Martin P. WinPitch LTL Ⅱ, a multimodal pronunciation software[C]//InSTIL/ICALL. Venice, 2004.
宋益丹. 对外汉语声调教学策略探索[J]. 语言教学与研究, 2009(3):48-53.SONG Yidan. Strategies on teaching tones in Chinese as a foreign language[J]. Language Teaching and Linguistic Studies, 2009(3):48-53. (in Chinese)
Hussein H, WEI Si, Mixdorff H, et al. Development of a computer-aided language learning system for Mandarin-tone recognition and pronunciation error detection[C]//Proceedings of the Speech Prosody. Chicago, 2010.
Kawahara H, Masuda-Katsuse I, De Cheveigne A. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F<sub>0</sub> extraction:Possible role of a repetitive structure in sounds[J]. Speech Communication, 1999, 27(3):187-207.
CHAO Yuen Ren. A Grammar of Spoken Chinese[M]. Berkeley and Los Angeles:University of California Press, 1968.
薛晶晶. 美国和泰国学习者汉语普通话阳平与上声习得的实验研究[D]. 北京:北京大学, 2013. XUE Jingjing. The Study on Mandarin Tone 2 and Tone 3 by American and Thai Speakers[D]. Beijing:Peking University, 2013. (in Chinese)
太田裕子.日本学生汉语普通话两字调的发音和感知研究[D]. 北京:北京语言大学, 2011.Ota Yuko. A study of Production and Perception of Tone Sandhi of Chinese Disyllables by Japanese Students[D]. Beijing:Beijing Language and Culture University, 2011. (in Chinese)