Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  百年期刊
Journal of Tsinghua University(Science and Technology)    2017, Vol. 57 Issue (2) : 170-175     DOI: 10.16511/j.cnki.qhdxxb.2017.22.010
INFORMATION ENGINEERING |
Tone training for Mandarin two-syllable words based on pitch projection synthesized speech
XIE Yanlu1, ZHANG Bei2, ZHANG Jinsong1,2
1. College of Information Science, Beijing Language and Culture University, Beijing 100083, China;
2. Center for Studies of Chinese as a Second Language, Beijing Language and Culture University, Beijing 100083, China
Download: PDF(1107 KB)  
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks    
Abstract  This study uses the pitch projection method to synthesize teaching speech with the appropriate standard voice. The teaching speech is synthesized by turning lexicon tones in the learners' speech into standard tones, while keeping the segments and timbie unchanged. This simplifies the complex variations in the speech signal except for the tones. Then, the system is used for tone training Japanese students based on the synthesized Mandarin two-syllable words. The training results show that this synthesized speech method is superior to a standard voice method with improved perception and production, as well as generalized production. The training results for the synthesized speech method are far better than a control group without training. Most of the results are statistically significant. Tests also show the existence of a selective attention mechanism in the human brain when learning speech. Thus, this study provides an experimental and theoretical basis for speech synthesized methods to be integrated into computer-assisted Mandarin tone learning systems.
Keywords phonetic teaching      language learning      speech synthesis      pitch projection      tone     
ZTFLH:  H193.2  
  TN912.33  
  H116.4  
Issue Date: 15 February 2017
Service
E-mail this article
E-mail Alert
RSS
Articles by authors
XIE Yanlu
ZHANG Bei
ZHANG Jinsong
Cite this article:   
XIE Yanlu,ZHANG Bei,ZHANG Jinsong. Tone training for Mandarin two-syllable words based on pitch projection synthesized speech[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 170-175.
URL:  
http://jst.tsinghuajournals.com/EN/10.16511/j.cnki.qhdxxb.2017.22.010     OR     http://jst.tsinghuajournals.com/EN/Y2017/V57/I2/170
  
  
  
  
  
  
  
  
[1] TANG Min, WANG Chao, Seneff S. Voice transformations:From speech synthesis to mammalian vocalizations[J]. Proc of the Eurospeech, 2002, 18:357-360.
[2] Probst K, Ke Y, Eskenazi M. Enhancing foreign language tutors:In search of the golden speaker[J]. Speech Communication, 2002, 37(3):161-173.
[3] Nosofsky R M. Attention and learning processes in the identification and categorization of integral stimuli[J]. Journal of Experimental Psychology:Learning, Memory, and Cognition, 1987, 13(1):87-108.
[4] Felps D, Bortfeld H, Gutierrez-Osuna R. Foreign accent conversion in computer assisted pronunciation training[J]. Speech Communication, 2009, 51(10):920-932.
[5] Rodríguez W R, Saz O, Lleida E. A prelingual tool for the education of altered voices[J]. Speech Communication, 2012, 54(5):583-600.
[6] ZHAO Sixuan, Koh S N, Luke K K. Accent reduction for computer-aided language learning[C]//2012 IEEE Proceedings of the 20th European Signal Processing Conference (EUSIPCO). Bucharest, 2012:335-339.
[7] XIE Yanlu, ZHANG Jinsong, SHI Shuju. Standard speaker selection in speech synthesis for Mandarin tone learning[C]//Proceedings of the 2012 International Conference on Information Technology and Software Engineering. Heidelberg, 2013:375-383.
[8] Peabody M, Seneff S. Towards automatic tone correction in non-native Mandarin[C]//International Symposium on Chinese Spoken Language Processing. Singapore, 2006:602-613.
[9] Martin P. WinPitch LTL Ⅱ, a multimodal pronunciation software[C]//InSTIL/ICALL. Venice, 2004.
[10] 宋益丹. 对外汉语声调教学策略探索[J]. 语言教学与研究, 2009(3):48-53.SONG Yidan. Strategies on teaching tones in Chinese as a foreign language[J]. Language Teaching and Linguistic Studies, 2009(3):48-53. (in Chinese)
[11] Hussein H, WEI Si, Mixdorff H, et al. Development of a computer-aided language learning system for Mandarin-tone recognition and pronunciation error detection[C]//Proceedings of the Speech Prosody. Chicago, 2010.
[12] Kawahara H, Masuda-Katsuse I, De Cheveigne A. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F<sub>0</sub> extraction:Possible role of a repetitive structure in sounds[J]. Speech Communication, 1999, 27(3):187-207.
[13] CHAO Yuen Ren. A Grammar of Spoken Chinese[M]. Berkeley and Los Angeles:University of California Press, 1968.
[14] 薛晶晶. 美国和泰国学习者汉语普通话阳平与上声习得的实验研究[D]. 北京:北京大学, 2013. XUE Jingjing. The Study on Mandarin Tone 2 and Tone 3 by American and Thai Speakers[D]. Beijing:Peking University, 2013. (in Chinese)
[15] 太田裕子.日本学生汉语普通话两字调的发音和感知研究[D]. 北京:北京语言大学, 2011.Ota Yuko. A study of Production and Perception of Tone Sandhi of Chinese Disyllables by Japanese Students[D]. Beijing:Beijing Language and Culture University, 2011. (in Chinese)
[1] SHANG Manxia, LI Yiran, JIANG Ling, LI Dongfang, HUANG Zhong, ZHNAG Man, LÜ Junfu, KE Xiwei. Study on the effect of limestone on NOx emission in circulating fluidized bed combustion and particle size optimization[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(12): 2033-2041.
[2] CHEN Daoxiang, LIN Peng, DING Peng, LI Guo, CHEN Tao, YU Zhuojing. Vibro-stone column filling schemes based on Group AHP[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(12): 1915-1921.
[3] CAO Chong, XIE Yanlu, ZHANG Jinsong. Influence on tone perception from vowels with different formant distributions[J]. Journal of Tsinghua University(Science and Technology), 2018, 58(4): 352-356.
[4] FU Ruibo, TAO Jianhua, LI Ya, WEN Zhengqi. Automatic prosodic boundary labeling based on fusing the silence duration with the lexical features[J]. Journal of Tsinghua University(Science and Technology), 2018, 58(1): 61-66,74.
[5] WU Yuhao, JIA Jia, ZHANG Xiulong, CAI Lianhong. Assessment of pure tone audiometry based on multiple judgments[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(3): 234-239.
[6] XIE Chenwei, SHI Feng, WEN Baoying. Perception of Cantonese tones[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(3): 299-305.
[7] GAO Yingying, ZHU Weibin. Describing and predicting affective messages for expressive speech synthesis[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 202-207.
[8] GONG Qin, LIU Yi. Estimation for the location of multiple moving sound sources in small-distance dual-microphone[J]. Journal of Tsinghua University(Science and Technology), 2016, 56(8): 901-907.
[9] GU Wentao. Error patterns in fundamental frequency contours of L2 Mandarin utterances by Cantonese and English learners[J]. Journal of Tsinghua University(Science and Technology), 2016, 56(11): 1166-1172.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
Copyright © Journal of Tsinghua University(Science and Technology), All Rights Reserved.
Powered by Beijing Magtech Co. Ltd