Modeling of the tongue tip in Standard Chinese using MRI
WANG Gaowu1, DANG Jianwu2, KONG Jiangping3
1. School of Chinese Language and Literature, Beijing Normal University, Beijing 100875, China;
2. School of Computer Science and Technology, Tianjin University, Tianjin 300072, China;
3. Department of Chinese Language and Literature, Peking University, Beijing 100871, China
Abstract:The tongue tip motion in Standard Chinese was modeled based on articulatory data from magnetic resonance imaging (MRI) images. An MRI articulatory database was developed for Standard Chinese, including 9 vowels and 75 consonant variants. Principle component analysis (PCA) of the tongue shape was then used to find articulatory factors. The results show that the tongue should be divided as the tongue tip and tongue body and modeled separately for more precise results. The tongue tip motion is modeled with two articulatory parameters for tongue tip protrude and tongue tip raise which represent the protruding/advancing and raising/retroflexing movements of the tongue tip.
汪高武, 党建武, 孔江平. 基于磁共振成像的汉语普通话舌尖调音建模[J]. 清华大学学报(自然科学版), 2017, 57(2): 158-163.
WANG Gaowu, DANG Jianwu, KONG Jiangping. Modeling of the tongue tip in Standard Chinese using MRI. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 158-163.
Fant G. Acoustic Theory of Speech Production[M]. 2nd Ed. Hague:Mouton, 1970:328.
[2]
Hardcastle W J, Laver J. The Handbook of Phonetic Sciences[M]. Oxford:Blackwell Publishing, 1999.
[3]
Story B H. A parametric model of the vocal tract area function for vowel and consonant simulation[J]. J Acoust Soc Am, 2005, 117(5):3231-3254.
[4]
Flanagan J. Speech Analysis Synthesis and Perception[M]. New York:Spinger, 1972.
[5]
Wilhelms-Tricarico R. A biomechanical and physiologically-based vocal tract model and its control[J]. J Phonetics, 1996, 24(1):23-38.
[6]
Dang J W, Honda K. Construction and control of a physiological articulatory model[J]. J Acoust Soc Am, 2004, 115(2):853-870.
[7]
Iskarous K. Patterns of tongue movement[J]. J Phonetics, 2005, 33(4):363-381.
[8]
Badin P, Bailly G, Reveret L, et al. Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images[J]. J Phonetics, 2002, 30(3):533-553.
[9]
Engwall O. Combining MRI, EMA and EPG measurements in a three-dimensional tongue model[J]. Speech Comm, 2003, 41(2/3):303-329.
[10]
Mermelstein P. Articulatory model for the study of speech production[J]. J Acoust Soc Am, 1973, 53(4):1070-1082.
[11]
Coker C H. A model of articulatory dynamics and control[J]. Proceedings of the IEEE, 1976, 64(4):452-460.
[12]
Lindblom B, Sundberg J. Acoustical consequences of lip, tongue, jaw, and larynx movement[J]. J Acoust Soc Am, 1971, 50(4):1166-1179.
[13]
Harshman R, Ladefoged P, Goldstein L. Factor analysis of tongue shapes[J]. J Acoust Soc Am, 1977, 62(3):693-707.
[14]
Beautemps D, Badin P, Bailly G. Linear degrees of freedom in speech production:Analysis of cineradio-and labio-film data and articulatory-acoustic modeling[J]. J Acoust Soc Am, 2001, 109(5):2165-2180.
[15]
Wang G, Kitamura T, Lu X G, et al. MRI-based study of morphological and acoustical properties of Mandarin sustained steady vowels[J]. J Signal Process, 2008, 12(4):311-314.
[16]
Wang Y, Wang H, Gao J, et al. Detailed morphological analysis of mandarin sustained steady vowels[C]//International Symposium on Chinese Spoken Language Processing (ISCSLP). Hong Kong, 2012:413-416.