该文主要讨论了唇凸度的定义和提取方法。根据上、下唇的运动规律不同,该文把上唇和下唇凸度分别定义为上唇或下唇外沿到上或下门齿的Euclid距离。使用运动捕获器获取发音过程中脸部标志点运动的三维坐标信息,运用奇异值分解法消除头部刚体运动和下颌的开口运动,利用置于脸部骨骼的参考点分别推算出上下门齿的空间位置,使用上唇和下唇外沿的坐标位置计算上唇或下唇凸度。结果表明:该计算方法不但在三维唇形数据上测试效果良好,同时也适用于二维唇形数据。
Abstract
The paper presents a method to measure lip protrusion. The upper and low lip movement patterns differ, so the lip protrusion is defined for the upper or lower lips as the Euclidean distance between the lip edge and the incisor. Three-dimensional lip coordinates were obtained by observing the trajectories of reference markers on human faces. The singular value decomposition (SVD) method was used to eliminate the head rigid-body movement and mouth opening movement. Then, the coordinates for the upper and lower incisors were obtained by calculating the coordinates of the reference markers pasted on the facial bony structure. Finally, lip edge coordinates were introduced to calculate the lip protrusion. The method gives good results with three-dimensional lip data and is also applicable for analyzing two-dimensional lip data.
关键词
唇凸度 /
奇异值分解 /
刚体运动 /
Euclid距离
Key words
lip protrusion /
singular value decomposition (SVD) /
rigid motion /
Euclidean distance
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Abry C, Boë L J. ""Laws"" for lips[J]. Speech Communication, 1986, 5(1):97-104. [2] 王安红. 普通话语音视位系统初探[D]. 北京:北京语言文化大学, 2000.WANG Anhong. Primary Research on Standard Chinese Viesemes[D]. Beijing:Beijing Language and Culture University, 2000. (in Chinese) [3] 王志明. 汉语视位建模及可视语音的研究[D]. 北京:清华大学, 2003.WANG Zhiming. Research on Modeling Chinese Viseme and Visual Speech[D]. Beijing:Tsinghua University, 2003. (in Chinese) [4] 吴宗济, 林茂灿. 实验语音学概要[M]. 北京:高等教育出版社, 1989.WU Zongji, LIN Maocan. A Prime of Experimental Phonetics[M]. Beijing:Higher Education Press, 1989. (in Chinese) [5] Denis B, Badin P, Bailly G. Linear degrees of freedom in speech production:Analysis of cineradio-and labio-film data and articulatory-acoustic modeling[J]. The Journal of the Acoustical Society of America, 2001, 109(5):2165-2180. [6] Martine T, Maeda S, Carlen A J, et al. Lip protrusion/rounding dissociation in French and English consonants:/w/vs[C]//Proc of ICPhS XV. Barcelona, Spain:ISCA, 2003:1763-1766. [7] Martinoa J M D, Magalhães L P, Violaro F. Facial animation based on context-dependent visemes[J]. Computers & Graphics, 2006, 30(6):971-980. [8] 皮昕. 口腔解剖生理[M]. 北京:人民卫生出版社, 2007.PI Xin. Oral Anatomy and Physiology[M]. Beijing:People's Medical Publishing House, 2007. (in Chinese) [9] Mermelstein P. Articulatory model for the study of speech production[J]. The Journal of the Acoustical Society of America, 1973, 53(4):1070-1082. [10] Arun K S, Huang T S, Blostein S D. Least-squares fitting of two 3-D point sets[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):698-700. [11] Fant G. Acoustic Theory of Speech Production:with Calculations Based on X-ray Studies of Russian Articulations[M]. Berlin:Walter de Gruyter, 1971. [12] Kass M, Witkin A, Terzopoulos D. Snakes:Active contour models[J]. International Journal of Computer Vision, 1988, 1(4):321-331.