Lip protrusion measurement based on facial skeleton data
PAN Xiaosheng1, ZHANG Menghan2, Liew Wee Chung3
1. The College of Information, Mechenical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, China;
2. School of Life Sciences, Fudan University, Shanghai 200438, China;
3. School of Information and Communication Technology, Griffith University, Queensland, Australia
Abstract:The paper presents a method to measure lip protrusion. The upper and low lip movement patterns differ, so the lip protrusion is defined for the upper or lower lips as the Euclidean distance between the lip edge and the incisor. Three-dimensional lip coordinates were obtained by observing the trajectories of reference markers on human faces. The singular value decomposition (SVD) method was used to eliminate the head rigid-body movement and mouth opening movement. Then, the coordinates for the upper and lower incisors were obtained by calculating the coordinates of the reference markers pasted on the facial bony structure. Finally, lip edge coordinates were introduced to calculate the lip protrusion. The method gives good results with three-dimensional lip data and is also applicable for analyzing two-dimensional lip data.
Abry C, Boë L J. ""Laws"" for lips[J]. Speech Communication, 1986, 5(1):97-104.
[2]
王安红. 普通话语音视位系统初探[D]. 北京:北京语言文化大学, 2000.WANG Anhong. Primary Research on Standard Chinese Viesemes[D]. Beijing:Beijing Language and Culture University, 2000. (in Chinese)
[3]
王志明. 汉语视位建模及可视语音的研究[D]. 北京:清华大学, 2003.WANG Zhiming. Research on Modeling Chinese Viseme and Visual Speech[D]. Beijing:Tsinghua University, 2003. (in Chinese)
[4]
吴宗济, 林茂灿. 实验语音学概要[M]. 北京:高等教育出版社, 1989.WU Zongji, LIN Maocan. A Prime of Experimental Phonetics[M]. Beijing:Higher Education Press, 1989. (in Chinese)
[5]
Denis B, Badin P, Bailly G. Linear degrees of freedom in speech production:Analysis of cineradio-and labio-film data and articulatory-acoustic modeling[J]. The Journal of the Acoustical Society of America, 2001, 109(5):2165-2180.
[6]
Martine T, Maeda S, Carlen A J, et al. Lip protrusion/rounding dissociation in French and English consonants:/w/vs[C]//Proc of ICPhS XV. Barcelona, Spain:ISCA, 2003:1763-1766.
[7]
Martinoa J M D, Magalhães L P, Violaro F. Facial animation based on context-dependent visemes[J]. Computers & Graphics, 2006, 30(6):971-980.
[8]
皮昕. 口腔解剖生理[M]. 北京:人民卫生出版社, 2007.PI Xin. Oral Anatomy and Physiology[M]. Beijing:People's Medical Publishing House, 2007. (in Chinese)
[9]
Mermelstein P. Articulatory model for the study of speech production[J]. The Journal of the Acoustical Society of America, 1973, 53(4):1070-1082.
[10]
Arun K S, Huang T S, Blostein S D. Least-squares fitting of two 3-D point sets[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):698-700.
[11]
Fant G. Acoustic Theory of Speech Production:with Calculations Based on X-ray Studies of Russian Articulations[M]. Berlin:Walter de Gruyter, 1971.
[12]
Kass M, Witkin A, Terzopoulos D. Snakes:Active contour models[J]. International Journal of Computer Vision, 1988, 1(4):321-331.