基于脸部骨骼位置信息的唇凸度计算方法

潘晓声; 张梦翰; Liew Wee Chung

doi:10.16511/j.cnki.qhdxxb.2016.26.018

PDF(2594 KB)

清华大学学报（自然科学版） ›› 2016, Vol. 56 ›› Issue (11) : 1237-1241. DOI: 10.16511/j.cnki.qhdxxb.2016.26.018

计算机科学与技术

基于脸部骨骼位置信息的唇凸度计算方法

潘晓声¹, 张梦翰², Liew Wee Chung³

作者信息 +

Lip protrusion measurement based on facial skeleton data

PAN Xiaosheng¹, ZHANG Menghan², Liew Wee Chung³

Author information +

文章历史 +

摘要

该文主要讨论了唇凸度的定义和提取方法。根据上、下唇的运动规律不同，该文把上唇和下唇凸度分别定义为上唇或下唇外沿到上或下门齿的Euclid距离。使用运动捕获器获取发音过程中脸部标志点运动的三维坐标信息，运用奇异值分解法消除头部刚体运动和下颌的开口运动，利用置于脸部骨骼的参考点分别推算出上下门齿的空间位置，使用上唇和下唇外沿的坐标位置计算上唇或下唇凸度。结果表明：该计算方法不但在三维唇形数据上测试效果良好，同时也适用于二维唇形数据。

Abstract

The paper presents a method to measure lip protrusion. The upper and low lip movement patterns differ, so the lip protrusion is defined for the upper or lower lips as the Euclidean distance between the lip edge and the incisor. Three-dimensional lip coordinates were obtained by observing the trajectories of reference markers on human faces. The singular value decomposition (SVD) method was used to eliminate the head rigid-body movement and mouth opening movement. Then, the coordinates for the upper and lower incisors were obtained by calculating the coordinates of the reference markers pasted on the facial bony structure. Finally, lip edge coordinates were introduced to calculate the lip protrusion. The method gives good results with three-dimensional lip data and is also applicable for analyzing two-dimensional lip data.

导出引用

潘晓声, 张梦翰, Liew Wee Chung. 基于脸部骨骼位置信息的唇凸度计算方法[J]. 清华大学学报（自然科学版）. 2016, 56(11): 1237-1241 https://doi.org/10.16511/j.cnki.qhdxxb.2016.26.018

PAN Xiaosheng, ZHANG Menghan, Liew Wee Chung. Lip protrusion measurement based on facial skeleton data[J]. Journal of Tsinghua University(Science and Technology). 2016, 56(11): 1237-1241 https://doi.org/10.16511/j.cnki.qhdxxb.2016.26.018

中图分类号： H018

参考文献

[1] Abry C, Boë L J. ""Laws"" for lips[J]. Speech Communication, 1986, 5(1):97-104. [2] 王安红. 普通话语音视位系统初探[D]. 北京:北京语言文化大学, 2000.WANG Anhong. Primary Research on Standard Chinese Viesemes[D]. Beijing:Beijing Language and Culture University, 2000. (in Chinese) [3] 王志明. 汉语视位建模及可视语音的研究[D]. 北京:清华大学, 2003.WANG Zhiming. Research on Modeling Chinese Viseme and Visual Speech[D]. Beijing:Tsinghua University, 2003. (in Chinese) [4] 吴宗济, 林茂灿. 实验语音学概要[M]. 北京:高等教育出版社, 1989.WU Zongji, LIN Maocan. A Prime of Experimental Phonetics[M]. Beijing:Higher Education Press, 1989. (in Chinese) [5] Denis B, Badin P, Bailly G. Linear degrees of freedom in speech production:Analysis of cineradio-and labio-film data and articulatory-acoustic modeling[J]. The Journal of the Acoustical Society of America, 2001, 109(5):2165-2180. [6] Martine T, Maeda S, Carlen A J, et al. Lip protrusion/rounding dissociation in French and English consonants:/w/vs[C]//Proc of ICPhS XV. Barcelona, Spain:ISCA, 2003:1763-1766. [7] Martinoa J M D, Magalhães L P, Violaro F. Facial animation based on context-dependent visemes[J]. Computers & Graphics, 2006, 30(6):971-980. [8] 皮昕. 口腔解剖生理[M]. 北京:人民卫生出版社, 2007.PI Xin. Oral Anatomy and Physiology[M]. Beijing:People's Medical Publishing House, 2007. (in Chinese) [9] Mermelstein P. Articulatory model for the study of speech production[J]. The Journal of the Acoustical Society of America, 1973, 53(4):1070-1082. [10] Arun K S, Huang T S, Blostein S D. Least-squares fitting of two 3-D point sets[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):698-700. [11] Fant G. Acoustic Theory of Speech Production:with Calculations Based on X-ray Studies of Russian Articulations[M]. Berlin:Walter de Gruyter, 1971. [12] Kass M, Witkin A, Terzopoulos D. Snakes:Active contour models[J]. International Journal of Computer Vision, 1988, 1(4):321-331.

PDF(2594 KB)

Accesses

Citation

Detail

段落导航

收稿日期	出版日期
2016-06-29	2016-11-15
发布日期
2016-11-15

选择文件类型/文献管理软件名称

选择包含的内容

摘要

Abstract

关键词

Key words

引用本文

{{custom_sec.title}}

{{custom_sec.title}}

参考文献

访问统计

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

摘要

Abstract

关键词

Key words

引用本文

{{custom_sec.title}}

{{custom_sec.title}}

参考文献

访问统计