Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们 横山亮次奖 百年刊庆
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  横山亮次奖  |  百年刊庆
清华大学学报(自然科学版)  2016, Vol. 56 Issue (11): 1237-1241    DOI: 10.16511/j.cnki.qhdxxb.2016.26.018
  计算机科学与技术 本期目录 | 过刊浏览 | 高级检索 |
基于脸部骨骼位置信息的唇凸度计算方法
潘晓声1, 张梦翰2, Liew Wee Chung3
1. 上海师范大学 信息与机电工程学院, 上海 200234, 中国;
2. 复旦大学 生命科学学院, 上海 200438, 中国;
3. 格里菲斯大学 信息与通讯技术学院, 昆士兰, 澳大利亚
Lip protrusion measurement based on facial skeleton data
PAN Xiaosheng1, ZHANG Menghan2, Liew Wee Chung3
1. The College of Information, Mechenical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, China;
2. School of Life Sciences, Fudan University, Shanghai 200438, China;
3. School of Information and Communication Technology, Griffith University, Queensland, Australia
全文: PDF(2594 KB)  
输出: BibTeX | EndNote (RIS)      
摘要 该文主要讨论了唇凸度的定义和提取方法。根据上、下唇的运动规律不同,该文把上唇和下唇凸度分别定义为上唇或下唇外沿到上或下门齿的Euclid距离。使用运动捕获器获取发音过程中脸部标志点运动的三维坐标信息,运用奇异值分解法消除头部刚体运动和下颌的开口运动,利用置于脸部骨骼的参考点分别推算出上下门齿的空间位置,使用上唇和下唇外沿的坐标位置计算上唇或下唇凸度。结果表明:该计算方法不但在三维唇形数据上测试效果良好,同时也适用于二维唇形数据。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
潘晓声
张梦翰
Liew Wee Chung
关键词 唇凸度奇异值分解刚体运动Euclid距离    
Abstract:The paper presents a method to measure lip protrusion. The upper and low lip movement patterns differ, so the lip protrusion is defined for the upper or lower lips as the Euclidean distance between the lip edge and the incisor. Three-dimensional lip coordinates were obtained by observing the trajectories of reference markers on human faces. The singular value decomposition (SVD) method was used to eliminate the head rigid-body movement and mouth opening movement. Then, the coordinates for the upper and lower incisors were obtained by calculating the coordinates of the reference markers pasted on the facial bony structure. Finally, lip edge coordinates were introduced to calculate the lip protrusion. The method gives good results with three-dimensional lip data and is also applicable for analyzing two-dimensional lip data.
Key wordslip protrusion    singular value decomposition (SVD)    rigid motion    Euclidean distance
收稿日期: 2016-06-29      出版日期: 2016-11-15
ZTFLH:  H018  
引用本文:   
潘晓声, 张梦翰, Liew Wee Chung. 基于脸部骨骼位置信息的唇凸度计算方法[J]. 清华大学学报(自然科学版), 2016, 56(11): 1237-1241.
PAN Xiaosheng, ZHANG Menghan, Liew Wee Chung. Lip protrusion measurement based on facial skeleton data. Journal of Tsinghua University(Science and Technology), 2016, 56(11): 1237-1241.
链接本文:  
http://jst.tsinghuajournals.com/CN/10.16511/j.cnki.qhdxxb.2016.26.018  或          http://jst.tsinghuajournals.com/CN/Y2016/V56/I11/1237
  图1 不同的唇凸度定义方法
  图2 人脸标志点位置
  图3 侧脸示意图(左)以及侧面人脸(右)
  图4 普通话“军”字的唇形参数与声学参数
[1] Abry C, Boë L J. ""Laws"" for lips[J]. Speech Communication, 1986, 5(1):97-104.
[2] 王安红. 普通话语音视位系统初探[D]. 北京:北京语言文化大学, 2000.WANG Anhong. Primary Research on Standard Chinese Viesemes[D]. Beijing:Beijing Language and Culture University, 2000. (in Chinese)
[3] 王志明. 汉语视位建模及可视语音的研究[D]. 北京:清华大学, 2003.WANG Zhiming. Research on Modeling Chinese Viseme and Visual Speech[D]. Beijing:Tsinghua University, 2003. (in Chinese)
[4] 吴宗济, 林茂灿. 实验语音学概要[M]. 北京:高等教育出版社, 1989.WU Zongji, LIN Maocan. A Prime of Experimental Phonetics[M]. Beijing:Higher Education Press, 1989. (in Chinese)
[5] Denis B, Badin P, Bailly G. Linear degrees of freedom in speech production:Analysis of cineradio-and labio-film data and articulatory-acoustic modeling[J]. The Journal of the Acoustical Society of America, 2001, 109(5):2165-2180.
[6] Martine T, Maeda S, Carlen A J, et al. Lip protrusion/rounding dissociation in French and English consonants:/w/vs[C]//Proc of ICPhS XV. Barcelona, Spain:ISCA, 2003:1763-1766.
[7] Martinoa J M D, Magalhães L P, Violaro F. Facial animation based on context-dependent visemes[J]. Computers & Graphics, 2006, 30(6):971-980.
[8] 皮昕. 口腔解剖生理[M]. 北京:人民卫生出版社, 2007.PI Xin. Oral Anatomy and Physiology[M]. Beijing:People's Medical Publishing House, 2007. (in Chinese)
[9] Mermelstein P. Articulatory model for the study of speech production[J]. The Journal of the Acoustical Society of America, 1973, 53(4):1070-1082.
[10] Arun K S, Huang T S, Blostein S D. Least-squares fitting of two 3-D point sets[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, 9(5):698-700.
[11] Fant G. Acoustic Theory of Speech Production:with Calculations Based on X-ray Studies of Russian Articulations[M]. Berlin:Walter de Gruyter, 1971.
[12] Kass M, Witkin A, Terzopoulos D. Snakes:Active contour models[J]. International Journal of Computer Vision, 1988, 1(4):321-331.
[1] 甘振业, 陈浩, 杨鸿武. 结合EEMD与K-SVD字典训练的语音增强算法[J]. 清华大学学报(自然科学版), 2017, 57(3): 286-292.
[2] 邢安昊, 张鹏远, 潘接林, 颜永红. 基于SVD的DNN裁剪方法和重训练[J]. 清华大学学报(自然科学版), 2016, 56(7): 772-776.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 《清华大学学报(自然科学版)》编辑部
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn