“发嗲”的情感语音基频特征分析

孔江平, 林悠然

清华大学学报(自然科学版) ›› 2016, Vol. 56 ›› Issue (11) : 1149-1153.

PDF(1129 KB)
PDF(1129 KB)
清华大学学报(自然科学版) ›› 2016, Vol. 56 ›› Issue (11) : 1149-1153. DOI: 10.16511/j.cnki.qhdxxb.2016.26.003
电子工程

“发嗲”的情感语音基频特征分析

  • 孔江平, 林悠然
作者信息 +

Fundamental frequency characteristics of “dearing” as emotional speech

  • KONG Jiangping, LIN Youran
Author information +
文章历史 +

摘要

发嗲是一种特殊的情感语音。该文从情感类型的角度出发,认为发嗲并非简单的某种情绪或态度,而是一种情感上主动性强的言语模式。该文提取了发嗲在基频上的特征,发现其在基频上最显著的特征是基频提高,这种提高并非整体的同步提升,而是与调类、性别、元音等因素有关,并且伴随着基频曲线形状和基频范围的变化。该文通过语音合成和听辨实验来检验基频变化对发嗲的意义,实验表明基频的提高突出体现了发嗲在情感激发维上的主动性特征。基频提高是发嗲的关键因素,但不是唯一特征,也不是听辨的充分条件。

Abstract

Dearing is a special kind of emotional speech. For emotion classification, dearing is not a mood or attitude, but a mode of speech which demonstrates a strong emotional activity. This study analysizes the dearing characteristics in terms of the fundamental frequency (f0) with the most obvious characteristic of "dearing" being the raised f0, which is not a constant increment, but is related to the tones, genders and vowels, with changes in the shapes of the f0 graphs and the tone register. This study also examines how the f0 transformation is related to dearing with sample syntheses and perceptional recognition, and demonstrates that the pitch increment typically shows the activity of dearing in the arousal dimension of emotional speech. The increment of f0 is crucial to dearing yet it is not the only feature nor the sufficient condition of recognition.

关键词

情感语音 / 发嗲 / 基频

Key words

emotional speech / dearing / fundamental frequency (f0)

引用本文

导出引用
孔江平, 林悠然. “发嗲”的情感语音基频特征分析[J]. 清华大学学报(自然科学版). 2016, 56(11): 1149-1153 https://doi.org/10.16511/j.cnki.qhdxxb.2016.26.003
KONG Jiangping, LIN Youran. Fundamental frequency characteristics of “dearing” as emotional speech[J]. Journal of Tsinghua University(Science and Technology). 2016, 56(11): 1149-1153 https://doi.org/10.16511/j.cnki.qhdxxb.2016.26.003
中图分类号: H017   

参考文献

[1] 叶蜚声, 徐通锵. 语言学纲要[M]. 北京:北京大学出版社, 2010.YE Feisheng, XU Tongqiang. Introduction to Linguistics[M]. Beijing:Peking University Press, 2010. (in Chinese) [2] Russell J. A circumplex model of affect[J]. JPSP, 1980, 39(6):1161-1178. [3] Plutchik R. A general psychoevolutionary theory of emotion[C]//Emotion:Theory, Research, and Experience. New York:Academic Press, 1980:3-33. [4] Fox N. If it's not left it's right. Electroencephalograph asymmetry and the development of emotion[J]. Am Psychol, 1991, 46(8):863-872. [5] Couper-Kuhlen E. An Introduction to English Prosody[M]. London:Edward Arnold, 1986. [6] Wichmann A. The attitudinal effects of prosody, and how they relate to emotion[C]//ISCA ITRW on Speech and Emotion. Newcastle, UK, 2000:143-148. [7] Fujisaki H. Prosody, models, and spontaneous speech[C]//Computing prosody. New York, USA:Springer-Verlag, 1997:27-42. [8] Cowie R, Douglas-Cowie E, Tsapatsoulis N, et al. Emotion recognition in human-computer interaction[J]. IEEE Signal Proc Mag, 2001, 18(1):32-80. [9] Tato R, Santos R, Kompe R, et al. Emotional space improves emotion recognition[C]//ICSLP-INTERSPEECH. Denver, USA, 2002:2029-2032. [10] 张立华, 杨莹春. 情感语音变化规律的特征分析[J]. 清华大学学报(自然科学版), 2008, 48(S1):652-657.ZHANG Lihua, YANG Yingchun. Emotional speech characteristics[J]. J Tsinghua Univ (Sci and Tech), 2008, 48(S1):652-657. (in Chinese) [11] 曾一鸣, 朱杰. 基于规则的汉语情感语音系统的设计与实现[J]. 电子测量技术. 2009, 32(11):62-64.ZENG Yiming, ZHU Jie. Design and implementation of rule-based emotional speech synthesis system[J]. Electronic Measurement Technology, 2009, 32(11):62-64. (in Chinese) [12] 张锐锋. 普通话情感语音的发声研究[D]. 北京:北京大学, 2015. ZHANG Ruifeng. On the Phonation of Putonghua Emotional Speech[D]. Beijing:Peking University, 2015. (in Chinese)

PDF(1129 KB)

Accesses

Citation

Detail

段落导航
相关文章

/