COMPUTER SCIENCE AND TECHNOLOGY |
|
|
|
|
|
Prosody modeling for Uyghur TTS |
Gulmire Imam1, Guljamal Mamateli2, Maynur Ablitip3, Askar Hamdulla4 |
1. School of Literature, Xinjiang Normal University, Urumqi 830054, China;
2. School of Mathematical Sciences, Xinjiang Normal University, Urumqi 830054, China;
3. Xinjiang Normal University Library, Urumqi 830054, China;
4. Institute of Information Science and Engineering, Xinjiang University, Urumqi 830046, China |
|
|
Abstract The prosodic features of syllables such as duration, energy, mean pitch, maximum pitch, minimum pitch and pitch range were extracted from a Uyghur text to speech (TTS) database with analyses of their variations for different prosodic hierarchies. The pitch reset, pre-boundary lengthening, and silence duration of different prosodic boundaries were also analyzed. The results of acoustic experiments show that the pitch reset and pre-boundary lengthening are much greater as the prosodic boundary degree increases. No obvious pause can be perceived at the prosodic word (PW) boundary and the average silence duration at the prosodic phrase (PP) and intonation phrase (INP) boundaries are 154.2 and 212.8 ms.
|
Keywords
Uyghur
text to speech (TTS)
prosody structure
acoustic analysis
|
|
Issue Date: 15 December 2017
|
|
|
[1] |
姑丽加玛丽·麦麦提艾力. 基于二级语音基元及其韵律参数的UTTS技术研究与实现[D]. 乌鲁木齐:新疆大学. 2009.Guljamal Mamateli. The Two Level Speech Unit and Their Prosodic Feature Based UTTS Technologies and Implementations[D]. Urumqi:Xinjiang University, 2009. (in Chinese)
|
[2] |
热娜古丽·达古提, 艾斯卡尔·艾木都拉, 地里木拉提·吐尔逊. 维吾尔语CVC型音节韵律特征声学分析[J]. 计算机工程, 2011, 37(9):193-195.Ranagul Dagut, Askar Hamdull, Dilmurat Tursun. Acoustic analysis on prosodic feature of CVC type syllables in Uyghur language[J]. Computer Engineering, 2011, 37(9):193-195.(in Chinese)
|
[3] |
江海燕, 刘岩, 卢莉.维吾尔语词重音实验研究[J]. 民族语文, 2010(3):67-71.JIANG Haiyan, LIU Yan, LU Li. Experimental study on Uyghur accent[J]. Minority Languages of China, 2010(3):67-71. (in Chinese)
|
[4] |
祖丽皮亚·阿曼, 艾斯卡尔·艾木都拉. 维吾尔语双音节词韵律特征声学分析[J]. 中文信息学报, 2009, 23(5):104-107.Zulpiya Aman, Askar Hamdulla. Acoustic analysis of the prosodic features of the disyllabic words in Uyghur language[J]. Journal of Chinese Information Processing, 2009, 23(5):104-107. (in Chinese)
|
[5] |
祖丽皮亚·阿曼, 艾斯卡尔·艾木都拉, 地里木拉提·吐尔逊. 维吾尔语三音节词韵律特征声学分析[J]. 计算机应用, 2009(7):2032-2034.Zulpiya Aman, Askar Hamdulla, Dilmurat Tursun. Acoustic analysis of prosodic features of trisyllabic words in Uyghur language[J]. Journal of Computer Application, 2009(7):2032-2034. (in Chinese)
|
[6] |
古力米热·依玛木,艾斯卡尔·艾木都拉.维吾尔语句韵律层级的人工标注规则研究[C]//第三届全国少数民族青年自然语言信息处理、第二届全国多语言知识库建设联合学术研讨会论文集. 乌鲁木齐, 2010:179-182.Imam Gulmire, Hamdulla Askar. Research on the rules and regulation for manual labeling of prosody levels in Uyghur sentence[C]//The Research and Development of Natural Language Processing Technology Among the Minority Youth -Proceedings of the Third National Minority Youth Natural Language Information Processing and the Second National Multi-lingual Knowledge Base Construction. Urumqi, 2010:179-182.(in Chinese)
|
[7] |
王蓓, 吕士楠, 杨玉芳. 汉语语句中重读音节音高变化模式研究[J]. 声学学报, 2002, 27(3):234-240.WANG Bei, LV Shinan, YANG Yufang. The pitch movement of stressed syllable in Chinese sentences[J]. Acta Acustica, 2002, 27(3):234-240. (in Chinese)
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|