Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  百年期刊
Journal of Tsinghua University(Science and Technology)    2017, Vol. 57 Issue (12) : 1259-1264     DOI: 10.16511/j.cnki.qhdxxb.2017.21.026
COMPUTER SCIENCE AND TECHNOLOGY |
Prosody modeling for Uyghur TTS
Gulmire Imam1, Guljamal Mamateli2, Maynur Ablitip3, Askar Hamdulla4
1. School of Literature, Xinjiang Normal University, Urumqi 830054, China;
2. School of Mathematical Sciences, Xinjiang Normal University, Urumqi 830054, China;
3. Xinjiang Normal University Library, Urumqi 830054, China;
4. Institute of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
Download: PDF(3132 KB)  
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks    
Abstract  The prosodic features of syllables such as duration, energy, mean pitch, maximum pitch, minimum pitch and pitch range were extracted from a Uyghur text to speech (TTS) database with analyses of their variations for different prosodic hierarchies. The pitch reset, pre-boundary lengthening, and silence duration of different prosodic boundaries were also analyzed. The results of acoustic experiments show that the pitch reset and pre-boundary lengthening are much greater as the prosodic boundary degree increases. No obvious pause can be perceived at the prosodic word (PW) boundary and the average silence duration at the prosodic phrase (PP) and intonation phrase (INP) boundaries are 154.2 and 212.8 ms.
Keywords Uyghur      text to speech (TTS)      prosody structure      acoustic analysis     
ZTFLH:  TN912.33  
Issue Date: 15 December 2017
Service
E-mail this article
E-mail Alert
RSS
Articles by authors
Cite this article:   
Gulmire Imam, Guljamal Mamateli, Maynur Ablitip, Askar Hamdulla. Prosody modeling for Uyghur TTS[J]. Journal of Tsinghua University(Science and Technology),2017, 57(12): 1259-1264.
URL:  
http://jst.tsinghuajournals.com/EN/10.16511/j.cnki.qhdxxb.2017.21.026     OR     http://jst.tsinghuajournals.com/EN/Y2017/V57/I12/1259
  
  
  
  
  
  
  
  
  
  
  
[1] 姑丽加玛丽·麦麦提艾力. 基于二级语音基元及其韵律参数的UTTS技术研究与实现[D]. 乌鲁木齐:新疆大学. 2009.Guljamal Mamateli. The Two Level Speech Unit and Their Prosodic Feature Based UTTS Technologies and Implementations[D]. Urumqi:Xinjiang University, 2009. (in Chinese)
[2] 热娜古丽·达古提, 艾斯卡尔·艾木都拉, 地里木拉提·吐尔逊. 维吾尔语CVC型音节韵律特征声学分析[J]. 计算机工程, 2011, 37(9):193-195.Ranagul Dagut, Askar Hamdull, Dilmurat Tursun. Acoustic analysis on prosodic feature of CVC type syllables in Uyghur language[J]. Computer Engineering, 2011, 37(9):193-195.(in Chinese)
[3] 江海燕, 刘岩, 卢莉.维吾尔语词重音实验研究[J]. 民族语文, 2010(3):67-71.JIANG Haiyan, LIU Yan, LU Li. Experimental study on Uyghur accent[J]. Minority Languages of China, 2010(3):67-71. (in Chinese)
[4] 祖丽皮亚·阿曼, 艾斯卡尔·艾木都拉. 维吾尔语双音节词韵律特征声学分析[J]. 中文信息学报, 2009, 23(5):104-107.Zulpiya Aman, Askar Hamdulla. Acoustic analysis of the prosodic features of the disyllabic words in Uyghur language[J]. Journal of Chinese Information Processing, 2009, 23(5):104-107. (in Chinese)
[5] 祖丽皮亚·阿曼, 艾斯卡尔·艾木都拉, 地里木拉提·吐尔逊. 维吾尔语三音节词韵律特征声学分析[J]. 计算机应用, 2009(7):2032-2034.Zulpiya Aman, Askar Hamdulla, Dilmurat Tursun. Acoustic analysis of prosodic features of trisyllabic words in Uyghur language[J]. Journal of Computer Application, 2009(7):2032-2034. (in Chinese)
[6] 古力米热·依玛木,艾斯卡尔·艾木都拉.维吾尔语句韵律层级的人工标注规则研究[C]//第三届全国少数民族青年自然语言信息处理、第二届全国多语言知识库建设联合学术研讨会论文集. 乌鲁木齐, 2010:179-182.Imam Gulmire, Hamdulla Askar. Research on the rules and regulation for manual labeling of prosody levels in Uyghur sentence[C]//The Research and Development of Natural Language Processing Technology Among the Minority Youth -Proceedings of the Third National Minority Youth Natural Language Information Processing and the Second National Multi-lingual Knowledge Base Construction. Urumqi, 2010:179-182.(in Chinese)
[7] 王蓓, 吕士楠, 杨玉芳. 汉语语句中重读音节音高变化模式研究[J]. 声学学报, 2002, 27(3):234-240.WANG Bei, LV Shinan, YANG Yufang. The pitch movement of stressed syllable in Chinese sentences[J]. Acta Acustica, 2002, 27(3):234-240. (in Chinese)
[1] NURMEMET Yolwas, LIU Junhua, WUSHOUR Silamu, REYIMAN Tursun, DAWEL Abilhayer. Crosslingual acoustic modeling in Uyghur speech recognition[J]. Journal of Tsinghua University(Science and Technology), 2018, 58(4): 342-346.
[2] Halidanmu Abudukelimu, LIU Yang, SUN Maosong. Performance comparison of neural machinetranslation systems in Uyghur-Chinese translation[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(8): 878-883.
[3] ABULIZI Abudukelimu, JIANG Minghu, YAO Dengfeng, ABUDUKELIMU Halidanmu. Neurocognitive mechanism for morphological complex word processing[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(4): 393-398.
[4] Mijit Ablimit, Akbar Pattar, Askar Hamdulla. Multilayer structure based lexicon optimization for language modeling[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(3): 257-263.
[5] IMAM Seyyare, PARHAT Rayilam, HAMDULLA Askar, LI Zhijun. Keyword extraction algorithms for emotion recognition from Uyghur text[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(3): 270-273.
[6] Aisikaer Rouzi, YIN Shi, ZHANG Zhiyong, WANG Dong, Askar Hamdulla, ZHENG Fang. THUYG-20: A free Uyghur speech database[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 182-187.
[7] Abdurahim Mahmoud, Hussein Yusuf, ZHANG Jiajun, ZONG Chengqing, Askar Hamdulla. Name recognition in the Uyghur language based on fuzzy matching and syllable-character conversion[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 188-196.
[8] Abdusalam Dawut, Hussein Yusuf, Askar Hamdulla. Emotion recognition from Uyghur sentences based on combinations of class discrimination words and a sentiment dictionary[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(2): 197-201.
[9] Hankiz Yilahun, Gulmire Imam, Maynur Ablitip, Guljamal Mamateli, Askar Hamdulla. Undulating scale of intonations of exclamatory Uyghur sentences[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(12): 1254-1258.
[10] ABUDUKELIMU Halidanmu, CHENG Yong, LIU Yang, SUN Maosong. Uyghur morphological segmentation with bidirectional GRU neural networks[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(1): 1-6.
[11] JIANG Zhiwei, DING Xiaoqing, PENG Liangrui. Character model optimization for segmentation-free Uyghur text line recognition[J]. Journal of Tsinghua University(Science and Technology), 2015, 55(8): 873-877,883.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
Copyright © Journal of Tsinghua University(Science and Technology), All Rights Reserved.
Powered by Beijing Magtech Co. Ltd