Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  百年期刊
Journal of Tsinghua University(Science and Technology)    2022, Vol. 62 Issue (2) : 347-354     DOI: 10.16511/j.cnki.qhdxxb.2021.22.015
COMPUTER SCIENCE AND TECHNOLOGY |
Ensemble weighted soft voting truth inference method for crowdsourcing
ZHANG Hua1,2, SHEN Fei1, JIANG Shihao1, ZHANG Lingjun1,3, XU Hong1
1. School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China;
2. Key Laboratory of Network Multimedia Technology of Zhejiang Province, Zhejiang University, Hangzhou 310058, China;
3. Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, China
Download: PDF(2747 KB)   HTML
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks    
Abstract  Many truth inference methods have been proposed to improve crowdsourcing quality and to obtain high-quality annotated data. Traditional truth inference uses multiple noisy labels as inputs to deduce the real labels through an aggregation strategy. This paper introduces the features of the instances that most effectively mine the useful information contained in the instances. The probability that a crowdsourcing instance belongs to each category is used to divide the crowd-sourcing dataset. An integrated meta-learning classifier is trained on the new dataset to calculate a similarity degree to get worker weights that show each worker's annotation ability for different instances. Finally, a weighted soft voting method is used to predict the labels. Tests show that this method is superior to existing truth inference algorithms for public and constructed datasets.
Keywords crowdsourcing      feature      meta-learning      classify     
Issue Date: 22 January 2022
Service
E-mail this article
E-mail Alert
RSS
Articles by authors
ZHANG Hua
SHEN Fei
JIANG Shihao
ZHANG Lingjun
XU Hong
Cite this article:   
ZHANG Hua,SHEN Fei,JIANG Shihao, et al. Ensemble weighted soft voting truth inference method for crowdsourcing[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(2): 347-354.
URL:  
http://jst.tsinghuajournals.com/EN/10.16511/j.cnki.qhdxxb.2021.22.015     OR     http://jst.tsinghuajournals.com/EN/Y2022/V62/I2/347
  
  
  
  
  
  
  
[1] LI Y L, GAO J, MENG C S, et al. A survey on truth discovery[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD). San Francisco, USA, 2016:1-16.
[2] DAWID A P, SKENE A M. Maximum likelihood estimation of observer error-rates using the EM algorithm[J]. Applied Statistics, 1979, 28(1):20-28.
[3] ZHANG Y C, CHEN X, ZHOU D Y, et al. Spectral methods meet EM:A provably optimal algorithm for crowdsourcing[C]//Proceedings of 28th Annual Conference on Neural Information Processing Systems (NIPS). Montreal, Canada, 2014:1260-1268.
[4] DEMARTINI G, DIFALLAH D E, CUDRE'-MAUROUX P. ZenCrowd:Leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking[C]//Proceedings of the 21st World Wide Web Conference (WWW). New York, USA, 2012:469-478.
[5] ZHOU D Y, PLATT J C, BASU S, et al. Learning from the wisdom of crowds by minimax entropy[C]//Proceedings of 26th Annual Conference on Neural Information Processing Systems (NIPS). Lake Tahoe, USA, 2012:2195-2203.
[6] WELINDER P, BRANSON S, BELONGIE S, et al. The multidimensional wisdom of crowds[C]//Proceedings of 24th Annual Conference on Neural Information Processing Systems (NIPS). Vancouver, Canada, 2010:2424-2432.
[7] ZHANG J, SHENG V S, WU J, et al. Multi-class ground truth inference in crowdsourcing with clustering[J]. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(4):1080-1085.
[8] RAYKAR V C, YU S P, ZHAO L H, et al. Learning from crowds[J]. The Journal of Machine Learning Research, 2010, 11:1297-1322.
[9] HOSPEDALES T, ANTONIOU A, MICAELLI P, et al. Meta-learning in neural networks:A survey[Z/OL]. arXiv:2004.05439v2, 2020.
[10] BRAZDIL P, GIRAUD CARRIER C, SOARES C, et al. Metalearning:Applications to data mining[M]. Berlin, Germany:Springer Science & Business Media, 2009.
[11] SALVADOR M M, BUDKA M, GABRYS B. Adapting multicomponent predictive systems using hybrid adaptation strategies with auto-WEKA in process industry[C]//Proceedings of the 33rd International Conference on Machine Learning (ICML). New York, USA, 2016:1-8.
[12] FINN C, XU K, LEVINE S. Probabilistic model-agnostic meta-learning[C]//Proceedings of 32nd Annual Conference on Neural Information Processing Systems (NIPS). Montreal, Canada, 2018:9516-9527.
[13] DIZAJI K G, HUANG H. Sentiment analysis via deep hybrid textual-crowd learning model[C]//Proceedings of 32nd AAAI Conference on Artificial Intelligence (AAAI). New Orleans, USA, 2018:1563-1570.
[14] ZHANG J, WU M, SHENG V S. Ensemble learning from crowds[J]. IEEE Transactions on Knowledge and Data Engineering, 2019, 31(8):1506-1519.
[15] TAO F N, JIANG L X, LI C Q. Label similarity-based weighted soft majority voting and pairing for crowdsourcing[J]. Knowledge and Information Systems, 2020, 62(7):2521-2538.
[16] WHITEHILL J, WU T F, BERGSMA J, et al. Whose vote should count more:Optimal integration of labels from labelers of unknown expertise[C]//Proceedings of 23rd Annual Conference on Neural Information Processing Systems (NIPS). Vancouver, Canada, 2009:2035-2043.
[17] ZHENG Y D, LI G L, LI Y B, et al. Truth inference in crowdsourcing:Is the problem solved?[J]. Proceedings of the VLDB Endowment, 2017, 10(5):541-552.
[18] ZHANG J, SHENG V S, NICHOLSON B, et al. CEKA:A tool for mining the wisdom of crowds[J]. The Journal of Machine Learning Research, 2015, 16(88):2853-2858.
[1] ZHANG Mingfang, LI Guilin, WU Chuna, WANG Li, TONG Lianghao. Estimation algorithm of driver's gaze zone based on lightweight spatial feature encoding network[J]. Journal of Tsinghua University(Science and Technology), 2024, 64(1): 44-54.
[2] LIU Yuanxin, LIAO Wenjie, LIN Yuanqing, XIE Linlin, LU Xinzheng. Influence of data features on the generative adversarial network-based intelligent design for shear wall structures[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(12): 2005-2018.
[3] YANG Hongyu, ZHANG Zixin, ZHANG Liang. Network security situation assessments with parallel feature extraction and an improved BiGRU[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(5): 842-848.
[4] ZHANG Dongcheng, QIANG Maoshan, JIANG Hanchen, HUANG Yujie. Mining safety hazard management collaboration features from large construction projects[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(2): 208-214.
[5] SUN Yue, HE Ke, ZHANG Zhinan. Multi-source information fitting regression integrated model of coefficient of friction[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(12): 1980-1988.
[6] LIU Shudong, ZHANG Jiani, CHEN Xu. Review-aware heterogeneous variational autoencoder recommendation model[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(1): 88-97.
[7] ZHANG Tianyi, ZHU Zhiming, ZHU Chuanhui, SUN Bowen. Visual sensing image processing and feature information extraction for arc welding[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(1): 156-162.
[8] LI Cong, MA Xun, YANG Rui, ZHANG Hui. Characterization of n-heptane annular pool fires based on flame shape and texture features[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(6): 502-508.
[9] TANG Yingfu, WANG Zhongjing, ZHANG Zixiong. Registration of sand dune images using an improved SIFT and SURF algorithm[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(2): 161-169.
[10] DING Ying, ZHANG Jianqin, YANG Mu, GONG Peng, JIA Lipeng, DENG Shaocun. Communicable disease transmission model for the prevention and control of COVID-19 in Wuhan City, China[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(12): 1452-1461.
[11] WANG Xiaomeng, GUAN Zhibin, XIN Wei, WANG Jiajie. Source code defect detection using deep convolutional neural networks[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(11): 1267-1272.
[12] CUI Junyun, CHEN Di, YUAN Ye, MA Yuliang, WANG Guoren. Online route planning algorithm in spatial crowdsourcing[J]. Journal of Tsinghua University(Science and Technology), 2020, 60(8): 672-682.
[13] WANG Xiaoxu, WANG Lizhen, WANG Jialong. Mining spatio-temporal co-location fuzzy congestion patterns from traffic datasets[J]. Journal of Tsinghua University(Science and Technology), 2020, 60(8): 683-692.
[14] ZHANG Zhanbo, WANG Zhenlei, WANG Xin. Fault detection based on orthogonal local slow features[J]. Journal of Tsinghua University(Science and Technology), 2020, 60(8): 693-700.
[15] LI Minhui, CHEN Yi, WU Baosheng. Analysis of features and factors controlling typical drainage networks in the Tibetan Plateau[J]. Journal of Tsinghua University(Science and Technology), 2020, 60(11): 951-957.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
Copyright © Journal of Tsinghua University(Science and Technology), All Rights Reserved.
Powered by Beijing Magtech Co. Ltd