Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  百年期刊
Journal of Tsinghua University(Science and Technology)    2017, Vol. 57 Issue (8) : 884-891     DOI: 10.16511/j.cnki.qhdxxb.2017.22.055
COMPUTER SCIENCE AND TECHNOLOGY |
Algorithm for clustering uncertain data streams based on density
HAN Donghong, SONG Ming, ZHANG Hongliang, WANG Jiaxi, WANG Jiaxing, WANG Guoren
School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
Download: PDF(1171 KB)  
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks    
Abstract  Uncertainties make it impossible to cluster uncertain data streams using traditional clustering algorithms. This paper presents a density-based clustering algorithms for uncertain data stream environments. An uncertainty metric is used to measure the distribution information in the uncertain data. The uncertain data streams DENCLUE algorithm (USDENCLUE) is then modified to deal with uncertainty data to minimize the impact of the data uncertainty on the clustering results. A density-based clustering algorithm is then given for uncertain data streams with a sliding window to rapidly prune the clusters using an exponential histogram of the cluster features. This algorithm can efficiently handle noisy data in evolving data streams to generate arbitrary clusters to improve the clustering quality. Comparisons of this algorithm with the CluStream clustering algorithm on real and synthetic data sets show the efficiency and effectiveness of this algorithm.
Keywords uncertain data streams      clustering      density      sliding window     
ZTFLH:  TP301.6  
Issue Date: 15 August 2017
Service
E-mail this article
E-mail Alert
RSS
Articles by authors
HAN Donghong
SONG Ming
ZHANG Hongliang
WANG Jiaxi
WANG Jiaxing
WANG Guoren
Cite this article:   
HAN Donghong,SONG Ming,ZHANG Hongliang, et al. Algorithm for clustering uncertain data streams based on density[J]. Journal of Tsinghua University(Science and Technology), 2017, 57(8): 884-891.
URL:  
http://jst.tsinghuajournals.com/EN/10.16511/j.cnki.qhdxxb.2017.22.055     OR     http://jst.tsinghuajournals.com/EN/Y2017/V57/I8/884
  
  
  
  
  
  
  
  
  
  
  
[1] Deshpande A, Guestrin C, Madden S, et al. Model-driven data acquisition in sensor networks[C]//Proceeding of the 30th International Conference on Very Large Data Bases. New York, USA:ACM Press, 2004:588-599.
[2] GU Yu, YU Ge, ZHANG Tiancheng. RFID complex event processing techniques[J]. Journal of Frontiers of Computer Science and Technology, 2007, 1(3):255-267.
[3] Jeffery S R, Garofalakis M N, Frwanklin M J. Adaptive cleaning for RFID data streams[C]//Proceeding of the 32nd International Conference on Very Large Data Bases. New York, USA:ACM Press, 2006:163-174.
[4] ZHOU Aoying, JIN Cheqing, WANG Guoren, et al. A survey on the management of uncertain data[J]. Chinese Journal of Computers, 2009, 32(1):1-16.
[5] Aggarwal C C, Han J, Wang J, et al. A framework for clustering evolving data streams[C]//Proceeding of the 29th International Conference on Very Large Data Bases. New York, USA:ACM Press, 2003:81-92.
[6] Aggarwal C C, Yu P S. A framework for clustering uncertain data streams[C]//Proceeding of the 24th International Conference on Data Engineering. Cancún, México, 2008:150-159.
[7] Aggarwal C C. On high dimensional projected clustering of uncertain data streams[C]//Proceeding of the 25th International Conference on Data Engineering. Shanghai, 2009:1152-1154.
[8] Huang G Y, Liang D P, Ren J D, et al. An algorithm for clustering uncertain data streams over sliding windows[C]//Proceeding of the 6th International Conference on Digital Content, Multimedia Technology and Its Applications. Seoul, Korea, 2009:173-177.
[9] ZHANG Chen, JIN Cheqing, ZHOU Aoying. Clustering algorithm over uncertain data streams[J]. Journal of Software, 2010, 21(9):2173-2182.
[10] CAO Keyan, WANG Guoren, HAN Donghong, et al. A framework for high-quality clustering uncertain data stream over sliding windows[C]//Proceeding of the 13th International Conference on Web-Age Information Management. Harbin, 2012:308-313.
[11] YANG Yue, LIU Zhuo, ZHANG Jianpei, et al. Dynamic density-based clustering algorithm over uncertain data streams[C]//Proceeding of the 20129th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD). Chongqing, 2012:2664-2670.
[12] JIN Cheqing, Yu J X, ZHOU Aoying, et al. Efficient clustering of uncertain data streams[J]. Knowledge and Information Systems, 2014, 40(3):509-539.
[13] Dallachiesa M, Jacques-Silva G, Gedik B, et al. Sliding windows over uncertain data streams[J]. Knowledge and Information Systems, 2015, 45(1):159-190.
[14] Hinneburg A, Keim D A. An efficient approach to clustering in large multimedia databases with noise[C]//Proceeding of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Washington, DC, USA, 2010:58-65.
[1] ZHAO Xingwang, HOU Zhedong, YAO Kaixuan, LIANG Jiye. Two-stage fusion multiview graph clustering based on the attention mechanism[J]. Journal of Tsinghua University(Science and Technology), 2024, 64(1): 1-12.
[2] WANG Liping, SHI Huijie, WANG Dong. Clustering and selection method of microservices for intelligent manufacturing[J]. Journal of Tsinghua University(Science and Technology), 2024, 64(1): 109-116.
[3] HU Yuwen, YAN Xiao, GONG Houjun, WANG Yanlin, ZHOU Lei. Numerical study on flow instability in parallel rectangular channels with coupled heat transfer[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(8): 1257-1263.
[4] DU Yuji, FU Ming, DUANMU Weike, HOU Longfei, LI Jing. Risk assessment method of gas pipeline networks based on fuzzy analytic hierarchy process and improved coefficient of variation[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(6): 941-950.
[5] LI Congjian, GAO Hang, LIU Yi. Fast reconstruction of a wind field based on numerical simulation and machine learning[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(6): 882-887.
[6] SUN Haobo, YANG Kaiming, ZHU Yu, LU Sen. Modal parameter estimates for a magnetic levitation planar motor based on density clustering[J]. Journal of Tsinghua University(Science and Technology), 2023, 63(1): 33-43.
[7] YU Yong, WANG Yinggang, LUO Zhengguo, YANG Yan, WANG Xinkai, GAO Tao, YU Qian. Link prediction algorithm based on clustering coefficient and node centrality[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(1): 98-104.
[8] JIN jin, LIN Ziqiao, YAN Jian, KUANG Linling. Calculational method for assessing the probability distribution of an equivalent power flux density[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(1): 172-178.
[9] HE Chen, YAO Chi, SHAO Yulong, HUANG Fan, ZHOU Chuangbing. Effective permeability of three-dimensional fractured rock with low fracture densities[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(8): 827-832.
[10] ZHENG Hongwei, MENG Guangwei, LI Feng, HUANG Tao, GUO Yaming, XUAN Mohan. Reliability analysis of the roller base of a counter-roller spinning machine based on the maximum entropy method[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(11): 1289-1294.
[11] XU Li, GUO Runhua, PENG Huiting, SHI Pengcheng. Three-dimensional laser profilometer survey system of pavement slip characteristics[J]. Journal of Tsinghua University(Science and Technology), 2021, 61(10): 1202-1211.
[12] JIN Xuhong, HUANG Fei, CHENG Xiaoli, WANG Qiang, WANG Bing. Atmospheric drag on satellites flying in lower low-earth orbit[J]. Journal of Tsinghua University(Science and Technology), 2020, 60(3): 219-226.
[13] GUO Hongxian, LI Dongrun, MA Ruinan, CHENG Xiaohui. Oedometer test of calcareous sands solidified using the MICP mixing method[J]. Journal of Tsinghua University(Science and Technology), 2019, 59(8): 593-600.
[14] XIAO Xi, XU Chen. Speech feature fusion algorithm based on acoustic state likelihood and supervised state modelling[J]. Journal of Tsinghua University(Science and Technology), 2019, 59(6): 476-481.
[15] ZHANG Jiwen, SONG Libin, XU Junjie, SHI Xunlei, LIU Li. Unpredefined ball detection algorithm for humanoid soccer robots[J]. Journal of Tsinghua University(Science and Technology), 2019, 59(4): 298-305.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
Copyright © Journal of Tsinghua University(Science and Technology), All Rights Reserved.
Powered by Beijing Magtech Co. Ltd