利用海量位置数据分析用户行为,挖掘用户的潜在价值越来越受到人们的关注。室外环境中已有较成熟的解决方案。针对室内空间中WiFi定位数据的精确度、鲁棒性不足等问题,对面向室内空间的语义轨迹提取方法进行了研究,能在减少错误、压缩原始位置数据的同时,增强轨迹的表达能力,使得更深入的室内时空数据挖掘成为可能。该文基于室内空间建模、数据清洗、事件提取和语义增强4个模块的框架提出了室内语义轨迹计算的方法,在真实数据集和模拟数据集上进行实验,结果表明:该方法能从存在误差和缺失的室内定位数据中,准确有效地挖掘和提取出含有语义信息的轨迹数据,为上层的应用分析所用。
Using massive location data to analyze user behavior and tap the potential value of customers is getting much attention. Mature solutions have been studied for outdoor space, while the accuracy and robustness of WiFi positioning data are main challenges for indoor space. The method of extracting the semantic trajectory for indoor space is studied. It can reduce error, compress original position data and enhance the expression capability of trajectory for indoor spatio-temporal data mining. Based on four modules including indoor space modeling, data cleansing, event extraction and semantic enhancement, a framework for calculating indoor semantic trajectory is proposed. Experiments are conducted on real and simulated datasets, and the results show that this method can accurately and effectively extract trajectory data containing semantic information from indoor positioning data with incompleteness and errors, which can be used for high-level analytical applications.
[1] ORELLANA D, WACHOWICZ M, ANDRIENKO N, et al. Uncovering interaction patterns in mobile outdoor gaming[C]//2009 International Conference on Advanced Geographic Information Systems & Web Services. Cancun, Mexico:IEEE, 2009:177-182.
[2] WU F, FU K, WANG Y, et al. A spatial-temporal-semantic neural network algorithm for location prediction on moving objects[J]. Algorithms, 2017, 10(2):37-37.
[3] MARKETOS G, FRENTZOS E, NTOUTSI I, et al. Building real-world trajectory warehouses[C]//Proceedings of the Seventh ACM International Workshop on Data Engineering for Wireless and Mobile Access. New York, USA:ACM, 2008:8-15.
[4] ZHENG Y. Trajectory data mining:An overview[J]. ACM Transactions on Intelligent Systems and Technology, 2015, 6(3):29-29.
[5] LEE S H, LIM I K, LEE J K. Method for improving indoor positioning accuracy using extended Kalman filter[J]. Mobile Information Systems, 2016, 2369103.
[6] WALTON L, WORBOYS M. An algebraic approach to image schemas for geographic space[C]//Proceedings of the 9th International Conference on Spatial Information Theory. Berlin, Germany:Springer-Verlag, 2009:357-370.
[7] XIE X K, LU H, PEDERSEN T B. Efficient distance-aware query evaluation on indoor moving objects[C]//2013 IEEE 29th International Conference on Data Engineering. Brisbane, QLD, Australia:IEEE, 2013:434-445.
[8] ALVARES L O, BOGORNY V, KUIJPERS B, et al. A model for enriching trajectories with semantic geographical information[C]//Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems. New York, USA:ACM, 2007:22-30.
[9] LI Q N, ZHENG Y, XIE X, et al. Mining user similarity based on location history[C]//Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. New York,USA:ACM, 2008:34-43.
[10] GONG L, SATO H, YAMAMOTO T, et al. Identification of activity stop locations in GPS trajectories by density-based clustering method combined with support vector machines[J]. Journal of Modern Transportation, 2015, 23(3):202-213.
[11] ESTER M, KRIEGEL H P, SANDER J, et al. A density-based algorithm for discovering clusters in large spatial databases with noise[C]//Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Palo Alto, USA:AAAI Press, 1996:226-231.
[12] BIRANT D, KUT A. ST-DBSCAN:An algorithm for clustering spatial-temporal data[J]. Data & Knowledge Engineering, 2007, 60(1):208-221.
[13] LUO T, ZHENG X W, XU G L, et al. An improved DBSCAN algorithm to detect stops in individual trajectories[J]. ISPRS International Journal of Geo-Information, 2017, 6(3):63-78.
[14] YAN Z X, CHAKRABORTY D, PARENT C, et al. Semantic trajectories:Mobility data computation and annotation[J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2013, 4(3):1-38.
[15] JENSEN C S, LU H, YANG B. Graph model based indoor tracking[C]//2009 Tenth International Conference on Mobile Data Management:Systems, Services and Middleware. Taipei, China:IEEE, 2009:122-131.
[16] JUNG Y, JOO M. Building information modelling (BIM) framework for practical implementation[J]. Automation in Construction, 2011, 20(2):126-133.
[17] YANG B, LU H, JENSEN C S. Probabilistic threshold k nearest neighbor queries over moving objects in symbolic indoor space[C]//Proceedings of the 13th International Conference on Extending Database Technology. New York, USA:ACM, 2010:335-346.
[18] SANDER J, ESTER M, KRIEGEL H P, et al. Density-based clustering in spatial databases:The algorithm gdbscan and its applications[J]. Data Mining and Knowledge Discovery, 1998, 2(2):169-194.
[19] LI H, LU H, CHEN X, et al. Vita:A versatile toolkit for generating indoor mobility data for real-world buildings[J]. Proceedings of the VLDB Endowment, 2016, 9(13):1453-1456.
[20] CHEN L, ÖZSU M T, ORIA V. Robust and fast similarity search for moving object trajectories[C]//Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data. New York, USA:ACM, 2005:491-502.