Analysis method for standardization reviews on Beijing enterprises
ZHU Yu1, CHEN Tao1, JI Xuewei2, ZHANG Hui2, WU Aizhi2
1. Institute of Safety Science and Technology, Department of Engineering Physics, Tsinghua University, Beijing 100084, China; 2. Beijing Academy of Safety Science and Technology, Beijing 100070, China
Abstract:There are over one million items from ten thousand enterprises in the Beijing Enterprise Standardization Review. Big data analytical methods can be used to analyze the relationships between the deduction counts of the review items because of the large data volume. The most popular method is association rules, but these are qualitative, not quantitative. Neural network, another widely used data mining method, are able to solve complex non-linear problems but requires much effort to choose the suitable inputs and targets. This article combines these two methods with the association rules used to select the inputs and targets from the review items and the neural network used to relate the inputs and the targets. A test gave a strong correlation between 3 selected review items and 8 other review items with a correlation coefficient of the fitting curve of over 0.84 between the predicted targets and the real value. Thus, this combined method can improve data mining of the enterprises review items with the result used to predict the deduction counts of the selected items.
[1] AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules in large databases[C]//Proceedings of the 20th International Conference on Very Large Data Bases. Santiago de Chile, Chile:Morgan Kaufmann Publishers Inc., 1994:487-499. [2] NEBOT V, BERLANGA R. Finding association rules in semantic web data[J]. Knowledge-Based Systems, 2012, 25(1):51-62. [3] KAMSU-FOGUEM B, RIGAL F, MAUGET F. Mining association rules for the quality improvement of the production process[J]. Expert Systems with Applications, 2013, 40(4):1034-1045. [4] 李晓兰, 曹晓钟, 朱君, 等. 基于关联规则挖掘的自动站观测数据相关性分析[J]. 气象科技, 2016, 44(5):715-721.LI X L, CAO X Z, ZHU J, et al. Correlation analysis of observation data from automatic stations based on association rule mining[J]. Meteorological Science and Technology, 2016, 44(5):715-721. (in Chinese) [5] 林颖华, 陈长凤. 基于关联规则的企业财务风险评价研究[J]. 会计之友, 2017(1):32-35.LIN Y H, CHEN C F. Research on enterprise financial risk evaluation based on association rules[J]. Friends of Accounting, 2017(1):32-35. (in Chinese) [6] RUMELHART D E, MCCLELLAND J L, PDP RESEARCH GROUP. Parallel distributed processing[M]. Cambridge, MA:MIT Press, 1987. [7] REN C, AN N, WANG J Z, et al. Optimal parameters selection for BP neural network based on particle swarm optimization:A case study of wind speed forecasting[J]. Knowledge-Based Systems, 2014, 56:226-239. [8] WANG J D, FANG K J, PANG W J, et al. Wind power interval prediction based on improved PSO and BP neural network[J]. Journal of Electrical Engineering and Technology, 2017, 12(3):989-995. [9] XIAO Z, YE S J, ZHONG B, et al. BP neural network with rough set for short term load forecasting[J]. Expert Systems with Applications, 2009, 36(1):273-279. [10] ZHANG Y D, WU L N. Stock market prediction of S & P 500 via combination of improved BCO approach and BP neural network[J]. Expert Systems with Applications, 2009, 36(5):8849-8854. [11] 罗前林. 600MW超临界机组主要运行参数目标值优化[D]. 武汉:华中科技大学, 2012.LUO Q L. The target value optimization of operating parameters of 600MW supercritical units[D]. Wuhan:Huazhong University of Science and Technology, 2012. (in Chinese) [12] KARABATAK M, INCE M C. An expert system for detection of breast cancer based on association rules and neural network[J]. Expert Systems with Applications, 2009, 36(2):3465-3469. [13] 晏杰, 亓文娟. 基于Aprior & FP-growth算法的研究[J]. 计算机系统应用, 2013, 22(5):122-125.YAN J, QI W J. Research based on Aprior & FP-growth algorithm[J]. Computer Systems & Applications, 2013, 22(5):122-125. (in Chinese)