PDF(23705 KB)
Construction of the CIM hierarchical classification semantic network based on multi-source heterogeneous data
Zhao XU, Wenxin GUAN, Gan ZHANG, Zhuozhen FANG, Weilang CAI
Journal of Tsinghua University(Science and Technology) ›› 2025, Vol. 65 ›› Issue (7) : 1197-1208.
PDF(23705 KB)
PDF(23705 KB)
Construction of the CIM hierarchical classification semantic network based on multi-source heterogeneous data
Objective: The city information model (CIM) is a new city information synthesis that combines large amounts of information to guide the construction of urban organisms through the digital representation of urban objects. However, because of the complex application requirements and the imperfect theoretical system of CIM, the problems of a lack of semantic specification and a unified framework need to be addressed, and the development of CIM is difficult to effectively promote. To form a universal semantic standard, this study proposes the CIM classification semantic web to guide the construction of the CIM and govern the CIM data. Methods: This study designs a CIM classification semantic tree via the line classification method based on various criteria, uses the robustly optimized bidirectional encoder representations from transformers pretraining approach (RoBERTa) model and a clustering algorithm to merge the synonyms of the same cluster, and adds new semantic knowledge to optimize the semantic tree. To further expand the application of CIM semantics, the city information model ontology (CIMO) is proposed based on the completed semantic tree and the Stanford seven-step method, which has six main classes and nine main property attributes. CIMO enables computers to process semantic information effectively. Moreover, given the multisource data feature within the CIM, this study aims to fully leverage semantic information derived from building information modeling and geographic information systems. This study designs a mapping relationship between CIMO and multisource heterogeneous data, which are composed of industry foundation classes and city geography markup language. The CIMO serves as a foundation for semantic analysis and the construction of knowledge graphs. This study proposes coding attributes as unique identifiers for management analysis, which improves the efficiency and accuracy of the CIM classification semantic net. The models of one primary school building and surrounding municipal facilities are selected as a case study to further evaluate the semantic-tree-based CIMO and knowledge graph data governance. Results: Based on mapping rules, this model could well form a triplet formal knowledge graph, and the resulting web ontology language could be understood and processed by computers. The semantic analysis could be completed based on the semantic web rule language(SWRL), and the logic test could be completed by an inference machine. The CIM classification semantic web that passed the test could identify the relationships between the instances and the instance categories contained in the query class by level and classification, could operate and update the instance data to complete logical and point-to-point query and governance, and had excellent data governance performance and clear semantic logic. The triplet file of the knowledge graph was imported into the graph database for graph visualization and graph data storage, which is convenient for intuitive understanding and processing of the graph data. Conclusions: The CIM classification semantic web proposed in this study can satisfy the construction of multiscenario, multiprecision, and multilevel CIM systems; can standardize the semantic expression of CIM; has good hierarchical logic and data processing functions; serves as a semantic standard, and integrates city-level and component-level data. The semantic web can provide guidance and a framework for the development of the CIM data governance platform and the modeling of city-level models and can promote the construction and development of the geometric and semantic integration of the CIM comprehensive system.
city information model / hierarchical classification / multi-source heterogeneous data / ontology / knowledge graph
| 1 |
王永海, 王宏伟, 于静, 等. 城市信息模型(CIM)平台关键技术研究与应用[J]. 建设科技, 2022 (7): 62- 66.
|
| 2 |
|
| 3 |
|
| 4 |
汪科, 杨柳忠, 季珏. 新时期我国推进智慧城市和CIM工作的认识和思考[J]. 建设科技, 2020 (18): 9- 12.
|
| 5 |
汪艳霞. 基于CIM的新城新区数字孪生城市建设研究[J]. 城市勘测, 2023 (6): 1- 6.
|
| 6 |
王永海, 姚玲, 陈顺清, 等. 城市信息模型(CIM)分级分类研究[J]. 图学学报, 2021, 42 (6): 995- 1001.
|
| 7 |
詹勇, 葛余超, 谭涵. 基于城市信息模型CIM的建筑信息提取与应用[J]. 地理空间信息, 2022, 20 (12): 38- 57.
|
| 8 |
|
| 9 |
|
| 10 |
|
| 11 |
中华人民共和国住房和城乡建设部. 城市信息模型数据加工技术标准: CJJ/T 319-2023[S]. 北京: 中国建筑工业出版社, 2023.
Ministry of Housing and Urban-Rural Development of the People's Republic of China. Standard for data transformation and lightweighting of CIM: CJJ/T319-2023[S]. Beijing: China Architecture & Building Press, 2023. (in Chinese)
|
| 12 |
吉林省建设标准化管理办公室. 市政工程信息模型设计应用标准: DB22T 5147-2023[S]. 长春: 吉林省住房和城乡建设厅, 2023.
Jilin Province Construction Standardization Management Office. Standard for design application of municipal engineering information model: DB22T 5147-2023[S]. Changchun: Jilin Provincial Department of Housing and Urban-Rural Development, 2023. (in Chinese)
|
| 13 |
吉林省建设标准化管理办公室. 建筑信息模型施工应用标准: DB22T 5148-2023[S]. 长春: 吉林省住房和城乡建设厅, 2023.
Jilin Province Construction Standardization Management Office. Standard for building information modeling in construction: DB22T 5148-2023[S]. Changchun: Jilin Provincial Department of Housing and Urban-Rural Development, 2023. (in Chinese)
|
| 14 |
中国工程建设标准化协会. 城市信息模型数据分类与编码标准: T/CECS 1643-2024[S]. 北京: 中国建筑工业出版社, 2024.
China Association for Engineering Construction Standardi- zation. Standard for classifying and coding of city information model data: T/CECS 1643-2024[S]. Beijing: China Architecture & Building Press, 2024. (in Chinese)
|
| 15 |
中华人民共和国住房和城乡建设部, 中华人民共和国国家质量监督检验检疫总局. 建筑信息模型分类和编码标准: GB/T 51269-2017[J]. 北京: 中国建筑工业出版社, 2017,
Ministry of Housing and Urban-Rural Development of the People's Republic of China, General Administration of Quality Supervision, Inspection and Quarantine of the People's Republic of China. Standard for classification and coding of building information model: GB/T 51269-2017[S]. Beijing: China Architecture & Building Press, 2017. (in Chinese)
|
| 16 |
中华人民共和国住房和城乡建设部. 城市数字公共基础设施统一识别代码编码规则: CJ/T 553-2024[S]. 北京: 中国建筑工业出版社, 2024.
Ministry of Housing and Urban-Rural Development. Coding rules for unified identification code of urban digital public infrastructure: CJ/T 553-2024[S]. Beijing: China Architecture & Building Press, 2024. (in Chinese)
|
| 17 |
LIU M Z, LI B, DAI J R, et al. An attribute word extraction model incorporating RoBERTa and CRF[C]//2022 Human-Centered Cognitive Systems. Shanghai, China: IEEE Press, 2022: 31-37.
|
| 18 |
潘泽宇, 史健勇, 姜柳. 基于语义网的电网工程BIM模型完备性审查方法[J]. 图学学报, 2023, 44 (5): 1021- 1033.
|
| 19 |
|
| 20 |
张吉祥, 张祥森, 武长旭, 等. 知识图谱构建技术综述[J]. 计算机工程, 2022, 48 (3): 23- 37.
|
| 21 |
李叶叶, 李贺, 沈旺, 等. 基于多源异构数据挖掘的在线评论知识图谱构建[J]. 情报科学, 2022, 40 (2): 65-73, 98.
|
| 22 |
|
| 23 |
|
| 24 |
|
| 25 |
|
| 26 |
|
| 27 |
|
/
| 〈 |
|
〉 |