水电地下工程安全管理多模态知识图谱构建方法

向云飞; 罗一鸣; 宁泽宇; 刘元广; 杨佐斌; 李子昌; 林鹏

doi:10.16511/j.cnki.qhdxxb.2025.26.014

清华大学学报（自然科学版） >

2025 , Vol. 65 >Issue 3: 433 - 445

DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2025.26.014

论文

水电地下工程安全管理多模态知识图谱构建方法

向云飞 ,
罗一鸣 ,
宁泽宇 ,
刘元广 ,
杨佐斌 ,
李子昌 ,
林鹏

展开

1. 清华大学水利水电工程系, 北京 100084;
2. 中国华能集团清洁能源技术研究院有限公司, 北京 102209;
3. 中国水利水电第十一工程局有限公司, 郑州 450001;
4. 华能澜沧江水电股份有限公司, 昆明 650214;
5. 清华四川能源互联网研究院, 成都 610213

收稿日期: 2024-07-02

网络出版日期: 2025-03-07

基金资助

中国水利水电第十一工程局有限公司技术开发项目(KKM-SUB-2024-008);中国华能集团有限公司科技项目(HNKJ23-H4);中国三峡建工(集团)有限公司技术服务项目(WDD/0578)

收起

Construction method of multimodal knowledge graph for safety management in hydropower underground engineering

XIANG Yunfei ,
LUO Yiming ,
NING Zeyu ,
LIU Yuanguang ,
YANG Zuobin ,
LI Zichang ,
LIN Peng

Expand

1. Department of Hydraulic Engineering, Tsinghua University, Beijing 100084, China;
2. China Huaneng Clean Energy Research Institute, Beijing 102209, China;
3. Sinohydro Bureau 11 Co., Ltd., Zhengzhou 450001, China;
4. Huaneng Lancang River Hydropower Inc., Kunming 650214, China;
5. Tsinghua Sichuan Energy Internet Research Institute, Chengdu 610213, China

Received date: 2024-07-02

Online published: 2025-03-07

Fold

摘要

水电地下工程安全管理面临交叉施工作业和资源动态流动等挑战, 相应的安全管理活动涉及多专业、多工种和多业务流程, 高效开展各项安全管理活动需要不同领域的专业知识和技能作为支撑。然而, 由于水电地下工程安全管理领域知识结构复杂, 且分散于文本、表格和图像等多模态数据中, 因此该文研究了水电地下工程安全管理多模态知识图谱构建方法, 这对于获取相关领域知识, 并为类似工程提供知识服务具有重要意义。该文首先构建了大规模高质量的水电地下工程安全管理多源异构数据集, 其中包含安全隐患排查和整改记录、法规和制度文档、安全隐患图像等数据; 其次, 基于大语言模型, 采用融合领域知识的提示微调方法进行知识抽取, 实现了多模态知识关联融合; 再次, 针对不同安全管理场景的差异化需求, 提出了场景知识提取方法, 并融合知识图谱和大语言模型技术实现了检索增强生成和可解释性知识推理; 最后, 基于多座水电工程收集的数据, 构建了多模态知识图谱, 并以白鹤滩水电地下工程为例进行验证, 开展了安全隐患整改措施智能推荐和法规文档遵从性检查。该文研究结果可为基础设施工程建设安全管理由数据驱动型向知识驱动型转变提供参考。

关键词： 水电地下工程; 安全管理; 数据集; 多模态知识图谱; 知识应用

本文引用格式

向云飞 , 罗一鸣 , 宁泽宇 , 刘元广 , 杨佐斌 , 李子昌 , 林鹏 . 水电地下工程安全管理多模态知识图谱构建方法[J]. 清华大学学报（自然科学版）, 2025 , 65(3) : 433 -445 . DOI: 10.16511/j.cnki.qhdxxb.2025.26.014

Abstract

[Object] Hydropower underground engineering encounters significant safety management challenges owing to overlapping construction activities, diverse process stages, and dynamic resource flows. This involves multidisciplinary safety tasks, such as safety hazard identification and rectification, emergency response, and regulatory compliance checks, which require specialized domain knowledge. In this context, safety management knowledge is intricate, such as expert experience, patterns and characteristics, and management codes, and is dispersed across multimodal data formats, including text, tables, and images. Efficient extraction of these multimodal data sources can significantly enhance data utility and support intelligent safety management. However, owing to the diverse nature of data formats, the complexity of the knowledge system, and the various management scenarios, current research struggles with limited knowledge sources, acquisition difficulties, and poor generalization. [Methods] This study proposes a method of constructing a multimodal knowledge graph (KG) for safety management in hydropower underground engineering. (1) A large-scale, high-quality, multisource heterogeneous dataset is built from safety hazard identification and rectification records, regulations, and images. (2) Knowledge modeling employs top-down and bottom-up approaches to define entities, relationships, attributes, and events pertinent to safety management in hydropower underground engineering. (3) The entity and relationship information from text data is obtained using a knowledge extraction method that uses a large language model (LLM) tuned with domain knowledge, enriched by specific examples for each entity type to handle small sample sizes. This approach uses demonstrations to provide the model with prior knowledge. (4) Instance segmentation is used to annotate safety hazard images. The entities identified in the images are then converted into vectors. Image and text data are linked based on semantic similarity. Image data are integrated into the textual KG, enabling the transformation from multimodal data to multimodal knowledge. (5) The multimodal KG is stored in Neo4j, an open-source graph database management system. (6) A scenario-specific knowledge acquisition method addresses the specific needs of safety management scenarios, integrating KG with LLMs to enable retrieval-augmented generation and interpretable knowledge reasoning. [Results] (1) This paper collected more than 120 000 safety hazard records, 30 regulatory documents, and 300 000 images of safety hazards. Leveraging these comprehensive data, this paper constructed a large-scale, high-quality, multisource heterogeneous dataset specifically designed for managing safety in hydropower underground engineering projects. (2) Taking a hydropower underground engineering project as an example, the constructed multimodal KG was applied to intelligent recommendations for safety hazard rectification and compliance checks. (3) The workflow for generating intelligent recommendations for safety hazard rectification measures involved the following steps. After users input safety hazard information, the scene-KG was extracted from the multimodal KG and fed into an LLM to generate appropriate rectification measures. (4) Based on the scene-KG, an inference retrieval method extended neighboring nodes and constructed inference-KG for compliance checks. By integrating inference-KG with an LLM, the system retrieved relevant content from regulatory documents based on user input. [Conclusions] The proposed method effectively extracts and applies domain knowledge from multimodal data in the context of safety management in hydropower underground engineering. It also successfully applies domain knowledge for safety management. The results serve as a reference for transitioning infrastructure construction safety management from a data-driven approach to a knowledge-driven approach.

Key words： hydropower underground engineering; safety management; dataset; multimodal knowledge graph; knowledge application

参考文献

[1] FAN Q X, DENG Z Y, LIN P, et al. Coordinated deformation control technologies for the high sidewall-bottom transfixion zone of large underground hydro-powerhouses [J]. Journal of Zhejiang University, 2022, 23(7): 543-563.
[2] 樊启祥, 林鹏, 蒋树, 等. 金沙江下游大型水电站岩石力学与工程综述[J]. 清华大学学报(自然科学版), 2020, 60(7): 537-556. FAN Q X, LIN P, JIANG S, et al. Review on the rock mechanics and engineering practice for large hydropower stations along the downstream section of the Jinsha River [J]. Journal of Tsinghua University (Science and Technology), 2020, 60(7): 537-556. (in Chinese)
[3] AN R N, LIN P, LI Z C, et al. Intelligent ventilation-on-demand control system for the construction of underground tunnel complex [J]. Journal of Intelligent Construction, 2024, 2(2): 9180032.
[4] 林鹏, 魏鹏程, 樊启祥, 等. 基于CNN模型的施工现场典型安全隐患数据学习[J]. 清华大学学报(自然科学版), 2019, 59(8): 628-634. LIN P, WEI P C, FAN Q X, et al. CNN model for mining safety hazard data from a construction site [J]. Journal of Tsinghua University (Science and Technology), 2019, 59(8): 628-634. (in Chinese)
[5] XIANG Y F, LIN P, AN R N, et al. Full participation flat closed-loop safety management method for offshore wind power construction sites [J]. Journal of Intelligent Construction, 2023, 1(1): 9180006.
[6] WANG B, WANG Y J. Big data in safety management: An overview [J]. Safety Science, 2021, 143: 105414.
[7] CARABANTES M. Black-box artificial intelligence: An epistemological and critical analysis [J]. AI & Society, 2020, 35(2): 309-317.
[8] LIANG W X, TADESSE G A, HO D, et al. Advances, challenges and opportunities in creating data for trustworthy AI [J]. Nature Machine Intelligence, 2022, 4(8): 669-677.
[9] 郑霞忠, 汪珂, 陈云, 等. 水电工程施工安全隐患文本智能类推研究[J]. 安全与环境学报, 2023, 23(12): 4449-4456. ZHENG X Z, WANG K, CHEN Y, et al. Research on text intelligent analogy of potential safety hazards in hydropower engineering construction [J]. Journal of Safety and Environment, 2023, 23(12): 4449-4456. (in Chinese)
[10] 黄简, 杨程, 冯天波, 等. 面向风电机组运维的知识图谱构建研究与应用[J]. 电力系统保护与控制, 2024, 52(8): 167-177. HUANG J, YANG C, FENG T B, et al. Research and application of knowledge graph construction for wind turbine operation and maintenance [J]. Power System Protection and Control, 2024, 52(8): 167-177. (in Chinese)
[11] 李秋荣, 刘晓晓, 王波, 等. 滑坡地质灾害语料库构建与命名实体识别[J/OL]. 南京信息工程大学学报. (2024-05-29) [2024-07-02]. https://doi.org/10.13878/j.cnki.jnuist.20240429001. LI Q R, LIU X X, WANG B, et al. Corpus construction and named entity recognition for landslide geological hazards [J/OL]. Journal of Nanjing University of Information Science & Technology. (2024-05-29) [2024-07-02]. https://doi.org/10.13878/j.cnki.jnuist.20240429001. (in Chinese)
[12] 刘宇, 李勇. 面向城市可持续发展的城市商圈/街区知识图谱构建方法与应用展望[J]. 地球信息科学学报, 2023, 25(12): 2374-2386. LIU Y, LI Y. Methodology for constructing urban business area/region knowledge graph and the applications in urban sustainable development [J]. Journal of Geo-information Science, 2023, 25(12): 2374-2386. (in Chinese)
[13] HOGAN A, BLOMQVIST E, COCHEZ M, et al. Knowledge graphs [J]. ACM Computing Surveys (Csur), 2021, 54(4): 1-37.
[14] 董兴芝. 面向智能高铁安全保障的知识图谱构建及应用关键技术研究[D]. 北京: 中国铁道科学研究院, 2022. DONG X Z. Research on key technology of knowledge graph construction and its application for intelligent high-speed railway safety assurance [D]. Beijing: China Academy of Railway Sciences, 2022. (in Chinese)
[15] 周义棋, 刘畅, 龙增, 等. 电网应急预案知识图谱构建方法与应用[J]. 中国安全生产科学技术, 2023, 19(1): 5-13. ZHOU Y Q, LIU C, LONG Z, et al. Construction method and application of knowledge graph in emergency plans for power grid [J]. Journal of Safety Science and Technology, 2023, 19(1): 5-13. (in Chinese)
[16] 汪珂. 基于知识图谱的水电工程施工安全隐患治理措施智能决策[D]. 宜昌: 三峡大学, 2023. WANG K. Intelligent analogy of safety hazard management in hydroelectric engineering construction based on knowledge graph [D]. Yichang: China Three Gorges University, 2023. (in Chinese)
[17] LAMPLE G, BALLESTEROS M, SUBRAMANIAN S, et al. Neural architectures for named entity recognition [C]//Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. San Diego, USA: Association for Computational Linguistics, 2016: 260-270.
[18] ALT C, HÜBNER M, HENNIG L. Fine-tuning pre-trained transformer language models to distantly supervised relation extraction [C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics, 2019: 1388-1398.
[19] CHEN X, ZHANG N Y, XIE X, et al. Knowprompt: Knowledge-aware prompt-tuning with synergistic optimization for relation extraction [C]//Proceedings of ACM Web Conference 2022. Lyon, France: ACM, 2022: 2778-2788.
[20] DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of deep bidirectional transformers for language understanding [C]//Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Minneapolis, USA: Association for Computational Linguistics, 2019: 4171-4186.
[21] 刘奕含, 宁念文, 杨东霖, 等. 面向城市交通的动态知识图谱综述：构建、表示与应用[J]. 地球信息科学学报, 2024, 26(4): 946-966. LIU Y H, NING N W, YANG D L, et al. Survey of dynamic knowledge graph for urban traffic: Construction, representation and application [J]. Journal of Geo-information Science, 2024, 26(4): 946-966. (in Chinese)
[22] DONG J W, JING X W, LU X, et al. Process knowledge graph modeling techniques and application methods for ship heterogeneous models [J]. Scientific Reports, 2022, 12(1): 2911.
[23] SCHICK T, SCHÜTZE H. True few-shot learning with prompts： A real-world perspective [J]. Transactions of the Association for Computational Linguistics, 2022, 10: 716-731.
[24] HAN X, ZHAO W L, DING N, et al. PTR: Prompt tuning with rules for text classification [J]. AI Open, 2022, 3: 182-192.
[25] DU Z X, QIAN Y J, LIU X, et al. GLM: General language model pretraining with autoregressive blank infilling [C]//Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland: Association for Computational Linguistics, 2022: 320-335.
[26] LEE D H, KADAKIA A, TAN K, et al. Good examples make a faster learner: Simple demonstration-based learning for low-resource NER [C]//Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland: Association for Computational Linguistics, 2022: 2687-2700.
[27] JOULIN A, GRAVE E, BOJANOWSKI P, et al. Bag of tricks for efficient text classification [C]//Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Valencia, Spain: Association for Computational Linguistics, 2017: 427-431.
[28] LEWIS P, PEREZ E, PIKTUS A, et al. Retrieval-augmented generation for knowledge-intensive NLP tasks [C]//Proceedings of the 34th International Conference on Neural Information Processing Systems. Vancouver, Canada: Curran Associates Inc., 2020: 793.
[29] WEN Y L, WANG Z F, SUN J M. MindMap: Knowledge graph prompting sparks graph of thoughts in large language models [C]//Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand: Association for Computational Linguistics, 2024: 10370-10388.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献

访问统计