Please wait a minute...
 首页  期刊介绍 期刊订阅 联系我们 横山亮次奖 百年刊庆
 
最新录用  |  预出版  |  当期目录  |  过刊浏览  |  阅读排行  |  下载排行  |  引用排行  |  横山亮次奖  |  百年刊庆
清华大学学报(自然科学版)  2022, Vol. 62 Issue (5): 978-986    DOI: 10.16511/j.cnki.qhdxxb.2022.22.023
  机械工程 本期目录 | 过刊浏览 | 高级检索 |
面向自主工业软件的知识提取和知识库构建方法
王立平1,2, 张超2, 蔡恩磊2, 史慧杰2, 王冬1
1. 清华大学 机械工程系, 北京 100084;
2. 电子科技大学 机械与电气工程学院, 成都 611731
Knowledge extraction and knowledge base construction method from industrial software packages
WANG Liping1,2, ZHANG Chao2, CAI Enlei2, SHI Huijie2, WANG Dong1
1. Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China;
2. School of Mechanical and Electrical Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
全文: PDF(5859 KB)   HTML
输出: BibTeX | EndNote (RIS)      
摘要 自主工业软件是支撑国内中小企业创新发展的核心力量之一。自主工业软件相关文本中蕴含着大量与制造业相关的知识,但是目前缺少相应的知识提取和知识库构建方法。该文提出一种基于神经网络和自然语言处理的知识提取模型,该模型包括文本表示、实体识别、关系抽取3个部分。基于知识图谱对提取的实体和关系进行建模,通过本体建模定义自主工业软件相关概念,利用图数据建模将本体模型中的概念映射到图数据中,提升了数据检索和建模能力,并将数据持久化存储到知识库中。应用结果表明:该方法可用于构建自主工业软件知识库,对整合制造业相关知识起到重要作用。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
王立平
张超
蔡恩磊
史慧杰
王冬
关键词 自主工业软件神经网络实体识别关系抽取知识图谱    
Abstract:Industrial software is a key force supporting the development of domestic small and medium-sized enterprises. Industrial software packages contain a large amount of knowledge related to manufacturing processes, but little of the knowledge embedded in these software packages has been extracted and put into a knowledge base. This paper presents a knowledge extraction model that combines neural networks and natural language processing. The model includes text representation, entity recognition, and relationship extraction. The extracted entities and relationships are modeled on a knowledge graph, while related concepts in the software are defined through ontology modeling. The ontology model concepts are mapped to graph data to improve data retrieval and modeling capabilities and the data can be stored in the knowledge base with long term. The results show that this method can build an industrial software knowledge base which will play an important role in integrating manufacturing knowledge.
Key wordsindustry software    neural network    entity recognition    relation extraction    knowledge graph
收稿日期: 2021-12-13      出版日期: 2022-04-26
基金资助:国家重点研发计划项目(2020YFB1712303)
通讯作者: 王冬,助理研究员,E-mail:d-wang@tsinghua.edu.cn      E-mail: d-wang@tsinghua.edu.cn
作者简介: 王立平(1967—),男,教授。
引用本文:   
王立平, 张超, 蔡恩磊, 史慧杰, 王冬. 面向自主工业软件的知识提取和知识库构建方法[J]. 清华大学学报(自然科学版), 2022, 62(5): 978-986.
WANG Liping, ZHANG Chao, CAI Enlei, SHI Huijie, WANG Dong. Knowledge extraction and knowledge base construction method from industrial software packages. Journal of Tsinghua University(Science and Technology), 2022, 62(5): 978-986.
链接本文:  
http://jst.tsinghuajournals.com/CN/10.16511/j.cnki.qhdxxb.2022.22.023  或          http://jst.tsinghuajournals.com/CN/Y2022/V62/I5/978
  
  
  
  
  
  
  
  
  
  
  
  
  
  
[1] 李保利, 陈玉忠, 俞士汶. 信息抽取研究综述[J]. 计算机工程与应用, 2003, 39(10):1-5, 66. LI B L, CHEN Y Z, YU S W. Research on information extraction:A survey[J]. Computer Engineering and Applications, 2003, 39(10):1-5, 66. (in Chinese)
[2] 王宁, 葛瑞芳, 苑春法, 等. 中文金融新闻中公司名的识别[J]. 中文信息学报, 2002, 16(2):1-6. WANG N, GE R F, YUAN C F, et al. Company name identification in Chinese financial domain[J]. Journal of Chinese Information Processing, 2002, 16(2):1-6. (in Chinese)
[3] 王丹, 樊兴华. 面向短文本的命名实体识别[J]. 计算机应用, 2009, 29(1):143-145, 171. WANG D, FAN X H. Named entity recognition for short text[J]. Journal of Computer Applications, 2009, 29(1):143-145, 171. (in Chinese)
[4] BLANCO E, MOLDOVAN D. Automatic discovery of manner relations and its applications[C]//Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Cambridge, USA:MIT, 2010:315-324.
[5] NING G L, BAI Y L. Biomedical named entity recognition based on Glove-BLSTM-CRF model[J]. Journal of Comput