Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2016, Vol. 52 ›› Issue (1): 89-96.DOI: 10.13209/j.0479-8023.2016.012

Previous Articles     Next Articles

A Chinese Event Trigger Inference Approach Based on Markov Logic Networks

ZHU Shaohua1,2, LI Peifeng1,2, ZHU Qiaoming1,2   

  1. 1. Natural Language Processing Lab, Soochow University, Suzhou 215006
    2. School of Computer Science and Technology, Soochow University, Suzhou 215006
  • Received:2015-06-19 Online:2016-01-20 Published:2016-01-20
  • Contact: LI Peifeng, E-mail: pfli(at)suda.edu.cn

基于MLN的中文事件触发词推理方法

朱少华1,2, 李培峰1,2, 朱巧明1,2   

  1. 1. 苏州大学自然语言处理实验室, 苏州 215006
    2. 苏州大学计算机科学与技术学院, 苏州 215006
  • 通讯作者: 李培峰, E-mail: pfli(a)suda.edu.cn
  • 基金资助:
    国家自然科学基金(61472265, 61331011)和江苏省前瞻性联合研究项目(BY2014059-08)资助

Abstract:

Previous Chinese argument extraction approaches mainly focus on feature engineering and trigger expansion, which cannot exploit inner relation between trigger mentions in same document. To address this issue, the authors bring forward a novel trigger inference mechanism based on Markov logic network. Head morpheme, the probabilities of a trigger mention fulfilling true and pseudo events from the training set and the relationships between trigger mentions are used to infer those trigger mentions with lack of effective context information or low confidences in testing set. Experimental results on the ACE 2005 Chinese corpus show that the proposed approach outperforms the baseline, with the F1 improvements of 3.65% and 2.51% in trigger identification and event type classification respectively.

Key words: trigger extraction, Markov logic network, trigger inference

摘要:

现有的中文事件触发词抽取方法大多数采用特征工程和触发词扩展方法, 无法利用同一文档中各个触发词实例之间的内在关系。为了解决上述问题, 基于马尔科夫逻辑网络(MLN), 利用核心词素, 训练语料中触发词实例填充真假事件的概率, 以及触发词实例间的关系等信息来推导测试集中缺乏有效上下文信息和低可信度的触发词实例。在ACE 2005 中文语料上的实验结果表明, 与基准系统相比, 该方法在触发词识别和事件类型分类阶段F1值分别提高3.65%和2.51%。

关键词: 触发词抽取, 马尔科夫逻辑网络, 触发词推理

CLC Number: