北京大学学报自然科学版 ›› 2020, Vol. 56 ›› Issue (1): 82-88.DOI: 10.13209/j.0479-8023.2019.091

上一篇    下一篇

融合篇章表征的事件指代消解研究

吴瑞萦, 孔芳   

  1. 苏州大学计算机科学与技术学院, 苏州 215006
  • 收稿日期:2019-05-14 修回日期:2019-09-19 出版日期:2020-01-20 发布日期:2020-01-20
  • 通讯作者: 孔芳, E-mail: kongfang(at)suda.edu.cn
  • 基金资助:
    国家自然科学基金(61876118, 61836007)资助

Event Coreference Resolution with Document Representation

WU Ruiying, KONG Fang   

  1. School of Computer Sciences and Technology, Soochow University, Suzhou 215006
  • Received:2019-05-14 Revised:2019-09-19 Online:2020-01-20 Published:2020-01-20
  • Contact: KONG Fang, E-mail: kongfang(at)suda.edu.cn

摘要:

事件指代消解任务比实体指代消解难度大, 主要原因为事件描述在非结构化文本中分布稀疏, 且不具备同指关系的单链占很大比例, 同时事件自身承载的语义信息比实体更加丰富。为了准确地抽取文本中的同指事件, 针对以上特点, 提出一种融合篇章表征的事件指代消解模型。该模型通过CRF有效地区分非事件句、单链以及同指链, 同时利用分层注意力机制捕捉句子级别和篇章级别的重要信息。在KBP2015和2016数据集上进行的事件指代消解实验验证了该模型的有效性, 在CoNLL评测标准下F1值达到43.07%。

关键词: 事件指代消解, 篇章表征, 分层注意力机制

Abstract:

Event coreference resolution is more difficult than entity coreference resolution. The main reason is that the event mentions in the unstructured texts are sparse, and most of them do not have the coreference relationship, at the same time, the semantic information carried by the event itself is richer than entity. In order to accurately extract the coreferential events in the text, for the above characteristics of event coreference resolution, an event coreference resolution platform with text representation is proposed. This platform effectively distinguishes non-event mention, single-chain and coreference event mention through CRF, and uses hierarchical attention mechanism to capture important information at sentence level and text level. Experiments on KBP2015 and 2016 datasets verify the validity of the model, and the CoNLL evaluation standard reaches 43.07% of the F1 value.

Key words: event coreference resolution, document representation, hierarchical attention mechanism