Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2019, Vol. 55 ›› Issue (1): 91-97.DOI: 10.13209/j.0479-8023.2018.060

Previous Articles     Next Articles

Feature Learning by Distant Supervision for Fine-Grained Implicit Discourse Relation Identification

TANG Yuting, LI Yanbin, LIU Lu, YU Zhonghua, CHEN Li   

  1. Department of Computer Science, Sichuan University, Chengdu 610065
  • Received:2018-04-15 Revised:2018-08-20 Online:2019-01-20 Published:2019-01-20
  • Contact: CHEN Li, E-mail: cl(at)scu.edu.cn

面向细粒度隐式篇章关系识别的远距离监督特征学习算法

唐裕婷, 李艳斌, 刘露, 于中华, 陈黎   

  1. 四川大学计算机学院, 成都 610065
  • 通讯作者: 陈黎, E-mail: cl(at)scu.edu.cn
  • 基金资助:
    四川省科技支撑项目(2014GZ0063)资助

Abstract:

Aiming at the identification of Chinese fine-grained implicit discourse relation and taking the directionality characteristic in account, the authors propose a feature learning algorithm based on the distant supervision to label explicit discourse data automatically. The relative position information between conjunction and words are applied to train the intensive word representation. Then the rhetorical function of words and the directionality of relations are encoded into the representation of intensive words, which is applied to the relation classification of fine-grained implicit discourses. From the experimental studies of the proposed approach, the classification accuracy reaches 49.79%, which are better than those approaches neglecting the directionality of discourse relations.

Key words: fine-grained, implicit discourse relation, Chinese, word representation, directionality

摘要:

针对中文细粒度隐式篇章关系识别进行研究。考虑细粒度篇章关系的方向性特点, 提出一种基于远距离监督的特征学习算法。该算法使用远距离监督的方法, 自动标注显式篇章数据, 然后利用词与连词之间的相对位置信息, 训练各个词的词表达, 将词的修辞功能以及关系的方向性编码到密集词表达中, 将这样的词表达应用到细粒度隐式篇章关系分类器。实验结果表明, 在细粒度隐式篇章关系识别任务中, 该方法的分类准确率达到49.79%, 比未考虑篇章关系方向性的方法有较大程度的提高。

关键词: 细粒度, 隐式篇章关系, 中文, 词表达, 方向性