Robustness of Chinese Machine Reading Comprehension

doi:10.13209/j.0479-8023.2020.088

Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2021, Vol. 57 ›› Issue (1): 16-22.DOI: 10.13209/j.0479-8023.2020.088

Previous Articles Next Articles

Robustness of Chinese Machine Reading Comprehension

LI Yeqiu¹, TANG Hongxuan¹, QIAN Jin¹, ZOU Bowei^1,2, HONG Yu^1,†

1. School of Computer Science and Technology, Soochow University, Suzhou 215000 2. Institute for Infocomm Research, Singapore 138632

Received:2020-06-08 Revised:2020-08-14 Online:2021-01-20 Published:2021-01-20
Contact: HONG Yu, E-mail: tianxianer(at)gmail.com

中文机器阅读理解的鲁棒性研究

李烨秋¹, 唐竑轩¹, 钱锦¹, 邹博伟^1,2, 洪宇^1,†

1. 苏州大学计算机科学与技术学院, 苏州 215000 2. 新加坡资讯通信研究院, 新加坡138632

通讯作者: 洪宇, E-mail: tianxianer(at)gmail.com
基金资助:
国家自然科学基金(61703293, 61672368, 61672367)和江苏高校优势学科建设工程项目资助

Abstract

Abstract:

In order to better evaluate the robustness of Machine Reading Comprehension (MRC) models, this paper builds three test sets from Dureader by automatically extracting and manually annotating, consisting of oversensitivity, over-stability, and generalization. In addition, this paper proposes a multi-task learning framework with answer extraction task and masked position prediction task. Experimental results demonstrate that proposed method gains significant robustness improvements and show the effectiveness of the three test sets on evaluating the robustness of MRC models.

Key words: machine reading comprehension, robustness, Chinese corpus

摘要：

为了更好地评价阅读理解模型的鲁棒性, 基于Dureader数据集, 通过自动抽取和人工标注的方法, 对过敏感、过稳定和泛化3个问题分别构建测试数据集。还提出基于答案抽取和掩码位置预测的多任务学习方法。实验结果表明, 所提方法能显著地提高阅读理解模型的鲁棒性, 所构建的测试集能够对模型的鲁棒性进行有效评估。

关键词: 机器阅读理解, 鲁棒性, 中文语料库

LI Yeqiu, TANG Hongxuan, QIAN Jin, ZOU Bowei, HONG Yu. Robustness of Chinese Machine Reading Comprehension[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57(1): 16-22.

李烨秋, 唐竑轩, 钱锦, 邹博伟, 洪宇. 中文机器阅读理解的鲁棒性研究[J]. 北京大学学报自然科学版, 2021, 57(1): 16-22.

Add to citation manager EndNote|Ris|BibTeX

URL: https://xbna.pku.edu.cn/EN/10.13209/j.0479-8023.2020.088

https://xbna.pku.edu.cn/EN/Y2021/V57/I1/16

[1]	ZHAO Shuaihua, LI Yanyan, CAO Jian, CAO Xixin. A BFGS-Corrected Gauss-Newton Solver for Bundle Adjustment [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2020, 56(6): 1013-1019.
[2]	YAN Wei,WANG Yuchen,WANG Zhenyu,SHI Guangyi. Design of ESD Protection for Low Noise Amplifier through Matching Network [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2014, 50(4): 745-752.
[3]	ZHU Xinshan,TANG Zhi. Adaptive Watermarking Based on Localized Perceptual Quality Evaluation [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2008, 44(1): 77-86.

Robustness of Chinese Machine Reading Comprehension

中文机器阅读理解的鲁棒性研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 3

Recommended Articles 0

Metrics