Multi-task Semantic Matching with Self-supervised Learning

doi:10.13209/j.0479-8023.2021.101

Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2022, Vol. 58 ›› Issue (1): 83-90.DOI: 10.13209/j.0479-8023.2021.101

Previous Articles Next Articles

Multi-task Semantic Matching with Self-supervised Learning

CHEN Yuan¹, QIU Xinying^1,2,†

1. School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou 510006 2. Guangzhou Key Laboratory of Multilingual Intelligent Processing, Guangdong University of Foreign Studies, Guangzhou 510006

Received:2021-06-08 Revised:2021-08-14 Online:2022-01-20 Published:2022-01-20
Contact: QIU Xinying, E-mail: xy.qiu(at)foxmail.com

结合自监督学习的多任务文本语义匹配方法

陈源¹, 丘心颖^1,2,†

1. 广东外语外贸大学信息科学与技术学院, 广州 510006 2. 广州市非通用语种智能处理实验室, 广东外语外贸大学, 广州 510006

通讯作者: 丘心颖, E-mail: xy.qiu(at)foxmail.com
基金资助:
国家社会科学基金(17BGL068)和广东省自然科学基金(2018A030313777)资助

Abstract

Abstract:

In semantic matching, the interaction information between pairs of texts is critical in predicting a matching score for the pairs. This paper proposes a multi-task learning framework with self-supervised learning for deep learning semantic matching problem. Specifically, a self-supervised model is designed for the paired sentences to regenerate each other with sequence-to-sequence generation method. Then a multi-task learning framework integrates the representation from the self-supervised generation with that of the deep matching model to predict the similarity score of the texts. Experimentations with 9 deep matching models prove that the proposed framework can improve the performances of the traditional deep matching models.

Key words: self-supervised learning, semantic matching, multi-task learning

摘要：

基于文本交互信息对文本语义匹配模型的重要性, 提出一种结合序列生成任务的自监督学习方法。该方法利用自监督模型提取的文本数据对的交互信息, 以特征增强的方式辅助基于神经网络的语义匹配模型, 构建多任务的文本匹配模型。9个模型的实验结果表明, 加入自监督学习模块后, 原始模型的效果都有不同程度的提升, 表明所提方法可以有效地改进深度文本语义匹配模型。

关键词: 自监督学习, 文本语义匹配, 多任务学习

CHEN Yuan, QIU Xinying. Multi-task Semantic Matching with Self-supervised Learning[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58(1): 83-90.

陈源, 丘心颖. 结合自监督学习的多任务文本语义匹配方法[J]. 北京大学学报自然科学版, 2022, 58(1): 83-90.

Add to citation manager EndNote|Ris|BibTeX

URL: https://xbna.pku.edu.cn/EN/10.13209/j.0479-8023.2021.101

https://xbna.pku.edu.cn/EN/Y2022/V58/I1/83

[1]	LIN Zijie, LONG Yunfei, DU Jiachen, XU Ruifeng. A Multi-modal Sentiment Recognition Method Based on Multi-task Learning [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57(1): 7-15.
[2]	LIU Mingtong, ZHANG Yujie, ZHANG Shu, MENG Yao, XU Jin’an, CHEN Yufeng. A Multi-Mechanism Fused Paraphrase Generation Model with Joint Auto-Encoding Learning [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2020, 56(1): 53-60.

Multi-task Semantic Matching with Self-supervised Learning

结合自监督学习的多任务文本语义匹配方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 2

Recommended Articles

Metrics