基于情感信息辅助的多模态情绪识别

doi:10.13209/j.0479-8023.2019.105

北京大学学报自然科学版 ›› 2020, Vol. 56 ›› Issue (1): 75-81.DOI: 10.13209/j.0479-8023.2019.105

基于情感信息辅助的多模态情绪识别

吴良庆, 刘启元, 张栋^†, 王建成, 李寿山, 周国栋

苏州大学计算机科学与技术学院, 苏州 215006

收稿日期:2019-05-22 修回日期:2019-09-19 出版日期:2020-01-20 发布日期:2020-01-20
通讯作者: 张栋, E-mail: dzhang17(at)stu.suda.edu.cn
基金资助:
国家自然科学基金(61331011, 61375073)资助

Multimodal Emotion Recognition with Auxiliary Sentiment Information

WU Liangqing, LIU Qiyuan, ZHANG Dong^†, WANG Jiancheng, LI Shoushan, ZHOU Guodong

School of Computer Science & Technology, Soochow University, Suzhou 215006

Received:2019-05-22 Revised:2019-09-19 Online:2020-01-20 Published:2020-01-20
Contact: ZHANG Dong, E-mail: dzhang17(at)stu.suda.edu.cn

摘要/Abstract

摘要：

不同于纯文本的情绪分析, 本文面向多模态数据(文本和语音)进行情绪识别研究。为了同时考虑多模态数据特征, 提出一种新颖的联合学习框架, 将多模态情绪分类作为主任务, 多模态情感分类作为辅助任务, 通过情感信息来辅助提升情绪识别任务的性能。首先, 通过私有网络层对主任务中的文本和语音模态信息分别进行编码, 以学习单个模态内部的情绪独立特征表示。接着, 通过辅助任务中的共享网络层来获取主任务的辅助情绪表示以及辅助任务的单模态完整情感表示。在得到主任务的文本和语音辅助情绪表示之后, 分别与主任务中的单模态独立特征表示相结合, 得到主任务中单模态情绪信息的完整表示。最后, 通过自注意力机制捕捉每个任务上的多模态交互特征, 得到最终的多模态情绪表示和情感表示。实验结果表明, 本文方法在多模态情感分析数据集上可以通过情感辅助信息大幅度地提升情绪分类任务的性能, 同时情感分类任务的性能也得到一定程度的提升。

关键词: 多模态, 情绪识别, 联合学习, 情感分析

Abstract:

Different from the previous studies with only text, this paper focuses on multimodal data (text and audio) to perform emotion recognition. To simultaneously address the characteristics of multimodal data, we propose a novel joint learning framework, which allows auxiliary task (multimodal sentiment classification) to help the main task (multimodal emotion classification). Specifically, private neural layers are designed for text and audio modalities from the main task to learn the uni-modal independent dynamics. Secondly, with the shared neural layers from auxiliary task, we obtain the uni-modal representations of the auxiliary task and the auxiliary representations of the main task. The uni-modal independent dynamics is combined with the auxiliary representations for each modality to acquire the uni-modal representations of the main task. Finally, in order to capture multimodal interactive dynamics, we fuse the text and audio modalities’ representations for the main and auxiliary tasks separately to obtain the final multimodal emotion and sentiment representations with the self attention mechanism. Empirical results demonstrate the effectiveness of our approach to multimodal emotion classification task as well as the sentiment classification task.

Key words: multimodal, emotion recognition, joint learning, sentiment analysis

吴良庆, 刘启元, 张栋, 王建成, 李寿山, 周国栋. 基于情感信息辅助的多模态情绪识别[J]. 北京大学学报（自然科学版）, 2020, 56(1): 75-81.

WU Liangqing, LIU Qiyuan, ZHANG Dong, WANG Jiancheng, LI Shoushan, ZHOU Guodong. Multimodal Emotion Recognition with Auxiliary Sentiment Information[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2020, 56(1): 75-81.

导出引用管理器 EndNote|Ris|BibTeX

链接本文: https://xbna.pku.edu.cn/CN/10.13209/j.0479-8023.2019.105

https://xbna.pku.edu.cn/CN/Y2020/V56/I1/75

[1]	赵宇兰, 万广文, 刘忠宝. 融合显性知识和隐性知识的古诗情感分析[J]. 北京大学学报自然科学版, 2025, 61(3): 420-430.
[2]	张亚洲, 王梦遥, 戎璐, 俞洋, 赵东明, 秦璟. ChatGPT可否充当情感专家？——调查其在情感与隐喻分析的潜力[J]. 北京大学学报自然科学版, 2024, 60(1): 43-52.
[3]	李子成, 常晓琴, 李雅梦, 李寿山, 周国栋. 基于联合学习的少样本多类别情感分类方法[J]. 北京大学学报自然科学版, 2023, 59(1): 57-64.
[4]	孙宇冲, 程曦苇, 宋睿华, 车万翔, 卢志武, 文继荣. 多模态与文本预训练模型的文本嵌入差异研究[J]. 北京大学学报自然科学版, 2023, 59(1): 48-56.
[5]	马超, 万璋, 张玉洁, 徐金安, 陈钰枫. 引入图像信息的多模态复述生成模型[J]. 北京大学学报自然科学版, 2022, 58(1): 45-53.
[6]	黄晋, 许实, 蔡而聪, 吴志杰, 郭美美, 朱佳. 基于多通道压缩双线性池化的情感‒原因句子对提取模型[J]. 北京大学学报自然科学版, 2022, 58(1): 21-28.
[7]	林子杰, 龙云飞, 杜嘉晨, 徐睿峰. 一种基于多任务学习的多模态情感识别方法[J]. 北京大学学报自然科学版, 2021, 57(1): 7-15.
[8]	厉小军, 施寒潇, 陈南南, 柳虹, 邹轶. 基于表示学习的情感分析研究[J]. 北京大学学报自然科学版, 2019, 55(1): 105-112.
[9]	闫雷鸣, 严璐绮, 王超智, 贺嘉会, 吴宏煜. 基于句式元学习的Twitter分类[J]. 北京大学学报自然科学版, 2019, 55(1): 98-104.
[10]	刘思叶, 田原, 冯雨宁, 庄育龙. 游客微博主题情感分析方法比较研究[J]. 北京大学学报自然科学版, 2018, 54(4): 687-692.
[11]	姜杰, 夏睿. 机器学习与语义规则融合的微博情感分类方法[J]. 北京大学学报自然科学版, 2017, 53(2): 247-254.
[12]	董理, 王中卿, 熊德意. 基于文本信息的股票指数预测[J]. 北京大学学报自然科学版, 2017, 53(2): 273-278.
[13]	刘翠娟, 刘箴, 柴艳杰, 方昊, 刘良平. 基于微博文本数据分析的社会群体情感可视计算方法研究[J]. 北京大学学报（自然科学版）, 2016, 52(1): 178-186.
[14]	朱珠,汪蓉,李寿山,周国栋. 中文文本中评价对象省略识别方法[J]. 北京大学学报（自然科学版）, 2015, 51(2): 315-320.
[15]	贺飞艳,何炎祥,刘楠,刘健博,彭敏. 面向微博短文本的细粒度情感特征抽取方法[J]. 北京大学学报（自然科学版）, 2014, 50(1): 48-54.

基于情感信息辅助的多模态情绪识别

Multimodal Emotion Recognition with Auxiliary Sentiment Information

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

留言