Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2023, Vol. 59 ›› Issue (4): 563-568.DOI: 10.13209/j.0479-8023.2022.116

Previous Articles     Next Articles

Consistency Assessment of Remote Sensing Dataset Based on Deep Learning

YAO Zhaoyuan1, MA Lei2, WAN Wei1, SONG Benqin2, WANG Weihong2, DENG Jiwei3, XIAO Lei1, JI Rui1, WEI Zhihao1, CUI Yaokui1,†

  1. 1. School of Earth and Space Sciences, Peking University, Beijing 100871 2. China Academic of Electronics and Information Technology, Beijing 100041 3. China Railway Design Corporation, Tianjin 300251
  • Received:2022-08-09 Revised:2022-10-12 Online:2023-07-20 Published:2023-07-20
  • Contact: CUI Yaokui, E-mail: yaokuicui(at)


姚照原1, 马磊2, 万玮1, 宋本钦2, 王卫红2, 邓继伟3, 肖蕾1, 冀锐1, 魏之皓1, 崔要奎1,†

  1. 1. 北京大学地球与空间科学学院, 北京 100871 2. 中国电子科技集团公司电子科学研究院, 北京 100041 3. 中国铁路设计集团有限公司, 天津 300251
  • 通讯作者: 崔要奎, E-mail: yaokuicui(at)


The current deep learning studies on remote sensing mainly focused on deep learning algorithms rather than deep learning datasets. This study proposes a method of dataset consistency assessment based on deep learning, in which the similarity among various types of ships from different sources (such as satellite remote sensing, 3D modeling, and web crawler) is evaluated and then used to characterize the consistency of the ship dataset. The results show that when the consistency of the dataset is the highest, the consistency by the proposed method is 1. When the consistency of the datasets is gradient, the consistency also changes. Images with similar data sources can be considered as same class, and images with greatly differences cannot be merged. Thus, the proposed method can assess the dataset consistency properly, and provide a suggestion to build an image dataset for deep learning training. 

Key words: deep learning, remote sensing, dataset, consistency


现有基于深度学习的遥感研究集中在算法开发方面, 缺乏对遥感样本库的研究。针对此问题,提出一种基于深度学习的样本库一致性评估方法, 对卫星遥感、三维建模和网络爬虫等不同来源的舰船样本库中各类样本之间的相似性进行评估。结果表明: 1) 在样本库一致性最高时, 该方法得到的一致性为1; 2) 当样本库一致性呈梯度变化时, 一致性评价结果随样本库一致性的变化而变化; 3) 数据来源相似的样本库能够进行合并, 制作方式差异较大的样本库不能合并。因此, 所提方法能够准确地评估样本库的一致性, 可为深度学习训练时样本库的选择提供参考。

关键词: 深度学习, 遥感影像, 样本库, 一致性评价