北京大学学报自然科学版 ›› 2017, Vol. 53 ›› Issue (2): 344-352.DOI: 10.13209/j.0479-8023.2016.090

上一篇    下一篇

基于地理关联度和证据理论的地名消歧方法研究

王星光, 张瑞洁, 张毅()   

  1. 北京大学遥感与地理信息系统研究所, 北京 100871
  • 收稿日期:2015-10-12 修回日期:2016-01-06 出版日期:2017-03-20 发布日期:2017-03-20
  • 通讯作者: 张毅
  • 基金资助:
    国家自然科学基金(41271385)资助

Toponym Resolution Based on Geo-relevance and D-S Theory

Xingguang WANG, Ruijie ZHANG, Yi ZHANG()   

  1. Institute of Remote Sensing and Geographical Information Systems, Peking University, Beijing 100871
  • Received:2015-10-12 Revised:2016-01-06 Online:2017-03-20 Published:2017-03-20
  • Contact: Yi ZHANG

摘要:

针对目前地名消歧方法普遍缺乏理论基础和统一形式化方法的现状, 以地理学第一定律为理论基础, 使用地理关联度形式化地理实体之间的邻近性。在此基础上, 提出基于证据理论的地名消歧计算模型, 用于表示与合成上下文中共现的地名证据。该模型模拟人类阅读和理解文本中时空语义的认知过程, 并为地名消歧处理提供一个统一的易扩展的形式化框架。最后, 给出本文地名消歧方法的实现算法及其实验评估。结果显示, 算法综合性能指标F1达到89.60%, 取得较好的实验效果。

关键词: 地理信息检索(GIR), 地名消歧, 地理关联度, 证据理论

Abstract: Aim

ing at the situation that previous toponym resolution researches largely lack theoretical basis and a general formal way, a concept of geo-relevance based on Tobler’s Frist Law is proposed to formalize vicinity among geographic entities. Then a toponym resolution computing model based on dempster-shafer (D-S) theory is proposed to represent and combine co-occurring toponym evidences in context. The cognitive process of human reading and understanding spatiotemporal semantics in text are simulated by D-S theory, while a general and scalable formal framework for toponym resolution is provided. Finally, an experiment evaluation is given with a good result of F1 value (89.60%).

Key words: geographic information retrieval, toponym resolution, geo-relevance, dempster-shafer theory

中图分类号: