Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2018, Vol. 54 ›› Issue (2): 236-242.DOI: 10.13209/j.0479-8023.2017.151

Previous Articles     Next Articles

Regular Path Queries on Large Graph Data

ZHANG Yu, ZENG Li, ZOU Lei   

  1. School of Electronics and Computer Science, Peking University, Beijing 100871
  • Received:2017-06-05 Revised:2017-09-05 Online:2018-03-20 Published:2018-03-20
  • Contact: ZOU Lei, E-mail: zoulei(at)pku.edu.cn

大规模图数据的正则路径查询

张雨, 曾立, 邹磊   

  1. 北京大学信息科学技术学院, 北京 100871
  • 通讯作者: 邹磊, E-mail: zoulei(at)pku.edu.cn
  • 基金资助:
    国家重点基础研究发展计划(2016YFB1000603)资助

Abstract:

The authors propose a divide-and-conquer based solution over gStore, an existing RDF search engine, to process property path query on large scale graph data. In proposed solution, regular expression is partitioned within the path query and then preprocess strings of fixed length. The authors handle the search over those subqueries of wildcards. The proposed method is able to filter lots of unpromising search and efficient on solving the regular path match problem over large scale graph data. The corresponding experiments on DBpedia and LUBM confirm that proposed method can response for queries in seconds on average.

Key words: property path, regular expression, SPARQL, RDF, gStore

摘要:

提出一种基于大规模RDF查询引擎gStore的分而治之方法, 对查询的正则表达式进行划分。在对定长字符串预处理的基础上, 再处理带有通配符的即时路径查询。该方法能够在开始阶段对搜索空间进行大量剪枝, 可以提高查询性能, 还能很好地解决正则表达式在RDF图中的路径匹配问题, 保证了对大规模图数据的支持。在DBpedia和LUBM上的实验表明, 该方法能够平均在秒级时间内返回路径查询结果。

关键词: 属性路径, 正则表达式, SPARQL, RDF, gStore

CLC Number: