Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2018, Vol. 54 ›› Issue (2): 236-242.DOI: 10.13209/j.0479-8023.2017.151
Previous Articles Next Articles
ZHANG Yu, ZENG Li, ZOU Lei†
Received:
Revised:
Online:
Published:
Contact:
张雨, 曾立, 邹磊†
通讯作者:
基金资助:
Abstract:
The authors propose a divide-and-conquer based solution over gStore, an existing RDF search engine, to process property path query on large scale graph data. In proposed solution, regular expression is partitioned within the path query and then preprocess strings of fixed length. The authors handle the search over those subqueries of wildcards. The proposed method is able to filter lots of unpromising search and efficient on solving the regular path match problem over large scale graph data. The corresponding experiments on DBpedia and LUBM confirm that proposed method can response for queries in seconds on average.
Key words: property path, regular expression, SPARQL, RDF, gStore
摘要:
提出一种基于大规模RDF查询引擎gStore的分而治之方法, 对查询的正则表达式进行划分。在对定长字符串预处理的基础上, 再处理带有通配符的即时路径查询。该方法能够在开始阶段对搜索空间进行大量剪枝, 可以提高查询性能, 还能很好地解决正则表达式在RDF图中的路径匹配问题, 保证了对大规模图数据的支持。在DBpedia和LUBM上的实验表明, 该方法能够平均在秒级时间内返回路径查询结果。
关键词: 属性路径, 正则表达式, SPARQL, RDF, gStore
CLC Number:
TP391
ZHANG Yu, ZENG Li, ZOU Lei. Regular Path Queries on Large Graph Data[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2018, 54(2): 236-242.
张雨, 曾立, 邹磊. 大规模图数据的正则路径查询[J]. 北京大学学报(自然科学版), 2018, 54(2): 236-242.
Add to citation manager EndNote|Ris|BibTeX
URL: https://xbna.pku.edu.cn/EN/10.13209/j.0479-8023.2017.151
https://xbna.pku.edu.cn/EN/Y2018/V54/I2/236