Acta Scientiarum Naturalium Universitatis Pekinensis
Previous Articles
LEI Ming,WANG Jianyong,ZHAO Jianghua,SHAN Songwei,CHEN Baojue
Received:
Online:
Published:
雷鸣,王建勇,赵江华,单松巍,陈葆珏
Abstract: With the rapid growing of WWW, significant progress has been made in search engine research area. The evolvement of search engine and the system architecture for the 3rd generation are reviewed. More emphasis will be given on some core technologies related to search engines of the 3rd generation. For example, the massive and efficient web-crawling technology, the method of hyper-link analysis, and the user behavior analyzing technology will be described in detail. In addition, it is also presented the recent research progress of WebGather, which is a typical search engine of 3rd generation. Several research hotspots for future search engine systems are pointed out in the conclusion.
Key words: World-wide Web, search engine, information retrieval, hyper-link analysis, user behavior analyzing, World-wide Web, search engine, information retrieval, hyper-link analysis, user behavior analyzing
摘要: 论述了三代搜索引擎的发展,着重介绍了第三代搜索引擎的体系结构,详细讨论了该搜索引擎的几个核心技术——大规模搜集技术、超链分析技术和用户行为分析技术。介绍了作者参与研发的第三代搜索引擎——“天网”的研究进展,并指出了搜索引擎未来几个研究的热点方向。
关键词: WWW, 搜索引擎, 信息检索, 超链分析, 用户行为分析, WWW, 搜索引擎, 信息检索, 超链分析, 用户行为分析
CLC Number:
TP391
TP393.4
LEI Ming,WANG Jianyong,ZHAO Jianghua,SHAN Songwei,CHEN Baojue. The 3rd Generation Search Engine and WebGather Version 2.0[J]. Acta Scientiarum Naturalium Universitatis Pekinensis.
雷鸣,王建勇,赵江华,单松巍,陈葆珏. 第三代搜索引擎与天网二期[J]. 北京大学学报(自然科学版).
Add to citation manager EndNote|Ris|BibTeX
URL: https://xbna.pku.edu.cn/EN/
https://xbna.pku.edu.cn/EN/Y2001/V37/I5/734