Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Extraction and Analysis of Chinese-Tibetan New Words from News Texts
PANG Xian, CHEN Bo, ZHAO Xiaobing
Acta Scientiarum Naturalium Universitatis Pekinensis    2025, 61 (1): 45-52.   DOI: 10.13209/j.0479-8023.2025.001
Abstract1412)   HTML    PDF(pc) (652KB)(3401)       Save
This paper proposes an effective unsupervised extraction method for news text. Combined with the unsupervised TopWORDS algorithm and the word segmentation tool PKUSEG, and aided by the heuristic word extraction method, the annual new words are extracted from Chinese and Tibetan news texts. A total of 606 new words in Chinese and 664 new words in Tibetan are extracted for 2022. In terms of efficiency, this method reduces the workload of manual selection and significantly improves the efficiency of new words extraction. In terms of effect, compared with the 2022 Chinese new words published in the “Language Situation in China: 2023”, the new words extracted by this method have obvious advantages in terms of number and language. In addition, this paper aligns the Chinese and Tibetan new words. A case study is engaged from the perspective of the development and use of new words.
Related Articles | Metrics | Comments0
Research on Continuity of Multi-Scale Space-Filling Curves
ZHAI Weixin, CHEN Bo, TONG Xiaochong, CHENG Chengqi
Acta Scientiarum Naturalium Universitatis Pekinensis    2018, 54 (2): 331-335.   DOI: 10.13209/j.0479-8023.2017.147
Abstract3202)   HTML83)    PDF(pc) (343KB)(1416)       Save

Multi-scale two-dimensional Hilbert curve is constructed, and specially the scale dimension is treated as the third dimension. The new structure embodies the multi-level characteristics and overcomes the drawback of Z sequence coding pattern, thus improving the continuity of the curve and advancing the spatial retrieval efficiency. The authors conducted two kinds of experiments based on the quad-tree model to compare the retrieval efficiency of Hilbert curve and Z curve. The consequence indicates that the multi-scale Hilbert curve performs better than Z curve, and the improvement on different data distributions vary from 15% to 30%.

Related Articles | Metrics | Comments0
Study on Globe Spatial Grid Reference System Construction
CHENG Chengqi, WU Feilong, WANG Rong, QIN Yonggang, TONG Xiaochong, CHEN Bo
Acta Scientiarum Naturalium Universitatis Pekinensis    2016, 52 (6): 1041-1049.   DOI: 10.13209/j.0479-8023.2016.051
Abstract4576)   HTML    PDF(pc) (1724KB)(5611)       Save

To supplement the deficiency of the latitude and longitude existed as location code, such as complex description, non-regional characteristics and complex computation, a globe spatial grid reference system is constructed based on GeoSOT from Peking University. The grid system, built from a perfect quadtree with one degree, one minute and one second grid, could be fit for air-earth joint action. It designs a simple and practical location coding method, which also supports distance simple calculation. It could realize multi-source spatial data integrated retrieval, and develop methods of efficient code operation, framework of spatial computing, and 3D-earth grid system. Globe Spatial Grid Reference System will definitely play an important role in the future of big spatial data applications.

Related Articles | Metrics | Comments0