中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Dictionary-Based Classical Chinese Word Segmentation and Its Application on Imperial Edicts of Jin Dynasties

文献类型:会议论文

作者Xiong, Huan2,3; Wu, Gengxuan1; Xue, Shujie1; Li, Hua1; Zhu, Tingshao2,3
出版日期2022
会议名称Human Centered Computing
会议日期不详
会议地点不详
通讯作者邮箱tszhu@psych.ac.cn (zhu, tingshao)
关键词Classical Chinese word segmentation Imperial edicts Psycholinguistic Word frequency analysis
页码Volume 13795 LNCS, Pages 153-160
英文摘要

Big data technology can play a significant role in exploring and analyzing classical Chinese literature and in enhancing our understanding and promotion of traditional culture. Analyzing psycholinguistic words used in ancient people’s self-expression texts is a good way to understand their psychological state. Based on the classical Chinese segmentation methods used by such dictionaries as CCIDict and CC-LIWC, this paper proposed a word segmentation algorithm that can better cover the ancient Chinese vocabulary used in imperial edicts. We used this algorithm to calculate the psycholinguistic words in imperial edicts of the Western and Eastern Jin Dynasties (265–420). We firstly collected 613 edicts from 18 emperors of the Western and Eastern Jin Dynasties, with a total word count of more than 45,000. After being analyzed and calculated by the dictionary-based classical Chinese word segmentation algorithm, all these words were divided into 78 categories of psycholinguistic words. By comparing the frequencies of such word categories in imperial edicts of the Western Jin (265–317) and the Eastern Jin (317–420), we found significant differences in the following five word categories: personal pronouns (p = 0.027), modal particles (p = 0.034), social process words (p = 0.016), difference words (p = 0.016), and time words (p = 0.043). Based on differences in these five categories, we analyzed the psychological changes of the Western Jin and Eastern Jin emperors. This paper thereby verified the applicability and feasibility of the dictionary-based classical Chinese word segmentation algorithm.

收录类别EI
会议录Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
源URL[http://ir.psych.ac.cn/handle/311026/44633]  
专题中国科学院心理研究所
作者单位1.Institute of Qilu Culture, Shandong Normal University, Jinan, China
2.Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
3.Institute of Psychology, Chinese Academy of Sciences, Beijing, China
推荐引用方式
GB/T 7714
Xiong, Huan,Wu, Gengxuan,Xue, Shujie,et al. Dictionary-Based Classical Chinese Word Segmentation and Its Application on Imperial Edicts of Jin Dynasties[C]. 见:Human Centered Computing. 不详. 不详.

入库方式: OAI收割

来源:心理研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。