Dictionary-Based Classical Chinese Word Segmentation and Its Application on Imperial Edicts of Jin Dynasties
文献类型:会议论文
作者 | Xiong, Huan2,3; Wu, Gengxuan1; Xue, Shujie1; Li, Hua1; Zhu, Tingshao2,3![]() |
出版日期 | 2022 |
会议名称 | Human Centered Computing |
会议日期 | 不详 |
会议地点 | 不详 |
通讯作者邮箱 | tszhu@psych.ac.cn (zhu, tingshao) |
关键词 | Classical Chinese word segmentation Imperial edicts Psycholinguistic Word frequency analysis |
页码 | Volume 13795 LNCS, Pages 153-160 |
英文摘要 | Big data technology can play a significant role in exploring and analyzing classical Chinese literature and in enhancing our understanding and promotion of traditional culture. Analyzing psycholinguistic words used in ancient people’s self-expression texts is a good way to understand their psychological state. Based on the classical Chinese segmentation methods used by such dictionaries as CCIDict and CC-LIWC, this paper proposed a word segmentation algorithm that can better cover the ancient Chinese vocabulary used in imperial edicts. We used this algorithm to calculate the psycholinguistic words in imperial edicts of the Western and Eastern Jin Dynasties (265–420). We firstly collected 613 edicts from 18 emperors of the Western and Eastern Jin Dynasties, with a total word count of more than 45,000. After being analyzed and calculated by the dictionary-based classical Chinese word segmentation algorithm, all these words were divided into 78 categories of psycholinguistic words. By comparing the frequencies of such word categories in imperial edicts of the Western Jin (265–317) and the Eastern Jin (317–420), we found significant differences in the following five word categories: personal pronouns (p = 0.027), modal particles (p = 0.034), social process words (p = 0.016), difference words (p = 0.016), and time words (p = 0.043). Based on differences in these five categories, we analyzed the psychological changes of the Western Jin and Eastern Jin emperors. This paper thereby verified the applicability and feasibility of the dictionary-based classical Chinese word segmentation algorithm. |
收录类别 | EI |
会议录 | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
源URL | [http://ir.psych.ac.cn/handle/311026/44633] ![]() |
专题 | 中国科学院心理研究所 |
作者单位 | 1.Institute of Qilu Culture, Shandong Normal University, Jinan, China 2.Department of Psychology, University of Chinese Academy of Sciences, Beijing, China 3.Institute of Psychology, Chinese Academy of Sciences, Beijing, China |
推荐引用方式 GB/T 7714 | Xiong, Huan,Wu, Gengxuan,Xue, Shujie,et al. Dictionary-Based Classical Chinese Word Segmentation and Its Application on Imperial Edicts of Jin Dynasties[C]. 见:Human Centered Computing. 不详. 不详. |
入库方式: OAI收割
来源:心理研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。