Detection of loan words in uyghur texts
文献类型:期刊论文
作者 | Mi, Chenggang2; Yang, Yating2![]() ![]() |
刊名 | Communications in Computer and Information Science
![]() |
出版日期 | 2014 |
卷号 | 496期号:12页码:103-112 |
关键词 | Loan Words Detection Phonetic Similarity Uyghur Perceptron-based Model |
英文摘要 | For low-resource languages like Uyghur, data sparseness is always a serious problem in related information processing, especially in some tasks based on parallel texts. To enrich bilingual resources, we detect Chinese and Russian loan words from Uyghur texts according to phonetic similarities between a loan word and its corresponding donor language word. In this paper, we propose a novel approach based on perceptron model to discover loan words from Uyghur texts, which consider the detection of loan words in Uyghur as a classification procedure. The experimental results show that our method is capable of detecting the Chinese and Russian loan words in Uyghur Texts effectively |
源URL | [http://ir.xjipc.cas.cn/handle/365002/4912] ![]() |
专题 | 新疆理化技术研究所_多语种信息技术研究室 |
作者单位 | 1.University of Chinese Academy of Sciences, Beijing, China 2.Xinjiang Technical Institute of Physics & Chemistry of Chinese Academy of Sciences, Urumqi, Xinjiang, China |
推荐引用方式 GB/T 7714 | Mi, Chenggang,Yang, Yating,Wang, Lei,et al. Detection of loan words in uyghur texts[J]. Communications in Computer and Information Science,2014,496(12):103-112. |
APA | Mi, Chenggang,Yang, Yating,Wang, Lei,Li, Xiao,&Dalielihan, Kamali.(2014).Detection of loan words in uyghur texts.Communications in Computer and Information Science,496(12),103-112. |
MLA | Mi, Chenggang,et al."Detection of loan words in uyghur texts".Communications in Computer and Information Science 496.12(2014):103-112. |
入库方式: OAI收割
来源:新疆理化技术研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。