中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Detection of loan words in uyghur texts

文献类型:期刊论文

作者Mi, Chenggang2; Yang, Yating2; Wang, Lei2; Li, Xiao2; Dalielihan, Kamali2
刊名Communications in Computer and Information Science
出版日期2014
卷号496期号:12页码:103-112
关键词Loan Words Detection Phonetic Similarity Uyghur Perceptron-based Model
英文摘要

For low-resource languages like Uyghur, data sparseness is always a serious problem in related information processing, especially in some tasks based on parallel texts. To enrich bilingual resources, we detect Chinese and Russian loan words from Uyghur texts according to phonetic similarities between a loan word and its corresponding donor language word. In this paper, we propose a novel approach based on perceptron model to discover loan words from Uyghur texts, which consider the detection of loan words in Uyghur as a classification procedure. The experimental results show that our method is capable of detecting the Chinese and Russian loan words in Uyghur Texts effectively

源URL[http://ir.xjipc.cas.cn/handle/365002/4912]  
专题新疆理化技术研究所_多语种信息技术研究室
作者单位1.University of Chinese Academy of Sciences, Beijing, China
2.Xinjiang Technical Institute of Physics & Chemistry of Chinese Academy of Sciences, Urumqi, Xinjiang, China
推荐引用方式
GB/T 7714
Mi, Chenggang,Yang, Yating,Wang, Lei,et al. Detection of loan words in uyghur texts[J]. Communications in Computer and Information Science,2014,496(12):103-112.
APA Mi, Chenggang,Yang, Yating,Wang, Lei,Li, Xiao,&Dalielihan, Kamali.(2014).Detection of loan words in uyghur texts.Communications in Computer and Information Science,496(12),103-112.
MLA Mi, Chenggang,et al."Detection of loan words in uyghur texts".Communications in Computer and Information Science 496.12(2014):103-112.

入库方式: OAI收割

来源:新疆理化技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。