中国科学院机构知识库网格系统: Research on modern Uyghur Common Word extraction

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Research on modern Uyghur Common Word extraction

文献类型：期刊论文


作者	Azragul1,2,3 ; Murat, Alim1,2,3 ; Xiao, Li1,2
刊名	International Journal of Database Theory and Application
出版日期	2016
卷号	9 期号:5 页码:45-54
ISSN号	20054270
英文摘要	The key techniques and methods for the construction of modern Uyghur language (MUL) corpus are presented. The techniques and methods included MUL corpus, MUL corpus pre-processing, MUL corpus statistics, MUL stemming and MUL data analysis; on the basis of related works we then developed an enhanced modern Uyghur Common Words (UCW)-glossary. We conducted basic inspections upon the words from two perspectives namely the usage frequency and distribution. Upon developing enhanced MUCW glossary we considered the number of word types, word frequency, word length, and the number of texts used as major factors.
源URL	[http://ir.xjipc.cas.cn/handle/365002/7801]
专题	新疆理化技术研究所_多语种信息技术研究室
作者单位	1.830054, China 2.School of Computer Science, Technology Xinjiang Normal University, Urumqi, Xinjiang 3.100049, China 4.University of Chinese Academy of Sciences, Beijing 5.830011, China 6.The Xinjiang Technical Institute of Physics ans Chemistry, CAS, Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Urumqi, Xinjiang
推荐引用方式 GB/T 7714	Azragul1,2,3,Murat, Alim1,2,3,Xiao, Li1,2. Research on modern Uyghur Common Word extraction[J]. International Journal of Database Theory and Application,2016,9(5):45-54.
APA	Azragul1,2,3,Murat, Alim1,2,3,&Xiao, Li1,2.(2016).Research on modern Uyghur Common Word extraction.International Journal of Database Theory and Application,9(5),45-54.
MLA	Azragul1,2,3,et al."Research on modern Uyghur Common Word extraction".International Journal of Database Theory and Application 9.5(2016):45-54.

入库方式： OAI收割

来源：新疆理化技术研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。