中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Using DragPushing to Refine Concept Index for Text Categorization

文献类型:期刊论文

作者Songbo Tan(谭松波); Xueqi Cheng(程学旗); Lilian Tang
刊名Journal of Computer Science and Technology
出版日期2006
卷号21期号:4页码:592-596
关键词Text Classification Information
英文摘要Concept index (CI) is a very fast and efficient feature extraction (FE) algorithm for text classification. The key approach in CI scheme is to express each document as a function of various concepts (centroids) present in the collection. However, the representative ability of centroids for categorizing corpus is often influenced by so-called model misfit caused by a number of factors in the FE process including feature selection to similarity measure. In order to address this issue, this work employs the ``DragPushing'' Strategy to refine the centroids that are used for concept index. We present an extensive experimental evaluation of refined concept index (RCI) on two English collections and one Chinese corpus using state-of-the-art Support Vector Machine (SVM) classifier. The results indicate that in each case, RCI-based SVM yields a much better performance than the normal CI-based SVM but lower computation cost during training and classification phases.
语种英语
公开日期2010-11-04
源URL[http://ictir.ict.ac.cn/handle/311040/839]  
专题中国科学院计算技术研究所期刊论文_2006年英文
推荐引用方式
GB/T 7714
Songbo Tan,Xueqi Cheng,Lilian Tang. Using DragPushing to Refine Concept Index for Text Categorization[J]. Journal of Computer Science and Technology,2006,21(4):592-596.
APA Songbo Tan,Xueqi Cheng,&Lilian Tang.(2006).Using DragPushing to Refine Concept Index for Text Categorization.Journal of Computer Science and Technology,21(4),592-596.
MLA Songbo Tan,et al."Using DragPushing to Refine Concept Index for Text Categorization".Journal of Computer Science and Technology 21.4(2006):592-596.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。