a novel kernel for text categorization
文献类型:会议论文
作者 | Zhang Lujiang ; Hu Xiaohui |
出版日期 | 2012 |
会议名称 | 2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012 |
会议日期 | May 25, 2012 - May 27, 2012 |
会议地点 | Zhangjiajie, China |
关键词 | Algorithms Computer science Support vector machines |
页码 | 186-190 |
中文摘要 | In this paper we proposed a novel kernel for text categorization. This kernel is an inner product in the feature space generated by all word combinations of specified length. A word combination is a collection of different words co-occurring in the same sentence. The word combination of length k is weighted by the k-th root of the product of the inverse document frequencies (IDF) of its words. A computationally simple and efficient algorithm was proposed to calculate this kernel. We conducted experiments on the 20 Newsgroups dataset. This kernel achieves better performance than the classical word kernel and word-sequence kernel. We also assessed the impact of word combination length on performance. © 2012 IEEE. |
英文摘要 | In this paper we proposed a novel kernel for text categorization. This kernel is an inner product in the feature space generated by all word combinations of specified length. A word combination is a collection of different words co-occurring in the same sentence. The word combination of length k is weighted by the k-th root of the product of the inverse document frequencies (IDF) of its words. A computationally simple and efficient algorithm was proposed to calculate this kernel. We conducted experiments on the 20 Newsgroups dataset. This kernel achieves better performance than the classical word kernel and word-sequence kernel. We also assessed the impact of word combination length on performance. © 2012 IEEE. |
收录类别 | EI |
会议主办者 | IEEE Beijing Section; Hunan University of Humanities, Science and Technology; Tongji University; Xiamen University; Central South University |
会议录 | CSAE 2012 - Proceedings, 2012 IEEE International Conference on Computer Science and Automation Engineering
![]() |
语种 | 英语 |
ISBN号 | 9781467300865 |
源URL | [http://ir.iscas.ac.cn/handle/311060/15762] ![]() |
专题 | 软件研究所_软件所图书馆_会议论文 |
推荐引用方式 GB/T 7714 | Zhang Lujiang,Hu Xiaohui. a novel kernel for text categorization[C]. 见:2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012. Zhangjiajie, China. May 25, 2012 - May 27, 2012. |
入库方式: OAI收割
来源:软件研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。