中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Minimum entropy approach to word segmentation problems

文献类型:期刊论文

作者Wang, B; Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China.
刊名PHYSICA A
出版日期2001
卷号293期号:40972页码:583-591
关键词Sequences
ISSN号0378-4371
英文摘要Given a sequence composed of a limited number of characters, we try to "read" it as a "text", This involves segmenting the sequence into "words". The difficulty is to distinguish good segmentation from enormous numbers of random ones. Aiming at revealing the nonrandomness of the sequence as strongly as possible, by applying maximum likelihood method, we find a quantity called segmentation entropy that can be used to fulfill the aim. Contrary to commonplace where maximum entropy principle was applied to obtain good solution, we chose to minimize the segmentation entropy to obtain good segmentation. The concept developed in this letter carl be used to study the noncoding DNA sequences, e.g,, for regulatory elements prediction, in eukaryote genomes. (C) 2001 Elsevier Science B.V. All rights reserved.
学科主题Physics
URL标识查看原文
WOS记录号WOS:000168730500023
公开日期2012-08-29
源URL[http://ir.itp.ac.cn/handle/311006/12775]  
专题理论物理研究所_理论物理所1978-2010年知识产出
通讯作者Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China.
推荐引用方式
GB/T 7714
Wang, B,Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China.. Minimum entropy approach to word segmentation problems[J]. PHYSICA A,2001,293(40972):583-591.
APA Wang, B,&Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China..(2001).Minimum entropy approach to word segmentation problems.PHYSICA A,293(40972),583-591.
MLA Wang, B,et al."Minimum entropy approach to word segmentation problems".PHYSICA A 293.40972(2001):583-591.

入库方式: OAI收割

来源:理论物理研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。