Minimum entropy approach to word segmentation problems
文献类型:期刊论文
作者 | Wang, B; Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China. |
刊名 | PHYSICA A
![]() |
出版日期 | 2001 |
卷号 | 293期号:40972页码:583-591 |
关键词 | Sequences |
ISSN号 | 0378-4371 |
英文摘要 | Given a sequence composed of a limited number of characters, we try to "read" it as a "text", This involves segmenting the sequence into "words". The difficulty is to distinguish good segmentation from enormous numbers of random ones. Aiming at revealing the nonrandomness of the sequence as strongly as possible, by applying maximum likelihood method, we find a quantity called segmentation entropy that can be used to fulfill the aim. Contrary to commonplace where maximum entropy principle was applied to obtain good solution, we chose to minimize the segmentation entropy to obtain good segmentation. The concept developed in this letter carl be used to study the noncoding DNA sequences, e.g,, for regulatory elements prediction, in eukaryote genomes. (C) 2001 Elsevier Science B.V. All rights reserved. |
学科主题 | Physics |
URL标识 | 查看原文 |
WOS记录号 | WOS:000168730500023 |
公开日期 | 2012-08-29 |
源URL | [http://ir.itp.ac.cn/handle/311006/12775] ![]() |
专题 | 理论物理研究所_理论物理所1978-2010年知识产出 |
通讯作者 | Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China. |
推荐引用方式 GB/T 7714 | Wang, B,Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China.. Minimum entropy approach to word segmentation problems[J]. PHYSICA A,2001,293(40972):583-591. |
APA | Wang, B,&Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China..(2001).Minimum entropy approach to word segmentation problems.PHYSICA A,293(40972),583-591. |
MLA | Wang, B,et al."Minimum entropy approach to word segmentation problems".PHYSICA A 293.40972(2001):583-591. |
入库方式: OAI收割
来源:理论物理研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。