中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Mirpara: a svm-based software tool for prediction of most probable microrna coding regions in genome scale sequences

文献类型:期刊论文

作者Wu, Yonggan1,2,3; Wei, Bo1; Liu, Haizhou1; Li, Tianxian2; Rayner, Simon1
刊名Bmc bioinformatics
出版日期2011-04-19
卷号12页码:14
ISSN号1471-2105
DOI10.1186/1471-2105-12-107
通讯作者Rayner, simon(simon.rayner.cn@gmail.com)
英文摘要Background: micrornas are a family of similar to 22 nt small rnas that can regulate gene expression at the post-transcriptional level. identification of these molecules and their targets can aid understanding of regulatory processes. recently, hts has become a common identification method but there are two major limitations associated with the technique. firstly, the method has low efficiency, with typically less than 1 in 10,000 sequences representing mirna reads and secondly the method preferentially targets highly expressed mirnas. if sequences are available, computational methods can provide a screening step to investigate the value of an hts study and aid interpretation of results. however, current methods can only predict mirnas for short fragments and have usually been trained against small datasets which don't always reflect the diversity of these molecules. results: we have developed a software tool, mirpara, that predicts most probable mature mirna coding regions from genome scale sequences in a species specific manner. we classified sequences from mirbase into animal, plant and overall categories and used a support vector machine to train three models based on an initial set of 77 parameters related to the physical properties of the pre-mirna and its mirnas. by applying parameter filtering we found a subset of similar to 25 parameters produced higher prediction ability compared to the full set. our software achieves an accuracy of up to 80% against experimentally verified mature mirnas, making it one of the most accurate methods available. conclusions: mirpara is an effective tool for locating mirnas coding regions in genome sequences and can be used as a screening step prior to hts experiments. it is available at http://www.whiov.ac.cn/bioinformatics/mirpara
WOS关键词SUPPORT VECTOR MACHINES ; DECOMPOSITION METHODS ; IDENTIFICATION ; DROSOPHILA ; RECOGNITION ; COMPLEX ; MIRNA ; TRANSCRIPTS ; PATHWAYS ; MIRBASE
WOS研究方向Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Mathematical & Computational Biology
WOS类目Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Mathematical & Computational Biology
语种英语
WOS记录号WOS:000291355400001
出版者BIOMED CENTRAL LTD
URI标识http://www.irgrid.ac.cn/handle/1471x/2375905
专题武汉病毒研究所
通讯作者Rayner, Simon
作者单位1.Chinese Acad Sci, Wuhan Inst Virol, Bioinformat Grp, State Key Lab Virol, Wuhan 430071, Hubei, Peoples R China
2.Chinese Acad Sci, State Key Lab Virol, Wuhan Inst Virol, Wuhan 430071, Hubei, Peoples R China
3.Texas Tech Univ, Dept Biol Sci, Lubbock, TX 79409 USA
推荐引用方式
GB/T 7714
Wu, Yonggan,Wei, Bo,Liu, Haizhou,et al. Mirpara: a svm-based software tool for prediction of most probable microrna coding regions in genome scale sequences[J]. Bmc bioinformatics,2011,12:14.
APA Wu, Yonggan,Wei, Bo,Liu, Haizhou,Li, Tianxian,&Rayner, Simon.(2011).Mirpara: a svm-based software tool for prediction of most probable microrna coding regions in genome scale sequences.Bmc bioinformatics,12,14.
MLA Wu, Yonggan,et al."Mirpara: a svm-based software tool for prediction of most probable microrna coding regions in genome scale sequences".Bmc bioinformatics 12(2011):14.

入库方式: iSwitch采集

来源:武汉病毒研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。