中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores

文献类型:期刊论文

作者Balaji,Pavan1; Feng,Shengzhong5; Wei,Yanjie5; Wang,Bingqiang2; Meng,Jintao3,4,5
刊名BMC Bioinformatics
出版日期2014-09-10
卷号15期号:Suppl 9
关键词genome assembly parallel computing De Bruijn graph
ISSN号1471-2105
DOI10.1186/1471-2105-15-S9-S2
英文摘要AbstractBackgroundThere is a widening gap between the throughput of massive parallel sequencing machines and the ability to analyze these sequencing data. Traditional assembly methods requiring long execution time and large amount of memory on a single workstation limit their use on these massive data.ResultsThis paper presents a highly scalable assembler named as SWAP-Assembler for processing massive sequencing data using thousands of cores, where SWAP is an acronym for Small World Asynchronous Parallel model. In the paper, a mathematical description of multi-step bi-directed graph (MSG) is provided to resolve the computational interdependence on merging edges, and a highly scalable computational framework for SWAP is developed to automatically preform the parallel computation of all operations. Graph cleaning and contig extension are also included for generating contigs with high quality. Experimental results show that SWAP-Assembler scales up to 2048 cores on Yanhuang dataset using only 26 minutes, which is better than several other parallel assemblers, such as ABySS, Ray, and PASHA. Results also show that SWAP-Assembler can generate high quality contigs with good N50 size and low error rate, especially it generated the longest N50 contig sizes for Fish and Yanhuang datasets.ConclusionsIn this paper, we presented a highly scalable and efficient genome assembly software, SWAP-Assembler. Compared with several other assemblers, it showed very good performance in terms of scalability and contig quality. This software is available at: https://sourceforge.net/projects/swapassembler
语种英语
WOS记录号BMC:10.1186/1471-2105-15-S9-S2
出版者BioMed Central
源URL[http://119.78.100.204/handle/2XEOYT63/4040]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Wei,Yanjie
作者单位1.Argonne National Laboratory; Mathematics and Computer Science Division
2.Beijing Genomics Institute
3.University of Chinese Academy of Sciences
4.Chinese Academy of Sciences; Institute of Computing Technology
5.Chinese Academy of Sciences; Shenzhen Institutes of Advanced Technology
推荐引用方式
GB/T 7714
Balaji,Pavan,Feng,Shengzhong,Wei,Yanjie,et al. SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores[J]. BMC Bioinformatics,2014,15(Suppl 9).
APA Balaji,Pavan,Feng,Shengzhong,Wei,Yanjie,Wang,Bingqiang,&Meng,Jintao.(2014).SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores.BMC Bioinformatics,15(Suppl 9).
MLA Balaji,Pavan,et al."SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores".BMC Bioinformatics 15.Suppl 9(2014).

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。