• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Dun, Ming (Dun, Ming.) | Li, Yunchun (Li, Yunchun.) | You, Xin (You, Xin.) | Sun, Qingxiao (Sun, Qingxiao.) | Luan, Zerong (Luan, Zerong.) | Yang, Hailong (Yang, Hailong.)

收录:

CPCI-S EI Scopus

摘要:

De novo genome assembly reconstructs the chromosomes from massive relatively short fragmented reads and serves as fundamental for studying new species where there is no reference genome. Wtdbg2 is a de novo assembler for long reads that is up to hundreds of kilobases. It is based on fuzzy-Bruijn graph (FBG) and is ten times faster than the cutting-edge assemblers such as Canu. However, the performance of wtdbg2 still requires further improvement: 1) it requires up to terabytes of memory to compute the assembly, which is infeasible to run on commodity server; 2) it requires tens of hours for assembling on large datasets such as genomes of homo sapiens. To address the above drawbacks, we propose several optimization techniques for accelerating wtdbg2 on commodity server, including a memory autotuning scheme, sequence alignment optimization and intermediate result elimination in the output procedure. We compare the optimized wtdbg2 with the original implementation and two cutting-edge assemblers on real-world datasets. The experiment results demonstrate that optimized wtdbg2 achieves maximum and average speedup of 2.31x and 1.54x respectively. In addition, our proposed optimization reduces the memory usage of wtdbg2 by 39.5% without affecting the correctness.

关键词:

Performance optimization wtdbg2 Computational biology Auto-tuning Genome assembly Load balance

作者机构:

  • [ 1 ] [Dun, Ming]Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
  • [ 2 ] [Li, Yunchun]Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
  • [ 3 ] [Li, Yunchun]Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
  • [ 4 ] [You, Xin]Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
  • [ 5 ] [Sun, Qingxiao]Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
  • [ 6 ] [Yang, Hailong]Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
  • [ 7 ] [Yang, Hailong]Beijing Univ Technol, State Key Lab Math Engn & Adv Comp, Beijing 100083, Peoples R China
  • [ 8 ] [Luan, Zerong]Beijing Univ Technol, Coll Life Sci & Bioengn, Beijing 100083, Peoples R China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I

ISSN: 0302-9743

年份: 2020

卷: 12452

页码: 232-246

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:642/5044490
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司