标题：A Dynamic Hashing Approach to Build the de Bruijn Graph for Genome Assembly
作者：Zhao, Kun; Liu, Weiguo; Voss, Gerrit; Mueller-Wittig, Wolfgang
作者机构：[Zhao, Kun; Liu, Weiguo] Shandong Univ, Sch Comp Sci & Technol, Jinan, Shandong, Peoples R China.; [Voss, Gerrit; Mueller-Wittig, Wolfgang] Nanyang 更多
会议名称：IEEE International Conference of Region 10 (TENCON)
会议日期：OCT 22-25, 2013
来源：2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON)
关键词：Dynamic hashing; De Bruijn graph; Genome assembly
摘要：The development of next-generation sequencing technologies has revolutionized the genome research and given rise to the explosive increase of DNA sequencing throughput. However, due to the continuing explosive growth of short-read database, these technologies face the challenges of short overlap and high throughput. The de Bruijn graph is particularly suitable for short-read assemblies, and its advantage is that the graph size will not be affected by the high redundancy of deep read coverage. With this character, the fragment assembly is cast as finding a path visiting every edge in the graph exactly once. In this paper, we present a new method to accelerate the genome assembly procedure. We have used a distributed dynamic hashing approach to construct the de Bruijn graph from short-read data. Evaluations using three paired-end datasets show that, our method outperforms previous parallel and distributed assemblers on a CPU cluster system.