K-mer Mapping and de Bruijn graphs: The case for velvet fragment assembly

Elvismary Molina de Armas; Edward Hermann Haeusler; Sergio Lifschitz; Maristela Terto de Holanda; Waldeyr Mendes Cordeiro da Silva; Paulo Cavalcanti Gomes Ferreira

doi:10.1109/BIBM.2016.7822642

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

K-mer Mapping and de Bruijn graphs: The case for velvet fragment assembly

Year: 2016, Pages: 882-889

DOI Bookmark: 10.1109/BIBM.2016.7822642

Authors

Elvismary Molina de Armas, Departamento de Informática - PUC-Rio, Rio de Janeiro - Brasil
Edward Hermann Haeusler, Departamento de Informática - PUC-Rio, Rio de Janeiro - Brasil
Sergio Lifschitz, Departamento de Informática - PUC-Rio, Rio de Janeiro - Brasil
Maristela Terto de Holanda, Departamento de Ciência da Computação - UNB, Brasília - Brasil
Waldeyr Mendes Cordeiro da Silva, Departamento de Ciência da Computação - UNB, Brasília - Brasil
Paulo Cavalcanti Gomes Ferreira, Instituto de Bioquímica Médica Leopoldo de Meis - UFRJ, Rio de Janeiro - Brasil

Abstract

K-mer Mapping, an internal process for many de novo genome fragments assembly methods, constitutes a computational challenge due to its high main memory consumption. We present in this paper a study of indexing methods to deal with this problem, considering plant genome assembling. We propose an ad-hoc I/O cost model to analyze the performance of B+− tree and hashing index structures. We use indexes to detect duplicate k-mers and improve the execution time. An actual RDBMS implementation for experiments with a sugarcane data set shows that one can obtain considerable performance gains while reducing RAM requirements.

Exploration of de Bruijn Graph Filtering for de novo Assembly Using GraphLab
2017 IEEE International Parallel and Distributed Processing Symposium: Workshops (IPDPSW)
Eliminating heterozygosity from reads through coverage normalization
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
GAGM: Genome assembly on GPU using mate pairs
2013 20th International Conference on High Performance Computing (HiPC)
GGAKE: GPU Based Genome Assembly Using K-Mer Extension
2013 IEEE International Conference on High Performance Computing and Communications (HPCC) & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (EUC)
Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting
2018 SC18: The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC)
RapidGKC: GPU-Accelerated K-Mer Counting
2024 IEEE 40th International Conference on Data Engineering (ICDE)
k-Core: Hardware Accelerator for k-Mer Generation and Counting used in Computational Genomics
2019 32nd International Conference on VLSI Design and 2019 18th International Conference on Embedded Systems (VLSID)
de novo repeat detection based on the third generation sequencing reads
2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
A New Approach for De Bruijn Graph Construction in De Novo Genome Assembling
2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

K-mer Mapping and de Bruijn graphs: The case for velvet fragment assembly

Authors

Abstract

Related Articles