Repository: Freie Universität Berlin, Math Department

Needle: a fast and space-efficient prefilter for estimating the quantification of very large collections of expression experiments

Darvish, Mitra and Seiler, Enrico and Mehringer, Svenja and Rahn, René and Reinert, Knut and Ponty, Yann (2022) Needle: a fast and space-efficient prefilter for estimating the quantification of very large collections of expression experiments. Bioinformatics, 38 (17). pp. 4100-4108. ISSN 1367-4803

Full text not available from this repository.

Official URL: https://doi.org/10.1093/bioinformatics/btac492

Abstract

Motivation The ever-growing size of sequencing data is a major bottleneck in bioinformatics as the advances of hardware development cannot keep up with the data growth. Therefore, an enormous amount of data is collected but rarely ever reused, because it is nearly impossible to find meaningful experiments in the stream of raw data. Results As a solution, we propose Needle, a fast and space-efficient index which can be built for thousands of experiments in <2 h and can estimate the quantification of a transcript in these experiments in seconds, thereby outperforming its competitors. The basic idea of the Needle index is to create multiple interleaved Bloom filters that each store a set of representative k-mers depending on their multiplicity in the raw data. This is then used to quantify the query. Availability and implementation https://github.com/seqan/needle.

Item Type:Article
Subjects:Mathematical and Computer Sciences > Computer Science
Divisions:Department of Mathematics and Computer Science > Institute of Computer Science > Algorithmic Bioinformatics Group
ID Code:2845
Deposited By: Anja Kasseckert
Deposited On:05 Sep 2022 10:50
Last Modified:05 Sep 2022 10:50

Repository Staff Only: item control page