Bloom Filter

Context

Bloom filters are space-saving probabilistic data structures for checking whether an element is part of a set. They require much less space than other data structures to represent data sets. However, one drawback is that they can generate false positives when queried. This data structure is well used in bioinformatics to test the existence of k-mer in genomic sequences. K-mers are indexed in a Bloom filter, and any k-mer of the same size can be queried against the structure. An efficient implementation of de-Bruijn graphs relies on the use of Bloom filters.

Code link

Comments are closed.