Genome Comparison – MASH

Context

MASH compute approximate distances between genomic sequences. First, it utilizes the MinHash technique to reduce genomes to compressed sketch representations. Then, using only the sketches, which can be thousands of times smaller, similarities between sequences can be rapidly estimated.

Code link

Reference

Comments are closed.