6533b874fe1ef96bd12d61d3
RESEARCH PRODUCT
An extension of the Burrows-Wheeler Transform and applications to sequence comparison and data compression
Antonio RestivoSabrina MantaciGiovanna RosoneMarinella Sciortinosubject
Discrete mathematicsMultisetBurrows-Wheeler transform; Data Compression; Mitochondrial genome phylogenyBurrows–Wheeler transformMultiplicity (mathematics)Mitochondrial genome phylogenyBurrows-Wheeler transformData CompressionSurjective functionConjugacy classSequence comparisonPreprocessorAlgorithmMathematicsData compressiondescription
We introduce a generalization of the Burrows-Wheeler Transform (BWT) that can be applied to a multiset of words. The extended transformation, denoted by E, is reversible, but, differently from BWT, it is also surjective. The E transformation allows to give a definition of distance between two sequences, that we apply here to the problem of the whole mitochondrial genome phylogeny. Moreover we give some consideration about compressing a set of words by using the E transformation as preprocessing.
year | journal | country | edition | language |
---|---|---|---|---|
2005-01-01 |