6533b7d5fe1ef96bd1264735
RESEARCH PRODUCT
Suffixes, Conjugates and Lyndon Words
Silvia BonomoMarinella SciortinoAntonio RestivoGiovanna RosoneSabrina Mantacisubject
MultisetReduction (recursion theory)BWT; Lyndon factorization; Suffix ArrayString (computer science)Suffix arrayLyndon words Lyndon factorization BWT Suffix array EBWT Circular words ConjugacyLexicographical orderlaw.inventionSuffix ArrayCombinatoricsBWTLyndon factorizationlawOrder (group theory)Symbol (formal)Word (group theory)Mathematicsdescription
In this paper we are interested in the study of the combinatorial aspects connecting three important constructions in the field of string algorithms: the suffix array, the Burrows-Wheeler transform (BWT) and the extended Burrows-Wheeler transform (EBWT). Such constructions involve the notions of suffixes and conjugates of words and are based on two different order relations, denoted by $\plex$ and $\pom$, that, even if strictly connected, are quite different from the computational point of view. In this study an important role is played by Lyndon words. In particular, we improve the upper bound on the number of symbol comparisons needed to establish the $\pom$ order between two primitive words by using a preliminary knowledge of the $\plex$ order of the corresponding Lyndon conjugates. Moreover, we propose an algorithm that efficiently sorts, according to the $\pom$ order, the list of conjugates of a multiset of Lyndon words. Finally, we show that the Lyndon factorization of a word helps the construction of its suffix array, allowing a reduction of the number of symbol comparisons needed to lexicographically sort the suffixes of the word.
year | journal | country | edition | language |
---|---|---|---|---|
2013-01-01 |