Block Trees
Nov 21, 2020·
,
,
,
,
,
,
,
·
0 min read
Djamal Belazzougui
data:image/s3,"s3://crabby-images/0b788/0b7882883eede5f2a323e3d212d59fa58fe95d99" alt="Manuel Cáceres"
Manuel Cáceres
Travis Gagie
Pawel Gawrychowski
Juha Kärkkäinen
Gonzalo Navarro
Alberto Ordóñez
Simon J. Puglisi
Yasuo Tabei
data:image/s3,"s3://crabby-images/be540/be5407d87af52f4a392ad13e85b15e8414615ed1" alt=""
Abstract
Let string $S[1..n]$ be parsed into $z$ phrases by the Lempel-Ziv algorithm. The corresponding compression algorithm encodes $S$ in $O(z)$ space, but it does not support random access to $S$. We introduce a data structure, the block tree, that represents $S$ in $O(z \log(n/z))$ space and extracts any symbol of $S$ in time $O(\log(n/z))$, among other space-time tradeoffs. The structure also supports other queries that are useful for building compressed data structures on top of $S$. Further, block trees can be built in linear time and in a scalable manner. Our experiments show that block trees offer relevant space-time tradeoffs compared to other compressed string representations for highly repetitive strings.
Type
Publication
In Journal of Computer and System Sciences