Skip to main content Accessibility help
×
Hostname: page-component-77c89778f8-vsgnj Total loading time: 0 Render date: 2024-07-17T13:28:31.995Z Has data issue: false hasContentIssue false

13 - Recent Trends

Published online by Cambridge University Press:  05 September 2016

Gonzalo Navarro
Affiliation:
Universidad de Chile
Get access

Summary

Throughout the book we have covered a number of compact data structures that are well established and tested, and their basic aspects can be considered stable. This chapter is conceived as an epilogue, where we take a look to the future. We describe some recent trends that, while not mature or general enough to deserve a thorough treatment in the book, are certainly promising and likely to be the focus of intense research in the upcoming years. Therefore, the chapter may also serve to guide the readers looking for hot research topics. Its writing style is different from the other chapters, as we discuss the bibliography together with the main material.

First, we consider encoding data structures. These ensure only that a certain set of queries of interest can be answered; the actual data cannot be recovered. Encodings can offer significant space reductions with respect to the data entropy and are interesting because they have the potential to reach the minimum space needed just to answer the desired queries; nothing superfluous is stored. They also define a new concept of entropy, in terms not only of a universe of objects, but also of a set of queries posed on them. Encodings for a few problems exist already, but they are very recent and mostly in a theoretical stage.

Second, we consider repetitive document collections. Many of the largest text collections arising in applications these days are highly repetitive, and thus very compressible if one applies the right compression methods. The compact data structures we have seen, however, focus on statistical compression, which is insensitive to this kind of compressibility. New text indexes building on other compression principles are being designed, although they are generally slower than the classical ones and still do not reach the exact entropy bounds as current statistical methods do.

Finally, we consider secondary memory. Compact data structures use less space than classical ones and thus may fit in smaller and faster memories. However, they have generally less locality of reference, thus they are slower than classical structures when competing in the same level of the memory hierarchy. This is particularly relevant when the datasets are huge and the structures must operate on disk, where the space usage is not so important and locality of reference is crucial.

Type
Chapter
Information
Compact Data Structures
A Practical Approach
, pp. 501 - 548
Publisher: Cambridge University Press
Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Abeliuk, A., Cánovas, R., and Navarro, G., (2013). Practical compressed suffix trees. Algorithms, 6(2), 319–351.Google Scholar
Agarwal, P. K. and Erickson, J. (1999). Geometric range searching and its relatives. In Advances in Discrete and Computational Geometry, volume 223 of Contemporary Mathematics, pages 1–56. AMS Press.
Al-Hafeedh, A., Crochemore, M., Ilie, L., Kopylov, E., Smyth, W. F., Tischler, G., and Yusufu, M. (2012). A comparison of index-based Lempel-Ziv LZ77 factorization algorithms. ACMComputing Surveys, 45(1), article 5.Google Scholar
Alstrup, S., Bender, M. A., Demaine, E. D., Farach-Colton, M., Munro, J. I., Rauhe, T., and Thorup, M. (2002). Efficient tree layout in a multilevel memory hierarchy. CoRR, cs/0211010v2. http:// arxiv.org/abs/cs.DS/0211010.
Arge, L. (2002). External memory data structures. In Handbook of Massive Data Sets, chapter 9, pages 313–357. Kluwer Academic Publishers.
Arge, L., Brodal, G. S., Fagerberg, R., and Laustsen, M. (2005). Cache-oblivious planar orthogonal range searching and counting. In Proc. 21st ACMSymposium on Computational Geometry (SoCG), pages 160–169.Google Scholar
Arroyuelo, D. and Navarro, G., (2007). A Lempel-Ziv text index on secondary storage. In Proc. 18th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 4580, pages 83–94.Google Scholar
Arroyuelo, D., Navarro, G., and Sadakane, K. (2012). Stronger Lempel-Ziv based compressed text indexing. Algorithmica, 62(1), 54–101.Google Scholar
Arroyuelo, D., Davoodi, P., and Rao, S. S. (2016). Succinct dynamic cardinal trees. Algorithmica, 74(2), 742–777.Google Scholar
Baeza-Yates, R., Barbosa, E. F., and Ziviani, N. (1996). Hierarchies of indices for text searching. Information Systems, 21(6), 497–514.Google Scholar
Belazzougui, D., Gagie, T., Gog, S., Manzini, G., and Sirén, J. (2014). Relative FM-indexes. In Proc. 21st International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 8799, pages 52–64.Google Scholar
Belazzougui, D., Puglisi, S. J., and Tabei, Y. (2015a). Access, rank, select in grammar-compressed strings. In Proc. 23rd Annual European Symposium on Algorithms (ESA), LNCS 9294, pages 142– 154.Google Scholar
Belazzougui, D., Cunial, F., Gagie, T., Prezza, N., and Raffinot, M. (2016). Practical combinations of repetition-aware data structures. CoRR, abs/1604.06002. http://arxiv.org/abs/1604.06002.
Belazzougui, D., Cunial, F., Gagie, T., Prezza, N., and Raffinot, M. (2015b). Composite repetitionaware data structures. In Proc. 26th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 9133, pages 26–39.Google Scholar
Belazzougui, D., Gagie, T., Gawrychowski, P., Kärkkäinen, J., Ordónez, A., Puglisi, S. J., and Tabei, Y. (2015c). Queries on LZ-bounded encodings. In Proc. 25th Data Compression Conference (DCC), pages 83–92.Google Scholar
Bell, T. C., Cleary, J., and Witten, I. H. (1990). Text Compression. Prentice Hall.
Bille, P., Landau, G. M., Raman, R., Sadakane, K., Rao, S. S., and Weimann, O. (2015). Random access to grammar-compressed strings and trees. SIAM Journal on Computing, 44(3), 513–539.Google Scholar
Blumer, A., Blumer, J., Haussler, D., McConnell, R. M., and Ehrenfeucht, A. (1987). Complete inverted files for efficient text retrieval and analysis. Journal of the ACM, 34(3), 578–595.Google Scholar
Brodal, G. S. and Fagerberg, R. (2006). Cache-oblivious string dictionaries. In Proc. 17th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 581–590.Google Scholar
Charikar, M., Lehman, E., Liu, D., Panigrahy, R., Prabhakaran, M., Sahai, A., and Shelat, A. (2005). The smallest grammar problem. IEEE Transactions on Information Theory, 51(7), 2554–2576.Google Scholar
Chen, G., Puglisi, S. J., and Smyth, W. F. (2008). Lempel-Ziv factorization using less time & space. Mathematics in Computer Science, 1, 605–623.Google Scholar
Chiang, Y.-J., Goodrich, M. T., Grove, E. F., Tamassia, R., Vengroff, D. E., and Vitter, J. S. (1995). External-memory graph algorithms. In Proc. 6th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 139–149.Google Scholar
Chien, Y.-F., Hon, W.-K., Shah, R., Thankachan, S. V., and Vitter, J. S. (2015). Geometric BWT: compressed text indexing via sparse suffixes and range searching. Algorithmica, 71(2), 258–278.Google Scholar
Chiu, S.-Y., Hon, W.-K., Shah, R., and Vitter, J. S. (2010). I/O-efficient compressed text indexes: From theory to practice. In Proc. 20th Data Compression Conference (DCC), pages 426–434.Google Scholar
Clark, D. R. and Munro, J. I. (1996). Efficient suffix trees on secondary storage. In Proc. 7th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 383–391.Google Scholar
Claude, F. and Navarro, G. (2012). Improved grammar-based compressed indexes. In Proc. 19th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 7608, pages 180–192.Google Scholar
Claude, F., Farina, A., Martínez-Prieto, M., and Navarro, G., (2010). Compresse. q-gram indexing for highly repetitive biological sequences. In Proc. 10th IEEE Conference on Bioinformatics and Bioengineering (BIBE), pages 86–91.Google Scholar
Claude, F., Farina, A., Martínez-Prieto, M., and Navarro, G., (2016). Universal indexes for highly repetitive document collections. Information Systems, 61, 1–23.Google Scholar
Colussi, L. and de Col, A. (1996). A time and space efficient data structure for string searching on large texts. Information Processing Letters, 58(5), 217–222.Google Scholar
Davoodi, P., Navarro, G., Raman, R., and Rao, S. S. (2014). Encoding range minima and range top-2 queries. Philosophical Transactions of the Royal Society A, 372(20130131).Google Scholar
Demaine, E. D., Iacono, J., and Langerman, S. (2015). Worst-case optimal tree layout in external memory. Algorithmica, 72(2), 369–378.Google Scholar
Deorowicz, S. and Grabowski, S. (2011). Robust relative compression of genomes with random access. Bioinformatics, 27, 2979–2986.Google Scholar
Dillabaugh, C., He, M., and Maheshwari, A. (2012). Succinct and I/O efficient data structures for traversal in trees. Algorithmica, 63(1–2), 201–223.Google Scholar
Dillabaugh, C., He, M., Maheshwari, A., and Zeh, N. (2016). I/O-efficient path traversal in succinct planar graphs. Algorithmica. Early view, DOI 10.1007/s00453-015-0086-7.
Do, H. H., Jansson, J., Sadakane, K., and Sung, W.-K. (2014). Fast relative Lempel-Ziv self-index for similar sequences. Theoretical Computer Science, 532, 14–30.Google Scholar
Farach, M. and Thorup, M. (1995). String matching in Lempel-Ziv compressed strings. In Proc. 27th ACM Symposium on Theory of Computing (STOC), pages 703–712.Google Scholar
Farzan, A. and Munro, J. I. (2014). A uniform paradigm to succinctly encode various families of trees. Algorithmica, 68(1), 16–40.Google Scholar
Ferrada, H., Gagie, T., Gog, S., and Puglisi, S. J. (2014). Relative Lempel-Ziv with constant-time random access. In Proc. 21st International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 8799, pages 13–17.Google Scholar
Ferragina, P., and Grossi, R. (1999). The string B-tree:Anewdata structure for string search in external memory and its applications. Journal of the ACM, 46(2), 236–280.Google Scholar
Ferragina, P., and Manzini, G., (2005). Indexing compressed texts. Journal of the ACM, 52(4), 552–581.Google Scholar
Ferragina, P., Manzini, G., Mäkinen, V., and Navarro, G., (2007). Compressed representations of sequences and full-text indexes. ACM Transactions on Algorithms, 3(2), article 20.Google Scholar
Fischer, J. (2011). Combined data structure for previous- and next-smaller-values. Theoretical Computer Science, 412(22), 2451–2456.Google Scholar
Fischer, J. and Heun, V. (2011). Space-efficient preprocessing schemes for range minimum queries on static arrays. SIAM Journal on Computing, 40(2), 465–492.Google Scholar
Fischer, J., Mäkinen, V., and Navarro, G., (2009). Faster entropy-bounded compressed suffix trees. Theoretical Computer Science, 410(51), 5354–5364.Google Scholar
Fischer, J., I, T., and Köppl, D. (2015). Lempel Ziv computation in small space (LZ-CISS). In Proc. 26th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 9133, pages 172– 184.Google Scholar
Gagie, T. and Puglisi, S. J. (2015). Searching and indexing genomic databases via kernelization. Frontiers in Bioengineering and Biotechnology, 3(12).Google Scholar
Gagie, T., Gawrychowski, P., Kärkkäinen, J., Nekrich, Y., and Puglisi, S. J. (2012). A faster grammarbased self-index. In Proc. 6th International Conference on Language and Automata Theory and Applications (LATA), LNCS 7183, pages 240–251.Google Scholar
Gagie, T., Hoobin, C., and Puglisi, S. J. (2014a). Block graphs in practice. In Proc. 2nd International Conference on Algorithms for Big Data (ICABD), pages 30–36.Google Scholar
Gagie, T., Gawrychowski, P., Kärkkäinen, J., Nekrich, Y., and Puglisi, S. J. (2014b). LZ77-based self-indexing with faster pattern matching. In Proc. 11th Latin American Theoretical Informatics Symposium (LATIN), LNCS 8392, pages 731–742.Google Scholar
Gagie, T., Gawrychowski, P., and Puglisi, S. J. (2015a). Approximate pattern matching in LZ77- compressed texts. Journal of Discrete Algorithms, 32, 64–68.Google Scholar
Gagie, T., Navarro, G., Puglisi, S. J., and Sirén, J. (2015b). Relative compressed suffix trees. CoRR, abs/1508.02550. http://arxiv.org/abs/1508.02550.
Gawrychowski, P. and Nicholson, P. K. (2015a). Encodings of range maximum-sum segment queries and applications. In Proc. 26th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 9133, pages 196–206.Google Scholar
Gawrychowski, P. and Nicholson, P. K. (2015b). Optimal encodings for range top-k, selection, and min-max. In Proc. 42nd International Colloquium on Automata, Languages, and Programming (ICALP), Part I, LNCS 9134, pages 593–604.Google Scholar
Geary, R. F., Raman, R., and Raman, V. (2006). Succinct ordinal trees with level-ancestor queries. ACM Transactions on Algorithms, 2(4), 510–534.Google Scholar
Gil, J. and Itai, A. (1999). How to pack trees. Journal of Algorithms, 32(2), 108–132.Google Scholar
Gog, S., Moffat, A., Culpepper, J. S., Turpin, A., and Wirth, A. (2014). Large-scale pattern search using reduced-space on-disk suffix arrays. IEEE Transactions on Knowledge and Data Engineering, 26(8), 1918–1931.Google Scholar
Golin, M. J., Iacono, J., Krizanc, D., Raman, R., Rao, S. S., and Shende, S. (2016). Encoding 2-D range maximum queries. Theoretical Computer Science, 609, 316–327.Google Scholar
González, R. and Navarro, G., (2009). A compressed text index on secondary memory. Journal of Combinatorial Mathematics and Combinatorial Computing, 71, 127–154.Google Scholar
González, R., Navarro, G., and Ferrada, H. (2014). Locally compressed suffix arrays. ACM Journal of Experimental Algorithmics, 19(1), article 1.Google Scholar
Goto, K. and Bannai, H. (2013). Simpler and faster Lempel Ziv factorization. In Proc. 23rd Data Compression Conference (DCC), pages 133–142.Google Scholar
Goto, K. and Bannai, H. (2014). Space efficient linear time Lempel-Ziv factorization for small alphabets. In Proc. 24th Data Compression Conference (DCC), pages 163–172.Google Scholar
Grossi, R. and Italiano, G. F. (1999). Efficient cross-trees for external memory. In External Memory Algorithms and Visualization, DIMACS Series in Discrete Mathematics and Theoretical Computer Science. AMS Press.
Grossi, R., Iacono, J., Navarro, G., Raman, R., and Rao, S. S. (2013). Encodings for range selection and top-k queries. In Proc. 21st Annual European Symposium on Algorithms (ESA), LNCS 8125, pages 553–564.Google Scholar
He, M., Munro, J. I., and Rao, S. S. (2012). Succinct ordinal trees based on tree covering. ACM Transactions on Algorithms, 8(4), article 42.Google Scholar
Hon, W.-K., Shah, R., and Vitter, J. S. (2006). Ordered pattern matching: Towards full-text retrieval. Technical Report TR-06-008, Purdue University.
Hon, W.-K., Shah, R., Thankachan, S. V., and Vitter, J. S. (2009). On entropy-compressed text indexing in external memory. In Proc. 16th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 5721, pages 75–89.Google Scholar
Huang, S., Lam, T. W., Sung, W.-K., Tam, S.-L., and Yiu, S.-M. (2010). Indexing similar DNA sequences. In Proc. 6th International Conference on Algorithmic Aspects in Information and Management (AAIM), LNCS 6124, pages 180–190.Google Scholar
Hutchinson, D. A., Maheshwari, A., and Zeh, N. (2003). An external memory data structure for shortest path queries. Discrete Applied Mathematics, 126, 55–82.Google Scholar
Jansson, J., Sadakane, K., and Sung, W.-K. (2015). Linked dynamic tries with applications to LZcompression in sublinear time and space. Algorithmica, 71(4), 969–988.Google Scholar
Jez, A. (2015). Approximation of grammar-based compression via recompression. Theoretical Computer Science, 592, 115–134.Google Scholar
Jez, A. (2016). A really simple approximation of smallest grammar. Theoretical Computer Science, 616, 141–150.Google Scholar
Jo, S. and Rao, S. S. (2015). Simultaneous encodings for range and next/previous larger/smaller value queries. In Proc. 21st International Conference on Computing and Combinatorics (COCOON), LNCS 9198, pages 648–660.Google Scholar
Jorgensen, A. G. and Larsen, K. G. (2011). Range selection and median: Tight cell probe lower bounds and adaptive data structures. In Proc. 22nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 805–813.Google Scholar
Kärkkäinen, J. and Ukkonen, E. (1996). Lempel-Ziv parsing and sublinear-size index structures for string matching. In Proc. 3rd South American Workshop on String Processing (WSP), pages 141–155.Google Scholar
Kärkkäinen, J., Kempa, D., and Puglisi, S. J. (2013a). Lightweight Lempel-Ziv parsing. In Proc. 12th International Symposium on Experimental Algorithms (SEA), pages 139–150.Google Scholar
Kärkkäinen, J., Kempa, D., and Puglisi, S. J. (2013b). Linear time Lempel-Ziv factorization: Simple, fast, small. In Proc. 24th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 7922, pages 189–200.Google Scholar
Kärkkäinen, J., Kempa, D., and Puglisi, S. J. (2014). Lempel-Ziv parsing in external memory. In Proc. 24th Data Compression Conference (DCC), pages 153–162.Google Scholar
Kempa, D. and Puglisi, S. J. (2013). Lempel-Ziv factorization: Simple, fast, practical. In Proc. 15th Workshop on Algorithm Engineering and Experiments (ALENEX), pages 103–112.Google Scholar
Kieffer, J. C. and Yang, E.-H. (2000). Grammar-based codes: A new class of universal lossless source codes. IEEE Transactions on Information Theory, 46(3), 737–754.Google Scholar
Köppl, D. and Sadakane, K. (2016). Lempel-Ziv computation in compressed space (LZ-CICS). In Proc. 26th Data Compression Conference (DCC), pages 3–12.Google Scholar
Kosaraju, S. R. and Manzini, G., (1999). Compression of low entropy strings with Lempel-Ziv algorithms. SIAM Journal on Computing, 29(3), 893–911.Google Scholar
Kreft, S. and Navarro, G., (2013). On compressing and indexing repetitive sequences. Theoretical Computer Science, 483, 115–133.Google Scholar
Kuruppu, S., Puglisi, S. J., and Zobel, J. (2010). Relative Lempel-Ziv compression of genomes for large-scale storage and retrieval. In Proc. 17th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 6393, pages 201–206.Google Scholar
Kuruppu, S., Puglisi, S. J., and Zobel, J. (2011). Reference sequence construction for relative compression of genomes. In Proc. 18th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 7024, pages 420–425.Google Scholar
Kuruppu, S., Beresford-Smith, B., Conway, T. C., and Zobel, J. (2012). Iterative dictionary construction for compression of large DNA data sets. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9, 137–149.Google Scholar
Larsson, J. and Moffat, A. (2000). Off-line dictionary-based compression. Proceedings of the IEEE, 88(11), 1722–1732.Google Scholar
Lempel, A. and Ziv, J. (1976). On the complexity of finite sequences. IEEE Transactions on Information Theory, 22(1), 75–81.Google Scholar
Mäkinen, V. (2003). Compact suffix array – A space-efficient full-text index. Fundamenta Informaticae, 56(1-2), 191–210.Google Scholar
Mäkinen, V. (2008). Personal communication.
Mäkinen, V. and Navarro, G., (2004). Compressed compact suffix arrays. In Proc. 15th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 3109, pages 420–433.Google Scholar
Mäkinen, V. and Navarro, G., (2005). Succinct suffix arrays based on run-length encoding. Nordic Journal of Computing, 12(1), 40–66.Google Scholar
Mäkinen, V.,Navarro, G., and Sadakane, K. (2004).Advantages of backward searching – efficient secondary memory and distributed implementation of compressed suffix arrays. In Proc. 15th Annual International Symposium on Algorithms and Computation (ISAAC), LNCS 3341, pages 681–692.Google Scholar
Mäkinen, V., Navarro, G., Sirén, J., and Välimäki, N. (2010). Storage and retrieval of highly repetitive sequence collections. Journal of Computational Biology, 17(3), 281–308.Google Scholar
Maruyama, S., Sakamoto, H., and Takeda, M. (2012). An online algorithm for lightweight grammarbased compression. Algorithms, 5(2), 214–235.Google Scholar
Maruyama, S., Nakahara, M., Kishiue, N., and Sakamoto, H. (2013a). ESP-index: A compressed index based on edit-sensitive parsing. Journal of Discrete Algorithms, 18, 100–112.Google Scholar
Maruyama, S., Tabei, Y., Sakamoto, H., and Sadakane, K. (2013b). Fully-online grammar compression. In Proc. 20th International Symposium on String Processing and Information Retrieval (SPIRE), pages 218–229.Google Scholar
Moffat, A., Puglisi, S. J., and Sinha, R. (2009). Reducing space requirements for disk resident suffix arrays. In Proc. 14th International Conference on Database Systems for Advanced Applications (DASFAA), pages 730–744.Google Scholar
Munro, J. I., Raman, V., and Storm, A. J. (2001). Representing dynamic binary trees succinctly. In Proc. 12th Annual ACM-SIAM Symposium on Discrete Algorithm (SODA), pages 529–536.Google Scholar
Na, J. C. and Park, K. (2004). Simple implementation of String B-trees. In Proc. 11th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 3246, pages 214–215.Google Scholar
Na, J. C., Park, H., Lee, S., Hong, M., Lecroq, T., Mouchard, L., and Park, K. (2013a). Suffix array of alignment: A practical index for similar data. In Proc. 20th International Symposium on String Processing and Information Retrieval (SPIRE), LNCS 8214, pages 243–254.Google Scholar
Na, J. C., Park, H., Crochemore, M., Holub, J., Iliopoulos, C. S., Mouchard, L., and Park, K. (2013b). Suffix tree of alignment: An efficient index for similar data. In Proc. 24th International Workshop on Combinatorial Algorithms (IWOCA), LNCS 8288, pages 337–348.Google Scholar
Navarro, G., (2004). Indexing text using the Ziv-Lempel trie. Journal of Discrete Algorithms, 2(1), 87–114.Google Scholar
Navarro, G., and Ordónez, A. (2016). Faster compressed suffix trees for repetitive text collections. Journal of Experimental Algorithmics, 21(1), article 1.8.Google Scholar
Navarro, G., and Thankachan, S. V. (2016). Optimal encodings for range majority queries. Algorithmica, 74(3), 1082–1098.Google Scholar
Navarro, G., Raman, R., and Rao, S. S. (2014). Asymptotically optimal encodings for range selection. In Proc. 34th Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS), pages 291–302.Google Scholar
Nevill-Manning, C., Witten, I., and Maulsby, D. (1994). Compression by induction of hierarchical grammars. In Proc. 4th Data Compression Conference (DCC), pages 244–253.Google Scholar
Nicholson, P. K. and Raman, R. (2015). Encoding nearest largest values. In Proc. 26th Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 9133, pages 385–395.Google Scholar
Nishimoto, T., I, T., Inenaga, S., Bannai, H., and Takeda, M. (2015). Dynamic index, LZ factorization, and LCE queries in compressed space. CoRR, abs/1504.06954. http://arxiv.org/abs/1504.06954.
Ohlebusch, E. and Gog, S. (2011). Lempel-Ziv factorization revisited. In Proc. 22nd Annual Symposium on Combinatorial Pattern Matching (CPM), LNCS 6661, pages 15–26.Google Scholar
Orlandi, A. and Venturini, R. (2016). Space-efficient substring occurrence estimation. Algorithmica, 74(1), 65–90.Google Scholar
Pătraşcu, M. and Thorup, M. (2006). Time-space trade-offs for predecessor search. In Proc. 38th Annual ACM Symposium on Theory of Computing (STOC), pages 232–240.Google Scholar
Policriti, A. and Prezza, N. (2016). Computing LZ77 in run-compressed space. In Proc. 26th Data Compression Conference (DCC), pages 23–32.Google Scholar
Raman, R. (2015). Encoding data structures. In Proc. 9th International Workshop on Algorithms and Computation (WALCOM), LNCS 8973, pages 1–7.Google Scholar
Rodeh, M., Pratt, V. R., and Even, S. (1981). Linear algorithm for data compression via string matching. Journal of the ACM, 28(1), 16–24.Google Scholar
Russo, L. M. S. and Oliveira, A. L. (2008). A compressed self-index using a Ziv-Lempel dictionary. Information Retrieval, 11(4), 359–388.Google Scholar
Rytter, W. (2003). Application of Lempel-Ziv factorization to the approximation of grammar-based compression. Theoretical Computer Science, 302(1–3), 211–222.Google Scholar
Sakamoto, H. (2005). A fully linear-time approximation algorithm for grammar-based compression. Journal of Discrete Algorithms, 3(2-4), 416–430.Google Scholar
Samet, H. (2006). Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann.
Sheinwald, D. (1994). On the Ziv-Lempel proof and related topics. Proceedings of the IEEE, 82, 866–871.Google Scholar
Sinha, R., Puglisi, S. J., Moffat, A., and Turpin, A. (2008). Improving suffix array locality for fast pattern matching on disk. In Proc. ACM International Conference on Management of Data (SIGMOD), pages 661–672.Google Scholar
Skala, M. (2013). Array range queries. In Space-Efficient Data Structures, Streams, and Algorithms – Papers in Honor of J. Ian Munro on the Occasion of His 66th Birthday, LNCS 8066, pages 333–350. Springer.Google Scholar
Storer, J. A. (1977). NP-completeness results concerning data compression. Technical Report 234, Department of Electrical Engineering and Computer Science, Princeton University.
Storer, J. A. and Szymanski, T. G. (1982). Data compression via textual substitution. Journal of the ACM, 29(4), 928–951.Google Scholar
Subramanian, S. and Ramaswamy, S. (1995). The P-range tree: A new data structure for range searching in secondary memory. In Proc. 6th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 378–387.Google Scholar
Szpankowski, W. (1993). A generalized suffix tree and its (un)expected asymptotic behaviors. SIAM Journal on Computing, 22(6), 1176–1198.Google Scholar
Vitter, J. S. (2008). Algorithms and Data Structures for External Memory. Now Publishers.
Wyner, A. and Ziv, J. (1994). The sliding-window Lempel-Ziv algorithm is asymptotically optimal. Proceedings of the IEEE, 82, 872–877.Google Scholar
Yamamoto, J., I, T., Bannai, H., Inenaga, S., and Takeda, M. (2014). Faster compact on-line Lempel- Ziv factorization. In Proc. 31st International Symposium on Theoretical Aspects of Computer Science (STACS), LIPIcs 25, pages 675–686.Google Scholar
Yang, X., Wang, B., Li, C., Wang, J., and Xie, X. (2013). Efficient direct search on compressed genomic data. In Proc. 29th IEEE International Conference on Data Engineering (ICDE), pages 961–972.Google Scholar
Ziv, J. and Lempel, A. (1977). A universal algorithm for sequential data compression. IEEE Transactions on Information Theory, 23(3), 337–343.Google Scholar
Ziv, J. and Lempel, A. (1978). Compression of individual sequences via variable length coding. IEEE Transactions on Information Theory, 24(5), 530–536.Google Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Recent Trends
  • Gonzalo Navarro, Universidad de Chile
  • Book: Compact Data Structures
  • Online publication: 05 September 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9781316588284.014
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Recent Trends
  • Gonzalo Navarro, Universidad de Chile
  • Book: Compact Data Structures
  • Online publication: 05 September 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9781316588284.014
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Recent Trends
  • Gonzalo Navarro, Universidad de Chile
  • Book: Compact Data Structures
  • Online publication: 05 September 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9781316588284.014
Available formats
×