Efficient parallel and incremental parsing of practical context-free languages

JEAN-PHILIPPE BERNARDY; KOEN CLAESSEN

doi:10.1017/S0956796815000131

Efficient parallel and incremental parsing of practical context-free languages

Part of: JFP Research Articles

Published online by Cambridge University Press: 23 July 2015

JEAN-PHILIPPE BERNARDY and

KOEN CLAESSEN

Show author details

JEAN-PHILIPPE BERNARDY: Affiliation:
Chalmers University of Technology & University of Gothenburg, Sweden (e-mail: bernardy@chalmers.se, koen@chalmers.se)
KOEN CLAESSEN: Affiliation:
Chalmers University of Technology & University of Gothenburg, Sweden (e-mail: bernardy@chalmers.se, koen@chalmers.se)

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

We present a divide-and-conquer algorithm for parsing context-free languages efficiently. Our algorithm is an instance of Valiant's (1975; General context-free recognition in less than cubic time. J. Comput. Syst. Sci.10(2), 308–314), who reduced the problem of parsing to matrix multiplications. We show that, while the conquer step of Valiant's is O(n3), it improves to O(log2n) under certain conditions satisfied by many useful inputs that occur in practice, and if one uses a sparse representation of matrices. The improvement happens because the multiplications involve an overwhelming majority of empty matrices. This result is relevant to modern computing: divide-and-conquer algorithms with a polylogarithmic conquer step can be parallelized relatively easily.

Type: Articles
Information: Journal of Functional Programming , Volume 25 , 2015 , e10

DOI: https://doi.org/10.1017/S0956796815000131 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2015

References

Allison, L. (1992) Lazy dynamic-programming can be eager. Inform. Process. Lett. 43 (4), 207–212.CrossRef Google Scholar

Bernardy, J.-P. (2008) Yi: An editor in Haskell for Haskell. In Proceedings of the 1st ACM SIGPLAN Symposium on Haskell. ACM, pp. 61–62.CrossRef Google Scholar

Bernardy, J.-P. (2009) Lazy functional incremental parsing. In Proceedings of the 2nd ACM SIGPLAN Symposium on Haskell. ACM, pp. 49–60.CrossRef Google Scholar

Bernardy, J.-P. and Claessen, K. (2013) Efficient divide-and-conquer parsing of practical context-free languages. In Proceedings of the 18th ACM SIGPLAN International Conference on Funct. Programming, pp. 111–122.CrossRef Google Scholar

Bird, R. (1986) An Introduction to the Theory of Lists. Programming Research Group, Oxford University Comp. Laboratory.Google Scholar

Burckhardt, S., Leijen, D., Sadowski, C., Yi, J. & Ball, T. (2011) Two for the price of one: A model for parallel and incremental computation. In Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications. ACM, pp. 427–444.CrossRef Google Scholar

Chomsky, N. (1959) On certain formal properties of grammars. Inform. Control 2 (2), 137–167.CrossRef Google Scholar

Chytil, M., Crochemore, M., Monien, B. & Rytter, W. (1991) On the parallel recognition of unambiguous context-free languages. Theor. Comput. Sci. 81 (2), 311–316.CrossRef Google Scholar

Claessen, K. (2004) Parallel parsing processes. J. Funct. Program. 14 (6), 741–757.CrossRef Google Scholar

Cocke, J. (1969) Programming Languages and their Compilers: Preliminary Notes. Courant Institute of Mathematical Sci., New York University.Google Scholar

Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. (2001) Introduction to Algorithms, 2nd ed.MIT press.Google Scholar

Forsberg, M. & Ranta, A.BNFC Quick reference, chapter Appendix A, London: College Publications, pp. 175–192.Google Scholar

Free Software Foundation. (1991) Gnu general public license.Google Scholar

Gibbons, J. (1996) The third homomorphism theorem. J. Funct. Program. 6 (4), 657–665.CrossRef Google Scholar

Hinze, R. & Paterson, R. (2006) Finger trees: A simple general-purpose data structure. J. Funct. Program. 16 (2), 197–218.CrossRef Google Scholar

Hughes, R. J. M. & Swierstra, S. D. (2003) Polish parsers, step by step. In Proceedings of the Eighth ACM SIGPLAN International Conference on Funct. Programming. ACM, pp. 239–248.CrossRef Google Scholar

Kasami, T. (1965) An Efficient Recognition and Syntax Analysis Algorithm for Context-Free Languages. Technical Report, DTIC Document.Google Scholar

Lange, M. and Leiß, H. (2009) To CNF or not to CNF? An efficient yet presentable version of the CYK algorithm. Inform. Didactica 8, 2008–2010.Google Scholar

Morita, K., Morihata, A., Matsuzaki, K., Hu, Z. & Takeichi, M. (2007) Automatic inversion generates divide-and-conquer parallel programs. ACM SIGPLAN Not. 42 (6), 146–155.CrossRef Google Scholar

Okhotin, A. (2014) Parsing by matrix multiplication generalized to boolean grammars. Theor. Comput. Sci. 516 (0), 101–120.CrossRef Google Scholar

O'Sullivan, B. (2013) The Criterion benchmarking library.Google Scholar

Rytter, W. and Giancarlo, R. (1987) Optimal parallel parsing of bracket languages. Theor. Comput. Sci. 53 (2), 295–306.CrossRef Google Scholar

Sikkel, K. and Nijholt, A. (1997) Parsing of Context-Free Languages. Berlin: Springer-Verlag, pp. 61–100.Google Scholar

Strassen, V. (1969) Gaussian elimination is not optimal. Numer. Math. 13, 354–356. DOI: 10.1007/BF02165411.CrossRef Google Scholar

Tomita, M. (1986) Efficient Parsing for Natural Language. Dordrecht: Kluwer Academic Publishers.CrossRef Google Scholar

Valiant, L. (1975) General context-free recognition in less than cubic time. J. Comput. Syst. Sci. 10 (2), 308–314.CrossRef Google Scholar

Wagner, T. A. and Graham, S. L. (1998) Efficient and flexible incremental parsing. ACM Trans. Program. Lang. Syst. 20 (5), 980–1013.CrossRef Google Scholar

Younger, D. (1967) Recognition and parsing of context-free languages in time n ³. Inform. Control 10 (2), 189–208.CrossRef Google Scholar

Submit a response

Discussions

No Discussions have been published for this article.

Article contents

Efficient parallel and incremental parsing of practical context-free languages

Abstract

References

Discussions

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests