Choosing the content of textual summaries of large time-series data sets

JIN YU; EHUD REITER; JIM HUNTER; CHRIS MELLISH

doi:10.1017/S1351324905004031

Abstract

Natural Language Generation (NLG) can be used to generate textual summaries of numeric data sets. In this paper we develop an architecture for generating short (a few sentences) summaries of large (100KB or more) time-series data sets. The architecture integrates pattern recognition, pattern abstraction, selection of the most significant patterns, microplanning (especially aggregation), and realisation. We also describe and evaluate SumTime-Turbine, a prototype system which uses this architecture to generate textualsummaries of sensor data from gas turbines.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Reiter, Ehud Sripada, Somayajulu Hunter, Jim Yu, Jin and Davy, Ian 2005. Choosing words in computer-generated weather forecasts. Artificial Intelligence, Vol. 167, Issue. 1-2, p. 137.

Carenini, Giuseppe Ng, Raymond T. and Pauls, Adam 2006. Interactive multimedia summaries of evaluative text. p. 124.

Batyrshin, Ildar Z. and Sheremetov, Leonid 2007. Forging New Frontiers: Fuzzy Pioneers I. Vol. 217, Issue. , p. 217.

Spärck Jones, Karen 2007. Automatic summarising: The state of the art. Information Processing & Management, Vol. 43, Issue. 6, p. 1449.

Ferres, Leo Verkhogliad, Petro Lindgaard, Gitte Boucher, Louis Chretien, Antoine and Lachance, Martin 2007. Improving accessibility to statistical graphs. p. 67.

Batyrshin, Ildar and Sheremetov, Leonid 2007. Theoretical Advances and Applications of Fuzzy Logic and Soft Computing. Vol. 42, Issue. , p. 209.

Batyrshin, Ildar Sheremetov, Leonid and Herrera-Avelar, Raul 2007. Perception-based Data Mining and Decision Making in Economics and Finance. Vol. 36, Issue. , p. 85.

Batyrshin, I.Z. and Sheremetov, L.B. 2008. Perception-based approach to time series data mining. Applied Soft Computing, Vol. 8, Issue. 3, p. 1211.

Carberry, Sandra and Elzer, Stephanie 2008. Computational Intelligence in Multimedia Processing: Recent Advances. Vol. 96, Issue. , p. 191.

Wu, Peng Carberry, Sandra Chester, Daniel and Elzer, Stephanie 2008. Foundations of Intelligent Systems. Vol. 4994, Issue. , p. 399.

Portet, François Reiter, Ehud Gatt, Albert Hunter, Jim Sripada, Somayajulu Freer, Yvonne and Sykes, Cindy 2009. Automatic generation of textual summaries from neonatal intensive care data. Artificial Intelligence, Vol. 173, Issue. 7-8, p. 789.

Molina, Martin and Stent, Amanda 2009. Generating Descriptions that Summarize Geospatial and Temporal Data. p. 485.

Matheson, Donald Sripada, Somayujulu and Coghill, George M 2010. Moving from data to text using causal statements in explanatory narratives. p. 1.

Kacprzyk, Janusz Wilbik, Anna and Zadrożny, Sławomir 2010. An approach to the linguistic summarization of time series using a fuzzy quantifier driven aggregation. International Journal of Intelligent Systems, p. n/a.

MOLINA, MARTIN and STENT, AMANDA 2010. A KNOWLEDGE-BASED METHOD FOR GENERATING SUMMARIES OF SPATIAL MOVEMENT IN GEOGRAPHIC AREAS. International Journal on Artificial Intelligence Tools, Vol. 19, Issue. 04, p. 393.

Abu Doush, Iyad Pontelli, Enrico Son, Tran Cao Simon, Dominic and Ma, Ou 2010. Multimodal Presentation of Two-Dimensional Charts. ACM Transactions on Accessible Computing, Vol. 3, Issue. 2, p. 1.

2010. The Handbook of Computational Linguistics and Natural Language Processing. p. 655.

Wanner, Leo Bohnet, Bernd Bouayad-Agha, Nadjet Lareau, François and Nicklaß, Daniel 2010. MARQUIS: GENERATION OF USER-TAILORED MULTILINGUAL AIR QUALITY BULLETINS. Applied Artificial Intelligence, Vol. 24, Issue. 10, p. 914.

Molina, Martin Parodi, Enrique and Stent, Amanda 2010. Combining Text and Graphics for Interactive Exploration of Behavior Datasets. p. 150.

Wu, Peng Carberry, Sandra Elzer, Stephanie and Chester, Daniel 2010. Diagrammatic Representation and Inference. Vol. 6170, Issue. , p. 220.

Download full list

Article contents

Choosing the content of textual summaries of large time-series data sets

Abstract

Access options

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Choosing the content of textual summaries of large time-series data sets

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests