Opinion Summarization and Search

Bing Liu

doi:10.1017/CBO9781139084789.010

As discussed in Chapter 2, in most sentiment analysis applications, one needs to study opinions from many people because due to the subjective nature of opinions, looking at only the opinion from a single person is usually not sufficient. To understand a large number of opinions, some form of summary is necessary. Definition 2.14 in Section 2.2 defined a structured opinion summary called aspect-based summary, also known as feature-based summary in Hu and Liu (2004) and Liu et al. (2005). Much of the opinion summarization research is based on this definition. This form of summary is also widely used in industry. For example, both Microsoft Bing and Google Product Search use this form of summary in their opinion analysis systems.

In general, opinion summarization can be seen as a kind of multidocument text summarization. Traditional text summarization has been studied extensively in NLP (Das, 2007). However, an opinion summary is quite different from a conventional single document or multidocument summary (of factual information). The reason is that an opinion summary should (1) be centered on entities and aspects and sentiments about them and (2) be quantitative. Traditional single document summarization produces a short document from a long document by extracting some “important” sentences, while traditional multidocument summarization finds differences among documents and discards repeated information. Neither of them explicitly captures different topics/entities and their aspects discussed in the documents, nor do they have a quantitative perspective. The “importance” of a sentence in traditional text summarization is typically defined operationally based on the summarization algorithms and measures used in each system. Opinion summary, on the other hand, can be defined formally in a structured form and represented as structured objects (see Definition 2.14). Even for output opinion summaries that are short text documents, there should still be explicit structures in them.

After discussing summarization, we move the topic of opinion search or retrieval in this chapter.

Book contents

9 - Opinion Summarization and Search

Summary

Access options

Book contents

9 - Opinion Summarization and Search

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive