SongRecommend: From summarization to recommendation

SWATI TATA; BARBARA DI EUGENIO

doi:10.1017/S1351324912000253

SongRecommend: From summarization to recommendation

Published online by Cambridge University Press: 28 September 2012

SWATI TATA and

BARBARA DI EUGENIO

Show author details

SWATI TATA: Affiliation:
Versay Solutions, Chicago, IL, USA e-mail: swtata@gmail.com
BARBARA DI EUGENIO: Affiliation:
Department of Computer Science, University of Illinois, Chicago, IL, USA e-mail: bdieugen@uic.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

In recent years, the availability of too much information has become a fact of life for anybody connected with the Internet. The same is true for music: because of the penetration of portable devices and the availability of millions of tracks on the web, individual music collections have become unwieldy. Users need tools to help search their own song collections, and to recommend songs they may be interested in. Whereas recommendation systems have been developed for a variety of products, a music recommendation system presents special challenges, including the ability to recommend individual songs, as opposed to entire albums, even if only full album reviews are available on-line. SongRecommend, our music recommendation system, combines information extraction and generation techniques to produce summaries of reviews of individual songs from album reviews. We present a number of evaluations for SongRecommend: intrinsic evaluations of the extraction components, and of the informativeness of the summaries; and a user study of the impact of the song review summaries on users’ decision-making processes. When presented with the summary, users were able to make quicker decisions, and their choices were more varied. Whereas the smaller size of the summary has an impact on time-on-task, users do not appear to choose a specific recommendation only based on number of words. Our work demonstrates that state-of-the-art techniques in Natural Language Processing can be integrated into an effective end-to-end system.

Type: Articles
Information: Natural Language Engineering , Volume 20 , Issue 1 , January 2014 , pp. 29 - 67

DOI: https://doi.org/10.1017/S1351324912000253 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2012

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bangalore, S., and Rambow, O. 2000. Corpus-based lexical choice in natural language generation. In Proceedings of ACL 2000, the 38th Annual Meeting of the Association for Computational Linguistics, Hong Kong, pp. 464–71.Google Scholar

Barzilay, R., and McKeown, K. 2005. Sentence fusion for multidocument news summarization. Computational Linguistics 31 (3): 297–328.CrossRef Google Scholar

Belz, A., Kow, E., Viethen, J., and Gatt, A. 2010. Generating referring expressions in context: the GREC shared task evaluation challenges. In Krahmer, E. and Theune, M. (eds.), Empirical Methods in Natural Language Generation, pp. 294–327. Lecture Notes in Computer Science, Vol. 5980. Berlin, Germany: Springer.CrossRef Google Scholar

Benamara, F., Cesarano, C., Picariello, A., Reforgiato, D., and Subrahmanian, V. 2007. Sentiment analysis: adjectives and adverbs are better than adjectives alone. Proceedings of the International Conference on Weblogs and Social Media (ICWSM), Boulder, CO, USA.Google Scholar

Bruce, R., and Wiebe, J. 1999. Recognizing subjectivity: a case study of manual tagging. Natural Language Engineering 5 (2): 187–205.CrossRef Google Scholar

Cano, P., Koppenberger, M., and Wack, N. 2005. An industrial-strength content-based music recommendation system. In Proceedings of the 28th Annual International ACM SIGIR Conference (SIGIR 2005), Salvador, Brazil, pp. 673–73.Google Scholar

Carenini, G., Ng, R., and Pauls, A. 2006. Multi-document summarization of evaluative text. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2006), Trento, Italy, pp. 305–12.Google Scholar

Carletta, J. 1996. Assessing agreement on classification tasks: the Kappa statistic. Computational Linguistics 22 (2): 249–54.Google Scholar

Celma, Ò. 2006. Interaction Design for Recommender Systems. PhD thesis, Universitat Pompeu Fabra, Barcelona, Spain.Google Scholar

Celma, Ò. 2010. Music Recommendation and Discovery: The Long Tail, Long Fail, and Long Play in the Digital Music Space. Berlin, Germany: Springer.CrossRef Google Scholar

Corman, S., Kuhn, T., McPhee, R., and Dooley, K. 2002. Studying complex discursive systems: centering resonance analysis of organizational communication. Human Communication Research 28 (2): 157–206.Google Scholar

Dale, R., and Reiter, E. 1995. Computational Interpretations of the Gricean Maxims in the Generation of Referring Expressions. Cognitive Science 18: 233–63.CrossRef Google Scholar

de Marneffe, M.-C., and Manning, C. D. 2008. Stanford typed dependencies manual. http://nlp.stanford.edu/software/dependencies_manual.pdf. (Accessed 16 Sep 2012).Google Scholar

Di Eugenio, B., Moore, J. D., and Paolucci, M. 1997. Learning features that predict cue usage. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL-EACL97), Madrid, Spain, pp. 80–7.CrossRef Google Scholar

Ding, X., Liu, B., and Yu, P. 2008. A holistic lexicon-based approach to opinion mining. In Proceedings of the International Conference on Web Search and Web Data Mining, Palo Alto, CA, USA, pp. 231–40.CrossRef Google Scholar

Downie, J. S., and Hu, X. 2006. Review mining for music digital libraries: phase II. In Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, Chapel Hill, NC, USA, pp. 196–97.CrossRef Google Scholar

Esuli, A., and Sebastiani, F. 2006. SentiWordNet: a publicly available lexical resource for opinion mining. Proceedings of the 5th Conference on Language Resources and Evaluation (LREC-06), Genova, Italy.Google Scholar

Fellbaum, C. (ed.) 1998. WordNet: An Electronic Lexical Database. Cambridge, MA, USA: MIT Press.CrossRef Google Scholar

Gamon, M., Aue, A., Corston-Oliver, S., and Ringger, E. 2005. Pulse: mining customer opinions from free text. In Advances in Intelligent Data Analysis VI, Lecture Notes in Computer Science, Vol. 3646, pp. 121–32. Berlin, Germany: Springer.CrossRef Google Scholar

Goel, S., Broder, A., Gabrilovich, E., and Pang, B. 2010. Anatomy of the long tail: ordinary people with extraordinary tastes. In Proceedings of the Third International Conference on Web Search and Web Data Mining, New York, NY, USA, pp. 201–10.CrossRef Google Scholar

Goldberg, D., Nichols, D., Oki, B. M., and Terry, D. 1992. Collaborative filtering to weave an information tapestry. Communications of the ACM 35 (12): 61–70.CrossRef Google Scholar

Harnly, A., Nenkova, A., Passonneau, R., and Rambow, O. 2005. Automation of summary evaluation by the Pyramid method. Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2005), Borovets, Bulgaria.Google Scholar

Hatzivassiloglou, V., and McKeown, K. R. 1997. Predicting the semantic orientation of adjectives. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL-EACL97), Madrid, Spain, pp. 174–81.CrossRef Google Scholar

Hearst, M. A. 1994. Multi-paragraph segmentation of expository text. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL99), Las Cruces, NM, USA, pp. 9–16.CrossRef Google Scholar

Higashinaka, R., Prasad, R., and Walker, M. 2006. Learning to generate naturalistic utterances using reviews in spoken dialogue systems. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006), Sidney, Australia, pp. 265–72.Google Scholar

Hu, M., and Liu, B. 2004. Mining and summarizing customer reviews. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), Seattle, WA, USA, pp. 168–77.Google Scholar

Jin, W., Ho, H., and Srihari, R. 2009. OpinionMiner: a novel machine learning system for web opinion mining and extraction. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2009), Paris, France, pp. 1195–204.CrossRef Google Scholar

Klein, D., and Manning, C. D. 2002. Fast exact inference with a factored model for natural language parsing. In Proceedings of Advances in Neural Information Processing Systems 15 (NIPS 2002), Vancouver, Canada, pp. 3–10.Google Scholar

Kleinbauer, T., Becker, S., and Becker, T. 2007. Combining multiple information layers for the automatic generation of indicative meeting abstracts. In Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07), Schloss Dagstuhl, Germany, pp. 151–54.Google Scholar

Krahmer, E., Erk, S., and Verleg, A. 2003. Graph-based generation of referring expressions. Computational Linguistics 29 (1): 53–72.CrossRef Google Scholar

Le Roux, F., Elkunchwar, R., Ghai, V., Gao, Y., and Lu, J. 2007. A course recommender system using multiple criteria decision making method. Proceedings of the International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007), Chengdu, China.Google Scholar

Levy, M., and Bosteels, K. 2010. Music recommendation and the long tail. Proceedings of the First Workshop on Music Recommendation and Discovery (at ACM RecSys) (WOMRAD 2010), Barcelona, Spain.Google Scholar

Lin, C.-Y. 2004. ROUGE: a package for automatic evaluation of summaries. In Marie-Francine Moens, S. S. (ed.), Proceedings of the Workshop Text Summarization Branches Out (at ACL 2004), Barcelona, Spain, pp. 74–81.Google Scholar

Mairesse, F., and Walker, M. 2010. Towards personality-based user adaptation: psychologically informed stylistic language generation. User Modeling and User-Adapted Interaction 20 (3): 227–78.CrossRef Google Scholar

Mani, I., and Maybury, M. T. 1999. Automatic Summarization. Boston, MA, USA: The MIT Press.Google Scholar

McRoy, S., Channarukul, S., and Ali, S. 2003. An augmented template-based approach to text realization. Natural Language Engineering 9 (4): 381–420.CrossRef Google Scholar

Miller, B., Albert, I., Lam, S., Konstan, J., and Riedl, J. 2003. MovieLens unplugged: experiences with an occasionally connected recommender system. In Proceedings of the 8th International Conference on Intelligent User Interfaces (IUI 2003), Miami, FL, USA, pp. 263–66.Google Scholar

Miller, G. A., Chodorow, M., Landes, S., Leacock, C., and Thomas, R. G. 1994. Using a semantic concordance for sense identification. In Proceedings of the Workshop on Human Language Technology (HLT '94), Plainsboro, NJ, USA, pp. 240–43.CrossRef Google Scholar

Minnen, G., Carroll, J., and Pearce, D. 2000. Robust, applied morphological generation. In Proceedings of the 1st International Natural Language Generation Conference (INLG 2000), Mitzpe Ramon, Israel, pp. 201–8.Google Scholar

Mitchell, T. 1997. Machine Learning. Burr Ridge, NJ, USA: McGraw Hill.Google Scholar

Nastase, V. 2008. Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2008), Edinburgh, Scotland, pp. 763–72.Google Scholar

Nenkova, A., and Passonneau, R. 2004. Evaluating content selection in summarization: the Pyramid method. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2004), Boston, MA, USA, pp. 145–52.Google Scholar

Nguyen, P., Mahajan, M., and Zweig, G. 2007. Summarization of multiple user reviews in the restaurant domain. Technical Report MSR-TR-2007-126. Microsoft, Redmond, WA, USA.Google Scholar

Popescu, A., and Etzioni, O. 2005. Extracting product features and opinions from reviews. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), Vancouver, BC, Canada, pp. 339–46.Google Scholar

Raimond, Y., Giasson, F., Jacobson, K., Fazekas, G., Gängler, T., and Reinhardt, S. 2010. Music ontology specification. Specification document. http://musicontology.com/. (Accessed 16 Sep 2012).Google Scholar

Ramshaw, L., and Marcus, M. 1995. Text chunking using transformation-based learning. In Proceedings of the Third ACL Workshop on Very Large Corpora, Cambridge, MA, USA, pp. 82–94.Google Scholar

Rich, E. 1979. User modeling via stereotypes. Cognitive Science 3 (4): 329–54.Google Scholar

Saggion, H. 2011. Learning predicate insertion rules for document abstracting. In Gelbukh, A. (ed.), Computational Linguistics and Intelligent Text Processing, pp. 301–12. Lecture Notes in Computer Science, Vol. 6609. Berlin, Germany: Springer.CrossRef Google Scholar

Saggion, H., and Funk, A. 2010. Interpreting SentiWordNet for opinion classification. In Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., and Tapias, D. (eds.), Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta, Malta.Google Scholar

Saggion, H., and Lapalme, G. 2002. Generating indicative-informative summaries with SumUM. Computational Linguistics 28 (4): 497–526.CrossRef Google Scholar

Schedl, M., Widmer, G., Pohle, T., and Seyerlehner, K. 2007. Web-based detection of music band members and line-up. Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR 2007), Vienna, Austria.Google Scholar

Soubbotin, M., and Soubbotin, S. 2005. Trade-off between factors influencing quality of the summary. Proceedings of the Document Understanding Workshop (DUC 2005), Vancouver, BC, Canada.Google Scholar

SpärckJones, K. Jones, K. 2007. Automatic summarising: the state of the art. Information Processing and Management 43 (6): 1449–81.CrossRef Google Scholar

Spärck Jones, K., and Galliers, J. R. 1995. Evaluating Natural Language Processing Systems: An Analysis and Review. Lecture Notes in Computer Science, Vol. 1083. New York, USA: Springer.Google Scholar

Subba, R. 2007. Exploiting event semantics to parse the rhetorical structure of natural language text. In Proceedings of the Doctoral Consortium at NAACL-HLT 2007, the Conference of the North American Chapter for the Association for Computational Linguistics, Rochester, NY, USA, pp. 21–4.Google Scholar

Subba, R. and Di Eugenio, B. 2009. An effective discourse parser that uses rich linguistic information. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Boulder, CO, USA, pp. 566–74.Google Scholar

Tata, S. 2010. SongRecommend: A Music Recommendation System with Fine-Grained Song Reviews. PhD thesis, University of Illinois at Chicago, IL, USA.Google Scholar

Tintarev, N., and Masthoff, J. 2007. Effective explanations of recommendations: user-centered design. In Proceedings of the ACM Conference on Recommender Systems (RecSys’07), Minneapolis, MN, USA, pp. 153–56.Google Scholar

UPN 2008. UPnP Device Architecture Version 1.0. www.upnp.org. (Accessed 16 Sep 2012).Google Scholar

Van Meteren, R., and Van Someren, M. 2000. Using content-based filtering for recommendation. Proceedings of the ECML/MLNet Workshop on Machine Learning and the New Information Age, Barcelona, Spain.Google Scholar

Van Setten, M., Pokraev, S., and Koolwaaij, J. 2004. Context-aware recommendations in the mobile tourist application COMPASS. In Adaptive Hypermedia and Adaptive Web-Based Systems, pp. 515–48. Lecture Notes in Computer Science, Vol. 3137. Berlin, Germany: Springer.Google Scholar

Wasserman, S., and Faust, K. 1994. Social Network Analysis: Methods and Applications. Structural Analysis in the Social Sciences Series, no. 8. Cambridge, UK: Cambridge University Press.CrossRef Google Scholar

Wiebe, J. M., Bruce, R. F., and O'Hara, T. P. 1999. Development and use of a gold-standard data set for subjectivity classifications. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL99), College Park, MD, USA, pp. 246–53.Google Scholar

Xie, Z. 2006. Machine Learning in Automatic Text Summarization: From Extracting to Abstracting. PhD thesis, University of Illinois, Chicago, IL, USA.Google Scholar

Xie, Z., Di Eugenio, B., and Nelson, P. C. 2008. From extracting to abstracting: generating quasi-abstractive summaries. In Proceedings of the Sixth International Language Resources and Evaluation (LREC’08), Marrakech, Morocco.Google Scholar

Zhuang, L., Jing, F., Zhu, X., and Zhang, L. 2006. Movie review mining and summarization. In Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM 2006), Arlington, VA, USA, pp. 43–50.CrossRef Google Scholar

Article contents

SongRecommend: From summarization to recommendation

Abstract

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests