Mining of Massive Datasets

Jure Leskovec; Anand Rajaraman; Jeffrey David Ullman

doi:10.1017/CBO9781139924801

Last updated 20/06/24: Online ordering is currently unavailable due to technical issues. We apologise for any delays responding to customers while we resolve this. For further updates please visit our website: https://www.cambridge.org/news-and-insights/technical-incident

Skip to main content Accessibility help

Home
Books
Mining of Massive Datasets

Mining of Massive Datasets

- Get access
  
  Buy a print copy
  
  Check if you have access via personal or institutional login
  
  Log in Register
Cited by 633
Cited by
- 633
Crossref Citations

This Book has been cited by the following publications. This list is generated based on data provided by Crossref.

Juhola, Martti and Grönfors, Tapio 2014. Encyclopedia of Information Science and Technology, Third Edition. p. 306.

CrossRef

Google Scholar

Vossen, Gottfried 2014. Big data as the new enabler in business and other intelligence. Vietnam Journal of Computer Science, Vol. 1, Issue. 1, p. 3.

CrossRef

Google Scholar

Ghasemi, Ahmad Masnadi‐Shirazi, Mohammad Ali Biguesh, M. and Qassemi, Foad 2014. Channel assignment based on bee algorithms in multi‐hop cognitive radio networks. IET Communications, Vol. 8, Issue. 13, p. 2356.

CrossRef

Google Scholar

Grigore, Radu and Kiefer, Stefan 2015. Computer Aided Verification. Vol. 9206, Issue. , p. 290.

CrossRef

Google Scholar

Szekely, Pedro Knoblock, Craig A. Slepicka, Jason Philpot, Andrew Singh, Amandeep Yin, Chengye Kapoor, Dipsy Natarajan, Prem Marcu, Daniel Knight, Kevin Stallard, David Karunamoorthy, Subessware S. Bojanapalli, Rajagopal Minton, Steven Amanatullah, Brian Hughes, Todd Tamayo, Mike Flynt, David Artiss, Rachel Chang, Shih-Fu Chen, Tao Hiebel, Gerald and Ferreira, Lidia 2015. The Semantic Web - ISWC 2015. Vol. 9367, Issue. , p. 205.

CrossRef

Google Scholar

Drakopoulos, Georgios and Megalooikonomou, Vasileios 2015. On the weight sparsity of multilayer perceptrons. p. 1.

CrossRef

Google Scholar

Cazzanti, Luca and Pallotta, Giuliana 2015. Mining maritime vessel traffic: Promises, challenges, techniques. p. 1.

CrossRef

Google Scholar

Broome, Barbara D. Hanratty, Timothy P. Hall, David L. Llinas, James Knoblock, Craig A. and Szekely, Pedro 2015. A scalable architecture for extracting, aligning, linking, and visualizing multi-Int data. Vol. 9499, Issue. , p. 949907.

CrossRef

Google Scholar

Lillo-Castellano, J.M. Mora-Jimenez, I. Moreno-Gonzalez, R. Montserrat-Garcia-de-Pablo, M. Garcia-Alberola, A. and Rojo-Alvarez, J.L. 2015. Big-data analytics for Arrhythmia Classification using data compression and kernel methods. p. 661.

CrossRef

Google Scholar

Balachandran, Vipin 2015. Query by example in large-scale code repositories. p. 467.

CrossRef

Google Scholar

Pokorný, Jaroslav Škoda, Petr Zelinka, Ivan Bednárek, David Zavoral, Filip Kruliš, Martin and Šaloun, Petr 2015. Big Data in Complex Systems. Vol. 9, Issue. , p. 29.

CrossRef

Google Scholar

Qayumi, Karima 2015. Multi-agent Based Intelligence Generation from Very Large Datasets. p. 502.

CrossRef

Google Scholar

Saravanan, S. 2015. Design of large-scale Content-based recommender system using hadoop MapReduce framework. p. 302.

CrossRef

Google Scholar

Silvestre, Guthemberg Sauvanaud, Carla Kaâniche, Mohamed and Kanoun, Karama 2015. Software Engineering for Resilient Systems. Vol. 9274, Issue. , p. 114.

CrossRef

Google Scholar

Pinto, Diogo Costa, Pedro Camacho, Rui and Costa, Vítor Santos 2015. Discovery Science. Vol. 9356, Issue. , p. 201.

CrossRef

Google Scholar

Yang, Xin-She Lee, Sanghyuk Lee, Sangmin and Theera-Umpon, Nipon 2015. Information Analysis of High-Dimensional Data and Applications. Mathematical Problems in Engineering, Vol. 2015, Issue. , p. 1.

CrossRef

Google Scholar

Hanusniak, Vladimir Svalec, Marian Branicky, Juraj Takac, Lubos and Zabovsky, Michal 2015. Exploitation of Hadoop framework for point cloud geographic data storage system. p. 197.

CrossRef

Google Scholar

Sabrina, Puspita Nurul and Saptawati, G. A. Putri 2015. Multiple MapReduce and derivative projected database: New approach for supporting PrefixSpan scalability. p. 148.

CrossRef

Google Scholar

Johnson, Richard A. and Wichern, Dean 2015. Wiley StatsRef: Statistics Reference Online. p. 1.

CrossRef

Google Scholar

Yoon, Clara E. O’Reilly, Ossian Bergen, Karianne J. and Beroza, Gregory C. 2015. Earthquake detection through computationally efficient similarity search. Science Advances, Vol. 1, Issue. 11,

CrossRef

Google Scholar

Download full list

2nd edition
Jure Leskovec, Stanford University, California, Anand Rajaraman, Milliways Laboratories, California, Jeffrey David Ullman, Stanford University, California

Publisher:: Cambridge University Press
Online publication date:: December 2014
Print publication year:: 2014
Online ISBN:: 9781139924801
DOI:: https://doi.org/10.1017/CBO9781139924801

Subjects:: Knowledge Management, Databases and Data Mining, Computer Science, Computational Statistics, Machine Learning and Information Science, Statistics and Probability

Information

Contents

Metrics

Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.

References

Metrics

Altmetric attention score

Total number of HTML views: 0

Total number of PDF views: 0 *

Loading metrics...

Total views: 0 *

Loading metrics...

* Views captured on Cambridge Core between #date#. This data will be updated every 24 hours.

Usage data cannot currently be displayed.

Mining of Massive Datasets

This Book has been cited by the following publications. This list is generated based on data provided by Crossref.

Book description

Refine List

Actions for selected content:

Contents

Frontmatter
pp i-iv

Contents
pp v-viii

Preface
pp ix-xii

1 - Data Mining
pp 1-18

2 - MapReduce and the New Software Stack
pp 19-67

3 - Finding Similar Items
pp 68-122

4 - Mining Data Streams
pp 123-153

5 - Link Analysis
pp 154-190

6 - Frequent Itemsets
pp 191-227

7 - Clustering
pp 228-266

8 - Advertising on the Web
pp 267-291

9 - Recommendation Systems
pp 292-324

10 - Mining Social-Network Graphs
pp 325-383

11 - Dimensionality Reduction
pp 384-414

12 - Large-Scale Machine Learning
pp 415-458

Index
pp 459-467

Metrics

Altmetric attention score

Full text views

Book summary page views

Mining of Massive Datasets

Book description

Refine List

Actions for selected content:

Save Search

Contents

Metrics

Altmetric attention score

Full text views

Book summary page views