Search

Part One - Frameworks for Scaling Up Machine Learning
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 21-22
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Preface
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This book attempts to aggregate state-of-the-art research in parallel and distributed machine learning. We believe that parallelization provides a key pathway for scaling up machine learning to large datasets and complex methods. Although large-scale machine learning has been increasingly popular in both industrial and academic research communities, there has been no singular resource covering the variety of approaches recently proposed. We did our best to assemble the most representative contemporary studies in one volume. While each contributed chapter concentrates on a distinct approach and problem, together with their references they provide a comprehensive view of the field.
We believe that the book will be useful to the broad audience of researchers, practitioners, and anyone who wants to grasp the future of machine learning. To smooth the ramp-up for beginners, the first five chapters provide introductory material on machine learning algorithms and parallel computing platforms. Although the book gets deeply technical in some parts, the reader is assumed to have only basic prior knowledge of machine learning and parallel/distributed computing, along with college-level mathematical maturity. We hope that an engineering undergraduate who is familiar with the notion of a classifier and had some exposure to threads, MPI, or MapReduce will be able to understand the majority of the book's content. We also hope that a seasoned expert will find this book full of new, interesting ideas to inspire future research in the area.

Subject Index
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 471-475
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part Four - Applications
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 371-372
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part Three - Alternative Learning Settings
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 281-282
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Scaling up Machine Learning

Parallel and Distributed Approaches
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Published online:

05 February 2012

Print publication:

30 December 2011
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.

1 - Scaling Up Machine Learning: Introduction
- By Ron Bekkerman, LinkedIn Corporation, Mountain View, CA, USA, Mikhail Bilenko, Microsoft Research, Redmond, WA, USA, John Langford, Yahoo! Research, New York, NY, USA
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 1-20
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Distributed and parallel processing of very large datasets has been employed for decades in specialized, high-budget settings, such as financial and petroleum industry applications. Recent years have brought dramatic progress in usability, cost effectiveness, and diversity of parallel computing platforms, with their popularity growing for a broad set of data analysis and machine learning tasks.
The current rise in interest in scaling up machine learning applications can be partially attributed to the evolution of hardware architectures and programming frameworks that make it easy to exploit the types of parallelism realizable in many learning algorithms. A number of platforms make it convenient to implement concurrent processing of data instances or their features. This allows fairly straightforward parallelization of many learning algorithms that view input as an unordered batch of examples and aggregate isolated computations over each of them.
Increased attention to large-scale machine learning is also due to the spread of very large datasets across many modern applications. Such datasets are often accumulated on distributed storage platforms, motivating the development of learning algorithms that can be distributed appropriately. Finally, the proliferation of sensing devices that perform real-time inference based on high-dimensional, complex feature representations drives additional demand for utilizing parallelism in learning-centric applications. Examples of this trend include speech recognition and visual object detection becoming commonplace in autonomous robots and mobile devices.

Contributors
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp xi-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp v-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part Two - Supervised and Unsupervised Learning Algorithms
Edited by Ron Bekkerman, Mikhail Bilenko, John Langford
Book:

Scaling up Machine Learning

Published online:

05 February 2012

Print publication:

30 December 2011, pp 107-108
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Search Results

Refine search

Refine search

Actions for selected content:

11 results

Part One - Frameworks for Scaling Up Machine Learning

Preface

Summary

Subject Index

Part Four - Applications

Frontmatter

Part Three - Alternative Learning Settings

Scaling up Machine Learning

1 - Scaling Up Machine Learning: Introduction

Summary

Contributors

Contents

Part Two - Supervised and Unsupervised Learning Algorithms

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

11 results

Summary

Scaling up Machine Learning

Summary