Dimension-reduction: PCA/KPCA and feature selection

S. Y. Kung

doi:10.1017/CBO9781139176224.005

Part II - Dimension-reduction: PCA/KPCA and feature selection

Published online by Cambridge University Press: 05 July 2014

S. Y. Kung

Show author details

S. Y. Kung: Affiliation:
Princeton University, New Jersey

Book contents

Get access

Summary

This part contains two chapters concerning reduction of the dimension of the feature space, which plays a vital role in improving learning efficiency as well as prediction performance.

Chapter 3 covers the most prominent subspace projection approach, namely the classical principal component analysis (PCA), cf. Algorithm 3.1. Theorems 3.1 and 3.2 establish the optimality of PCA for both the minimum reconstruction error and maximum entropy criteria. The optimal error and entropy attainable by PCA are given in closed form. Algorithms 3.2, 3.3, and 3.4 describe the numerical procedures for the computation of PCA via the data matrix, scatter matrix, and kernel matrix, respectively.

Given a finite training dataset, the PCA learning model meets the LSP condition, thus the conventional PCA model can be kernelized. When a nonlinear kernel is adopted, it further extends to the kernel-PCA (KPCA) learning model. The KPCA algorithms can be presented in intrinsic space or empirical space (see Algorithms 3.5 and 3.6). For several real-life datasets, visualization via KPCA shows more visible data separability than that via PCA. Moreover, KPCA is closely related to the kernel-induced spectral space, which proves instrumental for error analysis in unsupervised and supervised applications.

Chapter 4 explores various aspects of feature selection methods for supervised and unsupervised learning scenarios. It presents several filtering-based and wrapper-based methods for feature selection, a popular method for dimension reduction.

Type: Chapter
Information: Kernel Methods and Machine Learning , pp. 77 - 78

DOI: https://doi.org/10.1017/CBO9781139176224.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

Part II - Dimension-reduction: PCA/KPCA and feature selection

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive