rent-now

Rent More, Save More! Use code: ECRENTAL

5% off 1 book, 7% off 2 books, 10% off 3+ books

9780262514125

Semi-supervised Learning

by ; ;
  • ISBN13:

    9780262514125

  • ISBN10:

    0262514125

  • Edition: 1st
  • Format: Paperback
  • Copyright: 2010-01-22
  • Publisher: The MIT Press

Note: Supplemental materials are not guaranteed with Rental or Used book purchases.

Purchase Benefits

  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $48.00 Save up to $13.80
  • Rent Book $34.20
    Add to Cart Free Shipping Icon Free Shipping

    TERM
    PRICE
    DUE
    USUALLY SHIPS IN 3-5 BUSINESS DAYS
    *This item is part of an exclusive publisher rental program and requires an additional convenience fee. This fee will be reflected in the shopping cart.

How To: Textbook Rental

Looking to rent a book? Rent Semi-supervised Learning [ISBN: 9780262514125] for the semester, quarter, and short term or search our site for other textbooks by Chapelle, Olivier; Scholkopf, Bernhard; Zien, Alexander. Renting a textbook can save you up to 90% from the cost of buying.

Summary

In the field of machine learning, semi-supervised learning (SSL) occupies the middle ground, between supervised learning (in which all training examples are labeled) and unsupervised learning (in which no label data are given). Interest in SSL has increased in recent years, particularly because of application domains in which unlabeled data are plentiful, such as images, text, and bioinformatics. This first comprehensive overview of SSL presents state-of-the-art algorithms, a taxonomy of the field, selected applications, benchmark experiments, and perspectives on ongoing and future research. Semi-Supervised Learningfirst presents the key assumptions and ideas underlying the field: smoothness, cluster or low-density separation, manifold structure, and transduction. The core of the book is the presentation of SSL methods, organized according to algorithmic strategies. After an examination of generative models, the book describes algorithms that implement the low-density separation assumption, graph-based methods, and algorithms that perform two-step learning. The book then discusses SSL applications and offers guidelines for SSL practitioners by analyzing the results of extensive benchmark experiments. Finally, the book looks at interesting directions for SSL research. The book closes with a discussion of the relationship between semi-supervised learning and transduction. Olivier Chapelle and Alexander Zien are Research Scientists and Bernhard Schölkopf is Professor and Director at the Max Planck Institute for Biological Cybernetics in Tübingen. Schölkopf is coauthor of Learning with Kernels(MIT Press, 2002) and is a coeditor of Advances in Kernel Methods: Support Vector Learning(1998), Advances in Large-Margin Classifiers(2000), and Kernel Methods in Computational Biology(2004), all published by The MIT Press.

Author Biography

Olivier Chapelle is Senior Research Scientist in Machine Learning at Yahoo.

Bernhard Schölkopf is Director at the Max Planck Institute for Intelligent Systems in Tübingen, Germany. He is coauthor of Learning with Kernels (2002) and is a coeditor of Advances in Kernel Methods: Support Vector Learning (1998), Advances in Large-Margin Classifiers (2000), and Kernel Methods in Computational Biology (2004), all published by the MIT Press.

Alexander Zien is Senior Analyst in Bioinformatics at LIFE Biosystems GmbH, Heidelberg.

Table of Contents

Series Forewordp. xi
Prefacep. xiii
Introduction to Semi-Supervised Learningp. 1
Supervised, Unsupervised, and Semi-Supervised Learningp. 1
When Can Semi-Supervised Learning Work?p. 4
Classes of Algorithms and Organization of This Bookp. 8
Generative Models
A Taxonomy for Semi-Supervised Learning Methodsp. 15
The Semi-Supervised Learning Problemp. 15
Paradigms for Semi-Supervised Learningp. 17
Examplesp. 22
Conclusionsp. 31
Semi-Supervised Text Classification Using EMp. 33
Introductionp. 33
A Generative Model for Textp. 35
Experminental Results with Basic EMp. 41
Using a More Expressive Generative Modelp. 43
Overcoming the Challenges of Local Maximap. 49
Conclusions and Summaryp. 54
Risks of Semi-Supervised Learningp. 57
Do Unlabled Data Improve or Degrade Classification Performance?p. 57
Understanding Unlabeled Data: Asymptotic Biasp. 59
The Asymptotic Analysis of Generative Smei-Supervised Learningp. 63
The Value of Labeled and Unlabeled Datap. 67
Finite Sample Effectsp. 69
Model Search and Robustnessp. 70
Conclusionp. 71
Probabilistic Semi-Supervised Cluster with Constraintsp. 73
Introductionp. 74
HMRF Model for Semi-Supervised Clusteringp. 75
HMRF-KMeans Algorithmp. 81
Active Learning for Constraint Acquistionp. 93
Experimental Resultsp. 96
Related Workp. 100
Conclusionsp. 101
Low-Density Separation
Transductive Support Vector Machinesp. 105
Introductionp. 105
Transductive Support Vector Machinesp. 108
Why Use Margin on the Test Set?p. 111
Experiments and Applications of the TSVMsp. 112
Solving the TSVM Optimization Problemp. 114
Connection to Related Approachesp. 116
Summary and Conclusionsp. 116
Semi-Supervised Learning Using Semi-Definite Programmingp. 119
Relaxing SVM transductionp. 119
An Approximation for Speedupp. 126
General Semi-Supervised Learning Settingsp. 128
Empirical Resultsp. 129
Summary and Outlookp. 133
Appendix: The Extended Schur Complement Lemmap. 134
Gaussian Processes and the Null-Category Noise Modelp. 137
Introductionp. 137
The Noise Modelp. 141
Process Model and the Effect of the Null-Categoryp. 143
Posterior Inference and Predictionp. 145
Resultsp. 147
Discussionp. 149
Entropy Regularizationp. 151
Introductionp. 151
Derivation of the Criterionp. 152
Optimization Algorithmsp. 155
Related Methodsp. 158
Experimentsp. 160
Conclusionp. 166
Appendix: Proof of Theorem 9.1p. 166
Data-Dependent Regularizationp. 169
Introductionp. 169
Information Regularization on Metric Spacesp. 174
Information Regularization and Relational Datap. 182
Discussionp. 189
Graph-Based Models
Label Propogation and Quadratic Criterionp. 193
Introductionp. 193
Label Propogation on a Similarity Graphp. 194
Quadratic Cost Criterionp. 198
From Transduction to Inductionp. 205
Incorporating Class Prior Knowledgep. 205
Curse of Dimensionality for Semi-Supervised Learningp. 206
Discussionp. 215
The Geometric Basis of Semi-Supervised Learningp. 217
Introductionp. 217
Incorporating Geometry in Regularizationp. 220
Algorithmsp. 224
Data-Dependent Kernels for Semi-Supervised Learningp. 229
Linear Methods for Large-Scale Semi-Supervised Learningp. 231
Connections to Other Algorithms and Related Workp. 232
Future Directionsp. 234
Discrete Regularizationp. 237
Introductionp. 237
Discrete Analysisp. 239
Discrete Regularizationp. 245
Conclusionp. 249
Semi-Supervised Learning with Conditional Harmonic Mixingp. 251
Introductionp. 251
Conditional Harmonic Mixingp. 255
Learning in CHM Modelsp. 256
Incorporating Prior Knowledgep. 261
Learning the Conditionalsp. 261
Model Averagingp. 262
Experimentsp. 263
Conclusionsp. 273
Change of Representation
Graph Kernels by Spectral Transformsp. 277
The Graph Laplacianp. 278
Kernels by Spectral Transformsp. 280
Kernel Alignmentp. 281
Optimizing Alignment Using QCQP for Semi-Supervised Learningp. 282
Semi-Supervised Kernels with Order Restraintsp. 283
Experimental Resultsp. 285
Conclusionp. 289
Spectral Methods for Dimensionality Reductionp. 293
Introductionp. 293
Linear Methodsp. 295
Graph-Based Methodsp. 297
Kernel Methodsp. 303
Discussionp. 306
Modifying Distancesp. 309
Introductionp. 309
Estimating DBD Metricsp. 312
Computing DBD Metricsp. 321
Semi-Supervised Learning Using Density-Based Metricsp. 327
Conclusions and Future Workp. 329
Semi-Supervised Learning in Practice
Large-Scale Algorithmsp. 333
Introductionp. 333
Cost Approximationsp. 334
Subset Selectionp. 337
Discussionp. 340
Semi-Supervised Protein Classification Using Cluster Kernelsp. 343
Introductionp. 343
Representation and Kernels for Protein Sequencesp. 345
Semi-Supervised Kernels for Protein Sequencesp. 348
Experimentsp. 352
Discussionp. 358
Prediction of Protein Function from Networksp. 361
Introductionp. 361
Graph-Based Semi-Supervised Learningp. 364
Combining Multiple Graphsp. 366
Experiments on Function Prediction of Proteinsp. 369
Conclusion and Outlookp. 374
Analysis of Benchmarksp. 377
The Benchmarkp. 377
Application of SSL Methodsp. 383
Results and Discussionp. 390
Perspectives
An Augmented PAC Model for Semi-Supervised Learningp. 397
Introductionp. 398
A Formal Frameworkp. 400
Sample Complexity Resultsp. 403
Algorithmic Resultsp. 412
Related Models and Discussionp. 416
Metric-Based Approaches for Semi-Supervised Regression and Classificationp. 421
Introductionp. 421
Metric Structure of Supervised Learningp. 423
Model Selectionp. 426
Regularizationp. 436
Classificationp. 445
Conclusionp. 449
Transductive Inference and Semi-Supervised Learningp. 453
Problem Settingsp. 453
Problem of Generalization in Inductive and Transductive Inferencep. 455
Structure of the VC Bounds and Transductive Inferencep. 457
The Symmetrization Lemma and Transductive Inferencep. 458
Bounds for Transductive Inferencep. 459
The Structural Risk Minimization Principle for Induction and Transductionp. 460
Combinatorics in Transductive Inferencep. 462
Measures of Size of Equivalence Classesp. 463
Algorithms for Inductive and Transductive SVMsp. 465
Semi-Supervised Learningp. 470
Conclusion: Transductive Inference and the New Problems of Inferencep. 470
Beyond Transduction: Selective Inferencep. 471
A Discussion of Semi-Supervised Learning and Transductionp. 473
Referencesp. 479
Notation and Symbolsp. 499
Contributorsp. 503
Indexp. 509
Online Index
Table of Contents provided by Publisher. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program