did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9780470596692

Optimal Learning

by ;
  • ISBN13:

    9780470596692

  • ISBN10:

    0470596694

  • Edition: 1st
  • Format: Hardcover
  • Copyright: 2012-04-17
  • Publisher: Wiley
  • Purchase Benefits
  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $146.08 Save up to $0.73
  • Buy New
    $145.35
    Add to Cart Free Shipping Icon Free Shipping

    PRINT ON DEMAND: 2-4 WEEKS. THIS ITEM CANNOT BE CANCELLED OR RETURNED.

Supplemental Materials

What is included with this book?

Summary

This text presents optimal learning techniques with applications in energy, homeland security, health, sports, transportation science, biomedical research, biosurveillance, stochastic optimization, high technology, and complex resource allocation problems. The coverage utilizes a relatively new class of algorithmic strategies known as approximate dynamic programming, which merges dynamic programming (Markov decision processes), math programming (linear, nonlinear, and integer), simulation, and statistics. It features mathematical techniques that are applicable to a variety of situations, from identifying promising drug candidates to figuring out the best evacuation plan in the event of a natural disaster.

Author Biography

Warren B. Powell, PhD, is Professor of Operations Research and Financial Engineering at Princeton University, where he is founder and Director of CASTLE Laboratory, a research unit that works with industrial partners to test new ideas found in operations research. The recipient of the 2004 INFORMS Fellow Award, Dr. Powell is the author of Approximate Dynamic Programming: Solving the Curses of Dimensionality, Second Edition (Wiley). Ilya O. Ryzhov, PhD, is Assistant Professor in the Department of Decision, Operations, and Information Technologies at the Robert H. Smith School of Business at the University of Maryland. He has made fundamental contributions to bridge the fields of ranking and selection with multiarmed bandits and optimal learning with mathematical programming.

Table of Contents

Prefacep. xv
Acknowledgmentsp. xix
The Challenges of Learningp. 1
Learning the Best Pathp. 2
Areas of Applicationp. 4
Major Problem Classesp. 12
The Different Types of Learningp. 13
Learning from Different Communitiesp. 16
Information Collection Using Decision Treesp. 18
A Basic Decision Treep. 18
Decision Tree for Offline Learningp. 20
Decision Tree for Online Learningp. 21
Discussionp. 25
Website and Downloadable Softwarep. 26
Goals of this Bookp. 26
Problemsp. 27
Adaptive Learningp. 31
The Frequentist Viewp. 32
The Bayesian Viewp. 33
The Updating Equations for Independent Beliefsp. 34
The Expected Value of Informationp. 36
Updating for Correlated Normal Priorsp. 38
Bayesian Updating with an Uninformative Priorp. 41
Updating for Non-Gaussian Priorsp. 42
The Gamma-Exponential Modelp. 43
The Gamma-Poisson Modelp. 44
The Pareto-Uniform Modelp. 45
Models for Learning Probabilities*p. 46
Learning an Unknown Variance*p. 49
Monte Carlo Simulationp. 51
Why Does It Work?*p. 54
Derivation of ¿p. 54
Derivation of Bayesian Updating Equations for Independent Beliefsp. 55
Bibliographic Notesp. 57
Problemsp. 57
The Economics of Informationp. 61
An Elementary Information Problemp. 61
The Marginal Value of Informationp. 65
An information Acquisition Problemp. 68
Bibliographic Notesp. 70
Problemsp. 70
Ranking and Selectionp. 71
The Modelp. 72
Measurement Policiesp. 75
Deterministic Versus Sequential Policiesp. 75
Optimal Sequential Policiesp. 76
Heuristic Policiesp. 77
Evaluating Policiesp. 81
More Advanced Topics*p. 83
An Alternative Representation of the Probability Spacep. 83
Equivalence of Using True Means and Sample Estimatesp. 84
Bibliographic Notesp. 85
Problemsp. 85
The Knowledge Gradientp. 89
The Knowledge Gradient for Independent Beliefsp. 90
Computationp. 91
Some Properties of the Knowledge Gradientp. 93
The Four Distributions of Learningp. 94
The Value of Information and the S-Curve Effectp. 95
Knowledge Gradient for Correlated Beliefsp. 98
Anticipatory Versus Experiential Learningp. 103
The Knowledge Gradient for Some Non-Gaussian Distributionsp. 105
The Gamma-Exponential Modelp. 105
The Gamma-Poisson Modelp. 108
The Pareto-Uniform Modelp. 109
The Beta-Bernoulli Modelp. 111
Discussionp. 113
Relatives of the Knowledge Gradientp. 114
Expected Improvementp. 114
Linear Loss*p. 115
The Problem of Priorsp. 118
Discussionp. 120
Why Does It Work?*p. 120
Derivation of the Knowledge Gradient Formulap. 120
Bibliographic Notesp. 125
Problemsp. 125
Bandit Problemsp. 139
The Theory and Practice of Gittins Indicesp. 141
Gittins Indices in the Beta-Bernoulli Modelp. 142
Gittins Indices in tie Normal-Normal Modelp. 145
Approximating Gittins Indicesp. 147
Variations of Bandit Problemsp. 148
Upper Confidence Boundingp. 149
The Knowledge Gradient for Bandit Problemsp. 151
The Basic Ideap. 151
Some Experimental Comparisonsp. 153
Non-Normal Modelsp. 156
Bibliographic Notesp. 157
Problemsp. 157
Elements of a Learning Problemp. 163
The States of our Systemp. 164
Types of Decisionsp. 166
Exogenous Informationp. 167
Transition Functionsp. 168
Objective Functionsp. 168
Designing Versus Controllingp. 169
Measurement Costsp. 170
Objectivesp. 170
Evaluating Policiesp. 175
Discussionp. 177
Bibliographic Notesp. 178
Problemsp. 178
Linear Belief Modelsp. 181
Applicationsp. 182
Maximizing Ad Clicksp. 182
Dynamic Pricingp. 184
Housing Loansp. 184
Optimizing Dose Responsep. 185
A Brief Review of Linear Regressionp. 186
The Normal Equationsp. 186
Recursive Least Squaresp. 187
A Bayesian Interpretationp. 188
Generating a Priorp. 189
The Knowledge Gradient for a Linear Modelp. 191
Application to Drug Discoveryp. 192
Application to Dynamic Pricingp. 196
Bibliographic Notesp. 200
Problemsp. 200
Subset Selection Problemsp. 203
Applicationsp. 205
Choosing a Subset Using Ranking and Selectionp. 207
Setting Prior Means and Variancesp. 207
Two Strategies for Setting Prior Covariancesp. 208
Larger Setsp. 209
Using Simulation to Reduce the Problem Sizep. 210
Computational Issuesp. 212
Experimentsp. 213
Very Large Setsp. 214
Bibliographic Notesp. 216
Problemsp. 216
Optimizing a Scalar Functionp. 219
Deterministic Measurementsp. 219
Stochastic Measurementsp. 223
The Modelp. 223
Finding the Posterior Distributionp. 224
Choosing the Measurementp. 226
Discussionp. 229
Bibliographic Notesp. 229
Problemsp. 229
Optimal Biddingp. 231
Modeling Customer Demandp. 233
Some Valuation Modelsp. 233
The Logit Modelp. 234
Bayesian Modeling for Dynamic Pricingp. 237
A Conjugate Prior for Choosing Between Two Demand Curvesp. 237
Moment Matching for Nonconjugate Problemsp. 239
An Approximation for the Logit Modelp. 242
Bidding Strategiesp. 244
An Idea From Multi-Armed Banditsp. 245
Bayes-Greedy Biddingp. 245
Numerical Illustrationsp. 247
Why Does It Work?*p. 251
Moment Matching for Pareto Priorp. 251
Approximating the Logistic Expectationp. 252
Bibliographic Notesp. 253
Problemsp. 254
Stopping Problemsp. 255
Sequential Probability Ratio Testp. 255
The Secretary Problemp. 261
Setupp. 261
Solutionp. 262
Bibliographic Notesp. 266
Problemsp. 266
Active Learning in Statisticsp. 269
Deterministic Policiesp. 270
Sequential Policies for Classificationp. 274
Uncertainty Samplingp. 274
Query by Committeep. 275
Expected Error Reductionp. 277
A Variance-Minimizing Policyp. 277
Mixtures of Gaussiansp. 280
Estimating Parametersp. 280
Active Learningp. 282
Bibliographic Notesp. 283
Simulation Optimizationp. 285
Indifference Zone Selectionp. 288
Batch Proceduresp. 288
Sequential Proceduresp. 290
The 0-1 Procedure: Connection to Linear Lossp. 292
Optimal Computing Budget Allocationp. 293
Indifference-Zone Versionp. 293
Linear Loss Versionp. 295
When Does It Work?p. 295
Model-Based Simulated Annealingp. 296
Other Areas of Simulation Optimizationp. 298
Bibliographic Notesp. 299
Learning in Mathematical Programmingp. 301
Applicationsp. 303
Piloting a Hot Air Balloonp. 303
Optimizing a Portfoliop. 308
Network Problemsp. 309
Discussionp. 313
Learning on Graphsp. 313
Alternative Edge Selection Policiesp. 317
Learning Costs for Linear Programs*p. 318
Bibliographic Notesp. 324
Optimizing Over Continuous Measurementsp. 325
The Belief Modelp. 327
Updating Equationsp. 328
Parameter Estimationp. 330
Sequential Kriging Optimizationp. 332
The Knowledge Gradient for Continuous Parameters*p. 334
Maximizing the Knowledge Gradientp. 334
Approximating the Knowledge Gradientp. 335
The Gradient of the Knowledge Gradientp. 336
Maximizing the Knowledge Gradientp. 338
The KGCP Policyp. 339
Efficient Global Optimizationp. 340
Experimentsp. 341
Extension to Higher-Dimensional Problemsp. 342
Bibliographic Notesp. 343
Learning With a Physical Statep. 345
Introduction to Dynamic Programmingp. 347
Approximate Dynamic Programmingp. 348
The Exploration vs. Exploitation Problemp. 350
Discussionp. 351
Some Heuristic Learning Policiesp. 352
The Local Bandit Approximationp. 353
The Knowledge Gradient in Dynamic Programmingp. 355
Generalized Learning Using Basis Functionsp. 355
The Knowledge Gradientp. 358
Experimentsp. 361
An Expected Improvement Policyp. 363
Bibliographic Notesp. 364
Indexp. 381
Table of Contents provided by Ingram. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program