did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9781118010648

Data Analysis What Can Be Learned From the Past 50 Years

by
  • ISBN13:

    9781118010648

  • ISBN10:

    1118010647

  • Edition: 1st
  • Format: Hardcover
  • Copyright: 2011-04-11
  • Publisher: Wiley
  • Purchase Benefits
  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $157.81 Save up to $0.79
  • Buy New
    $157.02
    Add to Cart Free Shipping Icon Free Shipping

    PRINT ON DEMAND: 2-4 WEEKS. THIS ITEM CANNOT BE CANCELLED OR RETURNED.

Supplemental Materials

What is included with this book?

Summary

This book explores the many provocative questions concerning the fundamentals of data analysis. It is based on the time-tested experience of one of the gurus of the subject matter. Why should one study data analysis? How should it be taught? What techniques work best, and for whom? How valid are the results? How much data should be tested? Which machine languages should be used, if used at all? Emphasis on apprenticeship (through hands-on case studies) and anecdotes (through real-life applications) are the tools that Peter J. Huber uses in this volume. Concern with specific statistical techniques is not of immediate value; rather, questions of strategy when to use which technique are employed. Central to the discussion is an understanding of the significance of massive (or robust) data sets, the implementation of languages, and the use of models. Each is sprinkled with an ample number of examples and case studies. Personal practices, various pitfalls, and existing controversies are presented when applicable. The book serves as an excellent philosophical and historical companion to any present-day text in data analysis, robust statistics, data mining, statistical learning, or computational statistics.

Author Biography

Peter J. Huber, PhD, is a world-renowned statistician who has published four books and more than seventy journal articles in the areas of statistics and data analysis. He has held academic positions at Harvard University, Massachusetts Institute of. Technology, Cornell University, and ETH Zurich (Switzerland), and has made significant research contributions in the areas of robust statistics, computational statistics, and strategies in data analysis. A Fellow of the Institute of Mathematical Statistics and the American Academy of Arts and Sciences, Dr. Huber is the coauthor of Robust Statistics, Second Edition, also published by Wiley.

Table of Contents

Prefacep. xi
What is Data Analysis?p. 1
Tukey's 1962 paperp. 3
The Path of Statisticsp. 5
Strategy Issues in Data Analysisp. 11
Strategy in Data Analysisp. 11
Philosophical issuesp. 13
On the theory of data analysis and its teachingp. 14
Science and data analysisp. 15
Economy of forcesp. 16
Issues of sizep. 17
Strategic planningp. 21
Planning the data collectionp. 21
Choice of data and methods.p. 22
Systematic and random errorsp. 23
Strategic reservesp. 24
Human factorsp. 25
The stages of data analysisp. 26
Inspectionp. 26
Error checkingp. 27
Modificationp. 30
Comparisonp. 30
Modeling and Model fittingp. 30
Simulationp. 31
What-if analysesp. 32
Interpretationp. 32
Presentation of conclusionsp. 32
Tools required for strategy reasonsp. 33
Ad hoc programmingp. 33
Graphicsp. 34
Record keepingp. 35
Creating and keeping orderp. 35
Massive Data Setsp. 37
Introductionp. 38
Disclosure: Personal experiencesp. 39
What is massive? A classification of sizep. 39
Obstacles to scalingp. 40
Human limitations: visualizationp. 40
Human - machine interactionsp. 41
Storage requirementsp. 41
Computational complexityp. 42
Conclusionsp. 43
On the structure of large data setsp. 43
Types of datap. 43
How do data sets grow?p. 44
On data organizationp. 44
Derived data setsp. 45
Data base management and related issuesp. 46
Data archivingp. 48
The stages of a data analysisp. 49
Planning the data collectionp. 49
Actual collectionp. 50
Data accessp. 50
Initial data checkingp. 50
Data analysis properp. 51
The final product: presentation of arguments and conclusionsp. 51
Examples and some thoughts on strategyp. 52
Volume reductionp. 55
Supercomputers and software challengesp. 56
When do we need a Concorde?p. 57
General Purpose Data Analysis and Supercomputersp. 57
Languages, Programming Environments and Data-based Prototypingp. 58
Summary of conclusionsp. 59
Languages for Data Analysisp. 61
Goals and purposesp. 62
Natural languages and computing languagesp. 64
Natural languagesp. 64
Batch languagesp. 65
Immediate languagesp. 67
Language and literaturep. 68
Object orientation and related structural issuesp. 69
Extremism and compromises, slogans and realityp. 71
Some conclusionsp. 73
Interface issuesp. 74
The command line interfacep. 75
The menu interfacep. 78
The batch interface and programming environmentsp. 80
Some personal experiencesp. 81
Miscellaneous issuesp. 82
On building blocksp. 82
On the scope of namesp. 83
On notationp. 83
Book-keeping problemsp. 84
Requirements for a general purpose immediate languagep. 85
Approximate Modelsp. 89
Modelsp. 89
Bayesian modelingp. 92
Mathematical statistics and approximate modelsp. 94
Statistical significance and physical relevancep. 96
Judicious use of a wrong modelp. 97
Composite modelsp. 98
Modeling the length of dayp. 99
The role of simulationp. 111
Summary of conclusionsp. 112
Pitfallsp. 113
Simpson's paradoxp. 114
Missing datap. 116
The Case of the Babylonian Lunar Sixp. 118
X-ray crystallographyp. 126
Regression of Y on X or of X on Y?p. 129
Create order in datap. 133
General considerationsp. 134
Principal component methodsp. 135
Principal component methods: Jury datap. 137
Multidimensional scalingp. 145
Multidimensional scaling: the methodp. 145
Multidimensional scaling: a synthetic examplep. 145
Multidimensional scaling: map reconstructionp. 147
Correspondence analysisp. 147
Correspondence analysis: the methodp. 147
Kültepe eponymsp. 148
Further examples: marketing and Shakespearean playsp. 156
Multidimensional scaling vs. Correspondence analysisp. 160
Hodson's grave datap. 162
Plato datap. 168
More case studiesp. 177
A nutshell examplep. 178
Shape invariant modelingp. 182
Comparison of point configurationsp. 184
The cyclodecane conformationp. 186
The Thomson problemp. 189
Notes on numerical optimizationp. 190
Referencesp. 195
Indexp. 205
Table of Contents provided by Ingram. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program