did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9780470222805

Making Sense of Data II A Practical Guide to Data Visualization, Advanced Data Mining Methods, and Applications

by ;
  • ISBN13:

    9780470222805

  • ISBN10:

    0470222808

  • Edition: 1st
  • Format: Paperback
  • Copyright: 2009-02-03
  • Publisher: Wiley
  • Purchase Benefits
List Price: $123.68 Save up to $0.62
  • Buy New
    $123.06
    Add to Cart Free Shipping Icon Free Shipping

    PRINT ON DEMAND: 2-4 WEEKS. THIS ITEM CANNOT BE CANCELLED OR RETURNED.

Supplemental Materials

What is included with this book?

Summary

This second installment in the Making Sense of Data series continues to explore a diverse range commonly used approaches to making and communicating decisions from data. Delving into more technical topics, this book equips readers with advanced data mining methods that are needed to engineering, finance, and the social sciences.

Author Biography

Glenn J. Myatt, PhD, is cofounder of Leadscope, Inc. and a Partner of Myatt & Johnson, Inc., a consulting company that focuses on business intelligence application development delivered through the Internet. Dr. Myatt is the author of Making Sense of Data: A Practical Guide to Exploratory Data Analysis and Data Mining, also published by Wiley. WAYNE P. JOHNSON, MSc., is cofounder of Leadscope, Inc. and a Partner of Myatt & Johnson, Inc. Mr. Johnson has over two decades of experience in the design and development of large software systems, and his current professional interests include human–computer interaction, information visualization, and methodologies for contextual inquiry.

Table of Contents

Prefacep. xi
Introductionp. 1
Overviewp. 1
Definitionp. 1
Preparationp. 2
Overviewp. 2
Accessing Tabular Datap. 3
Accessing Unstructured Datap. 3
Understanding the Variables and Observationsp. 3
Data Cleaningp. 6
Transformationp. 7
Variable Reductionp. 9
Segmentationp. 10
Preparing Data to Applyp. 10
Analysisp. 11
Data Mining Tasksp. 11
Optimizationp. 12
Evaluationp. 12
Model Forensicsp. 13
Deploymentp. 13
Outline of Bookp. 14
Overviewp. 14
Data Visualizationp. 14
Clusteringp. 15
Predictive Analyticsp. 15
Applicationsp. 16
Softwarep. 16
Summaryp. 16
Further Readingp. 17
Data Visualizationp. 19
Overviewp. 19
Visualization Design Principlesp. 20
General Principlesp. 20
Graphics Designp. 20
Anatomy of a Graphp. 28
Tablesp. 32
Simple Tablesp. 32
Summary Tablesp. 32
Two-Way Contingency Tablesp. 34
Supertablesp. 34
Univariate Data Visualizationp. 36
Bar Chartp. 36
Histogramsp. 37
Frequency Polygramp. 41
Box Plotsp. 41
Dot Plotp. 43
Stem-and Leaf Plotp. 44
Quantile Plotp. 46
Quantile-Quantile Plotp. 48
Bivariate Data Visualizationp. 49
Scatterplotp. 49
Multivariate Data Visualizationp. 50
Histogram Matrixp. 52
Scatterplot Matrixp. 54
Multiple Box Plotp. 56
Trellis Plotp. 56
Visualizing Groupsp. 59
Dendrogramsp. 59
Decision Treesp. 60
Cluster Image Mapsp. 60
Dynamic Techniquesp. 63
Overviewp. 63
Data Brushingp. 64
Nearness Selectionp. 65
Sorting and Rearrangingp. 65
Searching and Filteringp. 65
Summaryp. 65
Further Readingp. 66
Clusteringp. 67
Overviewp. 67
Distance Measuresp. 75
Overviewp. 75
Numeric Distance Measuresp. 77
Binary Distance Measuresp. 79
Mixed Variablesp. 84
Other Measuresp. 86
Agglomerative Hierarchical Clusteringp. 87
Overviewp. 87
Simple Linkagep. 87
Complete Linkagep. 92
Average Linkagep. 93
Other Methodsp. 96
Selecting Groupsp. 96
Partitioned-Based Clusteringp. 98
Overviewp. 98
k-Meansp. 98
Worked Examplep. 100
Miscellaneous Partitioned-Based Clusteringp. 101
Fuzzy Clusteringp. 103
Overviewp. 103
Fuzzy k-Meansp. 103
Worked Examplesp. 104
Summaryp. 109
Further Readingp. 110
Predictive Analyticsp. 111
Overviewp. 111
Predictive Modelingp. 111
Testing Model Accuracyp. 116
Evaluating Regression Models' Predictive Accuracyp. 117
Evaluating Classification Models' Predictive Accuracyp. 119
Evaluating Binary Models' Predictive Accuracyp. 120
ROC Chartsp. 122
Lift Chartp. 124
Principal Component Analysisp. 126
Overviewp. 126
Principal Componentsp. 126
Generating Principal Componentsp. 127
Interpretation of Principal Componentsp. 128
Multiple Linear Regressionp. 130
Overviewp. 130
Generating Modelsp. 130
Predictionp. 136
Analysis of Residualsp. 136
Standard Errorp. 139
Coefficient of Multiple Determinationp. 140
Testing the Model Significancep. 142
Selecting and Transforming Variablesp. 143
Discriminant Analysisp. 145
Overviewp. 145
Discriminant Functionp. 146
Discriminant Analysis Examplep. 146
Logistic Regressionp. 151
Overviewp. 151
Logistic Regression Formulap. 151
Estimating Coefficientsp. 153
Assessing and Optimizing Resultsp. 156
Naïve Bayes Classifiersp. 157
Overviewp. 157
Bayes Theorem and the Independence Assumptionp. 158
Independence Assumptionp. 158
Classification Processp. 159
Summaryp. 161
Further Readingp. 163
Applicationsp. 165
p. Overview
Sales and Marketingp. 166
Industry-Specific Data Miningp. 169
Financep. 169
Insurancep. 171
Retailp. 172
Telecommunicationsp. 173
Manufacturingp. 174
Entertainmentp. 175
Pharamaceuticalsp. 177
Healthcarep. 178
micro RNA Data Analysis Case? Studyp. 181
Defining the Problemp. 181
Preparing the Datap. 181
Analysisp. 183
Credit Scoring Case Studyp. 192
Defining the Problemp. 192
Preparing the Datap. 192
Analysisp. 199
Deploymentp. 203
Data Mining Nontabular Datap. 203
Overviewp. 203
Data Mining Chemical Data 203
Data Mining Textp. 210
Further Readingp. 213
Marticesp. 215
Overview of Matricesp. 215
Matrix Additionp. 215
Matrix Multiplicationp. 216
Transpose of a Matrixp. 217
Inverse of a Matrixp. 217
Softwarep. 219
Software Overviewp. 219
Software Objectivesp. 219
Access and Installationp. 221
User Interface Overviewp. 221
Data Preparationp. 223
Overviewp. 223
Reading in Datap. 224
Searching the Datap. 225
Variable Characterizationp. 237
Removing Observations and Variablesp. 228
Clearing the Datap. 228
Transforming the Datap. 230
Segmentationp. 235
Principal Component Analysisp. 236
Tables and Graphsp. 238
Overviewp. 238
Contingency Tablesp. 239
Summary Tablesp. 240
Graphsp. 242
Graph Matricesp. 246
Statisticsp. 246
Overviewp. 246
Descriptive Statisticsp. 248
Confidence Intervalsp. 248
Hypothesis Testsp. 249
Chi-Square Testp. 250
ANOVAp. 251
comparative Statisticsp. 251
Groupingp. 253
Overviewp. 253
Clusteringp. 254
Associative Rulesp. 257
Decision Treesp. 258
Predictionp. 261
Overviewp. 261
Linear Regressionp. 263
Discriminant Analysisp. 265
Logistic Regressionp. 266
Naive Bayesp. 267
kNNp. 269
CARTp. 269
Neural Networksp. 270
Apply Modelp. 271
Bibliographyp. 273
Indexp. 279
Table of Contents provided by Ingram. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program