did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

did-you-know? rent-now

Amazon no longer offers textbook rentals. We do!

We're the #1 textbook rental company. Let us show you why.

9780387202754

Regression Methods In Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models

by ; ; ;
  • ISBN13:

    9780387202754

  • ISBN10:

    0387202757

  • Edition: 1st
  • Format: Hardcover
  • Copyright: 2004-12-30
  • Publisher: Springer Verlag
  • Purchase Benefits
List Price: $129.00 Save up to $88.84
  • Digital
    $87.02
    Add to Cart

    DURATION
    PRICE

Supplemental Materials

What is included with this book?

Summary

This new book provides a unified, in-depth, readable introduction to the multipredictor regression methods most widely used in biostatistics: linear models for continuous outcomes, logistic models for binary outcomes, the Cox model for right-censored survival times, repeated-measures models for longitudinal and hierarchical outcomes, and generalized linear models for counts and other outcomes. Treating these topics together takes advantage of all they have in common. The authors point out the many-shared elements in the methods they present for selecting, estimating, checking, and interpreting each of these models. They also show that these regression methods deal with confounding, mediation, and interaction of causal effects in essentially the same way. The examples, analyzed using Stata, are drawn from the biomedical context but generalize to other areas of application. While a first course in statistics is assumed, a chapter reviewing basic statistical methods is included. Some advanced topics are covered but the presentation remains intuitive. A brief introduction to regression analysis of complex surveys and notes for further reading are provided. For many students and researchers learning to use these methods, this one book may be all they need to conduct and interpret multipredictor regression analyses. The authors are on the faculty in the Division of Biostatistics, Department of Epidemiology and Biostatistics, University of California, San Francisco, and are authors or co-authors of more than 200 methodological as well as applied papers in the biological and biomedical sciences. The senior author, Charles E. McCulloch, is head of the Division and author of Generalized Linear Mixed Models (2003), Generalized, Linear, and Mixed Models (2000), and Variance Components (1992). From the reviews: "This book provides a unified introduction to the regression methods listed in the title...The methods are well illustrated by data drawn from medical studies...A real strength of this book is the careful discussion of issues common to all of the multipredictor methods covered." Journal of Biopharmaceutical Statistics, 2005 "This book is not just for biostatisticians. It is, in fact, a very good, and relatively nonmathematical, overview of multipredictor regression models. Although the examples are biologically oriented, they are generally easy to understand and follow...I heartily recommend the book" Technometrics, February 2006 "Overall, the text provides an overview of regression methods that is particularly strong in its breadth of coverage and emphasis on insight in place of mathematical detail. As intended, this well-unified approach should appeal to students who learn conceptually and verbally." Journal of the American Statistical Association, March 2006

Table of Contents

Preface vii
1 Introduction
1(6)
1.1 Example: Treatment of Back Pain
1(1)
1.2 The Family of Multipredictor Regression Methods
2(1)
1.3 Motivation for Multipredictor Regression
3(1)
1.3.1 Prediction
3(1)
1.3.2 Isolating the Effect of a Single Predictor
3(1)
1.3.3 Understanding Multiple Predictors
3(1)
1.4 Guide to the Book
4(3)
2 Exploratory and Descriptive Methods
7(22)
2.1 Data Checking
7(1)
2.2 Types Of Data
8(1)
2.3 One-Variable Descriptions
8(9)
2.3.1 Numerical Variables
9(7)
2.3.2 Categorical Variables
16(1)
2.4 Two-Variable Descriptions
17(6)
2.4.1 Outcome Versus Predictor Variables
18(1)
2.4.2 Continuous Outcome Variable
18(3)
2.4.3 Categorical Outcome Variable
21(2)
2.5 Multivariable Descriptions
23(3)
2.6 Problems
26(3)
3 Basic Statistical Methods
29(40)
3.1 t-Test and Analysis of Variance
29(6)
3.1.1 t-Test
30(1)
3.1.2 One- and Two-Sided Hypothesis Tests
30(1)
3.1.3 Paired t-Test
31(1)
3.1.4 One-Way Analysis of Variance (ANOVA)
32(1)
3.1.5 Pairwise Comparisons in ANOVA
33(1)
3.1.6 Multi-Way ANOVA and ANCOVA
33(1)
3.1.7 Robustness to Violations of Assumptions
33(2)
3.2 Correlation Coefficient
35(1)
3.3 Simple Linear Regression Model
36(8)
3.3.1 Systematic Part of the Model
36(2)
3.3.2 Random Part of the Model
38(1)
3.3.3 Assumptions About the Predictor
38(1)
3.3.4 Ordinary Least Squares Estimation
39(1)
3.3.5 Fitted Values and Residuals
40(1)
3.3.6 Sums of Squares
40(1)
3.3.7 Standard Errors of the Regression Coefficients
41(1)
3.3.8 Hypothesis Tests and Confidence Intervals
42(1)
3.3.9 Slope, Correlation Coefficient, and R²
43(1)
3.4 Contingency Table Methods for Binary Outcomes
44(10)
3.4.1 Measures of Risk and Association for Binary Outcomes
44(3)
3.4.2 Tests of Association in Contingency Tables
47(2)
3.4.3 Predictors With Multiple Categories
49(2)
3.4.4 Analyses Involving Multiple Categorical Predictors
51(3)
3.5 Basic Methods for Survival Analysis
54(8)
3.5.1 Right Censoring
54(1)
3.5.2 Kaplan-Meier Estimator of the Survival Function
55(2)
3.5.3 Interpretation of Kaplan-Meier Curves
57(1)
3.5.4 Median Survival
58(1)
3.5.5 Cumulative Incidence Function
59(1)
3.5.6 Comparing Groups Using the Logrank Test
60(2)
3.6 Bootstrap Confidence Intervals
62(1)
3.7 Interpretation of Negative Findings
63(2)
3.8 Further Notes and References
65(1)
3.9 Problems
65(2)
3.10 Learning Objectives
67(2)
4 Linear Regression
69(64)
4.1 Example: Exercise and Glucose
70(2)
4.2 Multiple Linear Regression Model
72(4)
4.2.1 Systematic Part of the Model
72(1)
4.2.2 Random Part of the Model
73(2)
4.2.3 Generalization of R² and r
75(1)
4.2.4 Standardized Regression Coefficients
75(1)
4.3 Categorical Predictors
76(7)
4.3.1 Binary Predictors
76(1)
4.3.2 Multilevel Categorical Predictors
77(2)
4.3.3 The F-Test
79(1)
4.3.4 Multiple Pairwise Comparisons Between Categories
80(2)
4.3.5 Testing for Trend Across Categories
82(1)
4.4 Confounding
83(1)
4.4.1 Causal Effects and Counterfactuals
84(1)
4.4.2 A Linear Model for the Counterfactual Experiment
85(1)
4.4.3 Confounding of Causal Effects
87(1)
4.4.4 Randomization Assumption
88(1)
4.4.5 Conditions for Confounding of Causal Effects
89(1)
4.4.6 Control of Confounding
89(1)
4.4.7 Range of Confounding Patterns
90(1)
4.4.8 Diagnostics for Confounding in a Sample
91(1)
4.4.9 Confounding Is Difficult To Rule Out
92(1)
4.4.10 Adjusted vs. Unadjusted βs
93(1)
4.4.11 Example: BMI and LDL
93(2)
4.5 Mediation
95(3)
4.5.1 Modeling Mediation
96(1)
4.5.2 Confidence Intervals for Measures of Mediation
97(1)
4.5.3 Example: BMI, Exercise, and Glucose
97(1)
4.6 Interaction
98(11)
4.6.1 Causal Effects and Interaction
99(1)
4.6.2 Modeling Interaction
100(1)
4.6.3 Overall Causal Effect in the Presence of Interaction
100(1)
4.6.4 Example: Hormone Therapy and Statin Use
101(2)
4.6.5 Example: BMI and Statin Use
103(2)
4.6.6 Interaction and Scale
105(1)
4.6.7 Example: Hormone Therapy and Baseline LDL
106(2)
4.6.8 Details
108(1)
4.7 Checking Model Assumptions and Fit
109(18)
4.7.1 Linearity
109(5)
4.7.2 Normality
114(3)
4.7.3 Constant Variance
117(4)
4.7.4 Outlying, High Leverage, and Influential Points
121(4)
4.7.5 Interpretation of Results for Log-Transformed Variables
125(2)
4.7.6 When to Use Transformations
127(1)
4.8 Summary
127(1)
4.9 Further Notes and References
127(1)
4.10 Problems
128(3)
4.11 Learning Objectives
131(2)
5 Predictor Selection
133(24)
5.1 Diagramming the Hypothesized Causal Model
135(2)
5.2 Prediction
137(3)
5.2.1 Bias-Variance Trade-off
137(1)
5.2.2 Estimating Prediction Error
138(1)
5.2.3 Screening Candidate Models
139(1)
5.2.4 Classification and Regression Trees (CART)
139(1)
5.3 Evaluating a Predictor of Primary Interest
140(4)
5.3.1 Including Predictors for Face Validity
141(1)
5.3.2 Selecting Predictors on Statistical Grounds
141(1)
5.3.3 Interactions With the Predictor of Primary Interest
141(1)
5.3.4 Example: Incontinence as a Risk Factor for Falling
142(1)
5.3.5 Randomized Experiments
142(2)
5.4 Identifying Multiple Important Predictors
144(3)
5.4.1 Ruling Out Confounding Is Still Central
145(1)
5.4.2 Cautious Interpretation Is Also Key
146(1)
5.4.3 Example: Risk Factors for Coronary Heart Disease
146(1)
5.4.4 Allen-Cady Modified Backward Selection
147(1)
5.5 Some Details
147(6)
5.5.1 Collinearity
147(2)
5.5.2 Number of Predictors
149(1)
5.5.3 Alternatives to Backward Selection
150(1)
5.5.4 Model Selection and Checking
151(1)
5.5.5 Model Selection Complicates Inference
152(1)
5.6 Summary
153(1)
5.7 Further Notes and References
154(1)
5.8 Problems
155(1)
5.9 Learning Objectives
156(1)
6 Logistic Regression
157(54)
6.1 Single Predictor Models
158(9)
6.1.1 Interpretation of Regression Coefficients
162(2)
6.1.2 Categorical Predictors
164(3)
6.2 Multipredictor Models
167(16)
6.2.1 Likelihood Ratio Tests
170(3)
6.2.2 Confounding
173(2)
6.2.3 Interaction
175(5)
6.2.4 Prediction
180(1)
6.2.5 Prediction Accuracy
181(2)
6.3 Case-Control Studies
183(5)
6.3.1 Matched Case-Control Studies
187(1)
6.4 Checking Model Assumptions and Fit
188(8)
6.4.1 Outlying and Influential Points
188(2)
6.4.2 Linearity
190(2)
6.4.3 Model Adequacy
192(3)
6.4.4 Technical Issues in Logistic Model Fitting
195(1)
6.5 Alternative Strategies for Binary Outcomes
196(7)
6.5.1 Infectious Disease Transmission Models
196(2)
6.5.2 Regression Models Based on Excess and Relative Risks
198(2)
6.5.3 Nonparametric Binary Regression
200(1)
6.5.4 More Than Two Outcome Levels
201(2)
6.6 Likelihood
203(3)
6.7 Summary
206(1)
6.8 Further Notes and References
207(1)
6.9 Problems
207(2)
6.10 Learning Objectives
209(2)
7 Survival Analysis
211(42)
7.1 Survival Data
211(4)
7.1.1 Why Linear and Logistic Regression Won't Work
211(1)
7.1.2 Hazard Function
212(1)
7.1.3 Hazard Ratio
213(2)
7.1.4 Proportional Hazards Assumption
215(1)
7.2 Cox Proportional Hazards Model
215(1)
7.2.1 Proportional Hazards Models
215(1)
7.2.2 Parametric vs. Semi-Parametric Models
216(3)
7.2.3 Hazard Ratios, Risk, and Survival Times
219(1)
7.2.4 Hypothesis Tests and Confidence Intervals
219(2)
7.2.5 Binary Predictors
221(1)
7.2.6 Multilevel Categorical Predictors
221(3)
7.2.7 Continuous Predictors
224(2)
7.2.8 Confounding
226(1)
7.2.9 Mediation
227(1)
7.2.10 Interaction
227(2)
7.2.11 Adjusted Survival Curves for Comparing Groups
229(2)
7.2.12 Predicted Survival for Specific Covariate Patterns
231(1)
7.3 Extensions to the Cox Model
231(7)
7.3.1 Time-Dependent Covariates
231(3)
7.3.2 Stratified Cox Model
234(4)
7.4 Checking Model Assumptions and Fit
238(7)
7.4.1 Log-Linearity
238(1)
7.4.2 Proportional Hazards
238(7)
7.5 Some Details
245(4)
7.5.1 Bootstrap Confidence Intervals
245(1)
7.5.2 Prediction
246(1)
7.5.3 Adjusting for Non-Confounding Covariates
246(1)
7.5.4 Independent Censoring
247(1)
7.5.5 Interval Censoring
247(1)
7.5.6 Left Truncation
248(1)
7.6 Summary
249(1)
7.7 Further Notes and References
249(1)
7.8 Problems
250(1)
7.9 Learning Objectives
251(2)
8 Repeated Measures Analysis
253(38)
8.1 A Simple Repeated Measures Example: Fecal Fat
254(5)
8.1.1 Model Equations for the Fecal Fat Example
256(1)
8.1.2 Correlations Within Subjects
257(2)
8.1.3 Estimates of the Effects of Pill Type
259(1)
8.2 Hierarchical Data
259(3)
8.2.1 Analysis Strategies for Hierarchical Data
259(3)
8.3 Longitudinal Data
262(4)
8.3.1 Analysis Strategies for Longitudinal Data
262(1)
8.3.2 Example: Birthweight and Birth Order
262(3)
8.3.3 When To Use Repeated Measures Analyses
265(1)
8.4 Generalized Estimating Equations
266(8)
8.4.1 Birthweight and Birth Order Revisited
266(2)
8.4.2 Correlation Structures
268(2)
8.4.3 Working Correlation and Robust Standard Errors
270(1)
8.4.4 Hypothesis Tests and Confidence Intervals
271(2)
8.4.5 Use of xtgee for Clustered Logistic Regression
273(1)
8.5 Random Effects Models
274(7)
8.5.1 Re-Analysis of Birthweight and Birth Order
276(2)
8.5.2 Prediction
278(1)
8.5.3 Logistic Model for Low Birthweight
279(2)
8.5.4 Marginal Versus Conditional Models
281(1)
8.6 Example: Cardiac Injury Following Brain Hemorrhage
281(5)
8.6.1 Bootstrap Confidence Intervals
283(3)
8.7 Summary
286(1)
8.8 Further Notes and References
286(1)
8.9 Problems
287(1)
8.10 Learning Objectives
288(3)
9 Generalized Linear Models
291(14)
9.1 Example: Treatment for Depression
291(4)
9.1.1 Statistical Issues
292(1)
9.1.2 Model for the Mean Response
292(1)
9.1.3 Choice of Distribution
293(1)
9.1.4 Interpreting the Parameters
294(1)
9.1.5 Further Notes
295(1)
9.2 Example: Costs of Phototherapy
295(2)
9.2.1 Model for the Mean Response
296(1)
9.2.2 Choice of Distribution
297(1)
9.2.3 Interpreting the Parameters
297(1)
9.3 Generalized Linear Models
297(4)
9.3.1 Example: Risky Drug Use Behavior
298(2)
9.3.2 Relationship of Mean to Variance
300(1)
9.3.3 Nonlinear Models
300(1)
9.4 Summary
301(1)
9.5 Further Notes and References
301(1)
9.6 Problems
302(1)
9.7 Learning Objectives
303(2)
10 Complex Surveys 305(12)
10.1 Example: NHANES
307(1)
10.2 Probability Weights
307(3)
10.3 Variance Estimation
310(4)
10.3.1 Design Effects
312(1)
10.3.2 Simplification of Correlation Structure
313(1)
10.3.3 Other Methods of Variance Estimation
313(1)
10.4 Summary
314(1)
10.5 Further Notes and References
314(1)
10.6 Problems
315(1)
10.7 Learning Objectives
316(1)
11 Summary 317(6)
11.1 Introduction
317(1)
11.2 Selecting Appropriate Statistical Methods
318(1)
11.3 Planning and Executing a Data Analysis
319(2)
11.3.1 Analysis Plans
319(1)
11.3.2 Choice of Software
320(1)
11.3.3 Record Keeping and Organization
320(1)
11.3.4 Data Security
320(1)
11.3.5 Consulting a Statistician
321(1)
11.3.6 Use of Internet Resources
321(1)
11.4 Further Notes and References
321(2)
References 323(10)
Index 333

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program