Piscataway/New
Applied Categorical Data Analysis
Spring
2007
3
Credits
SPH,
Room 2A, Wednesdays, 6:10-9:00 p.m.
Instructor: Pamela Ohman Strickland, Ph.D.
TA: Susan
Huyck, PhD candidate in Biostatistics
Required:
|
Agresti,
A. (1996) An Introduction to
Categorical Data Analysis. John
Wiley & Sons. |
|
Highly Recommended: |
Stokes,
M.E., |
Prerequisites:
Introduction to Biostatistics, Biostatistics Computing, Regression
Methods for Public Health Studies
Public health studies, especially those involving questionnaires, contain large amounts of categorical data. This class provides an introduction to descriptive and inferential statistics for univariate and multivariate categorical data with applications to epidemiological and clinical studies. For 2 and 3-way contingency tables, measures of association and tests for homogeneity between populations and independence of variables are presented. Related tests of trend for ordinal data are studied. Loglinear and logistic regression analyses are investigated for data sets with both nominal and ordinal variables.
Computing Language: SAS
By the end of this course, students will be able to:
1. Formulate appropriate
statistical hypotheses for examining cross-classified data from public health
and clinical studies
2. Justify the basic
theoretical models for categorical data
3. Create and/or actively
participate in the design and analysis plan for a study involving categorical
data, whether nominal or ordinal in nature
4. Conduct and/or actively
participate in the analysis of categorical data
5. Interpret results from
contingency tables or generalized linear models that evaluate relationships
between categorical variables
January 17 |
Introduction, Distributions and SamplingExamples
to be Used in the Course of the Semester,
Definitions Poisson,
Binomial and Multinomial Distributions Inference
about univariate proportions |
|
January 24 |
Two-way Contingency TablesProbability
Structures for 2-way tables, Chi-square tests of independence Comparing
proportions, risk ratios and odds ratios |
|
January 30 |
Two-way Contingency Tables with Ordinal DataContinuous
or Categorical? Testing
independence for ordinal data Measures
of Association for ordinal data |
|
February 7 |
Analyses for Matched Pairs of Categorical DataDefining the
hypothesis of interest, Testing independence
Three-way TablesJustification
for studying three-way relationships, partial association, Cochran-Mantel-Haenszel
Methods, Common odds ratios |
|
February 14 |
Generalized Linear ModelsComponents
of a generalized linear model, linear regression, models for binary data and
Poisson regression Models for count and rate data, model inference and model
fitting |
|
February 21 |
Logistic RegressionInterpreting
the logistic regression model, inference for logistic regression, model
checking, logit models with qualitative and/or ordinal predictors, multiple
logistic regression Project
Plan due. |
|
February 28 |
Logistic Regression, Continued |
|
March 7 |
Mid-term |
|
March 21 |
Multi-category Logit ModelsLogit
Models for Nominal Responses, cumulative logit models for ordinal responses,
paired-category logits for ordinal responses |
|
March 28 |
Loglinear Models for Contingency TablesLoglinear
models for two- and three-way tables, inference for loglinear models,
loglinear models for higher dimensions, the loglinear-logit connection |
|
April 4 |
Building and Applying Logit and Loglinear ModelsAssociation
graphs and collapsibility, modeling ordinal associations, tests of
conditional independence, effects of sparse data |
|
April 11 |
Models for Matched PairsComparing
dependent proportions, logistic regression for matched pairs, symmetry and
quasi-symmetry models for square tables, comparing marginal distributions,
analyzing rater agreement |
|
April 18 |
Conditional Logistic Regression (Final Project Reports Due)
|
|
April 25 |
Final Exam |
|
May 1 |
Revised Final Report Due |
Evaluation
1. Homework 25%
2. Class Attendance and
Participation 5%
3. Exams 45%
4. Final Project Write-Up 15%
5. Peer Review of Final Project 10%
Homework policies
There are two options for the final project.
1) Data Analysis
Complete a full analysis of
a set of data that contains an ordinal or multinomial response and at least one
ordinal covariate. This data set may not
be one that you are using for a fieldwork project or dissertation. Present two alternative analyses, one of
which must be a generalized linear model.
Compare and contrast the two approaches.
2) Report on a Statistical
Paper
Report on a statistical
paper addressing the analysis of categorical or ordinal data.
Final Project Peer Review
Final drafts of the Paper
will be due on April 18th.
Each student will be randomly assigned three other students’ papers to
review. Using a template, each student
will be asked to rate and comment on these three other papers. Statistical methods will be used to combine
these ratings and create a final ranking of all papers. 10% of your final grade will be based on
these peer reviews. (Please note that if
you do not hand in your own peer reviews for other students’ papers, you will
get a 0 out of this 10%. If you do not
have a paper to hand out to other students for review on April 18th,
you will also get 0 out of this 10%. No
exceptions will be made!)
You may use these peer
reviews to revise your own paper for a final submission of the project due on
May 1st.