Past Coursework
Analysis of Categorical Data
- A comprehensive overview of methods of analysis for binary and other
discrete response data, with applications to epidemiological and
clinical studies. Topics discussed include 2x2 tables, tests of
independence, measures of association, power and sample size
determination, stratification and matching in design and analysis,
interrater agreement, logistic regression analysis. We used SAS
throughout the course.
Statistical Computing with SAS
- This course provided introduction to the basics of statistical
programming with SAS (Statistical Analysis System). The course proceeded
from understanding data types, how to move data into and out of SAS, how
SAS holds data, what a SAS data set looks like, to the basic programming
concepts. We expanded our programming vocabulary, incorporated logic
structures into programming, and investigated the use of existing SAS
Procedures.
Data Science I
- Contemporary biostatistics and data analysis depends on the mastery
of tools for computation, visualization, dissemination, and
reproducibility in addition to proficiency in traditional statistical
techniques using R Studio. The goal of this course was to provide
training in the elements of a complete pipeline for data analysis.
Applied Regression I
- This course provided an introduction to the basics of regression
analysis. The class proceeded systematically from the examination of the
distributional qualities of the measures of interest, to assessing the
appropriateness of the assumption of linearity, to issues related to
variable inclusion, model fit, interpretation, and regression
diagnostics. We will primarily use scalar notation (i.e. we used limited
matrix notation). We used SAS throughout the course.
Current Coursework
Applied Regression II
- This course will introduce the statistical methods for analyzing
censored data, non-normally distributed response data, and repeated
measurements data that are commonly encountered in medical and public
health research. Topics include estimation and comparison of survival
curves, regression models for survival data, logit models, log-linear
models, and generalized estimating equations. Examples are drawn from
the health sciences. We are using R and SAS throughout the course.
Data Science II
- The course will introduce: concepts and methods in statistical
learning; classification and regression techniques beyond linear
methods; exploratory data analysis using methods in unsupervised
learning; various statistical learning methods using R; a pipeline for
predictive modeling: data preprocessing, model training, model
interpretation.
Public Health GIS
- The course will introduce students to the theory, software, and
analytical techniques needed to effectively use GIS in public health.
This includes the ability to visualize spatial data, identify
significant clusters, utilize appropriate geoprocessing tools, and
integrate spatial processes into regression analysis. By the end of the
course, students will have a solid understanding of how GIS can be used
to support decision making and analysis in public health.