Module 1 (Weeks 1 & 2) Question: What common background understanding do I need to get started in improving my ability to critically and quantitatively reason about health questions?
|
Week 1: July 22-July 28
|
Question
|
Statistical ideas
|
Statistical skills
|
Videos
|
Homework
|
Quiz
|
What is the scientific method?
|
Experiments generate evidence that supports some competing hypotheses more than others.
|
Specify hypotheses
|
1.1. Scientific method
|
1A
|
1A
|
How do we quantify evidence?
|
Likelihood ratio updates the prior odds to obtain posterior odds
|
Calculate odds from a probability;
Calculate a probability from an odds.
|
|
1A
|
1A
|
What does it mean to say: “this drug prevents heart attacks” or “obesity increases the cost of medical care”
|
The meaning of “cause” can usefully be made precise in terms of potential outcomes.
|
Determine whether a statement is “causal” or not.
|
1.2. On “cause”
|
1A
|
1A
|
What is probability?
|
Probability means long-run frequency or alternately is a quantification of the strength of belief; joint, marginal, conditional probabilities
|
Calculate probabilities from data in a 2x2 table.
|
1.3. On probability
|
1A
|
1A
|
Week 2: July 29-August 4
|
Question
|
Statistical ideas
|
Statistical skills
|
Videos
|
Homework
|
Quiz
|
What is a probability distribution for a random variable?
|
Probability distribution is a function that takes a possible value (discrete) or set of values (continuous) as input and returns the probability the random variable assumes the input value(s).
|
Calculate specified probabilities from a given probability distribution.
|
1.4. Probability distributions.
|
1B
|
1B
|
What is the binomial distribution? (What is meant by “independence” of two events?)
|
A mathematical model for INDEPENDENT coin tossing-like systems in nature.
|
Given the number of tosses (n) and probability of success on each INDEPENDENT toss (p), calculate the probability of observing x out of n successes for any x between 0 and n.
|
1.5. Binomial distribution
|
1B
|
1B
|
What is the Gaussian (“normal”) distribution and how is it derived?
|
Mathematical model for the probability distribution of the average of independent random variables.
|
Approximate the probability a Gaussian random variable with specified mean and variances falls in any given interval.
|
1.6. Gaussian distribution
|
1B
|
1B
|
Module 2 (Weeks 3 & 4) Question: How do the average medical care costs for people with a major smoking-caused disease (MSCD) differ from those for people without MSCDs who are otherwise similar?
|
Week 3: August 5-August 11
|
Question
|
Statistical ideas
|
Statistical skills
|
Videos
|
Homework
|
Quiz
|
What is the probability distribution of medical expenditures for persons with/without a MSCD?
|
Distribution of medical expenditures; summaries of location, spread, and shape; transforming the variable to better understand the distribution.
|
Calculate a mean, median, quartiles, standard deviation; Make and interpret a stem and leaf plot;
Make and interpret a boxplot.
|
2.1. Distribution of medical expenditures
|
2A
|
2A
|
What is the probability distribution of an independent sample of size n drawn from the observed distribution of medical expenditures
|
Simple random sampling; Stratified sampling; Central limit theorem.
| |
2.2. Sampling and the Central Limit Theorem
|
2A
|
2A
|
What is a confidence interval for the population mean?
|
Confidence intervals as interval estimates for an unknown population mean (the truth we seek).
|
Calculate a 95% or 99% confidence interval for an unknown population mean from a sample of n independent observations drawn from the population.
|
2.3. Confidence intervals.
|
2A
|
2A
|
Week 4: August 12-August 18
|
How does the distribution of medical expenditures differ for persons with vs without a MSCD?
|
Comparing distributions.
Differences of means;
Standardized mean differences;
Ratios.
|
Make and interpret a Q-Q plot.
Calculate and interpret t-statistic.
|
2.4. Comparing distributions
|
2B
|
2B
|
How does the distribution of medical expenditures differ across age and poverty strata?
|
Comparing distributions across several strata; table of pair-wise standardized mean differences
|
Interpreting multiple boxplots after variable transformation
|
2.4. Comparing distributions
|
2B
|
2B
|
How does the distribution of medical expenditures differ for persons with vs without a MSCD of similar age and poverty level?
|
Stratification by covariates;
Estimation within strata.
|
Calculate a t-statistics for each stratum; interpret results across strata
|
2.5. Pooling across strata
|
2B
|
2B
|
When is it warranted to pool results across strata and how do we do it?
|
“Effect modification” or “interaction”; Pooling multiple estimates, giving greater weight to the more precise estimates.
|
Estimate the overall effect of MSCD on medical expenditures from the stratum specific values
|
2.5. Pooling across strata
|
2B
|
2B
|
How does the pooled result compare to the simple comparison in 2.4? Why are they different?
|
Confounding when we fail to compare otherwise similar populations
| |
2.6. What we learned in module 2.
|
2B
|
2B
|
Module 3 (Week 5) Question: What is the rate of infant survival during the first 26 weeks of life in southern Nepal and how does the rate of survival vary by infant’s gestational age, sex, or being a singleton versus twin birth?
|
Week 5: August 19-August 25
|
Question
|
Statistical ideas
|
Statistical skills
|
Videos
|
Homework
|
Quiz
|
What is an odds ratio; what is a log odds ratio; why is the log odds ratio so commonly used in health research?
|
Defining parameters with unbounded support; odds approximate the risk; invariance of the odds ratio
|
Estimate an odds ratio from a 2x2 table with 95% confidence interval
|
3.1. Odds ratio estimation
|
3A
|
3A
|
How does child survival depend upon sex and twin status?
|
2x2 tables; log odds “model” or “logistic regression”
|
Apply odds ratio estimation to answer a substantive question.
|
3.2. Odd ratio estimation
|
3A
|
3A
|
How does the log odds of child survival depend upon gestational age?
|
Study outcome expressed as a function of a continuous predictor variable - regression
|
Display the dependence of the log odds of survival on gestational age
|
3.3. Introduction to logistic regression
|
3A
|
3A
|
|