Basic of Epidemiology
Definition
- Study of:
- distribution
- determinants
- diseases
- health-related states
- events
- Plus:
- measures to control or prevent the disease
PSM Scientists

Contributions | Contributors | Mnemonic |
John Snow | • Father of epidemiology • Father of modern epidemiology • Cholera epidemic in London ↳ Used spot maps, natural experiment | • Main Father → GOT → Snow |
Fracastorius | • Founder of epidemiology | • Found → Free of cost |
Dr. John M Last | • Coined 'Epidemiology' • Definition of epidemiology | • Last coin • Last definition |
Dr. John Graunt | • Father of medical statistics | • Status people grunts |
David Sacket | • Father of evidence-based medicine | • Evidence of sachet |
James Lind | • Prevented scurvy using citrus fruits • Conducted the first human experiment | ㅤ |
- Father of Public health → Cholera
Scientist | Contribution |
Ronald Ross | • Worked on malaria transmission • Identified Anopheles mosquito as vector |
Alfred Grotjahn | • Introduced the concept of social pathology |
C.E.A. Winslow | • Defined public health |
Edwin Chadwick | • Led the Great Sanitary Awakening (UK) • Instrumental in Public Health Act, 1848 |
Fracastorius | • Founder of epidemiology • Described syphilis transmission (sexual route) • Proposed Theory of Contagion |
David Morley | • Developed growth charts for child health monitoring |
Microbiology scientists
Louis Pasteur

- Mnemonic: FATHER CAR
- Developed vaccines for:
- Cholera.
- Anthrax.
- Rabies.
- Louis → Liquid (Liquid media)
- Vaccine CAR (coined vaccine, Germ cell theory) → Autoclave
- Pasteurisation → Fermentation
- Father of modern microbiology.
- Founded Pasteur Institute in Paris.
- Developed Pasteurisation (for milk).
- Key contributions:
- Liquid media.
- Fermentation Principle.
- Autoclave.
- Disapproved Theory of abiogenesis.
- Proposed Germ cell Theory.
- Coined the term "vaccine"
Robert Koch

- Father of modern microbiology.
- Significant contributions:
- Mnemonic:
- Koch organisms (koch bacilli - tb, cholera organism)
- are solid (solid culture media) → hang (hanging drop motility) → apply paint (aniline dye)
- Koch postulates.
- Discovery of Koch Bacilli (Tuberculosis).
- Identification of Cholera organism.
- Development of Solid culture media.
- Application of Aniline dye colour.
- Introduction of Hanging drop motility.
Koch Postulates
- Old (4) Postulates:
- Constant association of causative organisms with disease.
- Isolation in culture media is possible.
- Culture growth inoculated in animals should produce the same lesion.
- Re-isolation from the experimental animals is possible.
- New (1) Postulate:
- Humans should produce antibodies in serum whenever there is an infection (antigen).
- Exceptions from Postulates:
- Mycobacterium leprae.
- Treponema Pallidum.
- N. Gonorrhoea.
- My pallu gone
Paul Ehrlich

- Identified the Ehrlichia organism.
- Father of chemotherapy.
- Developed Toxin-antitoxin standardisation.
- Developed the Acid-fast stain/Ziehl Neelsen stain.
Other Scientists
- Walter Reed
- Worked on yellow fever
- Identified Aedes mosquito as vector
- Walter → Water → aedes → yellow fever
- Joseph Lister:
- Father of Antiseptic surgery.
- Used carbolic acid.
- Anton Von Leeuwenhoek:
- Father of the Light microscope (unilocular)
- First to observe "little animalcules"
- Ernst Ruska:
- Invented the Electron microscope
- Edward Jenner:
- Developed the first vaccine for smallpox.
- Coined the term "vaccination"
- Wardil (Edward) kidakkunna small (smallpox) kidsnu vaccination () cheyynm
- Karry B Mullis:
- Developed PCR.
- KBM PCR 3 letters
- H C Gram:
- Developed Gram staining.
- Kleinberger:
- Discovered L forms (cell wall deficient forms).
- K- L
- Alexander Fleming:
- Discovered Penicillin.
- Barbara McClintock:
- Discovered Transposons (Jumping genes)
- Tick tok - Jumping genes
Nobel Prizes During Covid Era
- HCYV mechanism:
- Michael Houghton
- Harvey J. Alter
- Charles M. Rice

Tools of Measurement
Ratio
- Numerator & denominator are separate entities.
- tio → two → 2 entities
Examples:
- Maternal Mortality Ratio (MMR) =
- (Maternal deaths / Live births) × 1,00,000
- No. of healthy / No. of sick
- Dead / Alive
Non-Ratio
Rate
- Numerator is part of denominator
- ate a part of big part
- Expressed in relation to time
- Uses multiplicative factors (1000, 10,000, etc.)
- Examples:
- Maternal Mortality Rate =
- Incidence = No. of new cases / Time
- Birth rate
- Death rate
(Maternal deaths / Women in reproductive age) × 1,00,000
Proportion
- Numerator is part of denominator
- No account of time
- Expressed in percentage
- Multiplier in proportion is always 100.
- Examples:
- Prevalence = Total no. of existing cases
- 2° attack rate
Incidence vs. Prevalence
Prevalence = Incidence × Duration
Prevalence | Incidence |
New + Old cases (total caseload) | New cases only |
Proportion | Rate |
Cross-sectional study | Cohort study |
Increases with duration | No effect |
For administrative and planning purposes | For prevention and control of disease |
All cases (new + old) / Total population × 100 | New cases / Population at risk × 1000 |
Impact of Treatment/Prevention on Incidence & Prevalence
Scenario | Incidence | Prevalence |
1. Effective treatment (cures disease) | Same | ↓ (due to ↓ D) |
2. Treatment prevents death but not cure (e.g., ART) | Same | ↑ (due to ↑ D) |
3. Treatment cures disease | Same | ↓ (due to ↓ D) |
4. Disease is easily curable or fatal | Same | ↓ (short D) |
5. New preventive modality (e.g., vaccine) | ↓ | ↓ (due to ↓ I) |
6. Primordial prevention | ↓↓ | ↓↓ |
Measurement of Mortality
Indicator | Formula | Measurement |
Crude death rate = 6.0 | Total death (D/t any cause) / Mid year population x 1000 Mid-year : July 1st. | Rate |
Cause specific death rate | Death d/t a disease / Mid year population x 1000 | Rate |
Case fatality rate (Virulence of a disease) | Death d/t a disease / Total cases of the disease x 100 Does not account for time. | Proportion Rabies → 100% |
Proportional mortality rate | Death d/t a disease / Total deaths (All causes) x 100 | Proportion |
- fatality, mortality = proportion = x 100
Standardization
- Comparing different population groups with different variables (age, gender, etc.).
Indirect standardization | Direct standardization |
When age-specific death rates are not available. Choose a reference population Standardized mortality ratio (SMR) is calculated. | When age-specific death rates are available. Age-standardized death rate Choose a reference population |
Standardized mortality ratio (SMR)
- To compare mortality in a study population with that in a standard or general population, assuming the same age structure.
- SMR = Observed Deaths (OD) x 100
Expected Deaths (ED) - Mnemonic: Standard aytt mortality undakkanam enkil → Odichittt Edikkanam (OD/ED)
- SMR > 100:
- OD > ED.
- SMR < 100:
- ED > OD.
Age-standardized death rate
- (Number of expected deaths/ Total standard population) x 1,00,000
- To compare mortality rates between two populations with different age structures.
- It adjusts for age to make a fair comparison.
Epidemiological Methods
Epidemiological Study Design

- ODA EaRN Evidence




A. Observational (Mnemonic: ODA)
1. Descriptive
- Buzzword
- Distribution of a disease in terms of time, place and person
- Formulation of hypothesis
- Describe a disease → No comparison group
- Collection, Formulation, presentation
- Types
- Case study:
- Single case description.
- Case series:
- Multiple cases description
2. Analytical
- Testing of hypothesis
- Buzzword → Determinant
- Analyse → Comparison group is always present
B. Experimental
- Confirmation of hypothesis
Experimental (Intervention) Studies
Type | Synonyms | Unit of Study |
Randomized Control Trial | Clinical trial | Patients Intention to treat 🗸 |
Field Trial | Vaccine trial | Healthy people |
Community Trial | Community study | Communities Preventive Trials 🗸 |
C. Evidence-based medicine
- Systematic review
- Meta analysis
Q. Studying the distribution of disease or health-related characteristics in the human population and identifying characteristics with which disease seem to be associated is
a. Descriptive epidemiology
b. Experimental epidemiology
c. Analytical epidemiology
d. Interventional epidemiology
a. Descriptive epidemiology
b. Experimental epidemiology
c. Analytical epidemiology
d. Interventional epidemiology
ANS
Descriptive
Q. Which of the following is an example of a longitudinal observational and analytical study?
a. Case-control study
b. Cohort study
c. Randomized controlled trial
d. Ecological study
ANS
Cohort study
Q. Best study of first choice for assessment of Unknown or New Disease with no etiological hypothesis?
A. Cohort study B. Case control
C. Cross sectional D. Descriptive epidemiology
A. Cohort study B. Case control
C. Cross sectional D. Descriptive epidemiology
ANS
D. Descriptive epidemiology
Q. A study is designed to evaluate the feasibility of acupuncture in children with chronic headache. Sixty children with chronic headache are recruited for the study. In addition to their usual therapy, all children are treated with acupuncture three times a week for 2 months. Which of the following best describes this study design?
A. Case control B. Case series
C. Cross-sectional D. Historical cohort E. Randomized clinical trial
ANS
B. Case series
Hypothesis
- An assumption that is yet to be verified.
Descriptive Studies :

Important
- Buzzword → Distribution of a disease in terms of time, place and person
- No comparison group
Time distribution
- Short term trend
- Outbreaks
- Epidemics
- Periodic trend
- Cyclic trends:
- Dengue: 1-3 years
- Measles: 2-3 years
- Rubella: 6-9 years
- Influenza: 7-10 years
- Long term trend
- AKA secular trends.
- Means progressive change in disease occurrence
- Epidemiological transition
- Shift from an era of communicable diseases to non-communicable diseases.
- ↑↑ cases of non-communicable diseases
Person distribution
- Age, gender, marital status, social class.
- Age distribution:
- Chicken pox: 1-10 years
- Measles: 2 yr – 3 years
- Typhoid: 5-19 years
- Diphtheria: <5-7 years
- Rubella: 6 - 8 yrs
- Developing countries: 3-10 years
- Developed countries: >15 years
Place distribution
- Stomach Ca: Japan
- Breast Ca: India
- Cervical Ca: India
- Yellow Fever: Africa
Analytical Epidemiology
- Buzzword → Determinant, Testing of hypothesis

ㅤ | Cross sectional | Case control | Cohort | Ecological |
Study group | Total population/ Sample | Cases: People with disease. Control: People with no disease. | Exposed to R/F. Not exposed to R/F. | Population (Unit of study) |
Method | Survey | Interview | Follow up | 3rd party data |
Assess | Prevalence | Risk factor (R/F) | Incidence | Correlation b/w variables |
AKA | Snapshot/ Transverse | Retrospective (Effect → Cause) | Prospective & Retrospective (Cause →Effect) | Correlational |
Parameters calculated | ㅤ | Odds ratio | Risk ratios AR, RR, PAR | ㅤ |
Bias | Selection, criteria | Selection, recall | Selection, attrition, Hawthorne | Ecological fallacy |
Use | ㅤ | Rare diseases | Rare risk factors | Compare population |

1. Case Control

- To study rare diseases.
- Cases → disease present
- Control → disease absent
Steps of a Case-Control Study
- Take SMEAr
- Selection of cases and control
- For 1 case:
- maximum 4 controls can be taken
- if the cost of collecting cases & control equal
- 1:1 is also sufficient
- Matching
- Measurement of Exposure
- Analysis and interpretation
A landmark Case control study
- Doll and Hill study
- Relation between Lung cancer and smoking
Many children from a particular community coming to a hospital were detected to have acute lymphoblastic leukemia (ALL). It was assumed that it is due to the presence of cytotoxic waste in the water of that community. If a case-control study has to be done to find whether the chemical and ALL are associated, what will be taken as the control?
A. Children from the area exposed, but unaffected with the disease
B. Children from the area not exposed and affected with the disease
C. Children coming to your OPD, who do not have the disease
D. All children with ALL irrespective of exposure status
ANS


Odds Ratio (OR):
- Cross product ratio.
- (a x d) / (b x c)
ㅤ | Disease + | Disease - |
Risk factor + | a | b |
Risk factor - | c | d |
- Mnemonic: Against (X → cross) Odds
- So, if odds ratio is 1.5
- This can be explained as “people with myocardial infarction are 1.5 times more likely to be exposed to smoking than non smokers”
- Because it is a case control study, which starts with outcome.
- DO NOT INTERPRET AS 👉 “people with smoking are 1.5 times more likely to develop mi.”
2. Cohort (Longitudinal) study


What type of study is being conducted to investigate the potential link between exposure to aniline dye and bladder cancer in workers who have been employed in the industry for more than 20 years? The study involves two groups:
- one group includes workers directly involved in handling the dye,
- while the other group consists of office clerks who are not directly exposed to the dye.
- The duration of occupation was recorded from available records.
A. Retrospective cohort study
B. Prospective cohort study
C. Case-control study
D. Intervention and response
B. Prospective cohort study
C. Case-control study
D. Intervention and response
ANS
Retrospective cohort study
Relative Risk (RR):
- AKA Risk ratio.
- Incidence in exposed (IE)
Incidence in non-exposed (INE) - = a/(a+b)
c/(c+d)
Interpreting OR & RR:
OR/RR | Association |
= 1 | No |
< 1 | Negative (Protective factor) |
> 1 | Positive (Risk factor) |
- ⇒ Confidence interval should not include 1
Formula
- NNT = 1 / ARR
- Lower NNT = better drug
- ARR (Absolute Risk Reduction) = ARC - ART
- ARC = Absolute risk in control group
- ART = Absolute risk in treatment group
- AR = Absolute risk of bad outcome like death
- Death in group
Total in group
Other Related Formulas
- RRR (Relative Risk Reduction) = (ARC - ART) / ARC = 1 - RR
- RR (Relative Risk) = ART / ARC
Example 1

Calculate Risk factors ?
✅ Significant Risk Factors
(OR > 1 + Confidence Interval does not include 1):
- Early menarche: OR = 1.3 (1.2–1.9)
- BRCA Mutation: OR = 1.9 (1.1–2.8)
- Obesity: OR = 1.25 (1.1–1.7)
❌ Not significant
(Confidence Interval includes 1 or OR < 1):
- High socioeconomic status: OR = 1.4 (0.5–3.5) → Not significant (CI includes 1)
- Age < 50 yrs: OR = 0.04 (0.01–0.32) → Protective factor
- Smoking: OR = 0.5 (0.02–0.9) → Protective factor
- Alcohol: OR = 1.0 (0.8–1.8) → Not significant
- No family history of Ca. breast: OR = 0.3 (0.01–0.9) → Protective factor

Project MONICA
- Related to heart disease
Mixed cohort study
- Bidirectional / Ambispective Cohort Study
- Has both retrospective and prospective components

- Example (Study starting in 2023)
- Retrospective:
- Collect past data
- Divide into exposed and non-exposed groups
- Check incidence of skin rash at baseline
- Prospective:
- Follow-up from 2023 → Continue tracking
Q. The relative risks for vaccines 1 and 2 are 0.5 and 2.0 respectively. Identify which of the following graphs represents them correctly.

ANS
- 1 and 2
- 1 → 0.5, 2
2 → 0.5, 2
3 → 0, 2
Q. A study conducted on 46 mothers who delivered deformed babies showed that 41 were found to have thalidomide during their early pregnancy; what kind of study is being done here?
A. Prospective cohort study
B. Retrospective cohort study
C. Case-control study
D. Descriptive study
A. Prospective cohort study
B. Retrospective cohort study
C. Case-control study
D. Descriptive study
ANS
Case-control study → Study started from a rare disease
Nested Case-Control Study

- Definition
- Smaller case-control study within a larger cohort.
- Start as a cohort study → Develop into case control study
- For rare/expensive tests,
- only analyze cases and a subset of controls

- Example
- Population: 100 children born in Delhi hospitals
- Initial data/specimen collection from mothers:
- Blood samples
- Urine samples
- Stem cells
- Prenatal exposure details
- Study type: Prospective cohort (followed for 50 years)
- Observation
- 10 children developed disease → Cases
- 90 children did not → Eligible pool for controls
- Advantages
- Prospective
- Temporality maintained
- Recall bias eliminated
- Cost-effective
- Limited testing only
- Suitable for rare investigations
- Useful for biological precursor studies
- Efficient:
- No need to test entire cohort
Attributable Risk:
- measures the absolute difference OR disparity in the incidence of a disease between exposed and non-exposed
- IE - INE
IE
- % of disease d/t risk factor under study among exposed group.
- risk difference
- Community physician/epidemiologist
Population Attributable Risk:
- I(total) - INE
I(total)
- % decline in disease if risk factor was lowered in general population.
- Provides an estimate of amount by which the disease could be reduced in any population if suspected factor was eliminated or modified
- Used to design health programs.
- Policymakers (smoking cessation program)
IMP: Attributable risk of an exposure refers to which of the following options?
(A) To what an extent the disease under study can be attributed to an exposure
(B) To what an extent the exposure can correlated to the disease under the study.
(C) To what an extent the disease under study can be prevented
(D) None of the above
(B) To what an extent the exposure can correlated to the disease under the study.
(C) To what an extent the disease under study can be prevented
(D) None of the above
ANS
A
Which of the following statements accurately reflects the findings of the multivariate analysis presented in the table below, regarding the correlation between the risk of blindness and variables such as age, literacy rate, residence, and gender?

- The female population is at greater risk of developing blindness.
- People dwelling in rural areas are at increased risk of developing blindness
- The risk of developing blindness increases with age
- Illiteracy leads to an increased risk of developing blindness
Note:
- If p value < 0.05 → accept the study
- If odds ratio > 1 → strong association
ANS

Number Needed to Treat (NNT)
- NNT = Number of patients to treat to prevent one additional bad outcome
(e.g. pain, death, MI, stroke, haemorrhage)
Formula
- NNT = 1 / ARR
- Lower NNT = better drug
- ARR (Absolute Risk Reduction) = ARC - ART
- ARC = Absolute risk in control group
- ART = Absolute risk in treatment group
- AR = Absolute risk of bad outcome like death
- Death in group
Total in group
Other Related Formulas
- RRR (Relative Risk Reduction) = (ARC - ART) / ARC = 1 - RR
- RR (Relative Risk) = ART / ARC
✅ Ischemic stroke prevention example
Group | Stroke (+) | Stroke (–) | Total |
A (Treatment) | 17 | 83 | 100 |
B (Control) | 22 | 78 | 100 |
- ARC = 22 / 100 = 0.22
- ART = 17 / 100 = 0.17
- ARR = 0.22 - 0.17 = 0.05
- NNT = 1 / 0.05 = 20
→ 20 patients needed to treat to prevent 1 ischemic stroke in 10 years
Experimental Epidemiology
Types:
Non-randomized | Randomized |
• Natural experiments • Preventive trials (Vaccine trials). • Before after trials (Pre-post study design) | • Clinical trials. • Risk factor trials. • Cessation experiments. • Trial of ecological agents. |
Randomization:

- 1st step
- Selection (Eg 200 people)
- Not random, so bias prone
- 2nd step
- Allocation into groups.
- Randomisation is done at this stage
- Removes selection bias & confounding factors
- Known and equal chance.
Evidence-Based Medicine
- Research on previous study design.
- Systematic reviews
- Preliminary studies to assess factors, associations, and other variables.
- Meta-analysis (Better)
- Best level of causal association.
- Systematic study + statistical approach.
- To find the final effect of an intervention, or risk factor.
- Approach through forest plot.
Clinical Trials:

Licensing Authority:
- USA: FDA
- India: DCGI (Drug Controller General of India)

Phase | Participants | Key Objectives | Study Design |
Preclinical | In animals | • Pre-clinical study CPCSEA approval • Clinic (Preclini) for Animals in Sea (CPCSEA) | ㅤ |
Phase 0 | 10–15 healthy volunteers | • Use Radiolabeled substances • [Pharmacokinetics/ Pharmacodynamics] • "Micro-dosing study." • For Expensive/ toxic drugs • [Conducted on humans] • micrO, radiO, tOxic drugs = 0 | • Exploratory Maximum drug amount: 100 mcg or (1/100)th of Human Equivalent Dose, whichever is lower. |
Phase I | Healthy volunteers | Determine • Maximum Tolerated Dose (MTD) • Toxicity • NOT Efficacy • Safety • Tolerability • I → T → Toxicity, Tolerability, safeTy, mTd | ㅤ |
Phase II | Patients 100–300 patients | • First test of efficacy • Safety, Dose range refinement | • Single blind studies • Max failure |
Phase III | Patients (up to 5000) | • Confirm efficacy = Efficacy trial • Compare safety and efficacy ↳ with old standard or placebo drug | • Multicentric • Double blind studies |
Phase IV | Post-market patients | • Identify rare and long-term adverse effects | • Post-marketing studies • Blackbox warnings |
Phase V | ㅤ | • Pharmaco-epidemiology | ㅤ |
- Preclinical study
- CPSEA approval
- Clinic (Preclini) for Animals in Sea (CPCSEA)
- Phase 0
- IND application filing:
- Submit new experimental drug application to CDSCO
- for a human clinical trial.
- 0 = IND (0 discovered by India)
- Phase 2
- Central Drugs Standard Control Organisation (CDSCO)
- After Phase 3
- NDA filing (3 letter = Phase 3)
- leading to the drug entering the market.
- Blackbox warnings
- Anti depressants = Suicidal effect
- No Suicide risk → Clozapine & Lithium
- Monteleukast = Depression & Suicide
Guidelines for Epidemiological Studies:
Study | Name | Guidelines description | ㅤ |
Randomized control trials | CONSORT | Consolidated Standards of Reporting Trials | Concert → Random arrangement |
Observational studies | STROBE | Strengthening the Reporting of Observational studies in Epidemiology | Strong observation |
Systematic reviews or Meta Analysis | PRISMA | Preferred Reporting Items for Systematic Reviews and Meta Analyses | Prismatic reviews Ma → meta analysis |
Case reports | CARE | Case Report | Take CARE of the CASE |
Qualitative research | SRQR | Standards for Reporting Qualitative Research | Sarkar → needs Quality |
Quality improvement studies | SQUIRE | Standards for Reporting Quality Improvement Reporting Excellence | Squire → helper → to improve quality |
Diagnostic/Prognostic studies | STARD | Standards for Reporting of Diagnostic Accuracy | Start Diagnosing |
Clinical trial protocols | SPIRIT | Standard Protocol Items: Recommendations for Interventional Trials | Try (Trial) spirit |
Quality of observational studies analysed for meta-analysis | MOOSE | Meta-analyses Of Observational Studies in Epidemiology | Meta moose |
Evidence based

1. Systematic Review
- A literature review that:
- Collects data
- Critically analyses multiple research studies
- Includes quantitative synthesis of all studies
- Steps of Systematic Review
- QSER
- Identifying Questions
- Set eligibility criteria
- Searching & Selecting studies
- Extract/Abstract Data
- Analysis / Result interpretation
2. Meta analysis
- One step ahead of systematic review
- Performs quantitative analysis
- Combines previous research systematically
- Arrives at overall conclusion
- Steps
- QSER
- Identifying Questions
- Set eligibility criteria
- Searching & Selecting studies
- Extract/Abstract Data
- Analysis / Result interpretation
- Outcome: Forest Plot

Funnel Plot


- Drawn for:
- Systematic review
- Meta-analysis
- Used to:
- Check publication bias
- Assess study quality
Forest Plot (Blobbogram)

- Only for Meta-analysis
- Visual display of effect sizes across studies
- Horizontal line → Confidence interval
Interpretation
- If Interval includes 1 → Not significant
- If Interval excludes 1 → Significant association
- OR > 1 → Risk factor (positive association)
- OR = 1 → No association
- OR < 1 → Protective factor (inverse association)
- Diamond → Represents overall/pooled estimate
- Example: OR = 2.2 (1.9 - 2.7) → Significant, Risk factor
Levels of Evidence/Hierarchy of Epidemiology:


- Best study: Confirmation of hypothesis
- Evidence based medicine
- Meta analysis → Best
- Systemic reviews
- Experimental studies
- Test hypothesis
- Cohort studies
- Case-Control studies
- Unknown disease: To formulate hypothesis
- Case-Series/Reports, Editorials, Opinions
- Animal studies
Bias, Confounding & Casual Associations
Errors:
- Random errors: Sampling errors.
- Systematic errors: Bias
- Even if we do everything systematically → still error occurs → Systematic error
Bias:
Bias | Description |
Selection bias | • Preferred selection. • Removed by randomisation. |
Attrition bias | • Loss to follow up. • In Cohort studies > experimental trials • Mnemonic: Atresia → Lost |
Recall bias | • Differential ability to recall between cases and controls. • In Case control studies |
Berksonian bias | • Birth of son in hospital • Differential hospital admission rates. • In Hospital based case control studies |
Neyman bias | • Man can die or not die • In Case control, experimental trials • Differential mortality pattern b/w groups. • AKA incidence prevalence bias. |
Hawthorne bias | • In Cohort studies • Subjects modify behavior when observed. • Example: Smokers quit smoking during a study. |
Pygmalion bias | • Researchers strong beliefs • Removed by blinding • Researchers strong beliefs Pig is male |

Ecological fallacy:
- Results of an ecological study are not applicable at individual level.
Rx of bias:
- Blinding
- BLINDING → Remove Bias, not confounder
- Randomisation → both B and C (RBC)
- Blinding → Only B (BB)
- Matching → Only C (MC)
- Types
- Single blinding
- Subject is blinded.
- Double blinding
- M/C done
- Subject + doctor are blinded.
- Machine used.
- Triple blinding
- Statistician/analyser + doctor + subjects are blinded.
- Best blinding.
- Researcher → not blind
Confounding


Effect Modifier:
- A variable that changes the strength or direction (positive or negative)
- of the observed effect of a risk factor on disease status
Confounder:
- 3rd variable, unequal distribution.
Criteria for a confounder:
- (All 3 required)
- Associated with risk factor.
- Associated with disease.
- Can lead to disease directly or indirectly.
- Example: Nicotine in smoke
Rx of confounding:
Type of confounder | Removed by |
Known | Matching |
Unknown | RSSR • R: Randomisation (M/C done). • S: Standardization • S: Stratification (Simplest). • R: Regression (Best). |
- Randomisation is universal treatment for both selection bias and confounding factor
- BLINDING → Remove Bias not confounder
- Randomisation → both B and C (RBC)
- Blinding → Only B (BB)
- Matching → Only C (MC)
Matching
- Matching means Cases and control are similar to almost all the factors that can affect the outcome of the study, except the disease under study
- Matching eliminates all the known confounders in the study.
Mnemonic:
- Confound → Kalyana alochana
- If know the person → match the person
- If no one is there
- find Standard () Random (m/c) person
- Stratify () → list according to preference → easy (engineers/doctors first) → simplest
- Regress () → cross out bad people → difficult but best thing to do


Study of Choice
Study | Indication |
Cohort | • Rare risk factor • Establish natural course of disease • Incidence • Multiple outcomes of risk factor • Long time taking |
Case control | • Rare disease • Multiple risk factors for disease |
Nested case control ↳ Type of analytical study | • Rare investigation (Expensive) |
Case control, Cross sectional | • Less time, less expensive |
Cross sectional | • Prevalence |
Ecological | • Correlation of variables |
Descriptive | • Hypothesis formulation • Formula → describe |
Analytical | • Hypothesis testing • Analyse and test |
Evidence based medicine | • Hypothesis validation & confirmation |
Hill's criteria of Causal Association:
- Temporality:
- Most essential criteria.
- RF should precede disease
- Biological plausibility.
- Biological explanation
- Coherence of association.
- Every time, RF should lead to disease
- Validity of association.
- Specificity, sensitivity, OR, RR
- Dose response relationship.
- ↑ Rf α ↑ Disease
- Specificity of association:
- Same RF → Same Disease every time
- most difficult to prove.
- Least essential criteria

Validity:
- Internal:
- Means study gives similar results on repeating
- Measure of robustness of study
- Based on bias → If bias + , Internal validity low
- External:
- It shows generalisability of study
- Study results can be applied/extrapolated to larger population.
- Based on sample size >> bias.
RE α 1 / (Precision α Reproducibility/ Reliability)
- Precisely () random () alkkare kuthiyal reproduce () cheyyam
SE α 1 / (Accuracy α Validity)
- Vaccurat (Validity → Accuracy) systematic (systematic error)



- Validity:
- Hit the board
- Results are within a desired range.
- Accuracy:
- Bull’s eye
- Nearness/closeness to the actual true value.
- Reliability:
- Repeatability
- Out of the board, but same result when repeated
- Dependable, reproducible, repeatable results.
- Note:
- Serial interval:
- Proxy indicator for incubation period.
- CFR:
- Denote virulence of a disease.
Ecological study
- Type: Observational analytical study
- Also called:
- Correlational study
- Geographical study
- Population study
- Aggregate study
Design
- Exposure and outcome assessed simultaneously
- Uses secondary data (already collected data) from private or third parties
- Commonly used in nutritional services
- Represented using scatter diagrams to show correlation
Unit of Study
- Population, not individuals
Limitation
- Ecological fallacy:
- Error of applying population-level results to individuals
- Population trends may not reflect individual-level associations
- Example
- Countries with high daily meat consumption show more colon cancer cases
- But people with colon cancer may not necessarily be meat-eaters

A research group designed a study to investigate the epidemiology of syphilis. The investigators examined per capita income and rates of syphilis in Delhi, Hyderabad, Noida, and Mumbai. Data on city-wide syphilis rates were provided by each city's health agency. The investigators ultimately found that the number of new cases of syphilis was higher in low-income neighbourhoods. The study is best described as which of the following?
Ans:
- We are studying cities - population.
- Relying on secondary data
- So, this is an ecological study
What type of study is it when a person collects data on lung cancer patients from government hospitals and the corresponding number of cigarette packets sold during the same time period, in order to examine the relationship between smoking and lung cancer?
A. Cross sectional
B. Ecological
C. Experimental
D. Quasi-experimental
A. Cross sectional
B. Ecological
C. Experimental
D. Quasi-experimental
Explanation
- Both Cross sectional and Ecological study → Collect data at same point
- If data is collected from individuals → Cross sectional study
- If data is collected from a data bank like hospital record, health agency records or a shop → Ecological study
Question based study
Drug | Interpretation |
A | Inferior |
B | Possibly inferior / Inconclusive |
C | Equivalent (crosses 0) |
D | Equivalent (clear) |
E | Equivalent but borderline superior |
F | Likely superior |
G | Clearly superior |

Understanding the graph:
- X-axis: Effectiveness difference between test drug and comparator.
- 0 = no difference.
- +δ = margin for superiority.
- –δ = margin for inferiority.
- Zone of Equivalency:
- If confidence interval lies inside this zone


