❓
Understanding Causal Inference
  • A Guide to Causal Inference
  • Table of Contents
  • About-us
  • Preface
  • What is Causality?
  • Why bother with Causality?
  • Origin of Causality
  • Statistical Inference Vs Causal Inference
  • Decision-Making
  • Why we need Causality?
    • Leaders in the Industry
  • Key Causal Terms and FAQ
  • Assumptions
    • Causal Assumptions
  • Bias
    • Selection Bias
    • Correlation is not Causation
      • Simpsons Paradox
  • Causal Graphs
    • Colliders
    • Confounders
    • Mediators
    • Back Door Paths
    • Front Door Paths
    • Structural Causal Model
    • do-calculus
    • Graph Theory
    • Build your DAG
    • Testable Implications
    • Limitations of Causal Graphs
  • Counterfactuals
    • Potential Outcomes Framework
  • Modeling for Causal Inference
    • Experimental Data
      • Randomization
        • Problems with Randomization
        • A/B Testing
          • Experiment
    • Non-Experimental / Observational Data
      • Instrumental Variables
      • Weighting
        • Inverse Propensity Weighting
      • Propensity Score
      • Sensitivity Analysis
      • Regression Discontinuity
      • Matching
      • Stratification
        • Methods
        • Implications
  • Tools and Libraries
    • DoWhy
      • Do-Sampler
      • EconML
      • Workflow
    • Causal Graphical Models
    • CausalInference
    • Dagitty
    • Other Libraries
  • Limitations of Causal Inference
    • Fundamental Problem of Causal Inference
  • Real-World Implementations
  • What's Next
  • References
Powered by GitBook
On this page

Was this helpful?

  1. Modeling for Causal Inference
  2. Non-Experimental / Observational Data
  3. Stratification

Implications

PreviousMethodsNextTools and Libraries

Last updated 4 years ago

Was this helpful?

Adding visualization here showing stratification:

Example: Comparing the Click-Through Rates(CTR) for A page and B page.

  • Analysis without stratification:

A performs a little better than B

  • **Analysis on removal of over sampling for Old people in the data set.

B performs better than A

  • Analysis when we stratify on one of the features("Age"):

We see that the performance of A and B depends on the age of the person and that is the reason we were getting flawed inferences earlier.

****

Go to the for reference!!!

Notebook