Causal Graphs
Last updated
Last updated
Causal graphs also referred to as DAG - Directed Acyclic Graphs!!!
There are 2 types of graphs:
Directed Graphs: When the arrow points to the direction of the cause of a variable. There has a particular direction of the path between the variables. Directed graphs have edges with direction.
UnDirected Graphs: When there is a line between the variables showing that there is an association between them. Undirected graphs have edges that do not have a direction.
A Directed Graph can be of 2 types:
Cyclic Graphs: A cyclic graph is a graph in which the only repeated vertices are the first and last vertices.
Acyclic Graphs: An acyclic graph is a graph without cycles. When following the graph from node to node, you will never visit the same node twice.
Causal graphs or DAG's are the graphs with :
Directed paths (No undirected paths) between variables.
Acyclic graphs
Note: Why Acyclic?
A causal arrow represents the time associated with the node, which increases as we follow the paths in the graph so that there is no path in order to return to the same node. Causality represents that a particular event or treatment affects the future and cant affect the past, hence there are no causal Loops.
We require the graph to be acyclic to prevent "causal loops". A DAG must not contain a feedback loop where a variable causes itself.
Causal graphs provide a way to visualize the causal relationships between the treatment, the outcome, and all the biases (confounder bias, selection bias, and others) altogether. Causal Graphs represent association and causation both, simultaneously. They help us think better. They allow us to think in a new way about solving a problem.
Note: Causal Graphs are essential because they explicitly define the underlying assumptions.
Causal DAGs consist of three elements:
Variables (nodes, vertices)
Arrows (directed edges, arcs): possible direct causal effects. The arrows order the variables in time.
Missing arrows; sharp assumptions about absent direct causal effects.
Helpful in:
Identifying variables to control for
Explicit the assumptions about origin of data
Applications:
Model testing
Structure learning
Reduce "what if" questions to symbolic calculus(Testable implications)
Reducing scientific questions to symbolic calculus(Testable implications)