Do operator: Difference between revisions

Latest revision as of 23:11, 14 February 2019

The do operator is used in causal inference to denote an intervention. Given random variables $X, Y$ , we write $Pr (Y = y ∣ d o (X = x))$ to mean the probability that $Y = y$ given we intervene and set $X$ to be $x$ . In some texts, this is abbreviated to $Pr (y ∣ \hat{x})$ (this notation assumes that the random variables corresponding to the individual values are clear from context). The notation ${Pr}_{x} (y)$ is also used.

In general $Pr (Y = y ∣ d o (X = x))$ is not the same as conditioning on $X = x$ , i.e. $Pr (Y = y ∣ X = x)$ . Note also that in the expression $d o (X = x)$ , the subexpression $X = x$ does not mean the event where the random variable $X$ takes on the value $x$ , i.e. the event ${ω \in Ω : X (ω) = x}$ . Thus, inside a do operator, the standard notational convention of probability theory does not hold. To stress the point, suppose the event $X = x$ can be specified in another way, such as by the event $Z = z$ . In this case, since $X = x$ and $Z = z$ are exactly the same set, the probabilities involving them, such as $Pr (X = x)$ vs $Pr (Z = z)$ and $Pr (Y = y ∣ X = x)$ vs $Pr (Y = y ∣ Z = z)$ , should all be the same, but I don't think $Pr (Y = y ∣ d o (X = x))$ and $Pr (Y = y ∣ d o (Z = z))$ need be the same (check this).

The do operator is used extensively in the do calculus.

History

Pearl: "An equivalent notation, using $s e t (x)$ instead of $d o (x)$ , was used in Pearl (1995a). The $d o (x)$ notation was first used in Goldszmidt and Pearl (1992) and is gaining in popular support. Lauritzen (2001) used $P (y ∣ X \leftarrow x)$ . The expression $P (y ∣ d o (x))$ is equivalent in intent to $P (Y_{x} = y)$ in the potential-outcome model introduced by Neyman (1923) and Rubin (1974) and to the expression $P [(X = x) ◻ \to (Y = y)]$ in the counter-factual theory of Lewis (1973b)."^[1]

References

↑ Judea Pearl. Causality. p. 70, footnote 2

[1] Judea Pearl. Causality. p. 70, footnote 2

[1]

@@ Line 1: / Line 1: @@
 The '''''do'' operator''' is used in causal inference to denote an intervention. Given [[random variable]]s <math>X,Y</math>, we write <math>\Pr(Y=y \mid \mathit{do}(X=x))</math> to mean the [[probability]] that <math>Y=y</math> given we intervene and set <math>X</math> to be <math>x</math>. In some texts, this is abbreviated to <math>\Pr(y\mid\hat x)</math> (this notation assumes that the random variables corresponding to the individual values are clear from context). The notation <math>\Pr_x(y)</math> is also used.
-In general <math>\Pr(Y=y \mid \mathit{do}(X=x))</math> is not the same as conditioning on <math>X=x</math>, i.e. <math>\Pr(Y=y \mid X=x)</math>.
+In general <math>\Pr(Y=y \mid \mathit{do}(X=x))</math> is not the same as conditioning on <math>X=x</math>, i.e. <math>\Pr(Y=y \mid X=x)</math>. Note also that in the expression <math>\mathit{do}(X=x)</math>, the subexpression <math>X=x</math> does not mean the event where the random variable <math>X</math> takes on the value <math>x</math>, i.e. the event <math>\{\omega\in\Omega : X(\omega) = x\}</math>. Thus, inside a ''do'' operator, the standard notational convention of probability theory does not hold. To stress the point, suppose the event <math>X=x</math> can be specified in another way, such as by the event <math>Z=z</math>. In this case, since <math>X=x</math> and <math>Z=z</math> are exactly the same set, the probabilities involving them, such as <math>\Pr(X=x)</math> vs <math>\Pr(Z=z)</math> and <math>\Pr(Y=y \mid X=x)</math> vs <math>\Pr(Y=y \mid Z=z)</math>, should all be the same, but I don't think <math>\Pr(Y=y \mid \mathit{do}(X=x))</math> and <math>\Pr(Y=y \mid \mathit{do}(Z=z))</math> need be the same (check this).
 The ''do'' operator is used extensively in the [[Do calculus|''do'' calculus]].
+==History==
+Pearl: "An equivalent notation, using <math>\mathit{set}(x)</math> instead of <math>\mathit{do}(x)</math>, was used in Pearl (1995a). The <math>\mathit{do}(x)</math> notation was first used in Goldszmidt and Pearl (1992) and is gaining in popular support. Lauritzen (2001) used <math>P ( y \mid X \leftarrow x)</math>. The expression <math>P(y \mid \mathit{do}(x))</math> is equivalent in intent to <math>P(Y_x = y)</math> in the potential-outcome model introduced by Neyman (1923) and Rubin (1974) and to the expression <math>P[(X = x) \mathbin{\Box\!\!\rightarrow} (Y = y)]</math> in the counter-factual theory of Lewis (1973b)."<ref>Judea Pearl. ''Causality''. p. 70, footnote 2</ref>
+==References==
+<references/>
 [[Category:Causal inference]]