Shattering

Let $X$ be a set of objects one wants to label as 0 or 1, and let $H$ be a class of functions from $X$ to ${0, 1}$ , called a hypothesis class. In other words, $H \subseteq {0, 1}^{X}$ . Let $S = {p_{1}, \dots, p_{n}} \subseteq X$ be a set of points in $X$ . The hypothesis class $H$ shatters $S$ iff for every label assignment $f : S \to {0, 1}$ there exists some $h \in H$ such that $h (p) = f (p)$ for all $p \in S$ .

We can use the idea of function restriction to restate this idea:

The hypothesis class $H$ shatters $S$ iff for every label assignment $f : S \to {0, 1}$ there exists some $h \in H$ such that $h |_{S} = f$ .

The hypothesis class $H$ shatters $S$ iff ${h |_{S} : h \in H} = {0, 1}^{S}$ .

Since functions going into ${0, 1}$ can be seen as subsets of the domain, we can also state the definition using just sets. Let $H' = {h^{- 1} ({1}) : h \in H}$ be the set of possible regions that the hypothesis class can label as positive.

The hypothesis class $H$ shatters $S$ iff for each subset $E$ of $S$ there exists some set $H \in {H^{'}}^{'}$ such that $E = H \cap S$ .

The hypothesis class $H$ shatters $S$ iff $P (S) = {H \cap S : H \in {H^{'}}^{'}}$ .

References

https://en.wikipedia.org/wiki/Shattered_set

Understanding Machine Learning, chapter on VC dimension.