The Well-Ordering Theorem

Variants of the well-ordering theorem involving ordinals and cardinals

Ordinals

As a recap in ordinal theory, an ordinal α\alpha is any set which is transitive (xα    xαx \in \alpha \implies x \subseteq \alpha) and xyx \in y is a well-order in α\alpha. If we denote x+1=x{x}x + 1 = x \cup \{x\}, it follows that if α\alpha is an ordinal, so is α+1\alpha + 1. And if SS is a set of ordinals, S\bigcup S is also an ordinal. \emptyset is an ordinal and we label it 0. Any nonzero ordinal is either a successor (α=β+1\alpha = \beta + 1 for some ordinal β\beta) or limit ordinal (α\alpha is the union of all ordinals βα\beta \in \alpha).

It then follows that the class of all ordinals is well-ordered by xyx \in y. This means any nonempty class of ordinals has a minimum element, and given any two ordinals αβ\alpha \ne \beta, either α<β\alpha < \beta or α>β\alpha > \beta, where the order here is \in. The transfinite recursion principle allows us to inductively construct transfinite sequences, which are essentially families indexed by some ordinal.

Given a well-ordered set WW, we denote W(x)={yW : y<x}W(x) = \{ y \in W \ : \ y < x \}. We shall say two well-orders W1W2W_1 \simeq W_2 are isomorphic if there exists a bijective function f:W1W2f : W_1 \to W_2 which is order-preserving. The isomorphism theorem of well-orders states that, given any W1,W2W_1,W_2 well-ordered sets, one of three cases will always happen:

  1. W1W2W_1 \simeq W_2

  2. W1(x)W2W_1(x) \simeq W_2 for some xW1x \in W_1

  3. W1W2(y)W_1 \simeq W_2(y) for some yW2y \in W_2

Since the class of ordinals is a proper class, and every ordinal itself is well-ordered, we have the main result that every well-ordered set must be isomorphic to some ordinal.

Cardinals

If we denote the relation \sim on the class of all sets to mean XYX \sim Y iff there exists a bijection f:XYf : X \to Y, then this is an equivalence relation. A cardinal is any equivalence class of this relation. We shall denote the equivalence class containing XX (known as the cardinality of XX) as card(X)\card{X}.

If we define a relation XYX \le Y on the class of all sets to mean “there exists an injective function f:XYf : X \to Y”, then this relation is cardinality-preserving. This means, if XXX \sim X' and YYY \sim Y', then XY    XYX \le Y \iff X' \le Y'. Because of this, this relation can be induced into a relation on the cardinals, where card(X)card(Y)\card{X} \le \card{Y} iff XYX \le Y.

We have the following properties

  1. (Reflexivity) card(X)card(X)\card{X} \le \card{X}

  2. (Transitivity) If card(X)card(Y)\card{X} \le \card{Y} and card(Y)card(Z)\card{Y} \le \card{Z}, then card(X)card(Z)\card{X} \le \card{Z}.

  3. (Antisymmetry) If card(X)card(Y)\card{X} \le \card{Y} and card(Y)card(X)\card{Y} \le \card{X}, then card(X)=card(Y)\card{X} = \card{Y}. This follows from the Cantor-Schröder-Bernstein theorem.

Evidently, this proves \le is a partial order on the set of cardinals. But if we now inspect the set of ordinals, we know that every ordinal must lie inside a cardinal. By transfinite recursion, we can define the initial ordinals, which are ordinals α\alpha such that for all β<α\beta < \alpha, we have card(β)<card(α)\card{\beta} < \card{\alpha} (meaning it is the smallest ordinal representing its cardinality).

Every ordinal has a corresponding unique initial ordinal which has the same cardinality, but the converse isn’t true in ZF theory. Similarly, every set XX can be associated with a least ordinal α\alpha such that α≰X\alpha \not\le X (in the sense that no injective function f:αXf : \alpha \to X exists), which is known as Hartogs number and can be proven with ZF theory. But in ZF theory, that doesn’t imply XαX \le \alpha, since we can’t use this fact to effectively construct an injective function f:Xαf : X \to \alpha.

Well-ordering theorem

All of the results above are true under ZF, even without the axiom of regularity. We now introduce different versions of the well-ordering theorem (which is only a theorem assuming some other version of the axiom of choice).

Well-ordering theorem. Every set can be well-ordered.

Proposition. The following statements are all equivalent.

  1. Every set can be well-ordered.

  2. Axiom of choice

  3. Every set is bijective to some ordinal.

  4. Every cardinal contains some initial ordinal.

  5. The \le relation on cardinals is a total order.

Proof. (1) ⇒ (2) If X\bigcup X can be well-ordered, then every element of XX has a least element. Thus, the map f:XXf : X \setminus \emptyset \to \bigcup X defined as f(x)=minxf(x) = \min x is a choice function.

(2) ⇒ (3) Let ff be a choice function on P(X)\mathcal{P}(X). By transfinite recursion, let x0Xx_0 \in X be anything, and for any ordinal α1\alpha \ge 1,

xα=f(Xβ<αxβ)x_\alpha = f\left(X \setminus \bigcup_{\beta < \alpha} x_\beta\right)

unless X=β<αxβX = \bigcup_{\beta < \alpha} x_\beta, in which case just let xα=x0x_\alpha = x_0. This must happen eventually, as otherwise this would be an injective sequence from all ordinals to XX (making XX a proper class). If α>0\alpha > 0 is the least ordinal where xα=x0x_\alpha = x_0, then (xβ)β<α(x_\beta)_{\beta < \alpha} is an injective transfinite sequence and contains every element of XX, so g:αXg : \alpha \to X defined as g(β)=xβg(\beta) = x_\beta is bijective.

(3) ⇒ (4) Let κ\kappa be a cardinal and XκX \in \kappa. If XX is bijective to α\alpha, then it must be bijective to the initial ordinal α0\alpha_0 associated with α\alpha, so α0card(X)=κ\alpha_0 \in \card{X} = \kappa.

(4) ⇒ (5) Let κ\kappa and κ\kappa' be cardinals with respective initial ordinals ακ\alpha \in \kappa and ακ\alpha' \in \kappa'. Then αα    κκ\alpha \le \alpha' \implies \kappa \le \kappa' and αα    κκ\alpha' \le \alpha \implies \kappa' \le \kappa. Since the ordinals are totally-ordered, so are the cardinals.

(5) ⇒ (1) Let α\alpha be the Hartogs number of a set XX. Since α≰X\alpha \not\le X, by totality, there must be an injective function f:Xαf : X \to \alpha. But then f(X)f(X), being a subset of an ordinal, must be well-ordered, which induces a well-order on XX where x<y    f(x)<f(y)x < y \iff f(x) < f(y). \square