Sperner's theorem (original) (raw)

From Wikipedia, the free encyclopedia

Theorem on the largest antichain of sets

Sperner's theorem, in discrete mathematics, describes the largest possible families of finite sets none of which contain any other sets in the family. It is one of the central results in extremal set theory. It is named after Emanuel Sperner, who published it in 1928.

This result is sometimes called Sperner's lemma, but the name "Sperner's lemma" also refers to an unrelated result on coloring triangulations. To differentiate the two results, the result on the size of a Sperner family is now more commonly known as Sperner's theorem.

A family of sets in which none of the sets is a strict subset of another is called a Sperner family, or an antichain of sets, or a clutter. For example, the family of _k_-element subsets of an _n_-element set is a Sperner family. No set in this family can contain any of the others, because a containing set has to be strictly bigger than the set it contains, and in this family all sets have equal size. The value of k that makes this example have as many sets as possible is n/2 if n is even, or either of the nearest integers to n/2 if n is odd. For this choice, the number of sets in the family is ( n ⌊ n / 2 ⌋ ) {\displaystyle {\tbinom {n}{\lfloor n/2\rfloor }}} {\displaystyle {\tbinom {n}{\lfloor n/2\rfloor }}}.

Sperner's theorem states that these examples are the largest possible Sperner families over an _n_-element set. Formally, the theorem states that,

  1. for every Sperner family S whose union has a total of n elements, | S | ≤ ( n ⌊ n / 2 ⌋ ) , {\displaystyle |S|\leq {\binom {n}{\lfloor n/2\rfloor }},} {\displaystyle |S|\leq {\binom {n}{\lfloor n/2\rfloor }},} and
  2. equality holds if and only if S consists of all subsets of an _n_-element set that have size ⌊ n / 2 ⌋ {\displaystyle \lfloor n/2\rfloor } {\displaystyle \lfloor n/2\rfloor } or all that have size ⌈ n / 2 ⌉ {\displaystyle \lceil n/2\rceil } {\displaystyle \lceil n/2\rceil }.

Sperner's theorem can also be stated in terms of partial order width. The family of all subsets of an _n_-element set (its power set) can be partially ordered by set inclusion; in this partial order, two distinct elements are said to be incomparable when neither of them contains the other. The width of a partial order is the largest number of elements in an antichain, a set of pairwise incomparable elements. Translating this terminology into the language of sets, an antichain is just a Sperner family, and the width of the partial order is the maximum number of sets in a Sperner family. Thus, another way of stating Sperner's theorem is that the width of the inclusion order on a power set is ( n ⌊ n / 2 ⌋ ) {\displaystyle {\binom {n}{\lfloor n/2\rfloor }}} {\displaystyle {\binom {n}{\lfloor n/2\rfloor }}}.

A graded partially ordered set is said to have the Sperner property when one of its largest antichains is formed by a set of elements that all have the same rank. In this terminology, Sperner's theorem states that the partially ordered set of all subsets of a finite set, partially ordered by set inclusion, has the Sperner property.

There are many proofs of Sperner's theorem, each leading to different generalizations (see Anderson (1987)).

The following proof is due to Lubell (1966). Let sk denote the number of _k_-sets in S. For all 0 ≤ kn,

( n ⌊ n / 2 ⌋ ) ≥ ( n k ) {\displaystyle {n \choose \lfloor {n/2}\rfloor }\geq {n \choose k}} {\displaystyle {n \choose \lfloor {n/2}\rfloor }\geq {n \choose k}}

and, thus,

s k ( n ⌊ n / 2 ⌋ ) ≤ s k ( n k ) . {\displaystyle {s_{k} \over {n \choose \lfloor {n/2}\rfloor }}\leq {s_{k} \over {n \choose k}}.} {\displaystyle {s_{k} \over {n \choose \lfloor {n/2}\rfloor }}\leq {s_{k} \over {n \choose k}}.}

Since S is an antichain, we can sum over the above inequality from k = 0 to n and then apply the LYM inequality to obtain

∑ k = 0 n s k ( n ⌊ n / 2 ⌋ ) ≤ ∑ k = 0 n s k ( n k ) ≤ 1 , {\displaystyle \sum _{k=0}^{n}{s_{k} \over {n \choose \lfloor {n/2}\rfloor }}\leq \sum _{k=0}^{n}{s_{k} \over {n \choose k}}\leq 1,} {\displaystyle \sum _{k=0}^{n}{s_{k} \over {n \choose \lfloor {n/2}\rfloor }}\leq \sum _{k=0}^{n}{s_{k} \over {n \choose k}}\leq 1,}

which means

| S | = ∑ k = 0 n s k ≤ ( n ⌊ n / 2 ⌋ ) . {\displaystyle |S|=\sum _{k=0}^{n}s_{k}\leq {n \choose \lfloor {n/2}\rfloor }.} {\displaystyle |S|=\sum _{k=0}^{n}s_{k}\leq {n \choose \lfloor {n/2}\rfloor }.}

This completes the proof of part 1.

To have equality, all the inequalities in the preceding proof must be equalities. Since

( n ⌊ n / 2 ⌋ ) = ( n k ) {\displaystyle {n \choose \lfloor {n/2}\rfloor }={n \choose k}} {\displaystyle {n \choose \lfloor {n/2}\rfloor }={n \choose k}}

if and only if k = ⌊ n / 2 ⌋ {\displaystyle k=\lfloor {n/2}\rfloor } {\displaystyle k=\lfloor {n/2}\rfloor } or ⌈ n / 2 ⌉ , {\displaystyle \lceil {n/2}\rceil ,} {\displaystyle \lceil {n/2}\rceil ,} we conclude that equality implies that S consists only of sets of sizes ⌊ n / 2 ⌋ {\displaystyle \lfloor {n/2}\rfloor } {\displaystyle \lfloor {n/2}\rfloor } or ⌈ n / 2 ⌉ . {\displaystyle \lceil {n/2}\rceil .} {\displaystyle \lceil {n/2}\rceil .} For even n that concludes the proof of part 2.

For odd n there is more work to do, which we omit here because it is complicated. See Anderson (1987), pp. 3–4.

There are several generalizations of Sperner's theorem for subsets of P ( E ) , {\displaystyle {\mathcal {P}}(E),} {\displaystyle {\mathcal {P}}(E),} the poset of all subsets of E.

A chain is a subfamily { S 0 , S 1 , … , S r } ⊆ P ( E ) {\displaystyle \{S_{0},S_{1},\dots ,S_{r}\}\subseteq {\mathcal {P}}(E)} {\displaystyle \{S_{0},S_{1},\dots ,S_{r}\}\subseteq {\mathcal {P}}(E)} that is totally ordered, i.e., S 0 ⊂ S 1 ⊂ ⋯ ⊂ S r {\displaystyle S_{0}\subset S_{1}\subset \dots \subset S_{r}} {\displaystyle S_{0}\subset S_{1}\subset \dots \subset S_{r}} (possibly after renumbering). The chain has r + 1 members and length r. An r-chain-free family (also called an r-family) is a family of subsets of E that contains no chain of length r. Erdős (1945) proved that the largest size of an _r_-chain-free family is the sum of the r largest binomial coefficients ( n i ) {\displaystyle {\binom {n}{i}}} {\displaystyle {\binom {n}{i}}}. The case r = 1 is Sperner's theorem.

_p_-compositions of a set

[edit]

In the set P ( E ) p {\displaystyle {\mathcal {P}}(E)^{p}} {\displaystyle {\mathcal {P}}(E)^{p}} of _p_-tuples of subsets of E, we say a _p_-tuple ( S 1 , … , S p ) {\displaystyle (S_{1},\dots ,S_{p})} {\displaystyle (S_{1},\dots ,S_{p})} is ≤ another one, ( T 1 , … , T p ) , {\displaystyle (T_{1},\dots ,T_{p}),} {\displaystyle (T_{1},\dots ,T_{p}),} if S i ⊆ T i {\displaystyle S_{i}\subseteq T_{i}} {\displaystyle S_{i}\subseteq T_{i}} for each i = 1,2,...,p. We call ( S 1 , … , S p ) {\displaystyle (S_{1},\dots ,S_{p})} {\displaystyle (S_{1},\dots ,S_{p})} a p-composition of E if the sets S 1 , … , S p {\displaystyle S_{1},\dots ,S_{p}} {\displaystyle S_{1},\dots ,S_{p}} form a partition of E. Meshalkin (1963) proved that the maximum size of an antichain of _p_-compositions is the largest _p_-multinomial coefficient ( n n 1 n 2 … n p ) , {\displaystyle {\binom {n}{n_{1}\ n_{2}\ \dots \ n_{p}}},} {\displaystyle {\binom {n}{n_{1}\ n_{2}\ \dots \ n_{p}}},} that is, the coefficient in which all n i are as nearly equal as possible (i.e., they differ by at most 1). Meshalkin proved this by proving a generalized LYM inequality.

The case p = 2 is Sperner's theorem, because then S 2 = E ∖ S 1 {\displaystyle S_{2}=E\setminus S_{1}} {\displaystyle S_{2}=E\setminus S_{1}} and the assumptions reduce to the sets S 1 {\displaystyle S_{1}} {\displaystyle S_{1}} being a Sperner family.

No long chains in _p_-compositions of a set

[edit]

Beck & Zaslavsky (2002) combined the Erdös and Meshalkin theorems by adapting Meshalkin's proof of his generalized LYM inequality. They showed that the largest size of a family of _p_-compositions such that the sets in the _i_-th position of the _p_-tuples, ignoring duplications, are _r_-chain-free, for every i = 1 , 2 , … , p − 1 {\displaystyle i=1,2,\dots ,p-1} {\displaystyle i=1,2,\dots ,p-1} (but not necessarily for i = p), is not greater than the sum of the r p − 1 {\displaystyle r^{p-1}} {\displaystyle r^{p-1}} largest _p_-multinomial coefficients.

Projective geometry analog

[edit]

In the finite projective geometry PG(d, F q) of dimension d over a finite field of order q, let L ( p , F q ) {\displaystyle {\mathcal {L}}(p,F_{q})} {\displaystyle {\mathcal {L}}(p,F_{q})} be the family of all subspaces. When partially ordered by set inclusion, this family is a lattice. Rota & Harper (1971) proved that the largest size of an antichain in L ( p , F q ) {\displaystyle {\mathcal {L}}(p,F_{q})} {\displaystyle {\mathcal {L}}(p,F_{q})} is the largest Gaussian coefficient [ d + 1 k ] ; {\displaystyle {\begin{bmatrix}d+1\\k\end{bmatrix}};} {\displaystyle {\begin{bmatrix}d+1\\k\end{bmatrix}};} this is the projective-geometry analog, or q-analog, of Sperner's theorem.

They further proved that the largest size of an _r_-chain-free family in L ( p , F q ) {\displaystyle {\mathcal {L}}(p,F_{q})} {\displaystyle {\mathcal {L}}(p,F_{q})} is the sum of the r largest Gaussian coefficients. Their proof is by a projective analog of the LYM inequality.

No long chains in _p_-compositions of a projective space

[edit]

Beck & Zaslavsky (2003) obtained a Meshalkin-like generalization of the Rota–Harper theorem. In PG(d, F q), a Meshalkin sequence of length p is a sequence ( A 1 , … , A p ) {\displaystyle (A_{1},\ldots ,A_{p})} {\displaystyle (A_{1},\ldots ,A_{p})} of projective subspaces such that no proper subspace of PG(d, F q) contains them all and their dimensions sum to d − p + 1 {\displaystyle d-p+1} {\displaystyle d-p+1}. The theorem is that a family of Meshalkin sequences of length p in PG(d, F q), such that the subspaces appearing in position i of the sequences contain no chain of length r for each i = 1 , 2 , … , p − 1 , {\displaystyle i=1,2,\dots ,p-1,} {\displaystyle i=1,2,\dots ,p-1,} is not more than the sum of the largest r p − 1 {\displaystyle r^{p-1}} {\displaystyle r^{p-1}} of the quantities

[ d + 1 n 1 n 2 … n p ] q s 2 ( n 1 , … , n p ) , {\displaystyle {\begin{bmatrix}d+1\\n_{1}\ n_{2}\ \dots \ n_{p}\end{bmatrix}}q^{s_{2}(n_{1},\ldots ,n_{p})},} {\displaystyle {\begin{bmatrix}d+1\\n_{1}\ n_{2}\ \dots \ n_{p}\end{bmatrix}}q^{s_{2}(n_{1},\ldots ,n_{p})},}

where [ d + 1 n 1 n 2 … n p ] {\displaystyle {\begin{bmatrix}d+1\\n_{1}\ n_{2}\ \dots \ n_{p}\end{bmatrix}}} {\displaystyle {\begin{bmatrix}d+1\\n_{1}\ n_{2}\ \dots \ n_{p}\end{bmatrix}}} (in which we assume that d + 1 = n 1 + ⋯ + n p {\displaystyle d+1=n_{1}+\cdots +n_{p}} {\displaystyle d+1=n_{1}+\cdots +n_{p}}) denotes the _p_-Gaussian coefficient

[ d + 1 n 1 ] [ d + 1 − n 1 n 2 ] ⋯ [ d + 1 − ( n 1 + ⋯ + n p − 1 ) n p ] {\displaystyle {\begin{bmatrix}d+1\\n_{1}\end{bmatrix}}{\begin{bmatrix}d+1-n_{1}\\n_{2}\end{bmatrix}}\cdots {\begin{bmatrix}d+1-(n_{1}+\cdots +n_{p-1})\\n_{p}\end{bmatrix}}} {\displaystyle {\begin{bmatrix}d+1\\n_{1}\end{bmatrix}}{\begin{bmatrix}d+1-n_{1}\\n_{2}\end{bmatrix}}\cdots {\begin{bmatrix}d+1-(n_{1}+\cdots +n_{p-1})\\n_{p}\end{bmatrix}}}

and

s 2 ( n 1 , … , n p ) := n 1 n 2 + n 1 n 3 + n 2 n 3 + n 1 n 4 + ⋯ + n p − 1 n p , {\displaystyle s_{2}(n_{1},\ldots ,n_{p}):=n_{1}n_{2}+n_{1}n_{3}+n_{2}n_{3}+n_{1}n_{4}+\cdots +n_{p-1}n_{p},} {\displaystyle s_{2}(n_{1},\ldots ,n_{p}):=n_{1}n_{2}+n_{1}n_{3}+n_{2}n_{3}+n_{1}n_{4}+\cdots +n_{p-1}n_{p},}

the second elementary symmetric function of the numbers n 1 , n 2 , … , n p . {\displaystyle n_{1},n_{2},\dots ,n_{p}.} {\displaystyle n_{1},n_{2},\dots ,n_{p}.}