Closed-subgroup theorem (original) (raw)

Group theory theorem

In mathematics, the closed-subgroup theorem (sometimes referred to as Cartan's theorem) is a theorem in the theory of Lie groups. It states that if H is a closed subgroup of a Lie group G, then H is an embedded Lie group with the smooth structure (and hence the group topology) agreeing with the embedding.[1]''U''%5F⊂%5F'''g'''%5Fsuch%5Fthat%5F''U''%5F×%5F''H''%5F→%5F''G'',%5F%28''X'',%5F''H''%29%5F→%5Fe''X''''H''%5Fis%5Fan%5Fanalytic%5Fbijection%5Fonto%5Fan%5Fopen%5Fneighborhood%5Fof%5F''H''%5Fin%5F''G''-2" title="null">[2][3]One of several results known as Cartan's theorem, it was first published in 1930 by Élie Cartan,[4] who was inspired by John von Neumann's 1929 proof of a special case for groups of linear transformations.[5][6]

Let G be a Lie group with Lie algebra g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}}. Now let H be an arbitrary closed subgroup of G. It is necessary to show that H is a smooth embedded submanifold of G. The first step is to identify something that could be the Lie algebra of H, that is, the tangent space of H at the identity. The challenge is that H is not assumed to have any smoothness and therefore it is not clear how one may define its tangent space. To proceed, define the "Lie algebra" h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} of H by the formula h = { X ∣ e t X ∈ H , ∀ t ∈ R } . {\displaystyle {\mathfrak {h}}=\left\{X\mid e^{tX}\in H,\,\,\forall t\in \mathbf {R} \right\}.} {\displaystyle {\mathfrak {h}}=\left\{X\mid e^{tX}\in H,\,\,\forall t\in \mathbf {R} \right\}.}

It is not difficult to show that h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} is a Lie subalgebra of g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}}.[7] In particular, h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} is a subspace of g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}}, which one might hope to be the tangent space of H at the identity. For this idea to work, however, h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} must be big enough to capture some interesting information about H. If, for example, H were some large subgroup of G but h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} turned out to be zero, h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} would not be helpful.

The key step, then, is to show that h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} actually captures all the elements of H that are sufficiently close to the identity. That is to say, it is necessary to prove the following critical lemma:

**Lemma—Take a small neighborhood U of the origin in g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}} such that the exponential map sends U diffeomorphically onto some neighborhood V {\displaystyle V} {\displaystyle V} of the identity in G, and let log: VU be the inverse of the exponential map. Then there is some smaller neighborhood WV such that if h belongs to WH, then log(h) belongs to h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}}.[8]

Once this has been established, one can use exponential coordinates on W, that is, writing each gW (not necessarily in H) as g = e X for X = log(g). In these coordinates, the lemma says that X corresponds to a point in H precisely if X belongs to h ⊂ g {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}} {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}}. That is to say, in exponential coordinates near the identity, H looks like h ⊂ g {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}} {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}}. Since h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} is just a subspace of g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}}, this means that h ⊂ g {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}} {\displaystyle {\mathfrak {h}}\subset {\mathfrak {g}}} is just like RkRn, with k = dim ⁡ ( h ) {\displaystyle k=\dim({\mathfrak {h}})} {\displaystyle k=\dim({\mathfrak {h}})} and n = dim ⁡ ( g ) {\displaystyle n=\dim({\mathfrak {g}})} {\displaystyle n=\dim({\mathfrak {g}})}. Thus, we have exhibited a "slice coordinate system" in which HG looks locally like RkRn, which is the condition for an embedded submanifold.[9]

It is worth noting that Rossmann shows that for any subgroup H of G (not necessarily closed), the Lie algebra h {\displaystyle {\mathfrak {h}}} {\displaystyle {\mathfrak {h}}} of H is a Lie subalgebra of g {\displaystyle {\mathfrak {g}}} {\displaystyle {\mathfrak {g}}}.[10] Rossmann then goes on to introduce coordinates[11] on H that make the identity component of H into a Lie group. It is important to note, however, that the topology on H coming from these coordinates is not the subset topology. That is to say, the identity component of H is an immersed submanifold of G but not an embedded submanifold.

In particular, the lemma stated above does not hold if H is not closed.

Example of a non-closed subgroup

[edit]

The torus G. Imagine a bent helix laid out on the surface picturing H. If a = p_⁄_q in lowest terms, the helix will close up on itself at (1, 1) after p rotations in φ and q rotations in θ. If a is irrational, the helix winds indefinitely.

For an example of a subgroup that is not an embedded Lie subgroup, consider the torus and an "irrational winding of the torus". G = T 2 = { ( e 2 π i θ 0 0 e 2 π i ϕ ) | θ , ϕ ∈ R } , {\displaystyle G=\mathbb {T} ^{2}=\left\{\left.{\begin{pmatrix}e^{2\pi i\theta }&0\\0&e^{2\pi i\phi }\end{pmatrix}}\right|\theta ,\phi \in \mathbf {R} \right\},} {\displaystyle G=\mathbb {T} ^{2}=\left\{\left.{\begin{pmatrix}e^{2\pi i\theta }&0\\0&e^{2\pi i\phi }\end{pmatrix}}\right|\theta ,\phi \in \mathbf {R} \right\},}and its subgroup H = { ( e 2 π i θ 0 0 e 2 π i a θ ) | θ ∈ R } with Lie algebra h = { ( i θ 0 0 i a θ ) | θ ∈ R } , {\displaystyle H=\left\{\left.{\begin{pmatrix}e^{2\pi i\theta }&0\\0&e^{2\pi ia\theta }\end{pmatrix}}\right|\theta \in \mathbf {R} \right\}{\text{with Lie algebra }}{\mathfrak {h}}=\left\{\left.{\begin{pmatrix}i\theta &0\\0&ia\theta \end{pmatrix}}\right|\theta \in \mathbf {R} \right\},} {\displaystyle H=\left\{\left.{\begin{pmatrix}e^{2\pi i\theta }&0\\0&e^{2\pi ia\theta }\end{pmatrix}}\right|\theta \in \mathbf {R} \right\}{\text{with Lie algebra }}{\mathfrak {h}}=\left\{\left.{\begin{pmatrix}i\theta &0\\0&ia\theta \end{pmatrix}}\right|\theta \in \mathbf {R} \right\},}with a irrational. Then H is dense in G and hence not closed.[12] In the relative topology, a small open subset of H is composed of infinitely many almost parallel line segments on the surface of the torus. This means that H is not locally path connected. In the group topology, the small open sets are single line segments on the surface of the torus and H is locally path connected.

The example shows that for some groups H one can find points in an arbitrarily small neighborhood U in the relative topology _τ_r of the identity that are exponentials of elements of h, yet they cannot be connected to the identity with a path staying in U. The group (H, _τ_r) is not a Lie group. While the map exp : h → (H, τ_r) is an analytic bijection, its inverse is not continuous. That is, if Uh corresponds to a small open interval −_ε < θ < ε, there is no open V ⊂ (H, _τ_r) with log(V) ⊂ U due to the appearance of the sets V. However, with the group topology _τ_g, (H, _τ_g) is a Lie group. With this topology the injection ι : (H, _τ_g) → G is an analytic injective immersion, but not a homeomorphism, hence not an embedding. There are also examples of groups H for which one can find points in an arbitrarily small neighborhood (in the relative topology) of the identity that are not exponentials of elements of h.[14] For closed subgroups this is not the case as the proof below of the theorem shows.

Because of the conclusion of the theorem, some authors chose to define linear Lie groups or matrix Lie groups as closed subgroups of GL(n, R) or GL(n, C).[15] In this setting, one proves that every element of the group sufficiently close to the identity is the exponential of an element of the Lie algebra.[8] (The proof is practically identical to the proof of the closed subgroup theorem presented below.) It follows every closed subgroup is an embedded submanifold of GL(n, C)[16]

The homogeneous space construction theorem—If HG is a closed Lie subgroup, then G/H, the left coset space, has a unique real-analytic manifold structure such that the quotient map π:GG/H is an analytic submersion. The left action given by _g_1 ⋅ (g_2_H) = (_g_1_g_2)H turns G/H into a homogeneous _G_-space.

The closed subgroup theorem now simplifies the hypotheses considerably, a priori widening the class of homogeneous spaces. Every closed subgroup yields a homogeneous space.

In a similar way, the closed subgroup theorem simplifies the hypothesis in the following theorem.

If X is a set with transitive group action and the isotropy group or stabilizer of a point xX is a closed Lie subgroup, then X has a unique smooth manifold structure such that the action is smooth.

Conditions for being closed

[edit]

A few sufficient conditions for HG being closed, hence an embedded Lie group, are given below.

An embedded Lie subgroup HG is closed[23] so a subgroup is an embedded Lie subgroup if and only if it is closed. Equivalently, H is an embedded Lie subgroup if and only if its group topology equals its relative topology.[24]

John von Neumann in 1929 proved the theorem in the case of matrix groups as given here. He was prominent in many areas, including quantum mechanics, set theory and the foundations of mathematics.

The proof is given for matrix groups with G = GL(n, R) for concreteness and relative simplicity, since matrices and their exponential mapping are easier concepts than in the general case. Historically, this case was proven first, by John von Neumann in 1929, and inspired Cartan to prove the full closed subgroup theorem in 1930.[5][6] The proof for general G is formally identical,[25] except that elements of the Lie algebra are left invariant vector fields on G and the exponential mapping is the time one flow of the vector field. If HG with G closed in GL(n, R), then H is closed in GL(n, R), so the specialization to GL(n, R) instead of arbitrary G ⊂ GL(n, R) matters little.

Proof of the key lemma

[edit]

We begin by establishing the key lemma stated in the "overview" section above.

Endow g with an inner product (e.g., the Hilbert–Schmidt inner product), and let h be the Lie algebra of H defined as h = {X ∈ M_n_(R) = g | e tXHtR}. Let s = {Sg | (S, T) = 0 ∀Th}, the orthogonal complement of h. Then g decomposes as the direct sum g = sh, so each Xg is uniquely expressed as X = S + T with Ss, Th.

Define a map Φ : g → GL(n, R) by (S, T) ↦ e S e T. Expand the exponentials, Φ ( S , T ) = e t S e t T = I + t S + t T + O ( t 2 ) , {\displaystyle \Phi (S,T)=e^{tS}e^{tT}=I+tS+tT+O(t^{2}),} {\displaystyle \Phi (S,T)=e^{tS}e^{tT}=I+tS+tT+O(t^{2}),}and the pushforward or differential at 0, Φ∗(S, T) = ⁠d/_dt_⁠Φ(tS, tT)|t = 0 is seen to be S + T, i.e. Φ∗ = Id, the identity. The hypothesis of the inverse function theorem is satisfied with Φ analytic, and thus there are open sets _U_1 ⊂ g, _V_1 ⊂ GL(n, R) with 0 ∈ _U_1 and I ∈ _V_1 such that Φ is a real-analytic bijection from _U_1 to _V_1 with analytic inverse. It remains to show that _U_1 and _V_1 contain open sets U and V such that the conclusion of the theorem holds.

Consider a countable neighborhood basis Β at 0 ∈ g, linearly ordered by reverse inclusion with _B_1 ⊂ _U_1.[a] Suppose for the purpose of obtaining a contradiction that for all i, Φ(B i) ∩ H contains an element h i that is not on the form h i = e T i, T ih. Then, since Φ is a bijection on the B i, there is a unique sequence X i = S i + T i, with 0 ≠ S is and T ih such that X iB i converging to 0 because Β is a neighborhood basis, with e S i_e_T i = h i. Since e T iH and h iH, e_S_iH as well.

Normalize the sequence in s, Y i = ⁠S i/||S i||⁠. It takes its values in the unit sphere in s and since it is compact, there is a convergent subsequence converging to Ys.[26] The index i henceforth refers to this subsequence. It will be shown that e tYH, ∀tR. Fix t and choose a sequence m i of integers such that m i ||S i|| → t as i → ∞. For example, m i such that m i ||S i|| ≤ t ≤ (m i + 1) ||S i|| will do, as S i → 0. Then ( e S i ) m i = e m i S i = e m i ‖ S i ‖ Y i → e t Y . {\displaystyle (e^{S_{i}})^{m_{i}}=e^{m_{i}S_{i}}=e^{m_{i}\|S_{i}\|Y_{i}}\rightarrow e^{tY}.} {\displaystyle (e^{S_{i}})^{m_{i}}=e^{m_{i}S_{i}}=e^{m_{i}\|S_{i}\|Y_{i}}\rightarrow e^{tY}.}

Since H is a group, the left hand side is in H for all i. Since H is closed, e tYH, ∀t,[27] hence Yh. This is a contradiction. Hence, for some i the sets U = Β_i_ and V = Φ(Β_i_) satisfy e _U_∩h = HV and the exponential restricted to the open set (Uh) ⊂ h is in analytic bijection with the open set Φ(U) ∩ HH. This proves the lemma.

Proof of the theorem

[edit]

For ji, the image in H of B j under Φ form a neighborhood basis at I. This is, by the way it is constructed, a neighborhood basis both in the group topology and the relative topology. Since multiplication in G is analytic, the left and right translates of this neighborhood basis by a group element gG gives a neighborhood basis at g. These bases restricted to H gives neighborhood bases at all hH. The topology generated by these bases is the relative topology. The conclusion is that the relative topology is the same as the group topology.

Next, construct coordinate charts on H. First define _φ_1 : e(U) ⊂ Gg, g ↦ log(g). This is an analytic bijection with analytic inverse. Furthermore, if hH, then _φ_1(h) ∈ h. By fixing a basis for g = hs and identifying g with Rn, then in these coordinates _φ_1(h) = (_x_1(h), ..., x m(h), 0, ..., 0), where m is the dimension of h. This shows that (e U, _φ_1) is a slice chart. By translating the charts obtained from the countable neighborhood basis used above one obtains slice charts around every point in H. This shows that H is an embedded submanifold of G.

Moreover, multiplication m, and inversion i in H are analytic since these operations are analytic in G and restriction to a submanifold (embedded or immersed) with the relative topology again yield analytic operations m : H × HG and i : H × HG.[28] But since H is embedded, m : H × HH and i : H × HH are analytic as well.[29]

  1. ^ For this one can choose open balls, Β = {B k | diam(B k) = ⁠1/k + _m_⁠, kN} for some large enough m such that _B_1 ⊂ _U_1. Here the metric obtained from the Hilbert–Schmidt inner product is used.

  2. ^ Lee 2003, Theorem 20.10. Lee states and proves this theorem in all generality.

  3. ''U''%5F⊂%5F'''g'''%5Fsuch%5Fthat%5F''U''%5F×%5F''H''%5F→%5F''G'',%5F%28''X'',%5F''H''%29%5F→%5Fe''X''''H''%5Fis%5Fan%5Fanalytic%5Fbijection%5Fonto%5Fan%5Fopen%5Fneighborhood%5Fof%5F''H''%5Fin%5F''G''%5F2-0" title="null">^ Rossmann 2002, Theorem 1, Section 2.7 Rossmann states the theorem for linear groups. The statement is that there is an open subset Ug such that U × HG, (X, H) → e_X_ H is an analytic bijection onto an open neighborhood of H in G.

  4. ^ Hall 2015, For linear groups, Hall proves a similar result in Corollary 3.45.

  5. ^ Cartan 1930, § 26.

  6. ^ a b von Neumann 1929.

  7. ^ a b Bochner 1958.

  8. ^ Hall 2015, Theorem 3.20.

  9. ^ a b Hall 2015, Theorem 3.42.

  10. ^ Lee 2003, Chapter 5.

  11. ^ Rossmann 2002, Chapter 2, Proposition 1 and Corollary 7.

  12. ^ Rossmann 2002, Section 2.3.

  13. ^ Lee 2003, Example 7.3.

  14. ^ Rossmann 2002.

  15. ^ E.g. Hall 2015. See definition in Chapter 1.

  16. ^ Hall 2015, Corollary 3.45.

  17. ^ Rossmann 2002, Problem 1. Section 2.7.

  18. ^ Rossmann 2002, Problem 3. Section 2.7.

  19. ^ Rossmann 2002, Problem 4. Section 2.7.

  20. ^ Rossmann 2002, Problem 5. Section 2.7.

  21. ^ Hall 2015, The result follows from Theorem 5.6.

  22. ^ Hall 2015, Exercise 14 in Chapter 5.

  23. ^ Lee 2003, Corollary 15.30 (or 8.30 in 978-0-387-95448-6 978-0-387-21752-9).

  24. ^ Rossmann 2002, Problem 2. Section 2.7.

  25. ^ See for instance Lee 2003 Chapter 21

  26. ^ Willard 1970, By problem 17G, s is sequentially compact, meaning every sequence has a convergent subsequence.

  27. ^ Willard 1970, Corollary 10.5.

  28. ^ Lee 2003, Proposition 8.22.

  29. ^ Lee 2003, Corollary 8.25.