Hurwitz's theorem (composition algebras) (original) (raw)
Non-associative algebras with positive-definite quadratic form
In mathematics, Hurwitz's theorem is a theorem of Adolf Hurwitz (1859–1919), published posthumously in 1923, solving the Hurwitz problem for finite-dimensional unital real non-associative algebras endowed with a nondegenerate positive-definite quadratic form. The theorem states that if the quadratic form defines a homomorphism into the positive real numbers on the non-zero part of the algebra, then the algebra must be isomorphic to the real numbers, the complex numbers, the quaternions, or the octonions, and that there are no other possibilities. Such algebras, sometimes called Hurwitz algebras, are examples of composition algebras.
The theory of composition algebras has subsequently been generalized to arbitrary quadratic forms and arbitrary fields.[1] Hurwitz's theorem implies that multiplicative formulas for sums of squares can only occur in 1, 2, 4 and 8 dimensions, a result originally proved by Hurwitz in 1898. It is a special case of the Hurwitz problem, solved also in Radon (1922). Subsequent proofs of the restrictions on the dimension have been given by Eckmann (1943) using the representation theory of finite groups and by Lee (1948) and Chevalley (1954) using Clifford algebras. Hurwitz's theorem has been applied in algebraic topology to problems on vector fields on spheres and the homotopy groups of the classical groups[2] and in quantum mechanics to the classification of simple Jordan algebras.[3]
Euclidean Hurwitz algebras
[edit]
A Hurwitz algebra or composition algebra is a finite-dimensional not necessarily associative algebra A with identity endowed with a nondegenerate quadratic form q such that q(a b) = q(a) q(b). If the underlying coefficient field is the reals and q is positive-definite, so that (a, b) = 1/2[q(a + b) − q(a) − q(b)] is an inner product, then A is called a Euclidean Hurwitz algebra or (finite-dimensional) normed division algebra.[4]
If A is a Euclidean Hurwitz algebra and a is in A, define the involution and right and left multiplication operators by
a ∗ = − a + 2 ( a , 1 ) 1 , L ( a ) b = a b , R ( a ) b = b a . {\displaystyle a^{*}=-a+2(a,1)1,\quad L(a)b=ab,\quad R(a)b=ba.}
Evidently the involution has period two and preserves the inner product and norm. These operators have the following properties:
- the involution is an antiautomorphism, i.e. (ab)* = b*a*
- aa* = ‖_a_‖2 1 = a*a
- L(a*) = L(a)*, R(a*) = R(a)*, so that the involution on the algebra corresponds to taking adjoints
- Re (ab) = Re (ba) if Re x = (x + x*)/2 = (x, 1)1
- Re (ab)c = Re a(bc)
- L(_a_2) = L(a)2, R(_a_2) = R(a)2, so that A is an alternative algebra.
These properties are proved starting from the polarized version of the identity (ab, ab) = (a, a)(b, b):
2 ( a , b ) ( c , d ) = ( a c , b d ) + ( a d , b c ) . {\displaystyle \displaystyle {2(a,b)(c,d)=(ac,bd)+(ad,bc).}}
Setting b = 1 or d = 1 yields L(a*) = L(a)* and R(c*) = R(c)*.
Hence Re(ab) = (ab, 1)1 = (a, b*)1 = (ba, 1)1 = Re(ba).
Similarly Re (ab)c = ((ab)c,1)1 = (ab, c*)1 = (b, a* c*)1 = (bc,a*)1 = (a(bc),1)1 = Re a(bc).
Hence ((ab)*, c) = (ab, c*) = (b, a*c*) = (1, b*(a*c*)) = (1, (b*a*)c*) = (b*a*, c), so that (ab)* = b*a*.
By the polarized identity ‖_a_‖2 (c, d) = (ac, ad) = (a* (ac), d) so L(a*) L(a) = L(‖_a_‖2). Applied to 1 this gives a*a = ‖_a_‖2 1. Replacing a by a* gives the other identity.
Substituting the formula for a* in L(a*) L(a) = L(a*a) gives L(a)2 = L(_a_2). The formula R(_a_2) = R(a)2 is proved analogously.
It is routine to check that the real numbers R, the complex numbers C and the quaternions H are examples of associative Euclidean Hurwitz algebras with their standard norms and involutions. There are moreover natural inclusions R ⊂ C ⊂ H.
Analysing such an inclusion leads to the Cayley–Dickson construction, formalized by A.A. Albert. Let A be a Euclidean Hurwitz algebra and B a proper unital subalgebra, so a Euclidean Hurwitz algebra in its own right. Pick a unit vector j in A orthogonal to B. Since (j, 1) = 0, it follows that j* = −j and hence _j_2 = −1. Let C be subalgebra generated by B and j. It is unital and is again a Euclidean Hurwitz algebra. It satisfies the following Cayley–Dickson multiplication laws:
C = B ⊕ B j , ( a + b j ) ∗ = a ∗ − b j , ( a + b j ) ( c + d j ) = ( a c − d ∗ b ) + ( b c ∗ + d a ) j . {\displaystyle \displaystyle {C=B\oplus Bj,\,\,\,(a+bj)^{*}=a^{*}-bj,\,\,\,(a+bj)(c+dj)=(ac-d^{*}b)+(bc^{*}+da)j.}}
B and Bj are orthogonal, since j is orthogonal to B. If a is in B, then j a = a* j, since by orthogonal 0 = 2(j, a*) = ja − a*j. The formula for the involution follows. To show that B ⊕ B j is closed under multiplication Bj = jB. Since Bj is orthogonal to 1, (bj)* = −bj.
- b(cj) = (cb) j since (b, j) = 0 so that, for x in A, (b(cj), x) = (b( jx), j(cj)) = −(b( jx), c*) = −(cb, ( jx)*) = −((cb) j, x*) = ((cb) j, x).
- ( jc)b = j(bc) taking adjoints above.
- (bj)(cj) = −c*b since (b, cj) = 0, so that, for x in A, ((bj)(cj), x) = −((cj)x*, bj) = (bx*, (cj) j) = −(c*b, x).
Imposing the multiplicativity of the norm on C for a + bj and c + dj gives:
( ‖ a ‖ 2 + ‖ b ‖ 2 ) ( ‖ c ‖ 2 + ‖ d ‖ 2 ) = ‖ a c − d ∗ b ‖ 2 + ‖ b c ∗ + d a ‖ 2 , {\displaystyle \displaystyle {(\|a\|^{2}+\|b\|^{2})(\|c\|^{2}+\|d\|^{2})=\|ac-d^{*}b\|^{2}+\|bc^{*}+da\|^{2},}}
which leads to
( a c , d ∗ b ) = ( b c ∗ , d a ) . {\displaystyle \displaystyle {(ac,d^{*}b)=(bc^{*},da).}}
Hence d(ac) = (da)c, so that B must be associative.
This analysis applies to the inclusion of R in C and C in H. Taking O = H ⊕ H with the product and inner product above gives a noncommutative nonassociative algebra generated by J = (0, 1). This recovers the usual definition of the octonions or Cayley numbers. If A is a Euclidean algebra, it must contain R. If it is strictly larger than R, the argument above shows that it contains C. If it is larger than C, it contains H. If it is larger still, it must contain O. But there the process must stop, because O is not associative. In fact H is not commutative and a(bj) = (ba) j ≠ (ab) j in O.[5]
Theorem. The only Euclidean Hurwitz algebras are the real numbers, the complex numbers, the quaternions and the octonions.
The proofs of Lee (1948) and Chevalley (1954) use Clifford algebras to show that the dimension N of A must be 1, 2, 4 or 8. In fact the operators L(a) with (a, 1) = 0 satisfy L(a)2 = −‖a_‖2 and so form a real Clifford algebra. If a is a unit vector, then L(a) is skew-adjoint with square −_I. So N must be either even or 1 (in which case A contains no unit vectors orthogonal to 1). The real Clifford algebra and its complexification act on the complexification of A, an N-dimensional complex space. If N is even, N − 1 is odd, so the Clifford algebra has exactly two complex irreducible representations of dimension 2_N_/2 − 1. So this power of 2 must divide N. It is easy to see that this implies N can only be 1, 2, 4 or 8.
The proof of Eckmann (1943) uses the representation theory of finite groups, or the projective representation theory of elementary abelian 2-groups, known to be equivalent to the representation theory of real Clifford algebras. Indeed, taking an orthonormal basis e i of the orthogonal complement of 1 gives rise to operators U i = L(e i)satisfying
U i 2 = − I , U i U j = − U j U i ( i ≠ j ) . {\displaystyle U_{i}^{2}=-I,\quad U_{i}U_{j}=-U_{j}U_{i}\,\,(i\neq j).}
This is a projective representation of a direct product of N − 1 groups of order 2. (N is assumed to be greater than 1.) The operators U i by construction are skew-symmetric and orthogonal. In fact Eckmann constructed operators of this type in a slightly different but equivalent way. It is in fact the method originally followed in Hurwitz (1923).[6] Assume that there is a composition law for two forms
( x 1 2 + ⋯ + x N 2 ) ( y 1 2 + ⋯ + y N 2 ) = z 1 2 + ⋯ + z N 2 , {\displaystyle \displaystyle {(x_{1}^{2}+\cdots +x_{N}^{2})(y_{1}^{2}+\cdots +y_{N}^{2})=z_{1}^{2}+\cdots +z_{N}^{2},}}
where z i is bilinear in x and y. Thus
z i = ∑ j = 1 N a i j ( x ) y j {\displaystyle \displaystyle {z_{i}=\sum _{j=1}^{N}a_{ij}(x)y_{j}}}
where the matrix T(x) = (a ij) is linear in x. The relations above are equivalent to
T ( x ) T ( x ) t = x 1 2 + ⋯ + x N 2 . {\displaystyle \displaystyle {T(x)T(x)^{t}=x_{1}^{2}+\cdots +x_{N}^{2}.}}
Writing
T ( x ) = T 1 x 1 + ⋯ + T N x N , {\displaystyle \displaystyle {T(x)=T_{1}x_{1}+\cdots +T_{N}x_{N},}}
the relations become
T i T j t + T j T i t = 2 δ i j I . {\displaystyle \displaystyle {T_{i}T_{j}^{t}+T_{j}T_{i}^{t}=2\delta _{ij}I.}}
Now set V i = (T N)t T i. Thus V N = I and the _V_1, ... , V N − 1 are skew-adjoint, orthogonal satisfying exactly the same relations as the U i's:
V i 2 = − I , V i V j = − V j V i ( i ≠ j ) . {\displaystyle \displaystyle {V_{i}^{2}=-I,\quad V_{i}V_{j}=-V_{j}V_{i}\,\,(i\neq j).}}
Since V i is an orthogonal matrix with square −I on a real vector space, N is even.
Let G be the finite group generated by elements v i such that
v i 2 = ε , v i v j = ε v j v i ( i ≠ j ) , {\displaystyle \displaystyle {v_{i}^{2}=\varepsilon ,\quad v_{i}v_{j}=\varepsilon v_{j}v_{i}\,\,(i\neq j),}}
where ε is central of order 2. The commutator subgroup [G, _G_] is just formed of 1 and ε. If N is odd this coincides with the center while if N is even the center has order 4 with extra elements γ = v_1...v N − 1 and ε_γ. If g in G is not in the center its conjugacy class is exactly g and εg. Thus there are2_N − 1 + 1 conjugacy classes for N odd and 2_N − 1 + 2 for N even. G has | G / [G, G_] | = 2_N − 1 1-dimensional complex representations. The total number of irreducible complex representations is the number of conjugacy classes. So since N is even, there are two further irreducible complex representations. Since the sum of the squares of the dimensions equals |G| and the dimensions divide |G|, the two irreducibles must have dimension 2(N − 2)/2. When N is even, there are two and their dimension must divide the order of the group, so is a power of two, so they must both have dimension 2(N − 2)/2. The space on which the V i's act can be complexified. It will have complex dimension N. It breaks up into some of complex irreducible representations of G, all having dimension 2(N − 2)/2. In particular this dimension is ≤ N, so N is less than or equal to 8. If N = 6, the dimension is 4, which does not divide 6. So N can only be 1, 2, 4 or 8.
Applications to Jordan algebras
[edit]
Let A be a Euclidean Hurwitz algebra and let M n(A) be the algebra of n-by-n matrices over A. It is a unital nonassociative algebra with an involution given by
( x i j ) ∗ = ( x j i ∗ ) . {\displaystyle \displaystyle {(x_{ij})^{*}=(x_{ji}^{*}).}}
The trace Tr(X ) is defined as the sum of the diagonal elements of X and the real-valued trace byTrR(X ) = Re Tr(X ). The real-valued trace satisfies:
Tr R X Y = Tr R Y X , Tr R ( X Y ) Z = Tr R X ( Y Z ) . {\displaystyle \operatorname {Tr} _{\mathbf {R} }XY=\operatorname {Tr} _{\mathbf {R} }YX,\qquad \operatorname {Tr} _{\mathbf {R} }(XY)Z=\operatorname {Tr} _{\mathbf {R} }X(YZ).}
These are immediate consequences of the known identities for n = 1.
In A define the associator by
[ a , b , c ] = a ( b c ) − ( a b ) c . {\displaystyle \displaystyle {[a,b,c]=a(bc)-(ab)c.}}
It is trilinear and vanishes identically if A is associative. Since A is an alternative algebra [a, a, _b_] = 0 and [b, a, _a_] = 0. Polarizing it follows that the associator is antisymmetric in its three entries. Furthermore, if a, b or c lie in R then [a, b, _c_] = 0. These facts imply that _M_3(A) has certain commutation properties. In fact if X is a matrix in _M_3(A) with real entries on the diagonal then
[ X , X 2 ] = a I , {\displaystyle \displaystyle {[X,X^{2}]=aI,}}
with a in A. In fact if Y = [X, X 2], then
y i j = ∑ k , ℓ [ x i k , x k ℓ , x ℓ j ] . {\displaystyle \displaystyle {y_{ij}=\sum _{k,\ell }[x_{ik},x_{k\ell },x_{\ell j}].}}
Since the diagonal entries of X are real, the off-diagonal entries of Y vanish. Each diagonal entry of Y is a sum of two associators involving only off diagonal terms of X. Since the associators are invariant under cyclic permutations, the diagonal entries of Y are all equal.
Let H n(A) be the space of self-adjoint elements in M n(A) with product X ∘Y = 1/2(X Y + Y X) and inner product (X, Y ) = TrR(X Y ).
Theorem. H n(A) is a Euclidean Jordan algebra if A is associative (the real numbers, complex numbers or quaternions) and n ≥ 3 or if A is nonassociative (the octonions) and n = 3.
The exceptional Jordan algebra _H_3(O) is called the Albert algebra after A.A. Albert.
To check that H n(A) satisfies the axioms for a Euclidean Jordan algebra, the real trace defines a symmetric bilinear form with (X, X) = Σ ‖x ij_‖2. So it is an inner product. It satisfies the associativity property (Z_∘_X, Y ) = (X, Z_∘_Y ) because of the properties of the real trace. The main axiom to check is the Jordan condition for the operators L(X) defined by L(X)Y = X ∘_Y:
[ L ( X ) , L ( X 2 ) ] = 0. {\displaystyle \displaystyle {[L(X),L(X^{2})]=0.}}
This is easy to check when A is associative, since M n(A) is an associative algebra so a Jordan algebra with X ∘Y = 1/2(X Y + Y X). When A = O and n = 3 a special argument is required, one of the shortest being due to Freudenthal (1951).[7]
In fact if T is in _H_3(O) with Tr T = 0, then
D ( X ) = T X − X T {\displaystyle \displaystyle {D(X)=TX-XT}}
defines a skew-adjoint derivation of _H_3(O). Indeed,
Tr ( T ( X ( X 2 ) ) − T ( X 2 ( X ) ) ) = Tr T ( a I ) = Tr ( T ) a = 0 , {\displaystyle \operatorname {Tr} (T(X(X^{2}))-T(X^{2}(X)))=\operatorname {Tr} T(aI)=\operatorname {Tr} (T)a=0,}
so that
( D ( X ) , X 2 ) = 0. {\displaystyle (D(X),X^{2})=0.}
Polarizing yields:
( D ( X ) , Y ∘ Z ) + ( D ( Y ) , Z ∘ X ) + ( D ( Z ) , X ∘ Y ) = 0. {\displaystyle (D(X),Y\circ Z)+(D(Y),Z\circ X)+(D(Z),X\circ Y)=0.}
Setting Z = 1 shows that D is skew-adjoint. The derivation property D(X ∘Y) = D(X)∘Y + X_∘_D(Y) follows by this and the associativity property of the inner product in the identity above.
With A and n as in the statement of the theorem, let K be the group of automorphisms of E = H n(A) leaving invariant the inner product. It is a closed subgroup of O(E) so a compact Lie group. Its Lie algebra consists of skew-adjoint derivations. Freudenthal (1951) showed that given X in E there is an automorphism k in K such that k(X) is a diagonal matrix. (By self-adjointness the diagonal entries will be real.) Freudenthal's diagonalization theorem immediately implies the Jordan condition, since Jordan products by real diagonal matrices commute on M n(A) for any non-associative algebra A.
To prove the diagonalization theorem, take X in E. By compactness k can be chosen in K minimizing the sums of the squares of the norms of the off-diagonal terms of k(X ). Since K preserves the sums of all the squares, this is equivalent to maximizing the sums of the squares of the norms of the diagonal terms of k(X ). Replacing X by k X, it can be assumed that the maximum is attained at X. Since the symmetric group S n, acting by permuting the coordinates, lies in K, if X is not diagonal, it can be supposed that _x_12 and its adjoint x_21 are non-zero. Let T be the skew-adjoint matrix with (2, 1) entry a, (1, 2) entry −_a* and 0 elsewhere and let D be the derivation ad T of E. Let k t = exp tD in K. Then only the first two diagonal entries in X(t) = k t X differ from those of X. The diagonal entries are real. The derivative of _x_11(t) at t = 0 is the (1, 1) coordinate of [T, _X_], i.e. a* _x_21 + _x_12 a = 2(_x_21, a). This derivative is non-zero if a = _x_21. On the other hand, the group k t preserves the real-valued trace. Since it can only change _x_11 and _x_22, it preserves their sum. However, on the line x + y = constant, _x_2 + _y_2 has no local maximum (only a global minimum), a contradiction. Hence X must be diagonal.
- ^ See:
- ^ See:
- ^ Jordan, von Neumann & Wigner 1934
- ^ Faraut & Koranyi 1994, p. 82
- ^ Faraut & Koranyi 1994, pp. 81–86
- ^ See:
- Hurwitz 1923, p. 11
- Herstein 1968, pp. 141–144
- ^ See:
- Faraut & Koranyi 1994, pp. 88–91
- Postnikov 1986
- Albert, A. A. (1934), "On a certain algebra of quantum mechanics", Ann. of Math., 35 (1): 65–73, doi:10.2307/1968118, JSTOR 1968118
- Chevalley, C. (1954), The algebraic theory of spinors and Clifford algebras, Columbia University Press
- Eckmann, Beno (1943), "Gruppentheoretischer Beweis des Satzes von Hurwitz–Radon über die Komposition quadratischer Formen", Comment. Math. Helv., 15: 358–366, doi:10.1007/bf02565652, S2CID 123322808
- Eckmann, Beno (1989), "Hurwitz–Radon matrices and periodicity modulo 8", Enseign. Math., 35: 77–91, archived from the original on 2013-06-16
- Eckmann, Beno (1999), "Topology, algebra, analysis—relations and missing links", Notices Amer. Math. Soc., 46: 520–527
- Faraut, J.; Koranyi, A. (1994), Analysis on symmetric cones, Oxford Mathematical Monographs, Oxford University Press, ISBN 978-0198534778
- Freudenthal, Hans (1951), Oktaven, Ausnahmegruppen und Oktavengeometrie, Mathematisch Instituut der Rijksuniversiteit te Utrecht
- Freudenthal, Hans (1985), "Oktaven, Ausnahmegruppen und Oktavengeometrie", Geom. Dedicata, 19: 7–63, doi:10.1007/bf00233101, S2CID 121496094 (reprint of 1951 article)
- Herstein, I. N. (1968), Noncommutative rings, Carus Mathematical Monographs, vol. 15, Mathematical Association of America, ISBN 978-0883850152
- Hurwitz, A. (1898), "Über die Composition der quadratischen Formen von beliebig vielen Variabeln", Goett. Nachr.: 309–316
- Hurwitz, A. (1923), "Über die Komposition der quadratischen Formen", Math. Ann., 88 (1–2): 1–25, doi:10.1007/bf01448439, S2CID 122147399
- Jacobson, N. (1968), Structure and representations of Jordan algebras, American Mathematical Society Colloquium Publications, vol. 39, American Mathematical Society
- Jordan, P.; von Neumann, J.; Wigner, E. (1934), "On an algebraic generalization of the quantum mechanical formalism", Ann. of Math., 35 (1): 29–64, doi:10.2307/1968117, JSTOR 1968117
- Lam, Tsit-Yuen (2005), Introduction to Quadratic Forms over Fields, Graduate Studies in Mathematics, vol. 67, American Mathematical Society, ISBN 978-0-8218-1095-8, MR 2104929, Zbl 1068.11023
- Lee, H. C. (1948), "Sur le théorème de Hurwitz-Radon pour la composition des formes quadratiques", Comment. Math. Helv., 21: 261–269, doi:10.1007/bf02568038, S2CID 121079375, archived from the original on 2014-05-03
- Porteous, I.R. (1969), Topological Geometry, Van Nostrand Reinhold, ISBN 978-0-442-06606-2, Zbl 0186.06304
- Postnikov, M. (1986), Lie groups and Lie algebras. Lectures in geometry. Semester V, Mir
- Radon, J. (1922), "Lineare scharen orthogonaler matrizen", Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg, 1: 1–14, doi:10.1007/bf02940576, S2CID 120583389
- Rajwade, A. R. (1993), Squares, London Mathematical Society Lecture Note Series, vol. 171, Cambridge University Press, ISBN 978-0-521-42668-8, Zbl 0785.11022
- Schafer, Richard D. (1995) [1966], An introduction to non-associative algebras, Dover Publications, ISBN 978-0-486-68813-8, Zbl 0145.25601
- Shapiro, Daniel B. (2000), Compositions of quadratic forms, De Gruyter Expositions in Mathematics, vol. 33, Walter de Gruyter, ISBN 978-3-11-012629-7, Zbl 0954.11011
- Baez, John C. (2002), "The octonions", Bull. Amer. Math. Soc., 39 (2): 145–205, arXiv:math/0105155, doi:10.1090/S0273-0979-01-00934-X, S2CID 586512
- Conway, John H.; Smith, Derek A. (2003), On quaternions and octonions: their geometry, arithmetic, and symmetry, A K Peters, ISBN 978-1568811345
- Kantor, I.L.; Solodovnikov, A.S. (1989), "Normed algebras with an identity. Hurwitz's theorem.", Hypercomplex numbers. An elementary introduction to algebras, Trans. A. Shenitzer (2nd ed.), Springer-Verlag, p. 121, ISBN 978-0-387-96980-0, Zbl 0669.17001
- Max Koecher & Reinhold Remmert (1990) "Composition Algebras. Hurwitz's Theorem — Vector-Product Algebras", chapter 10 of Numbers by Heinz-Dieter Ebbinghaus et al., Springer, ISBN 0-387-97202-1
- Springer, T. A.; F. D. Veldkamp (2000), Octonions, Jordan Algebras and Exceptional Groups, Springer-Verlag, ISBN 978-3-540-66337-9