Today we will give four proofs of the classification of the (finite-dimensional complex continuous) irreducible representations of (which you’ll recall we assumed way back in this previous post). As a first step, it turns out that the finite-dimensional representation theory of compact groups looks a lot like the finite-dimensional representation theory of finite groups, and this will be a major boon to three of the proofs. The last proof will instead proceed by classifying irreducible representations of the Lie algebra
.
At the end of the post we’ll briefly describe what we can conclude from all this about electrons orbiting a hydrogen atom.
Generalities
Below, “representation” means “finite-dimensional complex continuous representation.”
Let be a compact group and suppose that, one way or another, we have found a (left-invariant) Haar measure on
, normalized to have total measure
. Such measures exist for all compact groups but are non-trivial to construct in general; in the case of
the Haar measure can be straightforwardly described as the measure on
inherited from Lebesgue measure on
divided by the volume of
:
.
It follows that given a representation of , we can define the averaging operator
.
As in the case of finite groups, the averaging operator is a projection from onto its invariant subspace. It follows that we can average an inner product on
so that it is
-invariant, hence WLOG we talk about unitary representations only and Maschke’s theorem holds: representations are completely reducible. Schur’s theorem holds in this setting with exactly the same proof as usual.
Also as for finite groups, taking the trace of a representation defines its character , a continuous class function
. Taking characters is additive under direct sum, multiplicative under tensor product, and conjugate under taking duals. Moreover, given two representations
on vector spaces
we can define the inner hom
(what I’ve denoted by something like
in previous posts, but I think this notation is less confusing), which explicitly is the space of linear transformations
with action given by
and which abstractly is determined by the adjunction
.
In particular, , so the invariant subspace of
can be identified with the space of
-morphisms
. On the other hand,
is isomorphic to
, hence if
denote the corresponding characters,
has character
. Since the trace of the averaging operator gives the dimension of its image (the invariant subspace of a representation), it follows that
and combining this result with Schur’s lemma, the orthogonality relations follow exactly as for finite groups. In particular, a representation of is uniquely determined up to isomorphism by its character.
The irreducible representations of
has an obvious
-dimensional irreducible representation which we’ll denote by
. To specify the character of any representation of
, it suffices to specify its restriction to the maximal torus of elements of the form
since every element is conjugate to an element of the torus, and the character of
is then
.
From we can construct some additional representations: we have
where the former has dimension
and the latter has dimension
. It is not hard to see that the abelianization of
is trivial, so
is trivial, hence
is self-dual and quaternionic. It follows that
cannot have a one-dimensional summand, hence is an irreducible representation of dimension
. If
has eigenvalues
, then its action on
has eigenvalues
, so the character of this representation is given by
.
Note that the double cover gives an irreducible
-dimensional real representation of
which extends to a
-dimensional complex representation. In this representation
acts by rotation by
, where
, and it follows that the character of this representation agrees with
, hence that the two are isomorphic.
What can we say about higher-dimensional representations? Well, given , a standard strategy for constructing more representations is to apply various functors to
. Here it will be productive to consider the symmetric powers
, getting the hint from
. Since
, these representations have dimension
. Note that
is the trivial representation by definition. Recall that if
is a diagonalizable linear operator with eigenvalues
, then
has eigenvalues the products of all unordered
-tuples of eigenvalues of
(and this is easily proven by exhibiting the corresponding eigenvectors). It follows that the characters of the representations
are given by
.
If , the above is therefore equal to
where
is the
Chebyshev polynomial of the second kind.
Proposition: All of the representations are irreducible.
Proof. Since has real character, it is self-dual, so we can identify
with the space of homogeneous polynomials of degree
on
. If
is equipped with an invariant inner product, then letting
denotes an orthonormal basis for
,
acts on the symmetric algebra
of polynomial functions on as follows:
.
(The matrix given is the matrix of an element of with respect to an orthonormal basis
such that
is the corresponding dual basis.) Let
be a nonzero homogeneous polynomial of degree
. A minimal invariant subspace
containing
must also contain
for every
, and by taking appropriate linear combinations of these polynomials,
contains every monomial in
. Thus we may assume WLOG that
is a single monomial
. But for sufficiently small
the action of
sends
to a polynomial all of whose coefficients are nonzero, hence
contains every monomial in
, so
as desired.
It follows that the characters are orthogonal and satisfy
. This can also be checked with an explicit formula for integrating a class function on
, but we will not need to do this.
The representations all have real characters, so are all self-dual. A computation of the characters of both sides, or an explicit argument with bases, shows that
.
It follows by induction that any irreducible subrepresentation of is isomorphic to
for some
. Since they are all self-dual, these are in some sense all of the representations of
obtainable from
via universal methods.
Theorem: Every irreducible representation of is isomorphic to
for some
.
Proof 1
Our first proof is based on the following important observation: since the character of a representation of is determined by its restriction to any maximal torus
, the restriction functor
is essentially injective. So we should try to understand the second category in order to understand what kind of characters are possible.
Theorem: Regard as the unit complex numbers
. Then every irreducible representation of
is isomorphic to the representation
for some
.
This result is closely related to the existence of Fourier series; see Pontryagin duality for a general discussion.
Proof. Since is abelian it is true as for finite groups that any irreducible representation has dimension
(since any eigenvector of a non-identity element spans an invariant subspace). Thus an irreducible representation is just a continuous homomorphism
. By compactness the image of this map is contained in
, so an irreducible representation is just a continuous homomorphism
. By path lifting,
lifts to a continuous homomorphism
, which must therefore be of the form
for some real
. This gives
which is a homomorphism if and only if , and the conclusion follows.
Any irreducible representation of
decomposes into the direct sum of finitely many irreducible representations of a given maximal torus
. Suppose the representation
occurs
times. Then the character of
must be given by
where has eigenvalues
. That is, it must be a Laurent polynomial in
with integer coefficients. Moreover, since we can swap the order of the eigenvalues, it follows that
is invariant under the substitution
, so it must be a symmetric Laurent polynomial in
:
.
However, it is not hard to see that the characters of the symmetric powers
give a
-basis of the symmetric Laurent polynomials in
, so it follows that there exist
such that
.
In particular, for some
, hence
for some
as desired.
In general, maximal tori are very important in the classification and representation theory of compact Lie groups. The Lie-algebraic analogue of a maximal torus is a Cartan subalgebra, which plays a corresponding role in the classification and representation theory of semisimple Lie algebras.
Proof 2
Our second and third proofs are both motivated by the standard result that if is a finite group and
a faithful representation of
, then every irreducible representation of
occurs in
for some
. This result can be proven in many ways, some of which don’t generalize to compact groups, and in fact the result is not true as stated for compact groups: the example of
and
the representation
shows that we need to at least consider
for integers
. This result is true (see for example this MO question), but Proof 2 will not be enough to prove it in general. Nevertheless, it works in the special case of
.
First, note that if and only if they have the same eigenvalues (which are determined by their real part), hence if and only if they are conjugate in
. So letting
denote the eigenvalues
of
, the space of conjugacy classes of
is naturally identified with
, and
separates points on this space. It follows by the complex Stone-Weierstrass theorem that the smallest subalgebra of the algebra of class functions containing
and closed under conjugation is dense in the space of continuous class functions with the uniform norm. But since
is self-dual, this algebra is spanned by the characters of the representations
, which are all direct sums of the representations
.
It follows that given an irreducible representation , we can find a sequence of class functions which are linear combinations of the characters
converging uniformly to
. It again follows that
for some
, and again we are done.
This proof is not enough to give the general result about faithful representations because does not necessarily separate conjugacy classes in general. However, it is still possible to use the Stone-Weierstrass theorem to prove the general result, and this is the subject of the next proof.
Proof 3
Our third proof relies on a companion result to the orthogonality relations for characters. For us, a matrix coefficient of a compact group is a continuous function
of the form
for a unitary representation
and vectors
.
Theorem (orthogonality for matrix coefficients): Let be matrix coefficients of a compact Lie group
, where
are irreducible unitary representations on
with characters
. If
are non-isomorphic, then
.
(The full statement of the orthogonality relations includes an expression for the inner product of matrix coefficients from the same irreducible representation, but there is a constant in it that I can’t figure out, and in any case we won’t need it.)
Proof. Our convention is that inner products are conjugate-linear in the first variable. Fix and
. We can write
.
For fixed , the integrand gives a bilinear map
(since it is linear in
but conjugate-linear in
); moreover, the natural action of
by precomposition gives a representation isomorphic to
on the space of such bilinear maps, and the above integral computes precisely the averaging operator on this representation. If
is not isomorphic to
, the corresponding tensor product does not have any copies of the irreducible representation, so the averaging operator sends any vector to zero and the conclusion follows.
Theorem: Let be a compact group with a faithful representation
. Then the span of the matrix coefficients of the representations
is dense in
.
Proof. The sum of any two matrix coefficients of a given representation is another matrix coefficient of the same representation
. Furthermore, the complex conjugate of a matrix coefficient of
is a matrix coefficient of
, and the product of a matrix coefficient of
and a matrix coefficient of
is a matrix coefficient of
. It follows that if
has a faithful representation
, then by the complex Stone-Weierstrass theorem, the space spanned by the matrix coefficients of the representations
is dense in
. In particular, if
is any other irreducible representation, then we can approximate its matrix coefficients with the above matrix coefficients, and we find by orthogonality that
is a subrepresentation of
for some
.
Proof 4
For our final proof we will provisionally assume that all representations are smooth so that we can pass from a representation to the induced map on Lie algebras
. As it turns out, all continuous homomorphisms between Lie groups are automatically smooth (although we won’t prove this), so we can assume this WLOG.
Recall that a representation of a Lie algebra is a homomorphism of Lie algebras
for
a (finite-dimensional complex) vector space. There is an obvious notion of direct sum of representations. As for groups, we say that a representation is irreducible if it has no non-trivial
-invariant subspaces.
Proposition: Let be a Lie group all of whose elements are contained in a one-parameter subgroup of
,
a smooth representation, and
the induced representation of
. Then
is irreducible as a representation of
if and only if it is irreducible as a representation of
.
Proof. By assumption, is irreducible if and only if it has no non-trivial subspaces invariant under all of the one-parameter subgroups of
. Let
be such a one-parameter subgroup. Let
be a nonzero vector. Then
and
.
It follows that any point in the minimal -invariant subspace containing
is a limit of points in the minimal
-invariant subspace containing
, and vice versa, hence that the two coincide.
In particular, we now know that the representations of associated to
are all irreducible. Moreover, given a representation of
which is known to come from a representation of
, by exponentiating we can recover the action of a maximal torus, hence the character of the representation. It follows that if we show that every irreducible representation of
is of the form
, we have also shown the same statement for
. (Note that we don’t need to know how to lift representations of
to representations of
to do this.)
Recall that can be explicitly presented as the imaginary quaternions under the inherited bracket
.
(I’m not using the symbols because we’re about to extend scalars to
.) Since we only care about complex representations of this real Lie algebra, we can pass to the complexification
(a technique which highlights the algebraic convenience of working with Lie algebras), and now that we are working over we can ask for a more convenient basis of
. One idea is to look at the adjoint action
. By inspection the eigenvectors of this linear transformation are
with eigenvalues
respectively. So one convenient choice of basis is
, in which the relations are given by
.
This basis is convenient for the following fundamental reason. Let be any representation of
. (We won’t distinguish between an element of
and its action on
.) Suppose
is an eigenvector of
of eigenvalue
. Then
and similarly
hence is (zero or) an eigenvector of
of eigenvalue
and
is (zero or) an eigenvector of
of eigenvalue
! The eigenspaces spanned by these eigenvectors are known as the weight spaces of
, and correspond to the decomposition of
under the action of a maximal torus of
. But unlike in the first proof above using maximal tori, we can now use elements of the Lie algebra to move between weight spaces.
The sequence of vectors are all eigenvectors with distinct eigenvalues, so since
is finite-dimensional the sequence must eventually terminate. Hence we may assume WLOG that
. In this case
is known as a highest weight vector. Since
, it follows that
is a -invariant subspace of
. If
is irreducible it must therefore be all of
. If
, we conclude that
.
Hence has a basis of eigenvectors of
with eigenvalues
. In particular, it follows that the trace of
acting on
is
.
But since and the trace of a commutator is zero, it follows that we must have
. The resulting weights are precisely the weights of
, and it follows that
as desired.
Note that, unlike all of the other proofs, this one explicitly constructs the irreducible representations (at least as representations of the Lie algebra) without requiring that we knew what they were in advance.
The representation theory of
An electron orbiting a hydrogen atom can be described by four quantum numbers, which are really labels for a direct sum decomposition of the corresponding Hilbert space of states. Using what we now know about the representation theory of , we can explain two of these quantum numbers, although we will do so in more detail in a later post.
Keeping in mind the double cover , any irreducible representation of
pulls back to an irreducible representation of
. The ones we get in this way are precisely the ones in which
acts trivially, and by inspection these are the irreducible representations
. Hence
has exactly one irreducible representation of each odd dimension
. (Of course all of the irreducible representations of
are projective representations of
, so we should expect the even-dimensional ones to also be important in quantum mechanics.)
It follows that in any quantum mechanical system with physical symmetry, the eigenstates corresponding to a particular energy eigenvalue will organize themselves into clumps with an odd number of members. For electrons around a hydrogen atom, the corresponding clumps are referred to in chemistry as subshells and delineated by their azimuthal quantum number
, which measures orbital angular momentum. In chemistry, subshells are also traditionally referred to by letters based on what the corresponding atomic spectra looked like:
(sharp) refers to
,
(principal) refers to
,
(diffuse) refers to
,
(fundamental) refers to
,
and so forth. Experimentally, subshells hold
electrons,
subshells hold
electrons,
subshells hold
electrons,
subshells hold
electrons, and so forth. This is off by exactly a factor of
from what can be predicted purely on the basis of
symmetry (where we would expect
electrons, respectively), and this is because there is a second symmetry here coming from spin. Nevertheless, the representation theory of
alone is already enough to address abstractly the origin of one quantum number, and in fact by fixing a maximal torus and considering its eigenvectors we get another one, the magnetic quantum number
. The reason it takes values between
and
is that these are the possible weights of
.
Since an argument based on symmetry is independent of the choice of Hamiltonian, we cannot use symmetry alone to figure out what the energy eigenspaces are, so we still can’t explain the principal quantum number without actually looking at the Hamiltonian, and we also haven’t explained spin. But two out of four isn’t bad!
[…] Example. Let and let be the defining representation. The character is just the trace of regarded as a matrix, which completely determines its conjugacy class; consequently, already separates points, and to understand Haar measure on the conjugacy classes of it suffices to understand the moments of . But these are just the dimensions of the invariant subspaces of the tensor powers of . The explicit description […]
Very nice explanation, and clearly stated. My chemistry is now pretty faint, but I do recall the electron shells having certain shapes. Can representation theory help in saying anything about them?
Also, is it fair to say that the electrons in the same orbital have lost their own ‘individuality’? That is, if I put an electron in an orbital, and then later on take one out, can I meaningfully ask whether it is the same electron I put in and get a straight answer?
1) Yes. The keyword here is “spherical harmonic.” 2) My understanding is that two particles of the same type do not have an individual existence in quantum mechanics. They are identical on a fundamental level.
Thanks for the tip. Sure, electrons are fundamentally identical, but surely that doesn’t mean that we can’t distinguish them under certain situations. Take for example two electrons separated by some large distance. Even with the uncertainty principle at work, I think the two ‘identical in every way’ electrons can still be distinguished in space. Or am I making a silly blunder here?
I suppose.
How do you like your internship? Are you studying GARCH and ARIMA models?
Quite impressive to me -I am not fluent in representation theory.
Thanks alot for the post.
It would be even nicer with (more) references to your sources and possible extensions of the results -which are numerous I suspect. But this has already been alot of work, thanks.
The classical results of Lie theory extending these results can be found in just about any good text on Lie theory, e.g. Fulton and Harris or Serre.