Someone who has just read the previous post on how exponentiating quaternions gives a nice parameterization of might object as follows: “that’s nice and all, but there has to be a general version of this construction for more general Lie groups, right? You can’t always depend on the nice properties of division algebras.” And that someone would be right. Today we’ll begin to describe the appropriate generalization, the exponential map from a Lie algebra to its Lie group. To simplify the exposition, we’ll restrict to the case of matrix groups; that is, nice subgroups of
for
or
, which will allow us to mostly avoid differential geometry.
The theory of Lie groups and Lie algebras is regarded to be one of the most beautiful in mathematics, and it is also fundamental to many areas, so today’s post is an extended discussion motivating the definition of a Lie algebra. In the next post we will actually do something with them.
For studying the hydrogen atom, our interest in Lie algebras comes from the following. If a Lie group acts smoothly on a smooth manifold
, its Lie algebra acts by differential operators on the space
of smooth functions, and these differential operators are the “infinitesimal generators” which give us conserved quantities for the evolution of a quantum system on
(in the case that
consists of symmetries of the Hamiltonian). Despite the fact that Lie algebras are commonly sold as a tool for understanding Lie groups, arguably in quantum mechanics the Lie algebra of symmetries of a Hamiltonian is more fundamental. This is important in sitations where the Lie algebra can sometimes exist without an associated Lie group.
Symmetries
To describe what intuitive notion a Lie algebra is supposed to capture, let’s return to the intuitive notion a group is supposed to capture. There are several ways to convince yourself that the group axioms perfectly capture what is intuitively meant by (global) symmetry. First, Cayley’s theorem guarantees that abstract groups (sets with a binary operation satisfying certain axioms) are the same thing as concrete groups (permutations of some set, generally intended to preserve some structure). Second, the group axioms correspond perfectly to the axioms an equivalence relation satisfies: given any group acting on a set
, define the relation
on
. Then
- The fact that
is closed under multiplication is equivalent to the fact that
is transitive.
- The fact that
has an identity is equivalent to the fact that
is reflexive.
- The fact that
has inverses is equivalent to the fact that
is symmetric.
And, of course, function composition is also associative. In fact, groups, group actions, and equivalence relations all have a common generalization in groupoids, which are in turn special categories. So there is an overwhelming amount of abstract evidence that the group axioms are “right.”
When studying abstract groups, a general method is to study them via generators and relations. If the group is finitely generated or, better yet, finitely presented, this gives a completely finitary description of the group. However, the groups we are currently interested in, Lie groups, are (a priori) infinitary objects, and there does not seem to be any hope of describing such a group in terms of generators and relations.
However, this attitude only makes sense if we assume that our groups do not have any extra structure; in particular, we are assuming that they are discrete. The Lie groups we care about are far from discrete, so there are topological constraints on morphisms between Lie groups. These constraints are strongest if we assume that our Lie groups are connected (since we are ignoring the discreteness that comes from the group of connected components). In fact, the following holds.
Theorem: Any connected topological group is generated by a neighborhood
of the identity. Hence any continuous homomorphism out of
is determined by its restriction to any such neighborhood
.
Proof. Let be a neighborhood of the identity and let
be the subgroup generated by
. Since
is a union of translations of
, it is open. If
has the property that any neighborhood
of
intersects
, then
intersects
, hence
, so
is closed. Since
is non-empty, it must be all of
.
So morphisms out of connected topological groups are determined by what’s happening in arbitrarily small neighborhoods of the identity: informally speaking, they are determined by “infinitesimal” elements of . The goal of the definition of a Lie algebra is to make this notion of infinitesimal symmetry precise.
Derivations
What is an infinitesimal symmetry? As a first approximation, we want to consider a continuous analogue of homomorphisms , which correspond to applying the element
over and over again. The appropriate continuous analogue is a one-parameter subgroup
(where for Lie groups we will assume the morphism is smooth, although this turns out to already be true). Just as
is determined by
, by the above Theorem
is determined by
for arbitrary
. Our first approximation of an infinitesimal symmetry is the thing which generates a one-parameter subgroup.
Here is a definition which does not require any analysis to state, but which requires a little analysis to motivate. Let be an algebra, not necessarily commutative (say over
), and let
be a one-parameter group of automorphisms of
. (For example, we can choose
for
a smooth manifold and
the map induced by a one-parameter group of automorphisms of
.) Let us suppose that the derivative
exists in some suitable sense (in the space of -vector endomorphisms of
). Since
, this is just
. The map
will be our model for the “infinitesimal generator” of
. Since
is linear, so is
. It’s also not hard to see that
fixes constant functions. Finally, since
, we see that
(the product rule). These properties define a(n -) derivation of
, which is a map
satisfying the following axioms:
,
.
,
Note that the second axiom implies that is
-linear because of the third axiom.
Example. Let and let
be the translation map
. Then
is the ordinary derivative
. We might write
.
This example also works if we replace with the algebra of polynomials
.
Example. Let and let
be the translation map
where
is fixed. Then
is the directional derivative
along
. If
is given coordinates
and
is the unit vector in the direction
, we might write
.
This example also works if we replace with the algebra of polynomials
.
Example. Let and let
be the rotation map
.
Then . Once more this example works if we replace
with the algebra of polynomials
, but we can also replace
with the algebra of holomorphic functions on an annulus about
restricted to
, whereupon
and
(and the same is true for the corresponding algebra of polynomials
).
Example. If is any algebra such that the exponential
makes sense (for example a Banach algebra), then let
. (We think of
as the algebra of observables of some system and
as changing coordinates via the symmetry generated by
; this is relevant to the Heisenberg picture of quantum mechanics). Then
.
This example is extremely important to keep in mind.
Generally speaking, if and
comes from a one-parameter group of automorphisms
(via the relation
), then
is nothing more than the derivative along the vector field defined by
, which corresponds to its flow. It turns out that every derivation of
comes from a vector field in this way, so we can think of derivations in general as an algebraic generalization of vector fields; that is, we should think of a derivation
as a “vector field on
.”
We are now ready to give a more-or-less precise definition of an infinitesimal symmetry. For an algebra over a field
, a (first-order) infinitesimal symmetry of
is a homomorphism
of
-algebras such that the composition of the above homomorphism with the natural quotient
is the identity. (If
is noncommutative, we require that
is central.) In other words, an infinitesimal symmetry a first-order deformation of the identity symmetry.
Explicitly, an infinitesimal symmetry is given by a homomorphism such that
, hence
is
-linear, and such that
, hence
.
Hence is precisely a
-derivation.
If is commutative and
is (the evaluation map at) an
-point
of
, then composing an infinitesimal endomorphism
with
gives a morphism
. We call this a derivation at
, and it is the appropriate algebraic generalization of a tangent vector at
; this is the algebraic sense in which a derivation gives rise to a vector field.
Indeed, this is one way to define the Zariski tangent space at a point of a variety. In this case we have , so a derivation at
is nothing more than a choice
such that
, or
hence is a vector orthogonal to the gradient of every
at
. More abstractly, an infinitesimal symmetry of a commutative
-algebra can be written as a morphism
, or in the other direction as a morphism
where is the fiber product over
. (The tensor product over
is the coproduct in the category of commutative
-algebras. It dualizes to give the fiber product, which is therefore the product in the category of schemes over
.) Here just as
is the universal
-point,
is the universal
-point with a tangent vector, so an infinitesimal symmetry is just a tangent direction in the automorphism group of
. Compare to the algebraic definition of a one-parameter subgroup, which is a morphism
or, equivalently, a morphism (with the same restriction as before on the quotient
). Given such a morphism we get an infinitesimal symmetry by composing with the quotient
. So we see that a tiny bit of algebraic geometry provides an elegant algebraic language for going from one-parameter subgroups to infinitesimal symmetries.
Local nilpotence
Given a derivation we would like to promote it to a one-parameter subgroup
of automorphisms such that
, hence such that
. This differential equation at least formally admits the solution
which is defined at various levels of rigor depending on how much structure has and how well-behaved
is on it. (It is, at the very least, a morphism
for every
(a higher-order infinitesimal symmetry) if
has the appropriate characteristic, and by taking the categorical limit it is a morphism
if
has characteristic zero.) For example, if
and
comes from a vector field on
which is Lipschitz, then the Picard-Lindelöf theorem guarantees the existence and uniqueness of
in a neighborhood of zero, which defines it uniquely everywhere.
For now I would like to make the point that for some it is possible to make sense of the exponential with no assumption of additional structure on
. Namely, suppose that
is locally nilpotent on
: that is, that for every
, the vector space
is finite, and
acts nilpotently on it. Then
is a sum of finitely many terms for any particular
, so it is perfectly well-defined without the need to take any limits, and moreover it is not hard to show that
is actually a family of automorphisms (each of which has inverse
).
This applies in particular to the case that acting on
, hence it is possible to give very concrete meaning to the intuitive idea that
is the infinitesimal generator of translation in the
direction in the sense that
for any . (This is just an abstract form of the Taylor expansion formula.) Since these derivations all commute, the exponential map gives a smooth homomorphism from the vector space of derivations
to the Lie group
of translations acting on
. Of course, a statement of this form can’t be true in general for non-abelian groups of automorphisms, but it is still nice to see how the picture works out in a nice abelian case.
Two formulas
I can’t resist pointing out two cute applications of this idea that is the translation by
operator when
. Since in particular
, it follows that the backward difference operator
can be written
. If we want to find sums of the function
, we want in some sense to be inverting
, so we want to investigate
.
The RHS is essentially the generating function for the Bernoulli numbers, which gives
.
And subtracting two copies of this gives a rough heuristic derivation of the Euler-Maclaurin formula (since is indefinite integration)! By local nilpotence, the above is almost completely rigorous when applied to polynomials, which gives Faulhaber’s formula.
The Lie bracket
Just as groups are an abstraction of the concrete phenomenon of symmetries (permutations of sets), Lie algebras can be thought of as an abstraction of the concrete phenomenon of infinitesimal symmetries (first-order deformations of the identity automorphism of an algebra). So what axioms should a Lie algebra satisfy?
First of all, if is a
-derivation, then so is
for every scalar
. When
this corresponds to performing the same infinitesimal symmetry, but
times faster.
Second of all, an infinitesimal symmetry can be extended to a homomorphism
by making it
-linear, and then we can compose a pair of infinitesimal symmetries
and
(hence
are derivations) to get a new infinitesimal symmetry
. Together with the above observation, we can now confidently state that whatever a Lie algebra over
is, it is at the very least a
-vector space.
As we saw above, in some basic abelian cases this -vector space structure is already enough to recover the structure of the corresponding group of symmetries. In the non-abelian case, however, this will fail. The problem is that we are only looking at first-order information, and infinitesimal symmetries commute to first order, but actual symmetries don’t. In order to fix this issue we need to look for second-order information.
Hence for a field of characteristic not equal to
, we make the following definition: a second-order infinitesimal symmetry of a
-algebra
is a homomorphism
such that composition with the quotient
gives the identity. Explicitly, this is a map
where are
-linear and
. Expanding out this condition shows that it is equivalent to the condition that
is a
-derivation and that
satisfies
.
Since we know that is always a second-order infinitesimal symmetry, we see that there is always a distinguished choice
. Indeed,
. It therefore follows that
.
In other words, can be specified by specifying two derivations: the first derivation
gives a first-order deformation of the identity, and the second derivation
gives a second-order deformation of the flow
.
Given two derivations , we can take their second-order flows and get second-order symmetries
, and then we can compose those flows to get
.
This second-order symmetry is the flow of if
commute, but otherwise the second-order deformation is
.
We know from the above that second-order deformations are derivations, so it follows that is a derivation, the Lie bracket of
and
. Thinking of
as vector fields, the Lie bracket, which is a special case of the Lie derivative, measures how
changes along the flow induced by
.
This can be made precise as follows. If is a one-parameter group of automorphisms with
, then
acts on derivations via conjugation
. Differentiating this action gives the bracket
. This is precisely analogous to how every element of a group
defines an inner automorphism
of
.)
The Lie bracket is obviously bilinear. If commutes with
, then
; in particular,
. Hence the Lie bracket is alternating. In characteristic
, this is equivalent to the condition that
, which is intuitive since the change in
along
ought to be the opposite of the change in
along
. Finally, since the Lie bracket is bilinear and
is obtained by differentiating a group symmetry, it follows that
ought to satisfy the product rule; that is,
which is equivalent, after a little rearrangement and application of alternativity, to the Jacobi identity
.
In other words, the Lie bracket defines an action of a derivation on the space of derivations by derivations! This is less confusing than it sounds at first. After all, both left multiplication and conjugation define an action of a permutation (that is, an element of a group ) on a set of permutations (that is, on
) by permutation. The Lie bracket is more closely analogous to conjugation, but it is also instructive to think of it as analogous to left multiplication, since the condition that left multiplication defines a group action on a group
is precisely the associativity axiom. So the Jacobi identity can roughly be thought of as “associativity of the Lie bracket.”
It turns out that second-order information about a Lie group is enough to completely reconstruct its multiplication in a neighborhood of the identity; we do not need to go further to find third-order analogues of the Lie bracket. So we have enough structure to feel comfortable making the following definition.
Definition: A Lie algebra over a field is a
-vector space
equipped with a bilinear map
which is alternating and satisfies the Jacobi identity.
The vector space of
-derivations of a
-algebra
, and any subspace of this closed under bracket (a Lie subalgebra), forms a Lie algebra, so we already have a large and rich class of examples; in particular, the vector fields on a smooth manifold
form a Lie algebra.
In the next post we will describe how to associate a Lie algebra to a Lie group using as little differential geometry as possible. Lie algebras form a category whose morphisms are
-linear maps preserving the bracket, and what we will be describing is a functor
which is surprisingly close to faithful.
The fact that the Lie Bracket is a derivation doesn’t really prove the Lie Bracket of two derivations in the Lie Algebra, though does it? I mean, one can construct automorphisms phi: A –> A that are not in the same subgroup of Aut(A). It seems that something more is needed to prove closure.
[…] claimed. There is a more natural way to see why this identity is the “right one”; see Qiaochu. However, with this proof I want to make the point that this Jacobi identity is not our decision: […]
Excellent post, Qiaochu! I ran into this while trying to recall a few basic things about Lie groups/Lie algebras. This is a good reference for the basics.
Thanks!
[…] https://qchu.wordpress.com/2011/02/26/the-quaternions-and-lie-algebras-i/#comment-3341 […]
[…] https://qchu.wordpress.com/2011/02/26/the-quaternions-and-lie-algebras-i/#comment-3341 […]
I see. We can always think of an abstract Lie Algebra in the derivation / infinitesimal one-parameter family of automorphisms sense at least formally. You can define formally the exp map as a map from the lie algebra into a formal power series algebra. Then the set of all exp( g ) can be seen to be a formal group, once you prove the BCH formula. Then I suppose we can think of the resulting formal group as the automorphism group we were looking for
thank you for the reply
Yes, and in fact in characteristic zero this defines an equivalence of categories between Lie algebras and formal groups.
Thanks, I will try to read more about this and understand it. However, I think I kind of messed it up above. It seems like you can only define the group exp( g ) for g in the Lie Algebra when the Lie Algebra is nilpotent ( so that the BCH formula will actually have finitely many terms ). Otherwise, the elements exp( g ) are not closed under multiplication, unless you extend the domain from just the lie algebra, to formal “lie power series ( power series in the bracket )”.
Sorry, and just one more question.. I think this seems like it only works between finite dimesional lie algebras and power series of finitely many variables. What happens in the infinite dimensional lie algebra case? Can we still think of them in terms of derivatives of formal power series groups?
thank you
Yes, one has to use formal power series. I don’t know if the correspondence holds in infinite dimensions.
Hmm.. too bad.. I was hoping to find a “unified” way of thinking about abstract lie algebras. at least over characteristic 0
thank you!
Hello, thanks for this post, it is great! Could you suggest any references for motivations behind studying “abstract Lie Algebras”, or anything related to what was on this post?
I am trying to understand better why people would study Lie Algebras “in their own right”. I am slowly starting to appreciate it better — but in the past, I’ve thought of lie algebras existing as a field of study only to serve lie group theory ( lie algebras encode information about the lie group ). To me, perhaps motivating lie algebras via derivations seems to be a step towards motivating lie algebras without necessarily lie groups.
Thank you
One motivation is that there are interesting infinite-dimensional Lie algebras, such as the Kac-Moody algebras, and not all of them come from corresponding infinite-dimensional Lie groups. These have various applications, e.g. affine Lie algebras are relevant to mathematical physics. In physics, Lie algebras are actually more fundamental than Lie groups, since they directly give rise to observables.
Thanks, I will take a look. Do these Kac-Moody algebras fit into the “Lie Algebras are first-order deformations of the identity automorphism of an algebra” idea? I like the description you gave with taking the differential at 0 of a varying 1-parameter family of automorphisms of an algebra. Is this general enough? I am not yet at the level of understanding completely, but Aut(A) does not always have to form a lie group, is this correct? For example. Diff( M ) = Aut ( C^infinity ) of a manifold does not have to form a lie group, when Diff( M ) is not a differential manifold ( in the standard locally euclidean sense )?
Thank you for the reply!
I don’t think so. I’m not really sure where they come from. Everything is more or less fine if you take one-parameter families in the formal sense (power series) but in general there won’t be any “actual” one-parameter families running around unless A is a finite-dimensional real algebra or a Banach algebra or similar.
[…] We now know what a Lie algebra is and we know they are abstractions of infinitesimal symmetries, which are given by derivations. Today we will see what we can say about associating infinitesimal symmetries to continuous symmetries: that is, given a matrix Lie group , we will describe its associated Lie algebra of infinitesimal elements and the exponential map which promotes infinitesimal symmetries to real ones. […]