The following two lemmas might be encountered in a basic course in complex analysis (the first in a basic course in group theory, even).
Lemma 1: Fix a field . The group of fractional linear transformations
acts triple transitively on
and the stabilizer of any triplet of distinct points is trivial.
Lemma 2: The group of fractional linear transformations on preserving the upper half plane
is
.
I used to only know extremely boring computational proofs of both of these statements. However, I now know better! Today I’d like to give shorter and conceptual proofs of both of these, and then briefly discuss how they come about in the study of elliptic curves (a subject I’d like to talk about in more detail once this semester is over).
Background
Fix a field . The vector space
has automorphism group the general linear group
, the group consisting of all
invertible matrices with coefficients in
. Linear transformations preserve lines through the origin, so
also acts on the set of such lines. (More generally, linear transformations act on Grassmannians.) The space of lines in
is denoted
, the projective line over
, because it is an essentially “one-dimensional” object. In some basis, it consists of the span of the vectors
and the span of the additional vector
; the former is referred to as an “affine slice”
of
, and the latter is referred to as the “point at infinity.” In algebraic geometry this construction is important for providing the extra points that make nice theorems such as Bezout’s theorem true; it can be thought of as a kind of compactification.
acts on lines as follows. A linear transformation
sends the line spanned by (hereafter denoted by projective coordinates
) to
, which, if
, is the same as
. (If
, then we are instead at the point at infinity.) Thus we can identify
with the group of fractional linear transformations acting on
, up to the fact that scalar multiples of the identity act trivially on lines. So the group actually acting is
, the projective general linear group.
When (the case that is particularly important to complex analysis), the projective general linear group coincides with the projective special linear group
, since the coefficients of any invertible matrix can be scaled until the matrix has determinant
. The projective line
admits a famous interpretation here: under stereographic projection it can be identified with the Riemann sphere, and
is often called the group of Mobius transformations in this context. As it so happens,
is precisely the group of biholomorphic automorphisms of the Riemann sphere, which follows from the fact that the only meromorphic functions on the Riemann sphere are the rational functions.
Lemma 1
The key to a conceptual proof of Lemma 1 is the following observation: in the action of on
, the stabilizer of the point at infinity is the group
of affine linear transformations. It follows that we may understand the action of
on triplets of points
by first sending the first point to infinity. The remaining two points can only be acted on by affine linear transformations, but the affine linear transformations clearly act transitively on pairs of distinct points, and do so with trivial stabilizer!
The previous proof I knew of this lemma involved showing that the particular triplet of points had trivial stabilizer and then showing that any triplet of points could be sent to this one by writing down the coefficients of a transformation doing the job explicitly.
This lemma appears when studying elliptic curves in the following way. One can classify smooth projective algebraic curves over by the number of “holes” they have topologically, or more precisely by their genus. An incredibly powerful tool to help us do this is the Riemann-Roch theorem, which I need to learn about at some point. Using Riemann-Roch one can show that the only smooth projective curve of genus zero is the projective line
(and in fact this holds over more general fields and has number-theoretic significance), so the next obvious thing to do is classify curves of genus one.
The Riemann-Roch theorem implies that any smooth projective curve of genus one can be written in the form
(where I’m using affine coordinates for the sake of familiarity.) After a change of coordinates, it’s possible to write any such curve in the form
for some distinct (the curve cannot be smooth at a repeated root of the RHS). In fact, any curve of the form
where are distinct points on the Riemann sphere is a smooth projective curve of genus one (where if one of the points is at infinity one should interpret
as
). As it turns out, two such curves are isomorphic if and only if the roots of the corresponding quartic polynomials can be sent to each other by a fractional linear transformation, so one can identify the space of isomorphism classes of smooth projective curves of genus one with the space of isomorphism classes of (unordered) 4-tuples of distinct points on
up to projective linear transformation.
An elliptic curve is a smooth projective curve of genus one equipped with a distinguished point. This point serves as the identity for a group law defined on any elliptic curve, which comes abstractly from an identification of an elliptic curve with its Jacobian variety. It turns out that in order for the group law to have the “usual” definition in terms of the chord-tangent construction, the distinguished point on needs to be one of the four points
.
Given an elliptic curve presented in the form in which
is the distinguished point, send
to infinity by some fractional linear transformation. After applying this fractional linear transformation, we now have an elliptic curve of the form
whose group law is given by the chord-tangent construction and whose identity is at infinity. Now the remaining roots
can be transformed further by fractional linear transformations, but in order for two elliptic curves written in this way to be isomorphic the group identity (infinity) needs to be preserved, so instead of fractional linear transformations we can only transform the roots via affine linear transformations. Sending
, we finally get an elliptic curve in Legendre normal form
where is the remaining root. This can be understood in terms of Lemma 1 as follows: every equivalence class of ordered 4-tuples of points on the Riemann sphere has a unique representative in the form
. However, since order is irrelevant, we could have chosen to send any of
to
to get
, so the Legendre normal form is unique only up to the action of
on the three roots of the RHS not at infinity. One can write down the fractional linear transformations giving these permutations explicitly, and this gives a natural identification
.
A generic orbit of has size six, but there is a special point with an orbit of size three and another special point with an order of size two. (If you don’t know what these points are, it’s a fun exercise to figure them out.) The corresponding elliptic curves are very special!
Lemma 2
The key to a conceptual proof of Lemma 2 is the following observation: any Mobius transformation which fixes the upper half plane (considered as a subset of
after identifying
with the one-point compactification
) also fixes the lower half plane, and therefore it also fixes
. In particular, its zeroes and poles must all lie on
, so it must have the form
. But from here it’s clear that
, so it only remains to decide the sign of the determinant. The sign of the determinant determines, in a straightforward manner, whether
is sent to itself or to
, hence whether the corresponding transformation fixes
or swaps it with the lower half plane.
The previous proof I knew of this lemma took over two pages; I don’t even want to describe how awful it was!
This lemma appears when studying elliptic curves in the following way. The “analytic” way to define an elliptic curve is as a quotient where
is a free abelian subgroup of
of rank
, i.e. a lattice. (The correspondence between this definition and the one I gave above is not at all obvious, but it is the analytic definition which figures prominently in the classical history of the subject and its relation to the arc-length of ellipses. In the modern treatment the connection between the two definitions goes through the Weierstrass elliptic functions. If you want to learn more about the relationship between the two, I recommend the beautiful exposition by Stevenhagen, which can be found on this page of algebra notes.) Topologically,
is a torus (a parallelogram with opposite sides identified) regardless of the particular choice of
is, but it also carries the structure of a Riemann surface, and as it turns out, two such quotients are isomorphic as Riemann surfaces if and only if
for some
.
Given an oriented basis for a lattice, i.e. one such that
, we may quotient by the above equivalence and set
. This identifies the space of isomorphism classes of elliptic curves
equipped with an oriented basis for the lattice
with the upper half plane
. Of course, we don’t want to be content with a choice of basis. Two oriented bases span the same lattice if and only if they can be sent to each other by an element of
(the positive determinant is necessary to preserve the orientation), so it follows that two points in the upper half plane correspond to the same elliptic curve if and only if one can be taken to the other under the action of
, which sits naturally inside
as a discrete subgroup. This gives a natural identification
.
What’s the relationship between this picture and the picture coming from Lemma 1? Well, as it turns out, the points on an elliptic curve in the form
are precisely the non-identity points of order two, which for the corresponding quotient
, where
is spanned by
, are precisely the points
. Now, since as an abstract group
is isomorphic to a product of two copies of the circle group, the subgroup consisting of the points of order dividing
(“
-torsion”), for any positive integer
, is isomorphic to
. When we pick two of the points of order two to send to
to write an elliptic curve in Legendre form, we are in essence picking an ordered basis for their
-torsion.
Now, there is a natural way to parameterize elliptic curves with a given ordered basis for their -torsion. As before, we start with the identification of
with oriented bases
. Given such an oriented basis we may choose the ordered basis
for the
-torsion, and now two elliptic curves with given ordered basis for their
-torsion are isomorphic to each other, with the isomorphism respecting the choice of basis, if and only if the corresponding element of
is an element of a smaller subgroup consisting of those matrices isomorphic to the identity matrix
. This group is denoted
, and the quotient
is isomorphic to
. When
, this group is isomorphic to
! This is because choosing a basis of
is equivalent to choosing two of its non-identity points, and it gives the connection between this picture of elliptic curves and the one we described above.
So the story is as follows. Given an elliptic curve with
spanned by
, the ordered basis
determines a Legendre form
, and
is invariant under the action of
. This gives a natural map
which, if you believe everything I’ve said so far, is a bijection. Quotienting further by the action of then gives the action of
on
we saw earlier. In addition, it turns out that
is meromorphic as a function on
and well-behaved in the technical sense necessary for it to be a modular function of weight
with respect to
, and in fact it generates all such modular functions.
An interesting consequence of everything I’ve said so far: the action of on
satisfies all of the technical conditions required for the above map to be a covering map. (This is not the case with the action of
.) Among Riemann surfaces,
is very special because it is one of the three simply connected Riemann surfaces, which then implies that it is the universal cover of
and that
is isomorphic to the fundamental group of
. But the fundamental group of the plane minus two points is none other than the free group on two generators! (In fact, if I’m not mistaken, these generators can be taken to be
and its transpose.)
I’ve been learning all of this stuff through 18.784, the number-theory seminar I’m currently taking with Scott Carnahan on modular forms. The sheer number of different mathematical disciplines related to the subject is mind-boggling; I haven’t even mentioned the hyperbolic metric on and how
also happens to be its group of hyperbolic isometries, or about how all this works over fields smaller than
and the relationship to number theory, or about the relationship to theta functions of lattices… if you’d like some references, I’ll be adding them to the “Bibliography” page on the left.
[…] Posts AboutBibliographyThe definition of a functionFractional linear transformations and elliptic curvesSuggestions and RequestsNationals 2010GILA I: Group actions and equivalence relationsGraded […]
Well, regardless, it’s very neat stuff.
This is very interesting. I’m currently involved in research relating to hyperbolic geometry, and isometries (formerly) in the upper-half plane (PSL(2,R)). I’ve started working in SU(1,1) since then, but it’s nice to see other people out there doing work here.
This isn’t “work” so much as “exposition of classical material,” but thanks!
It’s amazing how far you’ve related the stuff in 18.784 to. 🙂 I didn’t recognize anything at first glance.