Archive for November, 2009

The Jacobi-Trudi identities

November 20, 2009

Today I’d like to introduce a totally explicit combinatorial definition of the Schur functions. Let \lambda \vdash n be a partition. A semistandard Young tableau T of shape \lambda is a filling of the Young diagram of \lambda with positive integers that are weakly increasing along rows and strictly increasing along columns. The weight of a tableau T is defined as \mathbf{x}^T = x_1^{T_1} x_2^{T_2} ... where T_i is the total number of times i appears in the tableau.

Definition 4: \displaystyle s_{\lambda}(x_1, x_2, ...) = \sum_T \mathbf{x}^T

where the sum is taken over all semistandard Young tableaux of shape \lambda.

As before we can readily verify that s_{(k)} = h_k, s_{(1^k)} = e_k. This definition will allow us to deduce the Jacobi-Trudi identities for the Schur functions, which describe among other things the action of the fundamental involution \omega. Since I’m trying to emphasize how many different ways there are to define the Schur functions, I’ll call these definitions instead of propositions.

Definition 5: \displaystyle s_{\lambda}= \det(h_{\lambda_i+j-i})_{1 \le i, j \le n}.

Definition 6: \displaystyle s_{\lambda'} = \det(e_{\lambda_i+j-i})_{1 \le i, j \le n}.

Read the rest of this entry »

The Lindstrom-Gessel-Viennot lemma

November 17, 2009

One of my favorite results in algebraic combinatorics is a surprisingly useful lemma which allows a combinatorial interpretation of the determinant of certain integer matrices. One of its more popular uses is to prove an equivalence between three other definitions of the Schur functions (none of which I have given yet), but I find its other applications equally endearing.

Let G be a locally finite directed acyclic graph, i.e. it has a not necessarily finite vertex set V with finitely many edges between each pair of vertices such that no collection of edges forms a cycle. For example, G could be \mathbb{Z}^2 with edges (x, y) \to (x, y + 1) and (x, y) \to (x + 1, y ), which we’ll denote the acyclic plane. Assign a weight w(e) to each edge and assign to a path the product of the weights of its edges. Given two vertices a, b let e(a, b) denote the sum of the weights of the paths from a to b. Hence even if there are infinitely many such paths this sum is well-defined formally, and if there are only finitely many paths between two vertices then setting each weight to 1 gives a well-defined non-negative integer.

Let a_1, ... a_n and b_1, ... b_n be a collection of vertices called sources and vertices called sinks. We are interested in n-tuples of paths, hereafter to be referred to as n-paths, sending each source to a distinct sink. Let \mathbf{M} be the n \times n matrix such that \mathbf{M}_{ij} = e(a_i, b_j). Then the permanent of \mathbf{M} counts the number of n-paths, but this is not interesting as permanents are hard to compute.

A n-path is called non-intersecting if none of the paths that make it up share a vertex; in particular, each a_i is sent to distinct b_i. A non-intersecting path determines a permutation \pi of the vertices; let the sign of a non-intersecting n-path be the sign of this permutation.

Lemma (Lindstrom, Gessel-Viennot): \det \mathbf{M} is the signed sum of the weights of all non-intersecting n-paths.

Corollary: If the only possible permutation is \pi = 1 (i.e. G is non-permutable), then \det \mathbf{M} is the sum of the weights of all non-intersecting n-paths.

Read the rest of this entry »

Groups vs. abelian groups

November 16, 2009

A few weeks ago on MathOverflow Greg Muller asked, “why do groups and abelian groups feel so different?” The answers were very interesting and came from several different perspectives, but I still don’t feel as if the question was resolved. So I’ll try to synthesize and summarize some of the answers and hopefully something will be clearer in the end.

Read the rest of this entry »

The many faces of Schur functions

November 15, 2009

The last time we talked about symmetric functions, I asked whether the vector space \mathcal{R} could be turned into an algebra, i.e. equipped with a nice product. It turns out that the induced representation allows us to construct such a product as follows:

Given representations \rho_1, \rho_2 of S_n, S_m, their tensor product \rho_1 \otimes \rho_2 is a representation of the direct product S_n \times S_m in a natural way. Now, S_n \times S_m injects naturally into S_{n+m}, which gives a new representation

\rho = \text{Ind}_{S_n \times S_m}^{S_{n+m}} \rho_1 \otimes \rho_2.

The character of this representation is called the induction product \rho_1 * \rho_2 of the characters of \rho_1, \rho_2, and with this product \mathcal{R} becomes a graded commutative algebra. (Commutativity and associativity are fairly straightforward to verify.) It now remains to answer the first question: does there exist an algebra homomorphism \phi : \Lambda \to \mathcal{R}? And can we describe the inner product on \Lambda coming from the inner product on \mathcal{R}?

To answer this question we’ll introduce perhaps the most important class of symmetric functions, the Schur functions s_{\lambda}.

N.B. I’ll be skipping even more proofs than usual today, partly because they require the development of a lot of machinery I haven’t described and partly because I don’t understand them all that well. Again, good references are Sagan or EC2.

Read the rest of this entry »

Whoops!

November 8, 2009

I seem to have broken my MaBloWriMo streak. I hope you’ll believe me when I say it was impossible for me to get a post up yesterday. Unfortunately, the rest of this week looks just as hairy (for completely different reasons), so I’m going to have to take a break. Here’s where we’re headed once I have time again:

I’m going to skip a lot of background at some point and just introduce several equivalent definitions of the Schur functions. My hope is that stating some of the important results of the theory, even without proof, will be enough to get other people interested in symmetric function theory. I also get a lot of material to go back to and flesh out in subsequent posts, since I haven’t gone through most of the proofs of the basic results very thoroughly.

After that, I want to meander slowly through parts of basic algebraic number theory and algebraic geometry. My goal here is to thoroughly understand the classical analogy between rings of integers in number fields and nonsingular affine algebraic curves. Since several bloggers have covered much of this material in some form already, I’ll try to link to other posts I’m aware of, but I’ll have to repeat some things because I want to motivate every definition I need.

Since I don’t have anything else to say at the moment, let’s make this post an open thread and we’ll see if the Scott Aaronson style of blogging works for me. General comments, questions, suggestions, requests, etc. welcome!

Set-multiset duality and supervector spaces

November 6, 2009

Recall that the elementary symmetric functions e_{\lambda} generate the ring of symmetric functions \Lambda as a module over any commutative ring R. A corollary of this result, although I didn’t state it explicitly, is that the elementary symmetric functions e_1, e_2, ... are algebraically independent, hence any ring homomorphism from the symmetric functions is determined freely by the images of e_1, e_2, ... .

A particularly interesting choice of endomorphism \omega : \Lambda \to \Lambda occurs when we set \omega(e_n) = h_n. This endomorphism sends the generating function E(t) for the elementary symmetric polynomials to the generating function H(t) = \frac{1}{E(-t)}, and this is an involution – in other words, it follows that \omega(h_n) = e_n and \omega itself must also be an involution on the ring of symmetric functions. Thus the elementary symmetric functions and complete homogeneous symmetric functions are dual in a very strong sense. This is closely related to the identity

\displaystyle \left( {n \choose d} \right) = (-1)^d {-n \choose d}

and today I’d like to try to explain one interpretation of what’s going on that I learned from Todd Trimble, which is that “the exterior algebra is the symmetric algebra of a purely odd supervector space.”

Read the rest of this entry »

I don’t trust uncountable sets

November 5, 2009

I have a mathematical confession: I don’t trust uncountable sets.

Some time ago on MathOverflow somebody asked what a reasonable definition of “infinite permutation” would be. The first answer that comes to mind is a bijection \mathbb{N} \to \mathbb{N}. The set of all such bijections does form a group, but not only is it uncountably generated, it contains, as Darsh observes, a copy of every countably generated group (acting on itself by left multiplication). In particular it contains a copy of the free group on countably many generators. It also doesn’t seem to carry any natural kind of topology.

On the other hand, a much nicer candidate is the set of “compactly supported” permutations, i.e. those which fix all but finitely many elements. This countable group S_{\infty} is generated by transpositions and therefore has a neat presentation given by the usual relations. I believe it’s also the largest locally finite subgroup of the full group of bijections.

I find this group much more philosophically appealing than the full group of bijections, and the reason is simple: each element of the group is computable. On the other hand, only countably many elements of the full group of bijections \mathbb{N} \to \mathbb{N} are computable: the rest can’t be written down by a Turing machine. And I don’t trust anything that can’t be written down by a Turing machine.

Corollary: I don’t trust the real numbers.

Instead of explaining what I mean by this, which I don’t think I have time for today, I’ll just throw a question out to the audience: how do you feel about all this?

Newton’s sums, necklace congruences, and zeta functions II

November 4, 2009

In a previous post I gave essentially the following definition: given a discrete dynamical system, i.e. a space X and a function f : X \to X, and under the assumption that f^n has a finite number of fixed points for all n, we define the dynamical zeta function to be the formal power series

\displaystyle \zeta_X(t) = \exp \sum_{n \ge 1} (\text{Fix } f^n) \frac{x^n}{n}.

What I didn’t do was motivate this definition, mostly because I hadn’t really worked out the motivation myself. Now that we have an important special case worked out, we can discuss the general case, which will give a purely combinatorial proof of the second half of the Newton-Girard identities.

Read the rest of this entry »

The cyclotomic identity and Lyndon words

November 3, 2009

In number theory there is a certain philosophy that \mathbb{F}_q[x] is a good toy model for the integers \mathbb{Z}. The two rings share an important property: they are basically the canonical examples of Euclidean domains, hence PIDs, hence UFDs. However, many number-theoretic questions involving prime factorization over \mathbb{F}_q[x] are much easier than their corresponding questions over \mathbb{Z}. One way to see this is to look at their zeta functions.

The usual zeta function \zeta(s) = \sum_{n \ge 1} \frac{1}{n^s} reflects the structure of prime factorization through its Euler product

\displaystyle \zeta(s) = \prod_p \frac{1}{1 - p^{-s}}

where the product runs over all primes; this is essentially equivalent to unique factorization. Since we know that monic polynomials over \mathbb{F}_q[x] also enjoy unique factorization, it’s natural to ask whether there’s a sum over monic polynomials that would give a similar Euler product.

In fact, there is such a product, and investigating it leads naturally to a seemingly unrelated subject: the combinatorics of words.

Read the rest of this entry »

Young’s lattice

November 2, 2009

As we saw last time, a sequence of nested groups gives rise to a graded “poset” in which objects can be related by more than one arrow. The poset we want to construct now is given by the sequence S_0 \le S_1 < S_2 < S_3 of symmetric groups (where S_0 and S_1 are both the trivial group). Since irreducible representations of the symmetric groups are parameterized by Young diagrams, this gives the set of Young diagrams a graded structure where the elements of rank n are the Young diagrams with n boxes.

It turns out that this “poset” is a genuine poset; in other words, in the restriction of an irreducible representation of S_n to an irreducible representation of S_{n-1}, the resulting representation contains each irreducible representation at most once. In fact, much more is true. In a poset we write \lambda \triangleright \mu if \lambda > \mu and there is no \lambda' with \lambda > \lambda' > \mu; this relation is called covering. So elements of rank n cover elements of rank n-1.

Theorem (Branching Rule): \lambda \triangleright \mu if and only if \mu is obtained from \lambda by removing a box.

This is the remarkable theorem that really justifies the choice of Young diagrams as a natural description of the irreducible representations of the symmetric group. Today we’ll explore some of the consequences of this theorem.

Read the rest of this entry »