GILA V: The Polya enumeration theorem and applications

June 21, 2009

I ended the last post by asking whether the proof of baby Polya extends to the multi-parameter setting where we want to keep track of how many of each color we use. In fact, it does. First, we should specify what exactly we’re trying to compute. Recall the setup: we have colors 1, 2, ... n (represented by variables r_1, r_2, ... r_n), and we have a set of slots S with |S| = m acted on by a group G where each slot will be assigned a color. Define f_G(t_1, ... t_n) to be the number of orbits of functions S \to \{ 1, 2, ... n \} under the action of G where color i is used t_i times. (Since the action of G only permutes slots, it doesn’t change the multiset of colors used.) What we want to compute is the generating function

\displaystyle F_G(r_1, r_2, ... r_n) = \sum_{t_1 + ... + t_n = m} f_G(t_1, t_2, ... t_n) r_1^{t_1} r_2^{t_2} ... r_n^{t_n}.

Note that setting r_1 = r_2 = .... = r_n = 1 we recover F_G(n), which doesn’t contain information about particular color combinations. By the orbit-counting lemma, this is equivalent to computing

\displaystyle \frac{1}{|G|} \sum_{g \in G} | \text{Fix}(g) |(r_1, r_2, ... r_n)

where we must now count fixed points in a weighted manner, recording the multiset of colors in each fixed point. How do we go about doing this?

Again, the fixed points of a given permutation g acting on functions S \to \{ 1, 2, ... n \} are precisely the functions constant (that is, monochromatic) on each cycle of g. Writing down each such fixed point in terms of the number of colors used then requires that we know how many cycles of each size there are in the cycle decomposition of g. To that end, we’ll define the cycle index monomial

\displaystyle z_1^{c_1(g)} z_2^{c_2(g)} ...

where c_k(g) denotes the number of cycles of size k of g and z_k is the corresponding formal variable; note that we require the condition that \sum_k kc_k(g) = m. Now we can count fixed points as follows: the only ways to color a cycle of size k are to color each of the k slots the same color, hence what we need to do is to make the substitution z_k = r_1^k + r_2^k + ... + r_n^k. This is the important step! Each term represents a different color with which we can color every slot in the same cycle. We take to the power of c_k(g) and multiply because we can color each cycle independently. Again, note that when r_1 = r_2 = ... = r_n = 1 we recover the same argument we used to prove baby Polya.

And we’re done! What we’ve proven is the following. Define the cycle index polynomial of the group action of G on S to be the average of the cycle index monomials, or

\displaystyle Z_G(z_1, z_2, ... ) = \frac{1}{|G|} \sum_{g \in G} z_1^{c_1(g)} z_2^{c_2(g)} ... .

Then the following holds.

Polya’s enumeration theorem: The generating function F_G(r_1, r_2, ... r_n) is equal to
Z_G(r_1 + ... + r_n, r_1^2 + ... + r_n^2, r_1^3 + ... + r_n^3, ...).

Again, to appreciate this result we’ll work through what it says about cyclic groups and symmetric groups. It turns out the PET has quite a lot to say on these two subjects alone!

Balls and urns

As before, here the symmetry group is G = S_m. We want to count the number of ways to assign one of n urns to each of m indistinguishable balls indexed by the multiset of number of times each urn is used; however, since we’re using the full symmetry group, the multiset of times each urn is used uniquely determines the orbit we’re in. In other words, f_G(t_1, t_2, ... t_n) = 1 for every choice of t_1 + t_2 + ... + t_n = m. We already knew this, of course, but the really nice thing here is that the full generating function

\displaystyle F_{S_m}(r_1, r_2, ... r_n) = \sum_{t_1 + ... + t_n = m} r_1^{t_1} r_2^{t_2} ... r_n^{t_n}

is the complete homogeneous symmetric polynomial h_m(r_1, ... r_n). Summing over all m we obtain the identity mentioned in the exercises of the last post,

\displaystyle \sum_{m \ge 0} F_{S_m} t^m = \prod_{i=1}^{m} \frac{1}{1 - r_i t}.

This identity occurs because the choice of a term in each factor 1 + r_i t + r_i^2 t^2 + ... uniquely determines the number of times color i appears in the corresponding coloring. Note what happens, again, when r_1 = r_2 = ... = r_n = 1. What makes this result even nicer is that it implies that if we can express this generating function in terms of the power symmetric functions p_1, p_2, ... , we can compute the cycle index polynomials of the symmetric groups all at once!

Recall that the definition of the power symmetric functions is p_k = \sum_{i=1}^{n} r_i^k; then the PET implies that

\displaystyle \prod_{i=1}^{n} \frac{1}{1 - r_i t} = \sum_{m \ge 0} Z_{S_m}(p_1, p_2, ... ) t^m.

So how do we express this generating function in terms of the power symmetric functions? The answer is very elegant: we simply write

\displaystyle \frac{1}{1 - r_i t} = \exp \ln \frac{1}{1 - r_i t}  = \exp \left( r_i t + \frac{r_i^2 t^2}{2} + \frac{r_i^3 t^3}{3} + ... \right)

and, multiplying the factors together, this implies the beautiful identity

\displaystyle \sum_{m \ge 0} Z_{S_m} t^m = \exp \left( z_1 t + \frac{z_2 t^2}{2} + \frac{z_3 t^3}{3} + ... \right).

I have a lot to say about this identity, but it deserves its own separate post.

Necklaces

The necklace-counting result extends in a rather straightforward way to this new regime; the argument we used before immediately implies that

\displaystyle Z_{C_m} = \frac{1}{m} \sum_{d | m} z_{m/d}^d \, \varphi \left( \frac{m}{d} \right)

and therefore that

\displaystyle F_{C_m} = \frac{1}{m} \sum_{d | m} \left( r_1^{m/d} + ... + r_n^{m/d} \right)^d \, \varphi \left( \frac{m}{d} \right).

As promised, we’re going to use this to prove a special case of Lucas’ theorem. First, we’re going to let m = ap where p is a specified prime. Second, we’re going to restrict to a subgroup of C_m. This is the first time we’re going to need to distinguish between isomorphic groups and non-isomorphic group actions, so I’ll denote by C_{a, p} the action of the subgroup of C_m given by the a^{th} powers of the elements of C_m. This is a cyclic group of order p, so again it has the nice property that orbits are either fixed points or full. We’re also going to restrict to the case of two colors again and we’re going to set r_1 = 1, r_2 = q because we lose no information in this way. The above then reduces to

\displaystyle F_{C_{a, p}} = \frac{ (1 + q^p)^a (p-1) + (1 + q)^{ap} }{p}.

The coefficient of q^k in this generating function counts the number of orbits of necklaces that use k of one color and ap - k of the other. By the binomial theorem, this coefficient is

\displaystyle \frac{ {ap \choose k} q^k }{p}

if k isn’t a multiple of p – every element is in a “full” orbit in this case – or if k = bp, the coefficient is

\displaystyle \frac{ {a \choose b} q^{bp} (p-1) + {ap \choose bp} q^{bp} }{p}

hence {ap \choose bp} \equiv {a \choose b} \bmod p as desired. One can duplicate the above argument using the properties of the Frobenius map, but as we’ve discussed we already know that the properties of the Frobenius map are intimately related to the group actions of C_p.

Repeatedly applying the Frobenius map gives a proof of the full Lucas’ theorem, and there is a corresponding combinatorial proof given on the Wikipedia article that fits nicely into this framework; instead of considering a cyclic group of prime order we consider an appropriate direct product of cyclic groups. “Action on subsets” is just another way of saying “action on 2-colorings.”

Young diagrams in a box

This is an example I should’ve mentioned earlier, but the poset L(m, n) of Young diagrams fitting inside a m \times n box can be discussed in terms of Polya theory.

The set of slots S is an m \times n box, and the number of colors is two. One color will denote that a box is filled, while another color will denote that it’s empty. This gives a total of 2^{mn} functions, some of which are Young diagrams. What we want to show is that there exists a group G acting on an m \times n box such that each orbit contains exactly one Young diagram.

The group can be described as follows: it consists of m copies of S_n, each of which acts to permute the entries in each row, but we’re also allowed to permute the rows themselves. In other words, a copy of S_m is acting on the copies of S_n. This construction goes by the name of the wreath product S_n \wr S_m. It’s a rather large group, since by construction it has size m! (n!)^m. The orbits of its action on the functions S \to \{ \text{empty}, \text{filled} \} can be uniquely identified with Young diagrams as follows: first sort the entries in each row so that the filled entries are left-justified, then sort the rows by the number of filled entries they have. This gives a unique Young diagram for each orbit since the multiset of number of filled entries in each row is invariant under the action of S_n \wr S_m. The PET, together with the result we’ve already proved, implies that

\displaystyle Z_{S_n \wr S_m}(1 + q, 1 + q^2, ...) = {m+n \choose m}_q.

I don’t, however, have a good idea of how to compute the cycle index polynomial of S_n \wr S_m, and this result is clearly not good enough to do so, since the above identity cannot distinguish between S_n \wr S_m and S_m \wr S_n even though the two groups are very different.

If this isn’t a good way to actually compute {m+n \choose n}_q, then what can we do with this realization of Young diagrams via a group action? It turns out there are some rather interesting general theorems in this area; see Stanley’s Lectures in Algebraic Combinatorics section 5.

A generalization

The set of colors in the setup of Polya’s theorem can be succinctly described by the generating function r_1 + r_2 + ... + r_n, one term for each color. There is a generalization of Polya’s theorem that allows us to replace this generating function by an arbitrary generating function g(r_1, ... r_n), hence to consider objects more general than a set of distinct colors. Although the proof is essentially the same, I think the proper setting for discussing this generalization is the theory of species. Nevertheless, for the sake of completeness, the more general theorem states that F_G is equal to

\displaystyle Z_G(g(r_1, ... r_n), g(r_1^2, ... r_n^2), g(r_1^3, ... r_n^3), ...).

The value of the generalization is that we can fill the slots with something “fatter” than a single color: we could put, for example, a sequence of colors in, or a graph. That allows us to count more general objects and to use more general combinatorial parameters.

Suppose, for example, that we wanted to compute the generating function for “spindles” – necklaces with a path attached to every vertex – up to cyclic symmetry and indexed by the total number of vertices. The group action is C_m as before, but the “colors” are non-empty sequences, which have the generating function \frac{x}{1 - x}. The generalized PET then tells us that the generating function for spindles with fixed necklace size is

\displaystyle \frac{1}{m} \sum_{d | m} \varphi \left( \frac{m}{d} \right) \left( \frac{x^{m/d}}{1 - x^{m/d}} \right)^d.

This is by no means an obvious result. Applying the generalized PET to the symmetric groups gives the general form of the multiset construction, but again these issues will have to wait until another post.

Exercises

Compute the cycle index polynomials of the group acting on the complete binary tree of depth d generated by the symmetries that flip the two children of any vertex, at least for some small d. (I don’t know if there’s a nice answer in general; this is an example of an iterated wreath product.)

Let I_m denote the number of involutions on m elements. Show, using the cycle index polynomials of the symmetric groups, that

\displaystyle \sum_{m \ge 0} \frac{I_m}{m!} t^m = e^{t + \frac{t^2}{2} }.

This implies both a summation formula and a recurrence relation for I_m via differentiation. Prove both by a direct argument. Can you generalize?

Tags: , , , , , , ,

6 Responses to “GILA V: The Polya enumeration theorem and applications”


  1. [...] of symmetric functions, which generalizes some ideas that came up in the previous discussion of Polya theory, can be motivated by thinking about polynomial functions of the roots of a monic polynomial . [...]

  2. Ajat Says:

    Hi there, can you direct me to the reference of the generalization of Polya’s Enumeration Theorem?
    I need it for the verification of a paper..

    thanks

    best regards

    ajat

  3. Ajat Says:

    Hi again, Thanks for the pointer!
    I wonder that if there exists a cycle index in rational function form..
    take a look of this paper http://www.springerlink.com/index/X558J42T8K1615G6.pdf, page 344 to 345

    The two-connected components generating function has fraction on it!

    i wonder if it comes from the rational style of the cycle-index “polynomial”.


Leave a Reply