⤺ Back to Posts

Frobenius Theorem

Mounir Lbath, March 2024

Table of Contents

  1. The Frobenius Theorem
  2. Representations of a Finite Group
  3. Characters
  4. Induced Representations
  5. Proof of the Frobenius Theorem
  6. Appendix: Useful Prerequisites

Introduction

The theory of representations of finite groups, discovered by Frobenius toward the end of the 19th century, consists in analyzing the properties and structure of a finite group (G,)(G, \cdot) by making it act linearly on a vector space. In this post, we use representation theory to prove a theorem about Frobenius groups. In the appendix, we gather some prerequisites that we will use without further mention in this text. This work was originally a TIPE (Travail d'Initiative Personnelle Encadré) for the Ecole Normale Supérieure and was given the grade 18/20.


The Frobenius Theorem

Definition 1.
Let (G,)(G, \cdot) be a finite group with identity ee and HH a proper subgroup of GG (i.e. H{e}H \neq \{e\} and HGH \neq G). We say that GG is a Frobenius group, that HH is a Frobenius complement, and that (G,H)(G,H) is a Frobenius pair if

gGH,gHg1H={e}.\forall\, g\in G\setminus H,\quad gHg^{-1}\cap H=\{e\}.

Note that this is equivalent to

(x,y)G2,y1xH    xHx1yHy1={e}.\forall\, (x,y)\in G^2,\quad y^{-1}x\notin H\implies xHx^{-1}\cap yHy^{-1}=\{e\}.

Intuitively, this means that the conjugates of HH cover GG "as much as possible."

Proposition 1.
If (G,H)(G,H) is a Frobenius pair, then

gGgHg1=G+1GH.\Bigl|\bigcup_{g\in G} gHg^{-1}\Bigr| = |G|+1-\frac{|G|}{|H|}.

Proof.
We have

gGgHg1={e}gHG/H(gHg1{e})=1+G/H(H1).\Bigl|\bigcup_{g\in G} gHg^{-1}\Bigr| = \Bigl|\{e\}\,\sqcup\,\bigsqcup_{gH\in G/H}\Bigl(gHg^{-1}\setminus\{e\}\Bigr)\Bigr| = 1 + |G/H|(|H|-1). \square

One can show that for any subgroup HH the above equality is in fact an inequality ()(\leq). It also shows that the conjugates of a proper subgroup never cover GG (recall that GG is finite, otherwise G=GLn(C)G=GL_n(\mathbb{C}) and H=Tn(C)GLn(C)H=T_n(\mathbb{C})\cap GL_n(\mathbb{C}) the group of invertible triangular matrices produce a counter-example).

Example 1.
Let GG be a finite group acting transitively on a set XX such that for every gG{e}g\in G\setminus\{e\}, gg fixes at most one point of XX. Then, if αX\alpha\in X is such that Stab(α){e}\operatorname{Stab}(\alpha)\neq\{e\}, the pair (G,Stab(α))(G,\operatorname{Stab}(\alpha)) is a Frobenius pair.

Proof.
We have Stab(α)G\operatorname{Stab}(\alpha)\neq G (by transitivity). If gGStab(α)g\in G\setminus\operatorname{Stab}(\alpha), then gααg\cdot\alpha\neq\alpha; hence gαg\cdot\alpha and α\alpha are simultaneously fixed only by ee. Therefore, Stab(gα)Stab(α)={e}\operatorname{Stab}(g\cdot\alpha)\cap \operatorname{Stab}(\alpha)=\{e\}, i.e. gStab(α)g1Stab(α)={e}g\,\operatorname{Stab}(\alpha)\,g^{-1}\cap\operatorname{Stab}(\alpha)=\{e\} \square.

One can in fact show that every Frobenius group is of this form. Here are some examples:

Example 2.

  • If KK is a finite field and G=Aff(1,K)={τa,b:xKax+b, (a,b)K×K}G=\operatorname{Aff}(1,K)=\{\tau_{a,b}: x\in K\mapsto ax+b,\ (a,b)\in K^*\times K\} is the group of affine bijections of KK onto itself, and H(K)={τa,0:xKax, aK}H(K)=\{\tau_{a,0}: x\in K\mapsto ax,\ a\in K^*\} (the subgroup of those that fix 00), then (G,H)(G,H) is a Frobenius pair. (Note: τa,b\tau_{a,b} fixes exactly b/(1a)b/(1-a) for a1a\neq1; fixes no point of KK for a=1a=1 and b0b\neq0; and fixes all of KK for a=1a=1 and b=0b=0 — i.e. for τa,b=id\tau_{a,b}=\mathrm{id}; finally, H(K)=Stab(0)H(K)=\operatorname{Stab}(0).)
  • If G=A4G=A_4 (the alternating group) and γ\gamma is a 3-cycle, then (G,γ)(G,\langle \gamma\rangle) is a Frobenius pair.
  • If nNn\in\mathbb{N}^* is odd, DnD_n is the dihedral group (the group of affine transformations of the plane preserving the regular nn-gon centered at the origin), and ss is an orthogonal symmetry in DnD_n, then (Dn,s)(D_n,\langle s\rangle) is a Frobenius pair.

Theorem 1 (Frobenius).
Let (G,H)(G,H) be a Frobenius pair. Define

N=(G(gGgHg1)){e},N=\Bigl(G\setminus\bigl(\bigcup_{g\in G}gHg^{-1}\bigr)\Bigr)\cup\{e\},

called the Frobenius kernel. Then NN is a normal subgroup of GG, and NH={e}N\cap H=\{e\}, and G=NHG=NH (we say that GG is the semidirect product of NN by HH).

The fact that NH={e}N\cap H=\{e\} is evident; moreover, by Proposition 1,

N=GgGgHg1+1=GH.|N|=|G|-\Bigl|\bigcup_{g\in G}gHg^{-1}\Bigr|+1=\frac{|G|}{|H|}.

Assuming that NN is a subgroup of GG, the map

φ:(n,h)N×Hnh\varphi: (n,h)\in N\times H\mapsto nh

is injective (if nh=nhnh=n'h' then n1n=hh1NH={e}n'^{-1}n=h'h^{-1}\in N\cap H=\{e\}, hence (n,h)=(n,h)(n,h)=(n',h')) and, by cardinality, surjective; thus, G=NHG=NH. Moreover, if gGg\in G, then gNg1NgNg^{-1}\subseteq N (otherwise an element of N{e}N\setminus\{e\} would be conjugate to an element of HH, contradicting the definition of NN); hence, NN is normal. The difficult part of the theorem lies in proving that NN is a subgroup. This will be demonstrated in Section 5.

Example 3.
Revisiting the pairs from Example 2 in the “group action” formulation of Example 1, the Frobenius kernel is the set of elements of GG acting without fixed points, to which the identity ee is added. Thus:

  • The kernel of (A4,γ)(A_4,\langle \gamma\rangle) is the Klein group V={id,(12)(34),(13)(24),(14)(23)}.V=\{\mathrm{id},\,(1\,2)(3\,4),\,(1\,3)(2\,4),\,(1\,4)(2\,3)\}.
  • The kernel of (Aff(1,K),H(K))(\operatorname{Aff}(1,K),H(K)) is the subgroup {τ1,b:bK}\{\tau_{1,b}:b\in K\} of translations in Aff(1,K)\operatorname{Aff}(1,K).
  • The Frobenius kernel of DnD_n is the group of rotations centered at the origin by an angle in 2πnZ\frac{2\pi}{n}\mathbb{Z} (naturally isomorphic to UnU_n).

Representations of a Finite Group

To prove the Frobenius theorem, we will try to express NN as the kernel of a representation.

Definition 2.
Let (G,)(G,\cdot) be a finite group. A representation of GG is a pair (ρ,V)(\rho,V) where VV is a finite-dimensional C\mathbb{C}-vector space and ρ\rho is a homomorphism from GG to GL(V)\operatorname{GL}(V). We will sometimes denote it simply by ρ\rho. The degree of ρ\rho is the integer dim(V)\dim(V).

Thus, we view the elements of GG as linear transformations on VV.

Example 4.

  • Representations of degree 1 correspond to homomorphisms from GG to C\mathbb{C}^*. In particular, the trivial representation is the one corresponding to the trivial homomorphism from GG to C\mathbb{C}^*.
  • A more elaborate example: if (fg)gG(f_g)_{g\in G} is a basis of a vector space VV of dimension G|G| indexed by GG, and for each gGg\in G we define ρ(g)\rho(g) as the unique element of GL(V)\operatorname{GL}(V) such that hG,ρ(g)(fh)=fgh,\forall\, h\in G,\quad \rho(g)(f_h)=f_{gh}, then we have defined a representation of GG called the regular representation. Moreover, it is said to be faithful because ker(ρ)={e}\ker(\rho)=\{e\}.
  • If (ρ,V)(\rho,V) and (ρ,V)(\rho',V') are two representations of GG, then by setting, for gGg\in G, one defines a representation on the space L(V,V)L(V,V') (the space of linear maps from VV to VV') by uL(V,V),(ρ,ρ~)(g)(u)=ρ(g)uρ(g1).\forall\, u\in L(V,V'),\quad (\widetilde{\rho,\rho'})(g)(u)=\rho'(g)\circ u\circ \rho(g^{-1}).
  • Let (ρ,V)(\rho,V) and (ρ,V)(\rho',V') be as above. Then one defines a representation (ρρ,VV)(\rho\oplus\rho',\,V\oplus V') by (g,v,v)G×V×V,(ρρ)(g)((v,v))=(ρ(g)(v),ρ(g)(v)).\forall\,(g,v,v')\in G\times V\times V',\quad (\rho\oplus\rho')(g)((v,v'))=(\rho(g)(v),\,\rho'(g)(v')). This construction also works for direct sum decompositions in which ρ\rho and ρ\rho' are defined: in a basis adapted to this decomposition, the matrices of (ρρ)(g)(\rho\oplus\rho')(g) will consist of two square diagonal blocks of sizes dim(V)\dim(V) and dim(V)\dim(V').

Definition 3.
If (ρ,V)(\rho,V) and (ρ,V)(\rho',V') are two representations of GG, we say that ρ\rho is equivalent to ρ\rho' (and write ρρ\rho\sim\rho') if dim(V)=dim(V)\dim(V)=\dim(V') and if there exist bases ee of VV and ee' of VV' such that

gG,Mat(ρ(g))=Mat(ρ(g)).\forall\, g\in G,\quad \operatorname{Mat}(\rho(g))=\operatorname{Mat}(\rho'(g)).

In other words, if there exists an isomorphism uL(V,V)u\in L(V,V') such that

gG,uρ(g)=ρ(g)u.\forall\, g\in G,\quad u\circ \rho(g)=\rho'(g)\circ u.

Moreover, for any ρ,ρ\rho,\rho' we denote

Hom(ρ,ρ)={uL(V,V)gG,  uρ(g)=ρ(g)u},\operatorname{Hom}(\rho,\rho')=\{u\in L(V,V')\mid \forall\, g\in G,\; u\circ \rho(g)=\rho'(g)\circ u\},

which is clearly a subspace of L(V,V)L(V,V').

Definition 4.
A representation ρ\rho of GG is said to be irreducible if the only subspaces of VV that are stable under all ρ(g)\rho(g) for gGg\in G are {0}\{0\} and VV.

These representations cannot be “broken” into subrepresentations; this is the case, for example, for representations of degree 1. In fact, we have the following proposition:

Proposition 2 (Maschke).
Let (ρ,V)(\rho,V) be a representation of GG and UU a subspace of VV that is stable under all ρ(g)\rho(g) for gGg\in G. Then there exists a supplementary subspace SS of UU that is also stable under all ρ(g)\rho(g).

Proof.
It suffices to construct a projector with image UU that commutes with all ρ(g)\rho(g); its kernel will then be the desired supplementary subspace. Let pp be any projector with image UU. Modify it by averaging over GG:

q=1GgGρ(g)pρ(g1).q=\frac{1}{|G|}\sum_{g\in G}\rho(g)\circ p\circ \rho(g^{-1}).

This is an endomorphism whose image is contained in UU and such that qU=idUq|_U=\operatorname{id}_U (since the ρ(g)\rho(g) stabilize UU) and which commutes with all ρ(g)\rho(g) (because ρ(g)qρ(g1)=q\rho(g)\circ q\circ \rho(g^{-1})=q, as the map ghgg\mapsto hg is bijective). \square

By induction on the dimension of VV, one shows the following reduction result:

Proposition 3.
If (ρ,V)(\rho,V) is a representation, then there exists kNk\in\mathbb{N}^* and subspaces V1,V2,,VkV_1,V_2,\dots,V_k of VV such that

V=i=1kVi,V=\bigoplus_{i=1}^k V_i,

and there exist irreducible representations (ρ1,V1),,(ρk,Vk)(\rho_1,V_1),\dots,(\rho_k,V_k) such that

ρ=ρ1ρ2ρk.\rho=\rho_1\oplus\rho_2\oplus\cdots\oplus\rho_k.

Finally, here is one last proposition that will prove very useful:

Proposition 4 (Schur).
Let (ρ,V)(\rho,V) and (ρ,V)(\rho',V') be two irreducible representations of GG. Then dimHom(ρ,ρ)={1if ρρ,0if ρ≁ρ.\dim\operatorname{Hom}(\rho,\rho')= \begin{cases} 1 & \text{if } \rho\sim\rho', \\[1mm] 0 & \text{if } \rho\not\sim\rho'. \end{cases}

Proof.

  • Suppose ρρ\rho\sim\rho'. Up to transporting ρ\rho' via the isomorphism given by the equivalence, assume V=VV=V' and ρ=ρ\rho=\rho'. Let uHom(ρ,ρ)u\in\operatorname{Hom}(\rho,\rho). Since C\mathbb{C} is algebraically closed, uu has an eigenvalue λ\lambda. Then for all gGg\in G, uu and ρ(g)\rho(g) commute, hence the eigenspace Eλ(u)=ker(uλid)E_\lambda(u)=\ker(u-\lambda\,\operatorname{id}) is stable under ρ(g)\rho(g). By the irreducibility of ρ\rho, since Eλ(u)E_\lambda(u) is nonzero, it must equal VV, so that u=λidu=\lambda\,\operatorname{id}. Conversely, every map of the form λid\lambda\,\operatorname{id} (with λC\lambda\in\mathbb{C}) belongs to Hom(ρ,ρ)\operatorname{Hom}(\rho,\rho). Hence, Hom(ρ,ρ)=Cid\operatorname{Hom}(\rho,\rho)=\mathbb{C}\,\operatorname{id} has dimension 1.
  • Suppose ρ≁ρ\rho\not\sim\rho'. Let uHom(ρ,ρ)u\in\operatorname{Hom}(\rho,\rho'). For any gGg\in G, we have ρ(g)u=uρ(g)\rho'(g)\circ u=u\circ \rho(g). Thus, the image im(u)\operatorname{im}(u) is stable under ρ(g)\rho'(g). By irreducibility, im(u)\operatorname{im}(u) is either VV' or {0}\{0\}. Similarly, ker(u)\ker(u) is either VV or {0}\{0\}. Therefore, if uu is not an isomorphism, then ker(u)=V\ker(u)=V and im(u)={0}\operatorname{im}(u)=\{0\}, so that u=0u=0. This completes the proof. \square

Characters

A large part of the information contained in a representation is actually stored in its character:

Definition 5.
Let (ρ,V)(\rho,V) be a representation of GG. The character of ρ\rho is the function

χρ:GC,gTr(ρ(g)).\chi_\rho: G\to\mathbb{C},\quad g\mapsto \operatorname{Tr}(\rho(g)).

A character is any function in CG\mathbb{C}^G that is the character of some representation. We say that a character is irreducible if it comes from an irreducible representation.

Example 5.

  • χ(e)=Tr(id)\chi(e)=\operatorname{Tr}(\operatorname{id}) is the degree of the representation.
  • Two equivalent representations have the same character (i.e. the same matrices in well-chosen bases).
  • If g,hGg,h\in G and χ\chi is a character, then χ(ghg1)=Tr(ρ(g)ρ(h)ρ(g)1)=Tr(ρ(h))=χ(h).\chi(ghg^{-1})=\operatorname{Tr}\bigl(\rho(g)\rho(h)\rho(g)^{-1}\bigr)=\operatorname{Tr}(\rho(h))=\chi(h). We denote by Cent(G)\operatorname{Cent}(G) the subspace of central functions on GG (i.e. functions constant on the conjugacy classes of GG; its dimension is the number of such classes). Then χCent(G)\chi\in\operatorname{Cent}(G).
  • Trivially, χρ1ρ2=χρ1+χρ2\chi_{\rho_1\oplus\rho_2}=\chi_{\rho_1}+\chi_{\rho_2}.
  • For any gGg\in G, since gG=eg^{|G|}=e, the polynomial XG1=ωUG(Xω)X^{|G|}-1=\prod_{\omega\in U_{|G|}}(X-\omega) annihilates ρ(g)\rho(g) and splits with simple roots. Hence, ρ(g)\rho(g) is diagonalizable with spectrum contained in UGU_{|G|}. If {λ1,,λd}\{\lambda_1,\dots,\lambda_d\} is the spectrum of ρ(g)\rho(g), then {λ11,,λd1}\{\lambda_1^{-1},\dots,\lambda_d^{-1}\} is the spectrum of ρ(g)1=ρ(g1),\rho(g)^{-1}=\rho(g^{-1}), so that, since the eigenvalues lie in UGU_{|G|}, χ(g1)=λ11++λd1=λ1++λd=χ(g).\chi(g^{-1})=\lambda_1^{-1}+\cdots+\lambda_d^{-1}=\overline{\lambda_1+\cdots+\lambda_d}=\overline{\chi(g)}.
  • Using the notations of Example 4, one has χρ,ρ~=χρχρ,\chi_{\widetilde{\rho,\rho'}}=\chi_\rho\,\chi_{\rho'}, which is a consequence (via the previous point and the definition of (ρ,ρ~)(\widetilde{\rho,\rho'})) of the fact that for A,BMn(C)A,B\in M_n(\mathbb{C}), the linear map MMn(C)AMBMn(C)M\in M_n(\mathbb{C})\mapsto AMB\in M_n(\mathbb{C}) has trace Tr(A)Tr(B)\operatorname{Tr}(A)\operatorname{Tr}(B).

Definition/Proposition 1.
We endow CG\mathbb{C}^G with the Hermitian inner product (see Appendix) defined by

(u,v)CG×CG,u,v=1GgGu(g)v(g).\forall\,(u,v)\in\mathbb{C}^G\times\mathbb{C}^G,\quad \langle u,v\rangle=\frac{1}{|G|}\sum_{g\in G} u(g)v(g).

To compute the Hermitian inner product of two characters, we again use the “averaging technique”:

Lemma 1.
Let ρ\rho be a representation. Then

1GgGρ(g)\frac{1}{|G|}\sum_{g\in G}\rho(g)

is a projector onto

VG:=gGker(ρ(g)id).V_G:=\bigcap_{g\in G}\ker\Bigl(\rho(g)-\operatorname{id}\Bigr).

Proof.
If xVGx\in V_G, then

1GgGρ(g)(x)=1GgGx=x(1).\frac{1}{|G|}\sum_{g\in G}\rho(g)(x)=\frac{1}{|G|}\sum_{g\in G}x=x\quad (1).

Moreover, if xVx\in V and

y=1GgGρ(g)(x),y=\frac{1}{|G|}\sum_{g\in G}\rho(g)(x),

then for any hGh\in G

ρ(h)(y)=1GgGρ(hg)(x)=y,\rho(h)(y)=\frac{1}{|G|}\sum_{g\in G}\rho(hg)(x)=y,

(since the map ghgg\mapsto hg is bijective) hence yVGy\in V_G (2).

With (1) and (2), the result follows. \square

Proposition 5.
If (ρ,V)(\rho,V) and (ρ,V)(\rho',V') are two representations of GG, then

χρ,χρ=dimHom(ρ,ρ).\langle\chi_\rho,\chi_{\rho'}\rangle=\dim\operatorname{Hom}(\rho,\rho').

Proof.
Define (ρ,ρ~)(\widetilde{\rho,\rho'}) as in Example 4. Apply Lemma 1 to this representation.

On one hand,

gGker((ρ,ρ~)(g)idL(V,V))=Hom(ρ,ρ),\bigcap_{g\in G}\ker\Bigl((\widetilde{\rho,\rho'})(g)-\operatorname{id}_{L(V,V')}\Bigr)=\operatorname{Hom}(\rho,\rho'),

so that

Tr(1GgG(ρ,ρ~)(g))=dimHom(ρ,ρ),\operatorname{Tr}\Bigl(\frac{1}{|G|}\sum_{g\in G}(\widetilde{\rho,\rho'})(g)\Bigr)=\dim\operatorname{Hom}(\rho,\rho'),

(since for a projector its trace equals its rank). On the other hand, by linearity and Example 5,

Tr(1GgG(ρ,ρ~)(g))=1GgGχρ,ρ~(g)=1GgGχρ(g)χρ(g)=χρ,χρ.\operatorname{Tr}\Bigl(\frac{1}{|G|}\sum_{g\in G}(\widetilde{\rho,\rho'})(g)\Bigr)=\frac{1}{|G|}\sum_{g\in G}\chi_{\widetilde{\rho,\rho'}}(g) =\frac{1}{|G|}\sum_{g\in G}\chi_\rho(g)\chi_{\rho'}(g)=\langle\chi_\rho,\chi_{\rho'}\rangle.

Combining with Schur’s lemma (Proposition 4) completes the proof. \square

Corollary 1.
If (ρ,V)(\rho,V) and (ρ,V)(\rho',V') are two irreducible representations of GG, then

χρ,χρ={1if ρρ,0if ρ≁ρ.\langle\chi_\rho,\chi_{\rho'}\rangle= \begin{cases} 1 & \text{if } \rho\sim\rho',\\[1mm] 0 & \text{if } \rho\not\sim\rho'. \end{cases}

Thus, the family of irreducible characters is orthonormal and finite (since CG\mathbb{C}^G is finite-dimensional). Moreover, two non-equivalent irreducible representations have distinct characters; hence there are as many irreducible representations (up to equivalence) as there are irreducible characters.

Denote by ρ1,,ρr\rho_1,\dots,\rho_r a complete set of irreducible representations and by χ1,,χr\chi_1,\dots,\chi_r the associated characters. Combining with Proposition 3, we obtain:

Corollary 2.
If ρ\rho is a representation, then there exist (n1,,nr)Nr(n_1,\dots,n_r)\in\mathbb{N}^r such that

ρn1ρ1nrρr(where niρi denotes ρiρi (ni times)).\rho\sim n_1\rho_1\oplus\cdots\oplus n_r\rho_r\quad (\text{where } n_i\rho_i \text{ denotes } \rho_i\oplus\cdots\oplus\rho_i \text{ ($n_i$ times)}).

Then the character of ρ\rho is given by

χ=i=1rniχi.\chi=\sum_{i=1}^r n_i\chi_i.

Proposition 6.
If ρ\rho and ρ\rho' are two representations, then

ρρ    χρ=χρ.\rho\sim\rho'\iff\chi_\rho=\chi_{\rho'}.

Proof.
The direct implication is clear. Suppose χρ=χρ=:χ\chi_\rho=\chi_{\rho'}=: \chi. Write

ρn1ρ1nrρrandρn1ρ1nrρr.\rho\sim n_1\rho_1\oplus\cdots\oplus n_r\rho_r\quad \text{and}\quad \rho'\sim n'_1\rho_1\oplus\cdots\oplus n'_r\rho_r.

By orthonormality, χi,χρ=ni\langle\chi_i,\chi_\rho\rangle=n_i and similarly χi,χρ=ni\langle\chi_i,\chi_{\rho'}\rangle=n'_i. Thus, ni=nin_i=n'_i for each ii, and so ρρ\rho\sim\rho'. \square

One can in fact show a stronger result than the orthonormality of characters (we will not prove it here):

Proposition 7.
The irreducible characters χ1,,χr\chi_1,\dots,\chi_r form an orthonormal basis of Cent(G)\operatorname{Cent}(G). Thus, rr is the number of conjugacy classes in GG and if fCent(G)f\in\operatorname{Cent}(G),

f=i=1rχi,fχi.f=\sum_{i=1}^r\langle\chi_i,f\rangle\chi_i.

Induced Representations

Let HH be a subgroup of GG. Starting from a representation (ρ,V)(\rho,V) of GG, one can easily obtain the representation (ρH,V)(\rho|_H,V) of HH, denoted ρH\rho|_H. Conversely, the purpose of this section is to give a natural way to extend a representation (ρ0,W)(\rho_0,W) of HH to a representation (ρ,V)(\rho,V) of GG (with WVW\subseteq V). Denote

G=i=1lgiH,G=\bigsqcup_{i=1}^l g_iH,

the partition of GG into left cosets modulo HH (with g1H=Hg_1H=H).

Definition 6.
Let (ρ,V)(\rho,V) be a representation of GG and let WW be a subspace of VV that is stable under ρ(h)\rho(h) for all hHh\in H. Let ρ0=ρH,W\rho_0=\rho|_{H,W} be the restriction of ρ\rho to HH and the subspace WW. We say that ρ\rho is induced from ρ0\rho_0 and write

ρ=IndHG(ρ0)\rho=\operatorname{Ind}^G_H(\rho_0)

if

V=i=1lρ(gi)(W).(*)V=\bigoplus_{i=1}^l \rho(g_i)(W).\tag{*}

Note that under the assumptions of Definition 6, if gGg\in G can be written as g=gihg=g_i h with hHh\in H, then

ρ(g)(W)=ρ(gih)(W)=ρ(gi)ρ(h)(W)=ρ(gi)(W).\rho(g)(W)=\rho(g_i h)(W)=\rho(g_i)\circ \rho(h)(W)=\rho(g_i)(W).

Thus, the subspaces ρ(gi)(W)\rho(g_i)(W) do not depend on the choice of representatives of the left cosets modulo HH. We have the following proposition:

Proposition 8.
If ρ\rho' is a representation of HH, there exists (up to equivalence) a unique representation ρ\rho of GG such that

ρ=IndHG(ρ).\rho=\operatorname{Ind}^G_H(\rho').

We do not prove this (although the proof is quite strait-forward once you read the following): let's just observe that if (ρ0,W)(\rho_0,W) and the action of the ρ(gi)\rho(g_i) on WW are fixed, then the induced representation is uniquely determined. Indeed, it is enough that it be so on the subspaces ρ(gi)(W)\rho(g_i)(W) (by direct sum) and if gGg\in G, 1il1\le i\le l, wWw\in W, one has

ρ(g)(ρ(gi)(w))=ρ(gj)(ρ0(h)(w))\rho(g)(\rho(g_i)(w))=\rho(g_j)(\rho_0(h)(w))

where ggi=gjhg\,g_i=g_jh (we take the representative of the class of ggig\,g_i modulo HH).

Proposition 9.
Let (ρ,V)(\rho,V) and (ρ,V)(\rho',V') be two representations of GG and let (ρ0,W)(\rho_0,W) be a representation of HH. Suppose that ρ=IndHG(ρ0)\rho=\operatorname{Ind}^G_H(\rho_0) (in particular, ρ0=ρH,W\rho_0=\rho|_{H,W}). Then

Hom(ρ,ρ)Hom(ρ0,ρH).\operatorname{Hom}(\rho,\rho')\simeq \operatorname{Hom}(\rho_0,\rho'|_H).

Proof.
The map

uHom(ρ,ρ)L(V,V)uWHom(ρ0,ρH)L(W,V)u\in\operatorname{Hom}(\rho,\rho')\subset L(V,V')\longmapsto u|_W\in \operatorname{Hom}(\rho_0,\rho'|_H)\subset L(W,V')

is linear. For bijectivity, one shows that any element vHom(ρ0,ρH)v\in \operatorname{Hom}(\rho_0,\rho'|_H) extends uniquely to an element uHom(ρ,ρ)u\in\operatorname{Hom}(\rho,\rho') on VV via the direct sum decomposition (*). \square

Translating Proposition 9 into an equality of dimensions (using Proposition 5), we have:

Corollary 3.
Using the notations of Proposition 9 and denoting IndHG(χρ0)=χρ\operatorname{Ind}^G_H(\chi_{\rho_0})=\chi_\rho, and letting ,G\langle\,\cdot\,,\cdot\,\rangle_G (resp. ,H\langle\,\cdot\,,\cdot\,\rangle_H) be the canonical inner product on Cent(G)\operatorname{Cent}(G) (resp. Cent(H)\operatorname{Cent}(H)), we have

IndHG(χρ0),χρG=χρ0,χρHH.\langle\operatorname{Ind}^G_H(\chi_{\rho_0}),\chi_{\rho'}\rangle_G=\langle\chi_{\rho_0},\chi'_{\rho}|_H\rangle_H.

Thus, by Proposition 6, one may speak of the "induced character" of χρ0\chi_{\rho_0} for IndHG(χρ0)\operatorname{Ind}^G_H(\chi_{\rho_0}).

Proposition 10.
If χ\chi is a character of HH, then for every gGg\in G,

IndHG(χ)(g)=1HkGkgk1Hχ(kgk1).\operatorname{Ind}^G_H(\chi)(g)=\frac{1}{|H|}\sum_{\substack{k\in G\\kgk^{-1}\in H}} \chi(kgk^{-1}).

Proof.
Let (ρ,V)(\rho,V) and (ρ,W)(\rho',W) be such that χ=χρ\chi=\chi_{\rho'} and ρ=IndHG(ρ)\rho=\operatorname{Ind}^G_H(\rho'). For gGg\in G, ρ(g)\rho(g) permutes the subspaces ρ(gi)(W)\rho(g_i)(W). In a basis adapted to the decomposition

V=i=1lρ(gi)(W),V=\bigoplus_{i=1}^l \rho(g_i)(W),

to compute the trace of ρ(g)\rho(g) it suffices to consider those indices ii such that

ρ(g)(ρ(gi)(W))=ρ(gi)(W),\rho(g)(\rho(g_i)(W))=\rho(g_i)(W),

i.e. such that gi1ggiHg_i^{-1}gg_i\in H. Thus, by the invariance of trace under change of basis (transporting ρ(gi)(W)\rho(g_i)(W) to WW via the isomorphism ρ(gi)\rho(g_i)),

χ(g)=1ilgi1ggiHTr(ρ(g)ρ(gi)(W))=1ilgi1ggiHTr(ρ(gi)1ggiW)=1ilgi1ggiHχ(gi1ggi).\chi(g)=\sum_{\substack{1\le i\le l\\ g_i^{-1}gg_i\in H}} \operatorname{Tr}(\rho(g)|_{\rho(g_i)(W)}) =\sum_{\substack{1\le i\le l\\ g_i^{-1}gg_i\in H}} \operatorname{Tr}(\rho(g_i)^{-1}gg_i|_W) =\sum_{\substack{1\le i\le l\\ g_i^{-1}gg_i\in H}} \chi(g_i^{-1}gg_i).

Since χ\chi is central on HH and each right coset of HH has cardinality H|H|, the result follows. \square

This formula allows us to define IndHG(f)\operatorname{Ind}^G_H(f) for any central function ff (replacing χ\chi by ff in the formula). As the characters linearly generate the central functions, we deduce from Corollary 3:

Proposition 11.
If f1Cent(H)f_1\in\operatorname{Cent}(H) and f2Cent(G)f_2\in\operatorname{Cent}(G), then

IndHG(f1),f2G=f1,f2HH.\langle \operatorname{Ind}^G_H(f_1), f_2\rangle_G=\langle f_1, f_2|_H\rangle_H.

Proof of the Frobenius Theorem

Now we present the proof of Theorem 1. Fix a Frobenius pair (G,H)(G,H). Let NN be the corresponding Frobenius kernel. The central result in the proof is the following proposition:

Proposition 12.
Let χ\chi be an irreducible character of HH. There exists an irreducible character χ~\widetilde{\chi} of GG such that χ~H=χ\widetilde{\chi}|_H=\chi and χ~\widetilde{\chi} is constant on NN.

Remark 1.
It is easy to see that any central function on HH extends in a unique way to a central function on GG which is constant on NN. What matters here is that this extension is a (irreducible) character.

Here is a lemma that will allow us to produce characters constant on NN:

Lemma 2.
Let fCent(H)f\in\operatorname{Cent}(H) such that f(e)=0f(e)=0 and let f~=IndHG(f)\widetilde{f}=\operatorname{Ind}^G_H(f). Then f~H=f\widetilde{f}|_H=f and f~N=0\widetilde{f}|_N=0.

Proof.
It is immediate from the definition of the induced function that f~(e)=0=f(e)\widetilde{f}(e)=0=f(e).
Let gH{e}g\in H\setminus\{e\}. Note that if kGk\in G then kgk1Hkgk^{-1}\in H if and only if gk1HkHg\in k^{-1}Hk\cap H. By the definition of a Frobenius complement, kHk\in H. Hence,

f~(g)=1HkGkgk1Hf(kgk1)=1HkHf(kgk1)=1HHf(g)=f(g).\widetilde{f}(g)=\frac{1}{|H|}\sum_{\substack{k\in G\\kgk^{-1}\in H}} f(kgk^{-1}) =\frac{1}{|H|}\sum_{k\in H} f(kgk^{-1}) =\frac{1}{|H|}\cdot|H|\cdot f(g)=f(g).

Now, let gN{e}g\in N\setminus\{e\}. For any kGk\in G, kgk1Hkgk^{-1}\notin H (otherwise gk1Hkg\in k^{-1}Hk, contradicting the definition of NN). Thus, f(kgk1)=0f(kgk^{-1})=0 and so f~(g)=0\widetilde{f}(g)=0. \square

Proof of Proposition 12.
Let d=χ(e)d=\chi(e) be the degree of χ\chi, l=G/HNl=|G|/|H|\in\mathbb{N}, θH\theta_H the trivial character of HH and θG\theta_G the trivial character of GG. If χ\chi is the trivial character, simply take χ~\widetilde{\chi} as the trivial character of GG, which is irreducible.

Otherwise, to apply Lemma 2, consider χdθHCent(H)\chi-d\cdot \theta_H\in\operatorname{Cent}(H) which vanishes at ee.

Then, set

ψ~:=IndHG(χdθH).\widetilde{\psi}:=\operatorname{Ind}^G_H(\chi-d\cdot\theta_H).

By Lemma 2, ψ~\widetilde{\psi} vanishes on NN and restricts to χdθH\chi-d\cdot\theta_H on HH (denoted by ()(*)). However, ψ~\widetilde{\psi} is not a character (since ψ~(e)=0\widetilde{\psi}(e)=0). In order to obtain an irreducible character, we need a central function of Hermitian norm 1. Compute, using Proposition 11,

ψ~,ψ~G=IndHG(χdθH),ψ~G=χdθH,ψ~HH=χdθH,χdθHH=1+d2,\langle\widetilde{\psi},\widetilde{\psi}\rangle_G=\langle \operatorname{Ind}^G_H(\chi-d\cdot\theta_H),\widetilde{\psi}\rangle_G =\langle\chi-d\cdot\theta_H,\widetilde{\psi}|_H\rangle_H =\langle\chi-d\cdot\theta_H,\chi-d\cdot\theta_H\rangle_H=1+d^2,

(using the orthonormality of irreducible characters and the fact that χθH\chi\neq\theta_H). Also, by Proposition 11,

ψ~,θGG=IndHG(χdθH),θGG=χdθH,θGHH=χdθH,θHH=d.\langle\widetilde{\psi},\theta_G\rangle_G=\langle \operatorname{Ind}^G_H(\chi-d\cdot\theta_H),\theta_G\rangle_G =\langle\chi-d\cdot\theta_H,\theta_G|_H\rangle_H=\langle\chi-d\cdot\theta_H,\theta_H\rangle_H=-d.

Thus, we set

χ~=ψ~+dθG.\widetilde{\chi}=\widetilde{\psi}+d\cdot\theta_G.

Then, using the inner products in Cent(G)\operatorname{Cent}(G),

χ~,χ~=ψ~,ψ~+2dψ~,θG+d2θG,θG=1+d22d2+d2=1.\langle\widetilde{\chi},\widetilde{\chi}\rangle=\langle\widetilde{\psi},\widetilde{\psi}\rangle+2d\langle\widetilde{\psi},\theta_G\rangle+d^2\langle\theta_G,\theta_G\rangle = 1+d^2-2d^2+d^2=1.

Moreover, χ~\widetilde{\chi} is clearly a Z\mathbb{Z}-linear combination of characters. Hence, there exist integers n1,,nrn_1,\dots,n_r such that

χ~=i=1rniχi.\widetilde{\chi}=\sum_{i=1}^r n_i\chi_i.

Then

1=χ~,χ~=n12++nr2,1=\langle\widetilde{\chi},\widetilde{\chi}\rangle=n_1^2+\cdots+n_r^2,

so that, as the nin_i are integers, only one of them is nonzero and equals ±1\pm1. Since χ~(e)=d>0\widetilde{\chi}(e)=d>0, that nonzero nin_i equals 1 and χ~\widetilde{\chi} is an irreducible character.

Finally, χ~\widetilde{\chi} remains constant on NN (because we added dθGd\cdot\theta_G to ψ~\widetilde{\psi}, which was constant on NN), and for hHh\in H,

χ~(h)=ψ~(h)+dθG(h)=(χdθH)(h)+d=χ(h).\widetilde{\chi}(h)=\widetilde{\psi}(h)+d\cdot\theta_G(h) =(\chi-d\cdot\theta_H)(h)+d =\chi(h).

Thus, χ~H=χ\widetilde{\chi}|_H=\chi, which completes the proof. \square

Remark 2.
We actually know all the values taken by χ~\widetilde{\chi}. Indeed, one has

G=N(gGgHg1),G=N\cup \Bigl(\bigcup_{g\in G}gHg^{-1}\Bigr),

and no element of NN is conjugate to an element of gGgHg1{e}\bigcup_{g\in G}gHg^{-1}\setminus\{e\} (otherwise it would be conjugate to an element of H{e}H\setminus\{e\}). Moreover, every element of gGgHg1{e}\bigcup_{g\in G}gHg^{-1}\setminus\{e\} is conjugate to an element of HH. If we denote by A1,,AmA_1,\dots,A_m the conjugacy classes in GG of the elements of gGgHg1{e}\bigcup_{g\in G}gHg^{-1}\setminus\{e\}, then

gGgHg1{e}=i=1mAi,\bigcup_{g\in G}gHg^{-1}\setminus\{e\}=\bigsqcup_{i=1}^m A_i,

and we choose h1A1H,,hmAmHh_1\in A_1\cap H,\dots, h_m\in A_m\cap H. Then, since characters are central, one obtains the following table:

eeA1A_1\dotsAmA_mN{e}N\setminus\{e\}
ψ~\widetilde{\psi}00χ(h1)d\chi(h_1)-d\dotsχ(hm)d\chi(h_m)-d00
θG\theta_G1111\dots1111
χ~=ψ~+dθG\widetilde{\chi}=\widetilde{\psi}+d\theta_Gddχ(h1)\chi(h_1)\dotsχ(hm)\chi(h_m)dd

(with d=χ(e)=χ~(e)d=\chi(e)=\widetilde{\chi}(e)).

The decomposition of a character into irreducible characters yields the following corollary:

Corollary 4.
Let χ\chi be a character of HH. There exists a character χ~\widetilde{\chi} of GG such that χ~H=χ\widetilde{\chi}|_H=\chi and χ~\widetilde{\chi} is constant on NN. The table above remains valid.

One last lemma will allow us to conclude.

Lemma 3.
Let χ\chi be a character of GG coming from a representation ρ\rho. Then

χ(g)=χ(e)if and only ifgker(ρ).\chi(g)=\chi(e)\quad\text{if and only if}\quad g\in\ker(\rho).

Proof.
Let d=χ(e)d=\chi(e) be the degree of ρ\rho. For gGg\in G, as seen in Example 5, ρ(g)\rho(g) is diagonalizable with spectrum {λ1,,λd}\{\lambda_1,\dots,\lambda_d\} in UU. Then by the triangle inequality,

χ(g)=λ1++λdλ1++λd=d=χ(e),|\chi(g)|=|\lambda_1+\cdots+\lambda_d|\le|\lambda_1|+\cdots+|\lambda_d|=d=\chi(e),

with equality if and only if λ1,,λd\lambda_1,\dots,\lambda_d are positively collinear, i.e. if they are all equal (since they are of modulus 1). Therefore, χ(g)=χ(e)\chi(g)=\chi(e) if and only if λ1==λd=1\lambda_1=\cdots=\lambda_d=1, that is, if and only if ρ(g)=id\rho(g)=\operatorname{id}. \square

At this point, all the ingredients are in place to conclude.

Conclusion of the Proof of Theorem 1.
Take ρ\rho a faithful representation of HH (for example, the regular representation) and χ\chi its character. Let χ~\widetilde{\chi} be as in Corollary 4 and ρ~\widetilde{\rho} the associated representation. We now show that

N=ker(ρ~).N=\ker(\widetilde{\rho}).

From the table in Remark 2 and Lemma 3, we have

ker(ρ~)={xGχ~(x)=χ~(e)}=N,\ker(\widetilde{\rho})=\{x\in G\mid \widetilde{\chi}(x)=\widetilde{\chi}(e)\}=N,

(where the last equality uses the fact that for the chosen h1,,hmh_1,\dots,h_m we have χ(hi)χ(e)\chi(h_i)\neq\chi(e) by Lemma 3 and the faithfulness of ρ\rho).

Thus, NN is indeed a (normal) subgroup of GG, which completes the proof.

\huge\square


Appendix: Useful Prerequisites

In this section, let (G,)(G,\cdot) be a finite group.

Definition A.
Let XX be a set. An action of the group GG on XX is any homomorphism φ\varphi from GG to S(X)S(X). We denote

gx:=φ(g)(x)for (g,x)G×X.g\cdot x:=\varphi(g)(x) \quad\text{for } (g,x)\in G\times X.

If xXx\in X, the stabilizer of xx is the set

Stab(x)={gGgx=x}.\operatorname{Stab}(x)=\{g\in G\mid g\cdot x=x\}.

It is straightforward to verify that this is a subgroup of GG. The action is said to be faithful if φ\varphi is injective, and transitive if for all xXx\in X,

{gx,  gG}=X.\{g\cdot x,\;g\in G\}=X.

Proposition A.
Suppose that GG acts on XX. Then for any gGg\in G and xXx\in X,

Stab(gx)=gStab(x)g1.\operatorname{Stab}(g\cdot x)=g\,\operatorname{Stab}(x)\,g^{-1}.

Proof.
If hStab(gx)h\in \operatorname{Stab}(g\cdot x), then

h(gx)=gx(g1hg)x=xg1hgStab(x),h\cdot (g\cdot x)=g\cdot x\quad\Longleftrightarrow\quad (g^{-1}hg)\cdot x=x\quad\Longleftrightarrow\quad g^{-1}hg\in \operatorname{Stab}(x),

i.e. hgStab(x)g1h\in g\,\operatorname{Stab}(x)\,g^{-1}. \square

Definition B.
Let (G,)(G,\cdot) be a group.

  • For gGg\in G, the map φg:xGgxg1G\varphi_g: x\in G\mapsto gxg^{-1}\in G is an automorphism of the group, called the conjugation automorphism; and the map gφgAut(G)g\mapsto \varphi_g\in\operatorname{Aut}(G) defines an action of GG on itself.
  • Two elements x,yGx,y\in G are said to be conjugate if there exists gGg\in G such that x=φg(y)x=\varphi_g(y). This defines an equivalence relation on GG whose equivalence classes are called conjugacy classes. Similarly, one defines the conjugates of a subset of GG. If HH is a subgroup of GG, then for every gGg\in G, φg(H)\varphi_g(H) is a subgroup of GG (being the image of a subgroup by a homomorphism) of cardinality H|H| (since φg\varphi_g is injective).

Definition C.
Let HH be a subgroup of GG. We say that HH is normal if

gG,  hH,ghg1H.\forall\, g\in G,\;\forall\, h\in H,\quad ghg^{-1}\in H.

Definition D.
Let HH be a subgroup of GG. Two elements x,yGx,y\in G are said to be congruent modulo HH if y1xHy^{-1}x\in H. This defines an equivalence relation on GG whose equivalence classes (the left cosets) are the sets

gH,gG.gH,\quad g\in G.

We denote

G/H={gHgG}.G/H=\{gH\mid g\in G\}.

Since for any gGg\in G the map τg:hgh\tau_g: h\mapsto gh is injective, every coset has cardinality H|H| and since their union is GG,

G/H=GH,|G/H|=\frac{|G|}{|H|},

which in particular shows that H|H| divides G|G| (this is Lagrange's Theorem).

Finally, we define the notion of a Hermitian space, which generalizes that of a Euclidean space:

Definition E.
Let VV be a C\mathbb{C}-vector space. A Hermitian inner product on VV is a function

,:V×VC,\langle\,\cdot\,,\,\cdot\,\rangle: V\times V\to\mathbb{C},

which is linear in the second variable (i.e. for all x,y,zVx,y,z\in V and λC\lambda\in\mathbb{C}, x,y+λz=x,y+λx,z\langle x,y+\lambda z\rangle=\langle x,y\rangle+\lambda\langle x,z\rangle), Hermitian symmetric (i.e. x,y=y,x\langle x,y\rangle=\overline{\langle y,x\rangle}), positive (i.e. x,xR+\langle x,x\rangle\in\mathbb{R}_+ for all xVx\in V), and definite (i.e. x,x=0\langle x,x\rangle=0 if and only if x=0x=0). As in the Euclidean case, one defines the notion of orthogonality and orthonormal families.

Here is the translated bibliography from the French PDF document, formatted in clean Markdown:


📚 Bibliography

  1. Gaëtan Chenevier
    Algebra I Course (Chapter 7, Linear Representations of Finite Groups)

  2. William Fulton and Joe Harris
    Representation Theory: A First Course.
    Springer, 2004.

  3. Gérard Rauch
    Finite Groups and Their Representations.
    Ellipses, 2000.
    ISBN: 978-2-7298-0180-9

  4. Jean-Pierre Serre
    Finite Groups: An Introduction.
    International Press, 2016.
    ISBN: 978-1-57146-327-2


Back to top

Mounir Lbath © 2025