In my previous post, I presented a proof of the existence portion of the structure theorem for finitely generated modules over a PID based on the Smith Normal Form of a matrix. In this post, I’d like to explain how the uniqueness portion of that theorem is actually a special case of a more general result, called Fitting’s Lemma, which holds for arbitrary commutative rings.
We begin by proving that one can characterize the diagonal entries in the Smith Normal Form of a matrix over a PID in an intrinsic way by relating them to the GCD of the
minors of
for all
. Actually, since the GCD isn’t defined for general rings, we will instead consider the ideal generated by the
minors (which makes sense for any ring, and is the ideal generated by the GCD in the case of a PID).
Throughout this post, will be a non-zero commutative ring with identity.
Determinental ideals
Definition: Let be an
matrix with entries in
. For
, we define the
determinental ideal
of
to be the ideal generated by all of the
minors of
. By convention, we set
if
and
if
.
By the Laplace expansion for determinants, the ideals form a decreasing nested sequence:
As mentioned above, we have:
Proposition 1: Suppose is a PID and
. Then
for all
, where
are the invariant factors of
(the diagonal entries in the Smith Normal Form).
Thus, when is a PID, knowledge of the invariant factors of
is equivalent to knowledge of the determinental ideals.
Proposition 1 is a consequence of Lemma 2 below, which itself is a consequence of:
Lemma 1: Let be an
matrix with entries in a ring
, and suppose
(resp.
) is an
(resp.
) matrix over
. Then
and
for all
.
Proof: The columns of are linear combinations of the columns of
. Since the determinant is multilinear, it follows that each
minor of
is an
-linear combination of
minors of
. Therefore
for all
. By a similar consideration involving rows,
. Q.E.D.
Lemma 2: Let be an
matrix with entries in a ring
, and suppose
(resp.
) is an invertible
(resp.
) matrix over
. Let
. Then
for all
.
Proof: By Lemma 1, . Since
and
are invertible, we can write
and apply the same argument to obtain
. Thus
. Q.E.D.
In particular, performing elementary row operations on a matrix (adding a multiple of one row to another, permuting the rows, or multiplying some row by a unit) does not change the ideals
, and the same holds for elementary column operations.
Fitting ideals of modules
The theory of Fitting ideals can be developed in the context of finitely generated modules, but it’s slightly simpler to explain in the context of finitely presented modules, so I’ll restrict my discussion to that case. (Note that if is noetherian then every finitely generated
-module is finitely presented.)
Let be a finitely presented
-module with presentation
, and let
be the
matrix with entries in
encoding the map
. (In down-to-earth terms,
is generated by the image of the standard basis vectors
under the map
, and the columns of
encode the
-linear relations between these generators.)
Definition: For , define the
Fitting ideal of
to be the ideal generated by the
minors of
, i.e.,
.
The key result in the theory is the following:
Fitting’s Lemma: The ideal is independent of the choice of presentation.
It therefore makes sense to talk about the Fitting ideals of a module without reference to any particular presentation.
Note that, by construction, the ideals form an increasing sequence:
When is a PID and
is a finitely generated torsion module, Fitting ideals encode the same information as the invariant factors (or elementary divisors) of
, because if
has invariant factors
with
for all
, we have
(cf. Proposition 1 and the previous post). Fitting’s theorem therefore generalizes the uniqueness of the invariant factors in the structure theorem for finitely generated modules over a PID.
Proof of Fitting’s Lemma
Our proof will follow this short note by Mel Hochster. Some readers may prefer the exposition given on this page of The Stacks Project.
Proof: We first show that the ideals depend only on the kernel of the surjection
, and not on the choice of a particular set of relations generating this kernel (which correspond to the columns of the matrix
). Given two finite sets of vectors in
generating
, we can compare each with the union. Therefore, it suffices to consider the case where one set of relations is included in the other. In terms of matrices, this means that we have two presentations for
, one generated by an
matrix
and one generated by an
matrix
whose first
columns are the same as those of
and whose last
columns are linear combinations of the first
. By subtracting linear combinations of the first
columns from the last
(which does not change the determinental ideals), we may assume that the last
columns are all zero, in which case the result is clear.
It remains to show that the ideals are independent of the choice of generators for
, i.e., of the choice of a surjection
. Once again, we can compare each of two different sets of generators with their union, and so we may assume that one set of generators is contained in the other. By induction, it suffices to consider the case where there is just one additional generator. By the previous paragraph, we may assume that, included among the list of relations for the second set of generators, there is a relation expressing the additional generator as a linear combination of the others. By relabeling the generators and relations (i.e., permuting the rows and columns of the presentation matrix), we may assume that the matrix
with the additional generators present has a 1 in the last row and column. By performing elementary column operations (subtracting multiples of the last column from the others), we can assume that all other entries in the last row of
are zero. In other words,
where is
and the last row of
is
. Note that
is a relations matrix for the presentation using the first
generators.
We now show that for all , we have
, which will finish the proof. Let
. Each
minor of
involving the 1 in the lower right-hand corner is the same, up to sign, as a
minor of
, all of which occur. It remains to check that the other
minors of
also belong to
. If such a minor involves the last row of
, it is zero. Otherwise, it has at least
columns in
, and thus its expansion by minors with respect to the remaining column belongs to
. Q.E.D
A generalization of the Cayley-Hamilton theorem
The ideal is called the initial Fitting ideal of
. If
is a PID and
is a torsion
-module,
is the product of all the invariant factors, which when
is the order of
and when
for some field
is the characteristic polynomial of
(if
corresponds to
in the usual way).
Proposition 2: The initial Fitting ideal of annihilates
.
This is just the Cayley-Hamilton theorem when is a finitely generated torsion module over
, and of Lagrange’s theorem when
.
Proof: Suppose is an
matrix representing some presentation
for
. Let
be the corresponding generators of
, where
is the given surjection. Let
be any
minor of
; in particular, the columns of
represent certain
-linear relations between the generators. We want to show that
annihilates
. This is a simple consequence of the identity (which holds for any square matrix over any commutative ring)
, where
is the adjugate matrix of
. Indeed, we have
, since
by assumption. Since the
generate
, it follows that
. Q.E.D.
Additional properties of Fitting ideals
We mention, without proof, some additional properties of Fitting ideals. Proofs can be found, for example, in David Eisenbud’s book “Commutative Algebra with a View Toward Algebraic Geometry”, D.G. Northcott’s monograph “Finite Free Resolutions”, Antoine Chambert-Loir’s “(Mostly) Commutative Algebra”, or the Stacks Project page mentioned above.
(1) If can be generated by
elements then
(this is clear from the definitions), and if
is a local ring then the converse holds as well. We can therefore view the
Fitting ideal as measuring, in a certain precise sense, the obstruction to a module being generated by
elements.
(2) Fitting ideals commute with localization: if is multiplicative then
.
(3) More generally, Fitting ideals commute with base change: if is a ring homomorphism then
is the ideal generated by the image of
.
(4) If is a short exact sequence of
-modules then
for all
. If the sequence is split, so that
, then
is the ideal generated by all products
with
.
(5) As discussed above, if is a PID then two finitely generated
-modules
are isomorphic if and only if they have the same Fitting ideals. If we consider only torsion modules, this remains true for Dedekind domains. However, for non-torsion modules the result fails: if
is a non-principal ideal in Dedekind domain
then
and
are not isomorphic as
-modules but they have the same Fitting ideals (namely, the zeroth Fitting ideal is
and all higher Fitting ideals are equal to
). If
is not a Dedekind ring, it is possible for two non-isomorphic torsion
-modules to have the same Fitting ideals. For example (see pp. 40-42 in this thesis for details), if
and
is the ideal generated by
and
, the torsion
-modules
and
have the same Fitting ideals but are not isomorphic. As another example, let
be the Unique Factorization Domain
and let
and
. Then the torsion
-modules
and
have the same Fitting ideals but are not isomorphic.
A glimpse of Iwasawa theory
I first heard about Fitting ideals in the context of Iwasawa theory, a rich area of study within modern number theory. Iwasawa was interested in studying the behavior of the -power torsion in the ideal class group of the cyclotomic field
, where
is a prime number, because Kummer had established a close connection between this problem and Fermat’s Last Theorem. Iwasawa’s audacious and perspicacious idea was that it is in fact easier, in many ways, to study the ideal class groups in the entire tower of number fields
all at once, where
. Each
is a Galois extension with Galois group
isomorphic to
, and the
-part of the ideal class group of each
is naturally a
-module. Iwasawa considered the inverse limit of these groups as a module over the Iwasawa ring
, where
is the inverse limit of the
. Using class field theory, Iwasawa constructed a closely related module
which is a finitely generated torsion module over
, and he used the structure of such modules to draw conclusions about the entire tower, including eventually the class group of
itself!
The point is that is one of the simplest kinds of rings that isn’t a PID, namely it’s a complete 2-dimensional regular local ring. A version of the structure theorem for finitely generated torsion modules over
can be stated as follows. We say that two
-modules
are pseudo-isomorphic if there is a homomorphism
with finite kernel and finite cokernel.
Theorem (Iwasawa, Serre): The Fitting ideals of a finitely generated torsion -module determine the module up to pseudo-isomorphism.
The initial Fitting ideal is called the characteristic ideal of
. The Main Conjecture of Iwasawa Theory, which was proved by Mazur and Wiles many years before Wiles’ revolutionary work on Fermat’s Last Theorem, relates the characteristic ideal of
(or, more precisely, its eigenspaces under the action of
) to
-adic L-functions, yielding a far-reaching generalization of the work of Kummer. See this survey paper by Romyar Sharifi for further details.
Concluding remarks
(1) One can also prove Lemma 1 using exterior algebra. Let denote the
exterior power of a matrix
over a commutative ring
, i.e., the matrix whose
-entry is the determinant of the
minor
, where
range over all
-element subsets of
and
, respectively. If
represents a homomorphism
of free
-modules, then
represents the induced map
on exterior powers. Since exterior powers are functorial, one has
(a generalization of the multiplicativity of the determinant), from which Lemma 1 follows easily.
(2) The initial Fitting ideal can be used to give a definition for the image of a morphism of schemes which behaves well in families. The idea is as follows: by the naturality of the construction of Fitting ideals of a module, it makes sense to attach a Fitting ideal sheaf to any sufficiently nice sheaf of modules on a scheme. Accordingly, the Fitting image of a morphism is defined to be the closed subscheme of
associated to the sheaf of ideals
. This point of view is explored in detail in the book “The Geometry of Syzygies” by Eisenbud.
(3) The Alexander polynomial of a knot can be defined as the initial Fitting ideal of the first homology (with integer coefficients) of the infinite cyclic cover of the complement of the knot, considered as a module over .
(4) For more information on Hans Fitting (1906-1938), who was a student of Emmy Noether and died of bone cancer at age 31 see this biographical page. Among other things, he introduced the Fitting decomposition of a vector space with respect to an endomorphism, which I discussed in this post on the Jordan Canonical Form. His father Friedrich Fitting, who was also a mathematician, is best known today for his 1931 proof that there are exactly 880 magic squares of order 4.
Pingback: Finitely generated modules over a P.I.D. and the Smith Normal Form | Matt Baker's Math Blog
Pingback: Linear algebra over rings | Matt Baker's Math Blog