In honor of Pi Day 2023, I’d like to discuss Hilbert’s 7th Problem, which in an oversimplified (and rather vague) form asks: under what circumstances can a transcendental function take algebraic values at algebraic points?
The connection with is that Lindemann proved in 1882 that the transcendental function
takes transcendental values at every nonzero algebraic number. Since
by Euler’s formula, this proves that
, and hence
itself, is transcendental. In light of this theorem, it is natural to wonder what if anything is special here about the function
and the point
.
One thing that’s special about is that if
is algebraic and
is also algebraic, then both
and
are algebraic for all
, and these numbers are all distinct. So one might be led to speculate that if
is a transcendental entire function then there are only finitely many algebraic numbers
for which
is also algebraic.
Unfortunately, as Hilbert knew, this is completely false. For example, the function is transcendental but it takes the rational value 1 at every integer. In 1886, Weierstrass had given an example of a transcendental entire function that takes rational values at all rational numbers; later, in 1895, Stäckel showed that there is a transcendental entire function that takes rational values at all algebraic points. However, the functions of Weierstrass and Stäckel, are in some sense “pathological”; they have large growth rates and do not occur “in nature”. The challenge is to make this intuitive feeling more precise, and also to distinguish
from
.
One thing that is special about , which is not shared by any of the other functions mentioned in the previous paragraph, is that it satisfies a linear differential equation with rational coefficients (namely
). The existence of such a (not necessarily linear) differential equation turns out to be the key idea needed to generalize Lindemann’s theorem in a substantial way.
Another fruitful generalization is to rephrase our original question as an unlikely intersection problem: given two algebraically independent entire functions and
satisfying suitable hypotheses, can we conclude that there are only finitely many complex numbers
such that
and
are simultaneously algebraic? This generalizes our original question by letting
and
.
One reason this is a fruitful generalization is that it contains, as a special case, the main question that Hilbert chose to focus on in his 7th problem:
Question (Hilbert, 1900): If are algebraic numbers with
and
irrational, is it true that
is transcendental?
To see the connection, let and
. If
is algebraic, then
and
are both algebraic for the infinitely many complex numbers
(For the record, Hilbert also formulated his question as the following equivalent problem in Euclidean geometry: In an isosceles triangle, if the ratio of the base angle to the angle at the vertex is algebraic but not rational, does this imply that the ratio between the base and side is transcendental? Also, according to Siegel, Hilbert often speculated that a solution to this problem would materialize later than a proof of the Riemann Hypothesis or Fermat’s Last Theorem. It was solved by Gelfond in 1934.)
It remains to find suitable hypotheses on and
under which we can actually prove such a finiteness theorem. This is the content of the main result we’ll be discussing in this blog, the Schneider-Lang theorem.
The Schneider-Lang theorem
The Schneider-Lang theorem not only has an elegant formulation, its proof is — at least to my taste — more motivated and intuitive than many other approaches to the transcendence of . It also contains ideas, such as the construction of a suitable auxiliary polynomial using Siegel’s Lemma, which play a very important role in modern transcendence theory.
Note: I will assume some familiarity with complex analysis in the remainder of this post, as well as some basic facts from algebraic number theory. I have benefitted particularly in the exposition which follows from the book “Transcendental Numbers” by M.R. Murty and P. Rath. Other exposition of the proof can be found in the Appendix 1 to S. Lang’s book “Algebra” and in this Master’s thesis.
In order to state the result, we first need a definition from complex analysis.
Definition: An entire function is said to be of finite order if there exists
such that
whenever
. The infimum of all such
is called the order of
. A meromorphic function is said to have order at most
if it is the quotient of two entire functions of order at most
.
Example: The function has order 1.
Theorem (Schneider-Lang): Let be meromorphic functions of finite order, and assume that
and
are algebraically independent over
. Suppose furthermore that the derivatives
belong to the ring
for some number field
. Then there are at most finitely many
such that
for all
.
More precisely, the proof can be optimized to show that if and
have order
and
, respectively, then the number of such
is bounded by
. This bound is sharp in the case where
,
, and
.
To see that the Schneider-Lang theorem implies the transcendence of , we deduce from it the following:
Corollary (Hermite-Lindemann): Let be a nonzero algebraic number. Then
is transcendental.
Proof: Suppose not, and let . Let
and
. The conditions of the Schneider-Lang theorem are satisfied, since growth rate considerations show that
and
are algebraically independent. However,
for all natural numbers
, a contradiction. Q.E.D.
The Schneider-Lang theorem also implies the Gelfond-Schneider theorem, which provides a positive answer to Hilbert’s 7th problem:
Corollary (Gelfond-Schneider): Let be algebraic numbers with
and
irrational. Then
is transcendental.
Proof: Suppose not, and let . Let
and
. Because
and
are algebraically independent, the conditions of the Schneider-Lang theorem are satisfied. However,
for all
, a contradiction. Q.E.D.
This shows, for example, that and
, and
are all transcendental.
Example: The applications of Schneider-Lang we’ve given so far involve two functions that satisfy a first-order linear differential equations. For an application of the more general formulation of the theorem, consider the Weierstrass -function associated to a complex lattice
. It is a meromorphic function on
with poles only at the points of
, and it satisfies a non-linear second-order differential equation of the form
for certain complex numbers
associated to
. The Schneider-Lang theorem can be used to prove that if
are algebraic then
is transcendental for all algebraic numbers
. To see this, one supposes that
is algebraic and takes the number field
and the functions
, and
which can be shown to be have order at most 3. Moreover,
and
are algebraically independent. If
, the addition formula for the Weierstrass
-function shows that
whenever
and
. This contradicts the Schneider-Lang theorem (see Chapter 10 of Murty-Rath for details).
Sketch of the proof
Although the general version of the Schneider-Lang theorem given above is useful for applications such as transcendence of special values of the Weierstrass -function, for the applications to Hermite-Lindemann and Gelfond-Schneider we only needed the following simplified version:
Theorem (Schneider-Lang; simplified form): Let be algebraically independent entire functions of order at most 1. Suppose furthermore that the derivatives
belong to the ring
for some number field
.Then there are at most finitely many
such that both
.
We focus on the proof of this statement, which contains all of the most important ideas from the general case. Our proof will show that if and
for
, then
.
We denote by the ring of algebraic integers belonging to the number field
.
Step 1: (Auxiliary function) Construct a polynomial in
and
with
not all zero such that
vanishes to order at least
at each
, where
is some large integer. With a judicious choice of
, we can ensure that the algebraic integers
are not too “large” by using a famous consequence of the Pigeonhole Principle known as Siegel’s Lemma. (Specifically, we choose
.)
Step 2: (Extrapolation) Assuming that and that
vanishes to order at least
at each
, show that it in fact
vanishes to order
at each
.
Since an entire function function which vanishes to infinite order at some must be identically zero, we conclude from Step 2 that
. By the choice of
in Step 1, this shows that
and
are algebraically dependent, a contradiction.
Admittedly, our description of Step 2 was quite vague, so let’s break it into a couple of more detailed sub-steps:
Step 2a (Liouville inequality): If vanishes to order exactly
at some
, say
, then
is a non-zero algebraic number. Moreover, because of the way we constructed
and the differential equations satisfied by the
, we can give upper bounds for the degree of
, the “denominator” of
(the least positive integer
such that
is an algebraic integer), and the absolute values of all the complex conjugates of
. Since the norm of a nonzero algebraic integer is a nonzero rational integer, the absolute value of the norm must be at least 1. This gives a nontrivial lower bound for
in terms of the above data.
Step 2b (Maximum modulus principle): Since vanishes to order at least
at each
, the function
is entire. The maximum modulus principle thus implies that
is bounded above by its maximum value on any circle of radius
around
. Choosing
carefully (specifically, we choose
) and using our assumption that
and
have order at most 1, we obtain an upper bound for
which contradicts the lower bound from Step 2a if
and
is sufficiently large.
A more detailed sketch
Here is a more quantitative sketch of the proof.
Suppose is a nonzero algebraic number of degree
, and let
be its conjugates, i.e., the roots of the minimal polynomial
of
over
. Let
be the leading coefficient of
. Define the denominator
of
to be
; this is the least positive integer
such that
. Define the size
of
to be
.
If is a polynomial with algebraic coefficients, we write
(resp.
) for the LCM of the denominators of the coefficients of
(resp. the maximum of the sizes of the coefficients of
).
We will need the very simple:
Lemma (Liouville Inequality): If is a nonzero algebraic number of degree
, then
Proof: Since , the product of all conjugates of
is a nonzero integer, hence it is at least 1. This easily yields the desired inequality. Q.E.D.
We will also need the following two results. We write for the usual differentiation operator.
Lemma (Derivative Lemma): With and
as in the statement of the Schneider-Lang theorem, there exists a constant
with the following property. If
is a polynomial of degree
with coefficients in
and
, then for all positive integers
we have
and
for all
.
For a proof, see Appendix 1, Lemma 3 in S. Lang’s “Algebra”. Lang’s argument uses the notion of a derivation on a commutative ring, together with induction on . The details are a bit too cumbersome for this post, but it should seem plausible that we can obtain bounds for
and
using the fact that the ring
is closed under taking derivatives, together with the generalized Leibniz formula
Lemma (Siegel’s Lemma): Let be a number field. There exists a constant
depending only on
with the following property. For
and
, with
, let
be algebraic numbers of size at most
and denominator at most
. Then the system of
homogeneous linear equations
has a nonzero solution
satisfying
for all
.
For a proof, see Appendix 1, Lemma 2 in S. Lang’s “Algebra”. (Lang assumes that the are algebraic integers, but the general case follows easily by clearing denominators in each of the linear equations.) Here we explain the proof of Siegel’s Lemma in the special case where the coefficients of the linear forms in question belong to
. The general case can be reduced to this one by writing everything in terms of an integral basis for
.
Proof over : Let
be the associated matrix, thought of as a map from
into
which takes the lattice
into
. Let
and let
be the set of vectors in
with coordinates of size at most
in absolute value. Then
maps
, which has size
, into
, which has size
. If
then by the Pigeonhole Principle there will be two distinct vectors in mapping to the same point. The difference of these two vectors gives a solution in
to the homogeneous system
. Choosing
gives the desired result in a sharpened form. Q.E.D.
We now return to our quantitative sketch of the simplified version of Schneider-Lang.
For notational convenience, we define if
and
.
We wish to find , not all zero, such that
satisfies
for
and
, where
are parameters to be determined later. This amounts to solving the following linear system of
equations in
unknowns:
Using the generalized Leibniz formula, one sees that the numbers all belong to
, and the Derivative Lemma gives
for all . If we choose
such that
(i.e., we have roughly twice as many equations as unknowns), Siegel’s Lemma shows that we can find such
with
.
Since and
are algebraically independent over
, and hence over
, the function
is not identically zero. Let
be the smallest integer such that all derivatives of
up to order
vanish at the points
but
does not vanish at some
, say
. Then
by construction, and the Derivative Lemma gives
Since , the Liouville inequality gives
However, the assumption that and
have order at most allow us to give a lower bound for this quantity:
Claim: If then
Since and
, we obtain a contradiction if
and
This finishes the proof of the Schneider-Lang theorem, modulo the claim.
To prove the claim, note that since vanishes to order at least
at each
, the function
is entire. Using the fact that
for some constant
by hypothesis, the Maximum Modulus Principle applied to
along the disc of radius
around
implies that
But a simple computation shows that , and the result follows. Q.E.D.
Concluding Remarks
(1) For the quantitative version of the Schneider-Lang theorem mentioned above, giving the sharp bound of for the number of exceptional points, see e.g. M. Waldschmidt’s book “Nombres Transcendents”.
(2) The modular -function, which is holomorphic on the complex upper-half plane
, takes algebraic values at all imaginary quadratic numbers. Conversely, Schneider showed that if
is not imaginary quadratic then
is transcendental. This can be deduced from the Schneider-Lang theorem by way of the Weierstrass
-function, see Chapter 15 of Murty-Rath.
(3) It is interesting and useful to replace algebraic points by algebraic subvarieties in many of the above considerations. For example, given a transcendental map between complex algebraic varieties defined over
, one can study the algebraic subvarieties
of
defined over
, such that
is also an algebraic subvariety defined over
. Theorems along such lines are known as Ax-Schanuel type theorems, and they have played influential role in recent developments in Diophantine geometry. We mention, for example, the proof of the André-Oort conjecture, a uniform version of the Mordell-Lang conjecture, and the relative Manin-Mumford conjecture.
(4) [Note added June 11, 2023.] Inspired by this post, David Savitt wrote up a lucid and detailed version of the above sketch geared toward Canada/USA Mathcamp students. David writes, “Our goal is to present this proof at a level approachable by advanced high school students with background in linear algebra and multivariable calculus.” David’s write-up can be accessed here.
Happy Pi Day!


Nice post, Matt! A small correction: you repeatedly write “Gelfand” when you mean “Gelfond.”
Thanks, Sam! This has been corrected now. 🙂
There’s a funny story about this: Endre Szemerédi intended to under Alexander Gelfond for his PhD from Moscow State, but was assigned Israel Gelfand, precisely because of this common typo.
Ha, wow.