In honor of Pi Day 2023, I’d like to discuss Hilbert’s 7th Problem, which in an oversimplified (and rather vague) form asks: under what circumstances can a transcendental function take algebraic values at algebraic points?
The connection with is that Lindemann proved in 1882 that the transcendental function takes transcendental values at every nonzero algebraic number. Since by Euler’s formula, this proves that , and hence itself, is transcendental. In light of this theorem, it is natural to wonder what if anything is special here about the function and the point .
One thing that’s special about is that if is algebraic and is also algebraic, then both and are algebraic for all , and these numbers are all distinct. So one might be led to speculate that if is a transcendental entire function then there are only finitely many algebraic numbers for which is also algebraic.
Unfortunately, as Hilbert knew, this is completely false. For example, the function is transcendental but it takes the rational value 1 at every integer. In 1886, Weierstrass had given an example of a transcendental entire function that takes rational values at all rational numbers; later, in 1895, Stäckel showed that there is a transcendental entire function that takes rational values at all algebraic points. However, the functions of Weierstrass and Stäckel, are in some sense “pathological”; they have large growth rates and do not occur “in nature”. The challenge is to make this intuitive feeling more precise, and also to distinguish from .
One thing that is special about , which is not shared by any of the other functions mentioned in the previous paragraph, is that it satisfies a linear differential equation with rational coefficients (namely ). The existence of such a (not necessarily linear) differential equation turns out to be the key idea needed to generalize Lindemann’s theorem in a substantial way.
Another fruitful generalization is to rephrase our original question as an unlikely intersection problem: given two algebraically independent entire functions and satisfying suitable hypotheses, can we conclude that there are only finitely many complex numbers such that and are simultaneously algebraic? This generalizes our original question by letting and .
One reason this is a fruitful generalization is that it contains, as a special case, the main question that Hilbert chose to focus on in his 7th problem:
Question (Hilbert, 1900): If are algebraic numbers with and irrational, is it true that is transcendental?
To see the connection, let and . If is algebraic, then and are both algebraic for the infinitely many complex numbers
(For the record, Hilbert also formulated his question as the following equivalent problem in Euclidean geometry: In an isosceles triangle, if the ratio of the base angle to the angle at the vertex is algebraic but not rational, does this imply that the ratio between the base and side is transcendental? Also, according to Siegel, Hilbert often speculated that a solution to this problem would materialize later than a proof of the Riemann Hypothesis or Fermat’s Last Theorem. It was solved by Gelfand in 1934.)
It remains to find suitable hypotheses on and under which we can actually prove such a finiteness theorem. This is the content of the main result we’ll be discussing in this blog, the Schneider-Lang theorem.
The Schneider-Lang theorem
The Schneider-Lang theorem not only has an elegant formulation, its proof is — at least to my taste — more motivated and intuitive than many other approaches to the transcendence of . It also contains ideas, such as the construction of a suitable auxiliary polynomial using Siegel’s Lemma, which play a very important role in modern transcendence theory.
Note: I will assume some familiarity with complex analysis in the remainder of this post, as well as some basic facts from algebraic number theory. I have benefitted particularly in the exposition which follows from the book “Transcendental Numbers” by M.R. Murty and P. Rath. Other exposition of the proof can be found in the Appendix 1 to S. Lang’s book “Algebra” and in this Master’s thesis.
In order to state the result, we first need a definition from complex analysis.
Definition: An entire function is said to be of finite order if there exists such that whenever . The infimum of all such is called the order of . A meromorphic function is said to have order at most if it is the quotient of two entire functions of order at most .
Example: The function has order 1.
Theorem (Schneider-Lang): Let be meromorphic functions of finite order, and assume that and are algebraically independent over . Suppose furthermore that the derivatives belong to the ring for some number field . Then there are at most finitely many such that for all .
More precisely, the proof can be optimized to show that if and have order and , respectively, then the number of such is bounded by . This bound is sharp in the case where , , and .
To see that the Schneider-Lang theorem implies the transcendence of , we deduce from it the following:
Corollary (Hermite-Lindemann): Let be a nonzero algebraic number. Then is transcendental.
Proof: Suppose not, and let . Let and . The conditions of the Schneider-Lang theorem are satisfied, since growth rate considerations show that and are algebraically independent. However, for all natural numbers , a contradiction. Q.E.D.
The Schneider-Lang theorem also implies the Gelfand-Schneider theorem, which provides a positive answer to Hilbert’s 7th problem:
Corollary (Gelfand-Schneider): Let be algebraic numbers with and irrational. Then is transcendental.
Proof: Suppose not, and let . Let and . Because and are algebraically independent, the conditions of the Schneider-Lang theorem are satisfied. However, for all , a contradiction. Q.E.D.
This shows, for example, that and , and are all transcendental.
Example: The applications of Schneider-Lang we’ve given so far involve two functions that satisfy a first-order linear differential equations. For an application of the more general formulation of the theorem, consider the Weierstrass -function associated to a complex lattice . It is a meromorphic function on with poles only at the points of , and it satisfies a non-linear second-order differential equation of the form for certain complex numbers associated to . The Schneider-Lang theorem can be used to prove that if are algebraic then is transcendental for all algebraic numbers . To see this, one supposes that is algebraic and takes the number field and the functions , and which can be shown to be have order at most 3. Moreover, and are algebraically independent. If , the addition formula for the Weierstrass -function shows that whenever and . This contradicts the Schneider-Lang theorem (see Chapter 10 of Murty-Rath for details).
Sketch of the proof
Although the general version of the Schneider-Lang theorem given above is useful for applications such as transcendence of special values of the Weierstrass -function, for the applications to Hermite-Lindemann and Gelfand-Schneider we only needed the following simplified version:
Theorem (Schneider-Lang; simplified form): Let be algebraically independent entire functions of order at most 1. Suppose furthermore that the derivatives belong to the ring for some number field .Then there are at most finitely many such that both .
We focus on the proof of this statement, which contains all of the most important ideas from the general case. Our proof will show that if and for , then .
We denote by the ring of algebraic integers belonging to the number field .
Step 1: (Auxiliary function) Construct a polynomial in and with not all zero such that vanishes to order at least at each , where is some large integer. With a judicious choice of , we can ensure that the algebraic integers are not too “large” by using a famous consequence of the Pigeonhole Principle known as Siegel’s Lemma. (Specifically, we choose .)
Step 2: (Extrapolation) Assuming that and that vanishes to order at least at each , show that it in fact vanishes to order at each .
Since an entire function function which vanishes to infinite order at some must be identically zero, we conclude from Step 2 that . By the choice of in Step 1, this shows that and are algebraically dependent, a contradiction.
Admittedly, our description of Step 2 was quite vague, so let’s break it into a couple of more detailed sub-steps:
Step 2a (Liouville inequality): If vanishes to order exactly at some , say , then is a non-zero algebraic number. Moreover, because of the way we constructed and the differential equations satisfied by the , we can give upper bounds for the degree of , the “denominator” of (the least positive integer such that is an algebraic integer), and the absolute values of all the complex conjugates of . Since the norm of a nonzero algebraic integer is a nonzero rational integer, the absolute value of the norm must be at least 1. This gives a nontrivial lower bound for in terms of the above data.
Step 2b (Maximum modulus principle): Since vanishes to order at least at each , the function is entire. The maximum modulus principle thus implies that is bounded above by its maximum value on any circle of radius around . Choosing carefully (specifically, we choose ) and using our assumption that and have order at most 1, we obtain an upper bound for which contradicts the lower bound from Step 2a if and is sufficiently large.
A more detailed sketch
Here is a more quantitative sketch of the proof.
Suppose is a nonzero algebraic number of degree , and let be its conjugates, i.e., the roots of the minimal polynomial of over . Let be the leading coefficient of . Define the denominator of to be ; this is the least positive integer such that . Define the size of to be .
If is a polynomial with algebraic coefficients, we write (resp. ) for the LCM of the denominators of the coefficients of (resp. the maximum of the sizes of the coefficients of ).
We will need the very simple:
Lemma (Liouville Inequality): If is a nonzero algebraic number of degree , then
Proof: Since , the product of all conjugates of is a nonzero integer, hence it is at least 1. This easily yields the desired inequality. Q.E.D.
We will also need the following two results. We write for the usual differentiation operator.
Lemma (Derivative Lemma): With and as in the statement of the Schneider-Lang theorem, there exists a constant with the following property. If is a polynomial of degree with coefficients in and , then for all positive integers we have and for all .
For a proof, see Appendix 1, Lemma 3 in S. Lang’s “Algebra”. Lang’s argument uses the notion of a derivation on a commutative ring, together with induction on . The details are a bit too cumbersome for this post, but it should seem plausible that we can obtain bounds for and using the fact that the ring is closed under taking derivatives, together with the generalized Leibniz formula
Lemma (Siegel’s Lemma): Let be a number field. There exists a constant depending only on with the following property. For and , with , let be algebraic numbers of size at most and denominator at most . Then the system of homogeneous linear equations has a nonzero solution satisfying for all .
For a proof, see Appendix 1, Lemma 2 in S. Lang’s “Algebra”. (Lang assumes that the are algebraic integers, but the general case follows easily by clearing denominators in each of the linear equations.) Here we explain the proof of Siegel’s Lemma in the special case where the coefficients of the linear forms in question belong to . The general case can be reduced to this one by writing everything in terms of an integral basis for .
Proof over : Let be the associated matrix, thought of as a map from into which takes the lattice into . Let and let be the set of vectors in with coordinates of size at most in absolute value. Then maps , which has size , into , which has size . If
then by the Pigeonhole Principle there will be two distinct vectors in mapping to the same point. The difference of these two vectors gives a solution in to the homogeneous system . Choosing gives the desired result in a sharpened form. Q.E.D.
We now return to our quantitative sketch of the simplified version of Schneider-Lang.
For notational convenience, we define if and .
We wish to find , not all zero, such that satisfies for and , where are parameters to be determined later. This amounts to solving the following linear system of equations in unknowns:
Using the generalized Leibniz formula, one sees that the numbers all belong to , and the Derivative Lemma gives
for all . If we choose such that (i.e., we have roughly twice as many equations as unknowns), Siegel’s Lemma shows that we can find such with .
Since and are algebraically independent over , and hence over , the function is not identically zero. Let be the smallest integer such that all derivatives of up to order vanish at the points but does not vanish at some , say . Then by construction, and the Derivative Lemma gives
Since , the Liouville inequality gives
However, the assumption that and have order at most allow us to give a lower bound for this quantity:
Claim: If then
Since and , we obtain a contradiction if and This finishes the proof of the Schneider-Lang theorem, modulo the claim.
To prove the claim, note that since vanishes to order at least at each , the function is entire. Using the fact that for some constant by hypothesis, the Maximum Modulus Principle applied to along the disc of radius around implies that
But a simple computation shows that , and the result follows. Q.E.D.
(1) For the quantitative version of the Schneider-Lang theorem mentioned above, giving the sharp bound of for the number of exceptional points, see e.g. M. Waldschmidt’s book “Nombres Transcendents”.
(2) The modular -function, which is holomorphic on the complex upper-half plane , takes algebraic values at all imaginary quadratic numbers. Conversely, Schneider showed that if is not imaginary quadratic then is transcendental. This can be deduced from the Schneider-Lang theorem by way of the Weierstrass -function, see Chapter 15 of Murty-Rath.
(3) It is interesting and useful to replace algebraic points by algebraic subvarieties in many of the above considerations. For example, given a transcendental map between complex algebraic varieties defined over , one can study the algebraic subvarieties of defined over , such that is also an algebraic subvariety defined over . Theorems along such lines are known as Ax-Schanuel type theorems, and they have played influential role in recent developments in Diophantine geometry. We mention, for example, the proof of the André-Oort conjecture, a uniform version of the Mordell-Lang conjecture, and the relative Manin-Mumford conjecture.
Happy Pi Day!