Auxiliary function


Auxiliary function

In mathematics, auxiliary functions are an important construction in transcendental number theory. They are functions which appear in most proofs in this area of mathematics and that have specific, desirable properties, such as taking the value zero for many arguments, or having a zero of high order at some point [M. Waldschmidt, "An Introduction to Irrationality and Transcendence Methods", Fields Institute, (2008).] .

Definition

Auxiliary functions are not a rigorously defined kind of function, rather they are functions which are either explicitly constructed or at least shown to exist and which provide a contradiction to some assumed hypothesis, or otherwise prove the result in question. Creating a function during the course of a proof in order to prove the result is not a technique exclusive to transcendence theory, but the term "auxiliary function" usually refers to the functions created in this area.

Explicit Functions

Liouville's Transcendence Criterion

Because of the naming convention mentioned above, auxiliary functions can be dated back to their source simply by looking at the earliest results in transcendence theory. One of these first results was Lioville's proof that transcendental numbers exist when he showed that the so called Liouville numbers were transcendental [J. Liouville, "Sur des classes très étendues de quantités dont la valeur n'est ni algébrique, ni même réductible à des irrationnelles algébriques"," J. Math. Pures et Appl. 18, 883-885, and 910-911, (1844).] . He did this by coming up with a transcendence criterion which these numbers satisfied. To derive this criterion he started with a general algebraic number α and found some property that this number would necessarily satisfy. The auxiliary function he used in the course of proving this criterion was simply the minimal polynomial of α. Specifically the irreducible polynomial "f" with integer coefficients such that "f"(α)=0. This function can be used to estimate how well the algebraic number α can be estimated by rational numbers "p"/"q". Specifically if α has degree "d" at least two then he showed that:left|f(p/q) ight|geqfrac{1}{q^d},and also, using the mean value theorem, that there is some constant depending on α, say "c"(α), such that:left|f(p/q) ight| leq c(alpha)left|alpha-frac{p}{q} ight|.Combining these results gives a property that the algebraic number must satisfy, and thus by negating this property gives a criterion for transcendental numbers.

The auxiliary function in Lioville's work is very simple, merely a polynomial which vanishes at a given algebraic number. This kind of property is usually the one that auxiliary functions satisfy. They either vanish or become very small at particular points, which is usually combined with the assumption that they don't vanish or can't be too small to derive a result.

Fourier's Proof of the Irrationality of e

Another simple, early occurrence is in Fourier's proof of the irrationality of "e" [C. Hermite, "Sur la fonction exponentielle"," C. R. Acad. Sci. Paris 77, (1873).] , though the notation used usually disguises this fact. Fourier's proof used the power series of the exponential function::e^x=sum_{n=0}^{infty} frac{x^n}{n!}.By truncating this power series after, say, "N"+1 terms we get a polynomial with rational coefficients of degree "N" which is in some sense "close" to the function "e""x". Specifically if we look at the auxiliary function defined by the remainder::R(x)=e^x-sum_{n=0}^{N} frac{x^n}{n!}then this function should take small values for "x" close to zero. If "e" is a rational number then by letting "x"=1 in the above formula we see that "R"(1) is also a rational number. However, Fourier proved that "R"(1) could not be rational by eliminating every possible denominator. Thus "e" cannot be rational.

Hermite's Proof of the Irrationality of er

Hermite extended the work of Fourier by approximating the function "e""x" not with a polynomial but with a rational function, that is a quotient of two polynomials. In particular he chose polynomials "A"("x") and "B"("x") such that the auxiliary function "R" defined by:R(x)=B(x)e^x-A(x),could be made as small as he wanted around "x"=0. But if "e""r" were rational then "R"("r") would have to be rational with a particular denominator, yet Hermite could make "R"("r") too small to have such a denominator, hence a contradiction.

Hermite's Proof of the Transcendence of e

To prove that "e" was in fact transcendental, Hermite took his work one step further by approximating not just the function "e""x", but also the functions "e""kx" for integers "k"=1,...,"m", where he assumed "e" was algebraic with degree "m". By approximating "e""kx" with rational functions with integer coefficients and with the same denominator, say "A""k"("x") / "B"("x"), he could define auxiliary functions "R""k"("x") by:R_k(x)=B(x)e^{kx}-A_k(x).,For his contradiction Hermite supposed that "e" satisfied the polynomial with integer coefficients "a"0+"a"1e+...+"a""m""e""m"=0. Multiplying this expression through by "B"(1) he noticed that it implied:R=a_0+a_1 R_1(1) +{}ldots{}+a_m R_m(1)=a_1 A_1(1)+{}ldots{}+a_m A_m(1).,The right hand side is an integer and so, by estimating the auxiliary functions and proving that 0<|"R"|<1 he derived the necessary contradiction.

Auxiliary Functions from the Pigeonhole Principle

The auxiliary functions sketched above can all be explicitly calculated and worked with. A breakthrough by Axel Thue and Carl Ludwig Siegel in the twentieth century was the realisation that these functions don't necessarily need to be explicitly known - it can be enough to know they exist and have certain properties. Using the Pigeonhole Principle Thue, and later Siegel, managed to prove the existence of auxiliary functions which, for example, took the value zero at many different points, or took high order zeros at a smalller collection of points. Moreover they proved it was possible to construct such functions without making the functions too large [A. Thue, "Selected Mathematical Papers"," Universitetsforlaget, Oslo, (1977).] [C.L. Siegel, "Über einige Anwendungen diophantischer Approximationen"," Abhandlungen Akad. Berlin 1, p.70, (1929).] . Their auxiliary functions were not explicit functions, then, but by knowing that a certain function with certain properties existed, they used its properties to simplify the transcendence proofs of the nineteenth century and give several new results [C.L. Siegel, "Über die Perioden elliptischer Funktionen"," J. f. M. 167, pp.62-69, (1932).] .

This method was picked up on and used by several other mathematicians, including Alexander Gelfond and Theodor Schneider who used it independently to prove the Gelfond-Schneider theorem [A. O. Gel'fond, "Sur le septième Problème de D. Hilbert"," Izv. Akad. Nauk SSSR 7 pp.623-630, (1934).] [T. Schneider, "Transzendenzuntersuchungen periodischer Funktionen. I. Transzendend von Potenzen"," J. reine angew. Math. 172, pp.65-69, (1934).] . Alan Baker also used the method in the 1960s for his work on linear forms in logarithms [A. Baker and G. Wüstholz, "Logarithmic forms and Diophantine geometry"," New Mathematical Monographs 9, Cambridge University Press, p.198, (2007).] . Another example of the use of this method from the 1960s is outlined below.

A theorem of Lang

In the 1960s Serge Lang proved a result using this non-explicit form of auxiliary functions. The theorem implies both the Hermite-Lindemann and Gelfond-Schneider theorems [S. Lang, "Introduction to Transcendental Numbers"," Addison-Wesley Publishing Company, (1966).] . The theorem deals with a number field "K" and meromorphic functions "f"1,...,"f""N" of order at most "ρ", at least two of which are algebraically independent, and such that if we differentiate any of these functions then the result is a polynomial in all of the functions. Under these hypotheses the theorem states that if there are "m" distinct complex numbers ω1,...,ωm such that "f""i""j" ) is in "K" for all combinations of "i" and "j", then "m" is bounded by:mleq 20 ho [K:mathbb{Q}] .

To prove the result Lang took two algebraically independent functions from "f"1,...,"f""N", say "f" and "g", and then created an auxiliary function which was simply a polynomial "F" in "f" and "g". This auxiliary function could not be explicitly stated since "f" and "g" are not explicitly known. But using Siegel's lemma Lang showed how to make "F" in such a way that it vanished to a high order at the "m" complex numbers ω1,...,ωm. Because of this high order vanishing it can be shown that a high-order derivative of "F" takes a value of small size one of the ω"i"s, "size" here referring to an algebraic property of a number. Using the maximum modulus principle Lang also found a separate way to estimate the absolute values of derivatives of "F", and using standard results comparing the size of a number and its absolute value he showed that these estimates were contradicted unless the claimed bound on "m" holds.

Interpolation Determinants

After the myriad of successes gleaned from using existent but not explicit auxiliary functions, in the 1990s Michel Laurent introduced the idea of interpolation determinants [M. Laurent, "Sur quelques résultats récents de transcendance"," Astérisque 198-200, pp.209-230, (1991).] . These are alternants - determinants of matrices of the form:mathcal{M}=left(varphi_i(zeta_j) ight)_{1leq i,jleq N}where φ"i" are a set of functions interpolated at a set of points ζ"j". Since a determinant is just a polynomial in the entries of a matrix, these auxiliary functions succumb to study by analytic means. A problem with the method was the need to choose a basis before the matrix could be worked with. A development by Jean-Benoît Bost removed this problem with the use of Arakelov theory [J.-B. Bost, "Périodes et isogénies des variétés abéliennes sur les corps de nombres (d'après D. Masser et G. Wüstholz)"," Astérisque 237, p. 795, (1996).] , and research in this area is ongoing. The example below gives an idea of the flavour of this approach.

A proof of the Hermite-Lindemann theorem

One of the simpler applications of this method is a proof of the real version of the Hermite-Lindemann theorem. That is, if α is a non-zero, real algebraic number, then "e"α is transcendental. First we let "k" be some natural number and "n" be a large multiple of "k". The interpolation determinant considered is the determinant Δ of the "n"4&times;"n"4 matrix:left({exp(j_2x)x^{j_1-1}}^{(i_1-1)}Big|_{x=(i_2-1)alpha} ight).The rows of this matrix are indexed by 1 &le; "i"1 &le; "n"4/"k" and 1 &le; "i"2 &le; "k", while the columns are indexed by 1 &le; "j"1 &le; "n"3 and 1 &le; "j"2 &le; "n". So the functions in our matrix are monomials in "x" and "e""x" and their derivatives, and we are interpolating at the "k" points 0,α,2α,...,("k"-1)α. Assuming that "e"α is algebraic we can form the number field Q(α,"e"α) of degree "m" over Q, and then multiply Δ by a suitable denominator as well as all its images under the embeddings of the field Q(α,"e"α) into C. For algebraic reasons this product is necessarily an integer, and using arguments relating to Wronskians it can be shown that it is non-zero, so its absolute value is an integer Ω &ge; 1.

Using a version of the mean value theorem for matrices it is possible to get an analytic bound on Ω as well, and in fact using big-O notation we have:Omega=mathcal{O}left(expleft(left(frac{m+1}{k}-frac{3}{2} ight)n^8log n ight) ight).The number "m" is fixed by the degree of the field Q(α,"e"α), but "k" is the number of points we are interpolating at, and so we can increase it at will. And once "k" &gt; 2("m"+1)/3 we will have Ω&rarr;0, eventually contradicting the established condition Ω &ge; 1. Thus "e"α cannot be algebraic after all [Adapted from: J. Pila, "Geometric and arithmetic postulation of the exponential function"," J. Australian Math. Soc. A 54 pp. 111-127, (1993).] .

References


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Auxiliary polynomial — is a term in mathematics which may refer to:* The auxiliary function argument in transcendence theory * The characteristic polynomial of a recurrence relation …   Wikipedia

  • Auxiliary Bishop — • A bishop deputed to a diocesan who, capable of governing and administering his diocese, is unable to perform the pontifical functions; or whose diocese is so extensive that it requires the labors of more than one; or whose episcopal see has… …   Catholic encyclopedia

  • Auxiliary field Monte Carlo — is a method that allows the calculation, by use of Monte Carlo techniques, of averages of operators in many body quantum mechanical (Blankenbecler 1981, Ceperley 1977) or classical problems (Baeurle 2004, Baeurle 2003, Baeurle 2002a).The… …   Wikipedia

  • Function word — Function words (or grammatical words) are words that have little lexical meaning or have ambiguous meaning, but instead serve to express grammatical relationships with other words within a sentence, or specify the attitude or mood of the speaker …   Wikipedia

  • Auxiliary police — (also called special police or special constables) are usually the part time reserves of a regular police force. They may be armed or unarmed. They may be unpaid volunteers or paid members of the police service with which they are affiliated.In… …   Wikipedia

  • auxiliary verbs — An auxiliary verb is one that is used with another verb to form a particular tense or mood, for example be in We were pleased, have in They have gone, and do in Do you mind? Sometimes more than one auxiliary verb is used to form a tense, as in We …   Modern English usage

  • Auxiliary verb — In linguistics, an auxiliary verb is a verb that gives further semantic or syntactic information about a main or full verb. In English, the extra meaning provided by an auxiliary verb alters the basic meaning of the main verb to make it have one… …   Wikipedia

  • Auxiliary power unit — An auxiliary power unit (APU) is a device on a vehicle whose purpose is to provide energy for functions other than propulsion. Different types of APU are found on aircraft, as well as on some large ground vehicles.AircraftFunctions of APU The… …   Wikipedia

  • Function object — A function object, also called a functor or functional, is a computer programming construct allowing an object to be invoked or called as if it were an ordinary function, usually with the same syntax.Function objects are unrelated to functors in… …   Wikipedia

  • function word — noun Date: 1940 a word (as a preposition, auxiliary verb, or conjunction) expressing primarily grammatical relationship …   New Collegiate Dictionary