GENERALIZING THE ARITHMETIC GEOMETRIC MEAN
A HAPLESS
COMPUTER
EXPERIMENT
JAAK PEETRE Matematiskainstitutionen
Box
6701S-113 85 Stockholm Sweden
(Received
May 1, 1988 andinrevised formSept.20,1988)
ABSTRACT. The paper discusses the asymptoticbehavior ofgeneralizations of theGauss’s
arithmetic-geometricmean, associated with the names Meissel
(1875)
and Borchardt(1876).
The"hapless computer experiment"inthetitlerefers to the fact that the author atanearlierstagethought that onehad genuine asymptotic formulae butit isnow shown that ingeneral
"fluctuations"arepresent.
However,
novery conclusiveresultsareobtainedsothe paper endsin a conjecture concerning the special rSle of the algorithms ofGauss and Borchardt. The paperdiscussesthe asymptoticbehaviorofgeneralizationsof the
Gauss’s
arithmetic-geometricmean,associated withthenamesMeissel
(1875)
and Borchardt(1876).
The"hapless computer experiment"inthetitlerefers tothe fact that the author at anearlierstage thought that onehad genuine asymptotic formulae butit is nowshown thatingeneral"fluctuations"arepresent.
However,
novery conclusiveresultsareobtainedsothe paper endsinaconjecture concerningthespecialrSleof thealgorithmsofGaussandBorchardt.
KEY
WORDS AND PHRASES.Arithmetic-geometric mean, iteration,algorithm,asymptotic formula1980
AMS
SUBJECTCLASSICATION CODES.
26E99, 26-03, 01A550.
INTRODUCTION.
Ihavenowworkedon algorithmsof thetype ofGauss’sarithmetic-geometricmean (agM.)
foraperiod of nearly4 years(startingaround the turn of the year
83]84).
Strangely enoughsome of the impetus for getting interested in this field came from the theory of
(abtract)
interpolation. This connection is described in my talk to theVarna conference inMay/June 1984[P1].
ThesameyearI
also preparedanoverallsurveyof"means
andtheiriterations" for the XIXth Nordic MathematicalCongress
in Reykjav[ACJP]
(with
J.Arazy,
T. Claesson andS. Janson ascoauthors).
Itookupthesamesubject the yearafter formyaddresstothe A.Haar MemorialConferenceinBudapest[P2],
whichisa collection of"unsolved problems",someofthempertainingtothe
agM.
Inparticular, suggestedthere acertain approximationfora2-dimensional algorithmderivedfrom the
"agM."
correspondingtothe cyclicgroupC3
withthreeelements analogoustoanasymptoticformulain anotebyE.Meissel
[M]
andalsonumerical data. Hence the second link ofthe title.
(I
believe now that the claim in[P1]
concerningtheasymptoticbehaviorof iterated powermeansexperiencesthesamefate.)
Thusthe question offinding good (asymptotic) approximations for the algorithm ofthe type in question isby andlargeopen.
The main purpose of this paper is to provide the reader a general background for this
problem, and to outline themeagerprogress myself havemade onit. Maybe, canthereby inspireotherpeopleto continuewhere stopped
(failed)...
As
Grunert’s
Arehiv isnowadaysvirtuallyinaeeesibleformostreadersandasthecommonofknowledgeofWestern languages,Englishexcluded,amongmathematicians is in suchasorrow
state, haveincluded atranslationof Meissel’s note
[M]
in eztenso(see
Appendix).1.
THE
agM.
G.F. Gauss
[1777-1855]
studiedtheagM.
from anearly age on(some
say14).
Let a,bbe tworealnumbers, 0<
b_<
a<
cz. Taking succesivelyarithmeticand geometricmeans wegeta",
andb,b’,
b" with two sequences a,a’,
a+b
b’
2a’
b’a"-
+
b"-2
Itis easy toseethattheyconvergetoa commonlimit,
M(a,
b)tlim
a,, limb,,,called the
agM.
ofa and b.(We
have b_<
b’_<
b"_<
_< a"
_<
a’ _<
a and.,_
e
(/z-
)
2
5
(
v-v
-
)=
<5
l(a-)
1
([G],
p.28).
The
agM.
and other related"means"
arediscussed inthe survey[ACJP]. (For
a more ex-tensivetreatment seee.g. the book[BB].)
Fromtherewerecallonlythe following.Basic properties of theGauss
agM.:
1)
Extremelyrapid convergence("quadratic" inthetechrficalsense).
2)
Integralrepresentation. As Gauss discovered,onehas-
[,,/2
M(a,b)
aode
v/a2
cos+
b2sin2’
where the integral to the left is a complete elliptic integral of the first kind. In standard
notation, writing k
b/a
("modulus"),
k’y/1- b2/a
2("complementary modulus"),
thelatterisjust
g(k’)/a
(cf. [BB],
theorem1.1).
Thusthe formulacanalsobe writtenM(a,b)=a
1
3) Differential
equation. The function y satisfies a differential equation vithM(1, k)
algebraiccoefficients
(known
as Legendre’sdifferentialequation)k(k
1)y"
+
(3k
1)y’
+
ky O,whichafterachangeofvariablebecomesaspecialcaseof thehypergeometricequation,likewise consideredbyGauss.
4) Uniformization.
Writinga and binthe forma
Mp2(x),b= Mq2(x)
(M
M(a,b)),
wherep and q are, what Gauss
calls,
summatoricfunctions(=
thetavalues; in conventional notationp(x)
Ooo(O,t),
q(x)
Ool(O,t)
wherexeit),
the algorithm reducestoz x2.
5)
Complexvalues. Using theuniformization it ispossibletoextend the algoritmtothecase of complexvaluesofa and b. Then thelimitM(a, b)
is not any longerunique and the valuescorresponding to different"determinations" ofit are related by a modular transformation. This isalsooneofthehistoric rootsofthetheory of modularfunctions.
6)
Asymptoticformula.
CentralinGauss’streatmentof theagM.
isthe asymptotic formulaM(1,
k)
(k
0).
log
I
canbereadilyderivedfrom thefollowingformulaforthe uniformizing parameter, likewiseduetoGauss
(see
again thediscussion in[nn]):
2. BORCHARDT’S GENERALIZATION OF
THE agM.
WITH FOUR "ELEMENTS". C.W.Borchardt[1817-1880]
was astudent and closefriendof Jacobi’s. Healso editedvolumeoneof Jacobi’s collected works and became theeditorof"Crelle’sJournal" after Crellc’sdeath;
therefore thisjournal wasfor sometimeknown as "Borchardt’s Journal". He proposed as a generalization of the
agM.
thefollowingschemebasedontheiterationof the transformationbf
C
a+b+c+e
4 2 2Itturnsout thatthisalgorithmhasatheory entirelyparallelto
Gauss’s.
Inparticular,onehas propercounterpartsof all the properties1)-6)
mentionedinSec. 1,with thepossible exception of6).
The "uniformization" isnowobtained by thetafunctions in two variables andinplaceof the elliptic curve now enters
Kummer’s
quartic surface. Borchardt also briefly indicates a2’*-dimensionalgeneralization, but forvarious reasons thisalgorithmis far more defective,largelybecause thetafunctions innvariablesdo notsuffice forpurposes ofuniformization. We quote
([Bore],
p.621):
"... but of which kind thesetranscendentalfunctions are, in terms of which thelimitcanberepresented,isaquestion whoseanswermustbeleft for thefuture".3. THE
"MONSTER"
ALGORITHM.
In
[ACJP]
it ispointed out that bothGauss’s
andBorchardt’s algorithm are specialcasesofthefollowingvery generalconstruction. Let Gbeany compact group. Let
f
be apositivetype
(in
fact, inL’(G)).
Similarly,f"
v/-f
v/
is a continuous function(in
C(G)).
Continuingweobtainasequence of functionswhich,as is provedin[ACJP],
tends pointwise,in fact uniformly, toa constant functions, denoted
M(f)
orMa(f) (and
identified with the correspondingnumber).
Properties:
O) M(1)=
1.1) M(Af)
AM(f) (A
>
0)
(homogeneity).2)
inff
< M(f) <
supf.
3) M(f)
<
M(9)
iff
<
9 a.e. (monotonicity).4)
M(f
+
9) >
M(f)
+
M(9)
(concavity).Itislikewise mentioned in
[ACJP]
that ifwepass tocomplex valuedfunctionsf, assuming thatRef >
0,thesameconvergence holdstrue, butthis ismuch harderto prove.EXAMPLE. Forinstance, ifG
ZI"
andiff
isanalyticinsidetheunit diskD
(we
make theidentification
OD
Z/’)
thenM(f)= f(0).
Wearehowevermostlyinterested inthe caseoffinitegroups.
EXAMPLE. IfG
C2 (the
cyclic groupwithtwoelements)
weget backGauss’s algorithmandifG
C2
x6’2
weget Borchardt’s algorithm. Borchardt’s generalizationwith 2"elements similarlycorrespondstoGC’.
Thefirst non-classicalcaseisthusGC3 (the
cyclic groupwiththree
elements).
Spelledoutexplicitlyit isthus questionofiterating themapa+b+c
a
3
2v+c
3
c’=
2/"+
b3
4.
ALGORITHMS
DERIVED
FROMTHE
"MONSTER"
ALGORITHM.Let usreturn to the case ofageneralgroup G. For simplicitywe take Gfinite. Let
H
beanysubgroup of
G.
Werestrict attentiontofunctionsf
which areconstant onH
andonthecomplement HC
G\H,
i.e.{a
/-/f=
bH
Then theiteratesareof thesametype:f,=
{
a
+
(1
-)b
fe=d
a’
’
+
(1
-)b
fe_d
b’f’=
7+
(1
)b’
whereN
la"
HI
IGI/IHI
isthe indexofH
in G.(By
Lagrange’s
theoremwe knowthat Nisaninteger.) Thuswe arelead toconsider, quitegenerally,the2-dimensional algorithm(1)
a’=
0(1
a)
+
0bb’
20V/’
+
(1
20)b
0(1
a)
+
Ob0(
v/)
2’ where0is anynumberin the interval(0,
1/2].
Restricting attention to functions
f
which are constant on the setsG,\G,-1
(i
1,...,r),
that is,
f[al
al,f[a,\al
a2,f[a.ka.-I
ar
with[GI[--
o1,[G2\GI[
a:,IG\G-ll
,
1+-.-+r
Ial,
we refacedwiththe r-dimensional gorithm(0,
,/Ial)
a
01
a+
+
Ora
201+
(1’)
a
201 + 202 +
(03
01
02)
3+
04a
4+
+
Or
a=
20a+202+...+20_a_aa+(O-Oa-...-O_a)a
Noticethat one cMsowrite
=’
011
+
+
0=
0
(
)=
0=(
)=
-0_,(-
)
PROOF OF
(1’).
Indeedconsidertheequation jk.Assumefirstthat
G1.
Thenwehave thefollowingpossibilities:j
Ga,
kG1
coesponds toateal, jG2G1,
kG2G
correspondstoate2a jGG-I,
kGG_i
corresponds toaterma.
This accountsfor the foula fora.
Next smethat
G2G.
Thenwehave thefollowingpossibilities: j GG2,kG2
G
orjj
=,
a=ka
opona
tore= (= -)==
j
GsG2,
kGzG
correspondstoatesas
j
GG_x,
kGG_a
corresponds toatermaa.
Thisaccountsforthe foula fora.
In the se way one treats the ce
GsG2.
The proofis concluded by inductionargument.#
5. MEISSEL.
In
a short note[M]
(of.
Appendix to thispaper)
published in 1874 in Grunert’s Archiv(D.E.)F.
Meissel[1826-1895],
whowas headmaster of a secondary schoolin Kiel, Germany,toward the end of the last century, considerstheiteration a+b+c
+ac
+
bcv/ab
3Inparticular,he states withoutproof the followingasymptotic formulafor thecorresponding
limit
M(a,
b,c):
(2)
M(1
1c)(
A
B
)x
c-Olog
3
3I+
V3and
A
andB
arecertainsconstantsA
0.3951642...,B
10’299z49"’’.
(The
exactmeaningof isnot clearfrom thecontext; actually,Meissel himself writessimplyinformula
(2).)
In
Sec. 8 willgiveanattempttojustifyMeissel’s formula(1).
REMARK. In
the excellent book[BB],
p. 268-269, this algorithm is called Schl6milch’salgorithm and in this context reference is made to a paper by Schoenberg’s
[Scho],
whichhowever has been inaccessible to me. have checked with the index of Jahrbuch up toand
includingtheyear1903
(the
yearofmymother’sbirth)
andfoundonlyonepaperof SchlSmilch’sdealingwith iterationof means,namely
[SchlS],
butitdoesnot seemtobeveryrelevantinthe present context.6. DISCUSSION
(THE
COMPUTER
EXPERIMENT).
Guidedbythe asymptoticformulae by Gaussand Meissel suggestedin
[P2]
the followingapproximtion ofthelimit
M(a,
b)
of theiterationsof thetransformation(1)
inSec. 4" 1(3)
M(1,b)A
4(b-,O)
(og
)
made also numerical experiments on a minicomputer, which to some extent seemed to
support myguess. Herearesomevalues for the constant
A
obtained: N=2A
1.570796327...N=3
A
2.3410... N=4A
3.289868133...N=5
A
4.420...N=6
A
5.738...N=7
A
7.251...N=8
A
8.96...I thought at thetime that had arigorous proofof
(3)
interpreted as agenuine asymptotic formulaso didnotpaymuchattention tothefluctuations inthenumericaldata,whichseemed to increase withN. Of course, as(3)
reduces to Gauss’sformulaifN 2thereare nosuchfluctuations in thiscase and also not in Borchardt’scase
(N
4).
ThereforeBorwein’s letter[Borw]
cameas asurprise, if notashock.However,
nowquicklyrealized that the fluctuationsreallywerepart of the picture
(in
all othercasesbutN 2,4)
and not something connectingwiththe insufficiency of the numerical device available tome. This will be explained in the next Section.
Thenumerical experiment thussuggests only the following: There is an exact asymptotic
formula
onlyif
N 2,4 and there can be an integralformula .for
the limitof
theGauss-Borchardt type only in these two cases.
But,
emphasize, this is somethingthat have notprovedsoitistherefore just question of another conjecture.
In
the formercaseA
r/2
exactly and the lattercaseprobablyA
2/3.
7. EXPLANATION OF
THE
FLUCTUATION.Ininhomogeneousnotation
(M(b)
M(1, b))
thefunctionalequationfor thelimit/llof the(4)
M(b)-g(b)M(b’),
wherewefor conveniencehave put
()
+
(
).
It is easy to seethat the only solution ofthe functional equation
(4)
whichis analyticnear b 1 normalizedby the requirementM(1)
1comes as aninfiniteproduct(5)
M(b)
g(b)g(b’)g(b")...
This is connected with the fact that b 1 is anattractive
(even
hyperattractive) fixed pointofthe map
20vf
+
(1
20)
b-b’
+
(
)
andthat
g(1)-
1,whichguarantees the convergence of the productin(5). (In (5)
b’,b",...,
ofcourse,aretheiteratesofbunderthis
map.)
Next,
consider insteadthefunctionalequation(6)
Z(b)
h(b)Z(b’),
wherenow
h(b)
g(b)
0+
(1
0)b
b 0 is arepulsive fixedpoint for themapb
-
b’,
hence an attractive (againevenhyperat-tractive)
onefor the inverse map bH’b.(If
b issmall then b’2v/
so that fortheinverseholdsbH ’b
.
b2/4.)
Alsoh(0)
1. It follows, by thesamereason asbefore, thatthere is auniquesolutionof
(5),
analyticat b 0 and normalizedbyZ(0)
1. OnecanthuswriteZ(b)
1+
Zlb
+
z2b
2+
andit ispossible to write down arecursion forthe coefficients zn,whichgeneralizes the one
usedby Gauss
[G].
Finally,considerN()
(N(’))
.
This isessentially Bhttcher’ equation forthe inverse map
(see
e.g.[ACJP]).
It follows that there existsaunique solution which admits the expansion4
N(b)
-
+
no
+
nb
+
n2b
:+
thusinpartcular satisfying
4
N(b)
-
(b
0).
REMARK.
ThelettersZ
andNarepicked in honor ofGauss
(for
GermanZhhler,
nominator, andNenner,
denominator, respectively). Do not confuse the function NN(b)
and the previous integer valued parameter N(=
theindexof thesubgroup H thefinitegroupG);
seeSec. 4.
Nowit isclear that thefunction
satisfiesthe samefunctionalequation
(4)
as our meanM. The quotient isafunction which is invariantunderb b’. It follows thatwehavewhereOisan"oscillatory" function, thatis, invariant under the transformation b b’
(O(b)
O(b’)).
Nowthe presenceof
thefluctuations
is completely explained. Ouraboveconjecture(end
ofSee.
6)
amountstherefore toO constiffN 2,4.REMARK. A
naturalparameternearb 0seemstobe1/N(b)
or evenbettert=2
1 log log
N(b)
log2(If
b b then+
2r.)
Therefore one can go a step further an ask for the Fourierdevelopment of thefunctionO:
O
A0
+
A1
cost+ Ba
sin+
A2
cos2t+
B2
sin2t+
Weseethusthat, inaway,what wehave observedinthenumericalexperimentisjust the 0-th
coefficient
A
A0
inthis expansion.PROBLEM: To account forthe"higher"
terms!
8. ATTEMPTS TO JUSTIFY MEISSEL’S FORMULA.
Wereturn to Meissel’s algorithm
(See.
5),
denoting itslimit byM(a,b,c).
The following propertiesofM(a,
b,c)
areobvious:1) M(Aa,
Ab,c)
,kM(a,
b,c)
(homogeneity),2) M(1,1,1)--
1(normalization),
3) M(a,
b,c)
M(a’,
b’,
c’)
(invariance).
Nowwemake the"Ansatz":
M(a,b,c)
A(log
B---)
-’x,
where
A
A(a, b)
issupposedtobehomogeneousofdegree 1andB
B(a, b)
homogeneousofdegree0. Pluggingthis intothefunctionalequationwefind
B
_
A(a.+b
b
3B(a+b
b
3"-’
A(log
-)
3
)-(log
))-Write
A’
A(a’, b’),
B’
B(a’, b’).
Thelog-factorstotheleft andtothe right canbe writtenas
(log
1+
logB)
-respectivelyThis gives
1 1 1 1
B_a
3a(
1 1log
-c
+
log-
+
log log-c
+
log-
+
3logB’
A
A3
].
and
logB=log-+31og
or B= aba’=
a+b 3b’=
--b
3 xvhich inthe base
(t
a/b)
inducesthemapt+l
havingonerepulsivefixedpoint k-1 determined
(as
in[M])
by the equation
k(1
+
k)
3. Theb-equationcanbe writtenHencethe A-equationissatisfiedby
vith
whence
(as
in[M])
TheB-equationcanbewritten
or, ininhomogeneousnotation,
Inparticular,
Thatis,
oriterating
3
3lk
V3’
log(1
+
t)
log3
(b’)
a’
bB(
o’
1)
--g-
(B( b-7,
1))3
tl/2
(t)
--.
((t’))
.
B(t)
B(k_l)
V,B(k_I))
1 1 1
B()
B(
-)
()
.(.
,)
.(.
,,)r
ThusAand
B
do"exist". Itishowevernot atall cleartomein whatsenseM
isapproximatedintermsofthe functions
A
andB.NOTE
(added
Sept.1988).Recently
J. and P. Borwein sent me a truly marvelous paper a+3b)"
whereinparticular theconjecture entitled"Onthemeaniteration(a,
b)
--4 2
aboutthe asymptoticbehaviorofthe BorchardtmeanattheendofSec. 6 isfullyestablished. Even more, the authors show that the mean in question shares with the
agM.
practicallyREFERENCES
ACJP. Arazy,
J., CLAESSON, T., JANSON, S.
andPEETRE,
J.,
Meansandtheir iterations.In:
STEF/NSON,
JSn R.(ed.),
Proc.of
the Nineteenth Nordic Congressof
MathemMicians,Reykjavlk,1984,pp. 191-212. IcelandicMath.
Soc.,
ScienceInst.,
Univ. ofIceland, Reykjavlk,1985.
Bore.
BORCHARDT, C.-W.,
Ueberdasarithmetisch-geometrischeMittelausvierElemcnten.Monatshefte
Akad. Wiss. Berlin,Nov. 1876, pp. 611-621.Borw.
BORWEIN,
P.,
personalcommunication,Jan. 1986.BB. BORWEIN, J. and
BORWEIN,
P.,
Piand theA
GM.A
study in analytic numbertheory and computational complexity. Wiley,NewYork Chichester-Brisbane-Toronto-Singapore, 1987.G.
GAUSS, C.F.,
Bestimmung der Anziehungeines elliptischenRin.fles.
Nachlass zur Theoriedes arithmetisch-geometrischen Mittcls und der
Modulfunktion.
Ubersetz yon Dr. HaraldGeppert.
Ostwald’s Klassikerder exakten Wissenschaften 225. A-kademische Verlagsgesell-schaft,Leipzig, 1927.M.
MEISSEL,
E.,
Bemerkungen zur hypergeometrischen Reihe. Arch. Math. Phys.(R.
Hoppe)
57(1875),
446-448.(Translated
inthe Appendixbelow.)
P1.
PEETRE,
J.,
On Asplund’s averaging method- the interpolation(function)
way. In:Constructive Theory
of
Functions, Proc.of
the Internat.Conf.
on Constructive Theoryof
Functions,Varna,
May
27 June 2, 1984,pp. 664-671. Publishing Houseof theBulg. Acad. Sci., Sofia, 1984.P2.
PEETRE,
J.,
Someunsolvedproblems. In: Colloquie Math. Soc. Jdnos Bolyai49.
Alfred
HaarMemorial Conference,Budapest, 1985,pp. 711-735. NorthHolland,
Amsterdam,
19S6. Schl.SCHLMILCH,
O.,
Hyperarithmetischeundhypergeometrische Mittel nebst geometri-scherAnwendungen. Schlmilch Z. 34(1889),
59-63.Scho.
SCHOENBERG,
I.J.,
Onthearithmetic-geometricmean. Delta 7(1977),
4-67.APPENDIX.
TRANSLATION OFMEISSELS’S
NOTE[M].
REMARKS ON
THE
HYPERGEOMETRIC SERIES.Recently communicatedtoProfessorRfimkerinHamburgthe followingrelation:
de
f0
de
which I incidentally found as a byproduct in investigations concerning the Gauss hyper-geometricseries. Anotherrelation is
"
de
cos(
7rcf0
"
(1
k:sin)+’r
-’)
V/i-
k sin do notremember havingseenitanywhere.What
I
reallyaimedatwasreallyanextensionof the investigation, originatingfromLagrange
and further amplifiedby
Gauss,
of thelimit to whichthesequence of(successive)
arithmeticand geometricmeansof twonumbers aandbconverges, and consideredtheequations
3an+
=an
q-bn
h-Cn3b+a
=a,b,+
anCn+
bncn
writing
aoo
boo
coo
M(a0,
b0,c0).
Inparticular, wasinterestedintheextremalcase
a0=l; b0=l;
co
verysmall;
and found then as alimitofaoo
boo
coo(1)
M(1,1,
w)=(
A
B
),
log
where
A, B, A
areconstantsandinfact*
1log(1
+
k)
log3 where kisthe root of the equation(
+ )’
.
The valueofAis
A
0, 43331485.Forsimplicity wetake thelogtobeBriggian and then the values of
A
andB
becomeapprox-imtely
A
0,3951642log briggB 0,2997049
havecarriedoutthe computationsforvery smallwand, forexample, found directlythat 1
forw
1--0-,
M 0, 167832571
forw
106561
M
0,01483609