DEPENDING ON OTHER FUNCTIONS
ION CHIT¸ ESCU
For functions f : I → R, g : I → Ror C, f injective, one computes the n-th derivative ofF :f(I) →R or C, F(x) = g(f−1(x)). To this end, one uses the formula of Fa`a di Bruno and a formula for higher order derivatives of inverses by the present author. Three different formulae are given.
AMS 2010 Subject Classification: 26A04, 26A24.
Key words: generalized composition, generalized inverse, higher order derivatives.
1. INTRODUCTION
For an intervalI and functionsf :I →R,g:I →C, the injectivity off implies functional dependence, i.e. one can consider the functionF :f(I)→C, F(x) =g(f−1(x)), the inverse of f being defined on the image f(I).
Alternatively, writing, fort∈I:
x=f(t), y=g(t) one can eliminate t, obtaining y=F(x).
The aim of this paper is to give formulae for then-th derivativeDnF(x) in terms of the derivatives Dpf(t) and Dpg(t), p ≤ n. In case g(t) = t, one obtains formulae for the n-th derivative off−1 in terms of Dpf.
Besides the purely theoretical interest, the solution of this problem can be useful for the study of parametrical representations of the form x =f(t), y =g(t), in order to draw the representative images and to put into evidence the underlying properties.
The present paper relies heavily on our previous paper [4] and can be viewed as a continuation of it. The major result used in both papers is the fa- mous formula of Francesco Fa`a di Bruno exibiting then-th derivativeDn(g◦f) in terms of Dpf and Dpg.
Using this formula and the formula for Dn(f−1) from [4] we give three formulae for DnF.
The first formula is the major result of the paper. The second formula is a more complicated version of the first formula. The third formula is, as a
REV. ROUMAINE MATH. PURES APPL.,57(2012),2, 105-115
matter of fact, an inductive method, not using the formula of Fa`a di Bruno.
It uses polynomials in several variables and can be easily applied.
2. PRELIMINARY FACTS
Throughout the paper, Nwill be the set of natural numbers with N∗ = N\ {0},K =R orCwill be the set of scalars (real or complex) andI,J will be (non degenerate) intervals of real numbers.
For non empty setsX, Y,Z,A⊂X, B ⊂Y and functions f :X →Y such thatf(A)⊂B,g:B →Z, one can construct the generalized composition g◦f :A→Z, given via (g◦f)(x) =g(f(x)).
For non empty sets X, Y and injective f : X → Y, one can construct the generalized inverse f−1 of f which is the function h :f(X) → X, given via h(y) =x, wherex∈X is uniquely determined by the conditionf(x) =y.
Let 1≤k≤nbe natural numbers. An element (k1, k2, . . . , kn)∈N will be called (n, k)-multiindex if it has the following properties:
k1+k2+· · ·+kn=k, k1+ 2k2+ 3k3+· · ·+nkn=n.
We shall write
M(n, k) = the set of all (n, k)-multiindexes.
For instance
M(1,1) ={(1)}, M(2,1) ={(0,1)}, M(2,2) ={(2,0)},
M(4,2) ={(1,0,1,0),(0,2,0,0)}.
For any 1≤k≤n, the setM(n, k) is non empty. The (n, k)-multiindexes have combinatorial interpretation.
For p ∈ N∗, the p-derivative of a function u at x will be denoted by Dpu(x). Incidentally we shall also write
D1u(x) =f0(x), D2u(x) =f00(x), D3u(x) =f000(x).
Now, we are ready to present the famous formula of Fa`a di Bruno (ori- ginally in [2] and [3]). See also [4], [5] and [7].
Theorem 1(The formula of Fa`a di Bruno). Let I, J be intervals, t∈I and n∈N∗. Let f :I → R be n times differentiable at t, such that f(I) = J and let g:J →K be n times differentiable at f(t). Then g◦f :I →K isn
times differentiable at tand one has Dn(g◦f)(f) =
n
X
k=1
Dkg(f(t))·A(n, k)(t), where
A(n, k)(t) = X
(k1,k2,...,kn)∈M(n,k)
n!
k1!k2!. . . kn!
D1f(t) 1!
k1
·
D2f(t) 2!
k2
·. . .·
Dnf(t) n!
kn
.
In [4], we proved
Theorem 2 (The Formula for Higher Order Derivatives of Inverses).
Let I be an interval, f : I → R be continuous and strictly monotone and let J = f(I). Let t ∈ I be such that f is n times differentiable at t, n ≥ 2 and f0(t)6= 0.
Thenh=f−1 :J →I is ntimes differentiable at x=f(t) and one has, for any 1≤m≤n
Dmh(x) = (−1)m+1 Dm(t) (f0(t))m(m+1)2
. Here (see Theorem 1)
D1(t) = 1,
D2(t) =A(2,1)(t), D3(t) =
A(2,1)(t) A(2,2)(t) A(3,1)(t) A(3,2)(t)
, and for m≥4
Dm(t) =
A(2,1)(t) A(2,2)(t) 0 0 . . . . . .0
A(3,1)(t) A(3,2)(t) A(3,3)(t) 0 . . . . . .0 . . . .
A(m−1,1)(t) A(m−1,2)(t) A(m−1,3)(t) . . . A(m−1, m−1)(t)
A(m,1)(t) A(m,2)(t) A(m,3)(t) . . . A(m, m−1)(t) .
3. RESULTS
Troughout this paragraph, we shall work within the following
Framework. AssumeI ⊂Ris an interval, f :I →R is continuous and strictly monotone and g : I → K. One can consider the generalized inverse h=f−1 :J =f(I)→I and the functionF =g◦h:J →R.
We write, fort∈I:
f(t) =x, g(t) =y,
hence, for x ∈ f(I), we can write h(x) = t and F(x) = y (and even worse:
F(x) =y(x)).
To complete the framework, we shall consider n ∈ N∗ and to ∈ I such that f and g arentimes differentiable att0 and f0(t0)6= 0.
Accepting this framework, we shall give formulae for DnF(x0), where x0=f(t0).
We shall accept the well-known fact that, within the above mentioned framework, the function h isntimes differentiable atx0.
Now we are in position to prove (induction onn)
Theorem 3. The function F isn times differentiable at x0. For the first derivatives, one has
F0(x0) = g0(t0) f0(t0), (1)
F00(x0) = g00(t0)f0(t0)−g0(t0)f00(t0) f0(t0)3 . (2)
Proof. The result is valid forn= 1. Indeed, let (xn)n⊂f(I)\ {x0}be a sequence such that xn−→
n x0, wherexn=f(tn). Because h is continuous, one has tn−→
n t0, hence
F(xn)−F(x0)
xn−x0 = g(tn)−g(t0) f(tn)−f(t0) −→
n
g0(t0) f0(t0) proving (1).
Accept the result for n and let us prove it for n+ 1. Because f and g are n+ 1 times differentiable at t0, they are n times differentiable in a neighbourhood U of t0 and f0(t) 6= 0 in U, because f0 is continuous at t0. Using stepn= 1,F is differentiable in the neighbourhoodf(U) ofx0 and, for any x∈f(U), one has (write x=f(t),t∈U)
(3) F0(x) = g0(t)
f0(t) = g0(h(x)) f0(h(x)) =
g0 f0 ◦h
(x) (generalized composition).
Using (3) and the fact thathisntimes differentiable inf(U) (according to the fact that f is n times differentiable in U and f0(t) 6= 0 for t∈ U), we see that F0 isn times differentiable atx0.
Equality (2) is obtained using (3).
First Formula for DnF(x0)
We have F(x) = g(h(x)) in a neighbourhood of x0. Using the Formula of Fa`a di Bruno, we get (forn∈N∗)
DnF(x0) =
n
X
k=1
Dkg(h(x0))·b(n, k)(x0), where
b(n, k)(x0) = X
(k1,k2,...,kn)∈M(n,k)
n!
k1!k2!. . . kn!·
·
D1h(x0) 1!
k1
D2h(x0) 2!
k2
·. . .·
Dnh(x0) n!
kn
. Because (for 1≤p≤n) one has (see Theorem 2)
Dph(x0) = (−1)p+1 Dp(t0) f0(t0)p(p+1)2 we get
DnF(x0) =n!
n
X
k=1
Dkg(t0)B(n, k)(t0), where
B(n, k)(t0) = X
(k1,k2,...,kn)∈M(n,k)
1
k1!. . . kn!·(−1)(1+1)k1
(1!)k1 · D1(t0) f0(t0)1·22
!k1
·
·(−1)(2+1)k2
(2!)k2 · D2(t0) f0(t0)2·32
!k2
·. . .·(−1)(n+1)kn
(n!)kn · Dn(t0) f0(t0)n(n+1)2
!kn
. Because
(1 + 1)k1+ (2 + 1)k2+· · ·+ (n+ 1)kn=n+k we obtain finally our
FIRST FORMULA
(∗) DnF(x0) =n!(−1)n
n
X
k=1
Dkg(t0)C(n, k)(t0), where
C(n, k)(t0) = (−1)k X
(k1,k2,...,kn)∈M(n,k)
1
k1!(1!)k1 ·k2!(2!)k2 ·. . .·kn!(n!)kn·
· D1(t0) f0(t0)1·22
!k1
· D2(t0) f0(t0)2·32
!k2
·. . .· Dn(t0) f0(t0)n(n+1)2
!kn
.
PARTICULAR CASE
When g(t) = t for all t in I, one gets F(x) = h(x) and (∗) becomes (because Dkg(t)6= 0⇒k= 1 and M(n,1) ={(0,0, . . . ,0,1)})
Dnh(x0) =n!(−1)n·(−1)1· 1
n!· Dn(t0) f0(t0)n(n+1)2
!1
= (−1)n+1 Dn(t0) f0(t0)n(n+1)2 confirming the previous formula.
Second Formula for DnF(x0)
A.We start by considering the same intervalI and the pointt0 ∈I. Let u : I → R, v :I → Rbe two functions such that v(t) 6= 0 for any t ∈I and assume u and v areq times differentiable att0,q ∈N∗.
We shall compute Dq uv (t0).
Using the formula of Leibniz we obtain
(4) Dq
u v
(t0) =
q
X
p=0
q!
p!(q−p)!Dp 1
v
(t0)·Dq−p(u)(t0).
We have
1
v =ϕ◦v, where
ϕ:R\ {0} →R, ϕ(x) = 1 x and, for any k∈N∗
Dkϕ(x) = (−1)k·k!·x−k−1.
Hence, using Fa`a di Bruno’s Formula, we get for any 1≤p≤q Dp
1 v
(t0) =
p
X
k=1
(−1)k·k!·v(t)−k−1· (5)
· X
(k1,k2,...,kp)∈M(p,k)
p!
k1!·k2!·. . .·kp!·
D1v(t) 1!
k1
·. . .·
Dpv(t) p!
kp
and (4) gives
(6) Dqu
v
(t0) = q!
0!q!· 1
v(t0)Dqu(t0)+
+
q
X
p=1
q!
p!(q−p)!Dp 1
v
(t0)·Dq−p(u)(t0) =
=q!
Dqu(t0) q!
1 v(t0)+
q
X
p=1
Dq−pu(t0) (q−p)! · 1
p!Dp 1
v
(t0)
. Using(5) and (6), we get
(7) Dq u
v
(t0) =q!
Dqu(t0) q! · 1
v(t0) +
q
X
p=1
Dq−pu(t0)
(q−p)! ·Wp(t0)
, where
Wp(t0) =
p
X
k=1
(−1)k·k!· 1 v(t)k+1·
· X
(k1,k2,...,kp)∈M(p,k)
1
k1!·k2!·. . .·kp!·
D1v(t0) 1!
k1
·. . .·
Dpv(t0) p!
kp
. We think formula (7) can be of independent interest.
B. Let us come back to the initial framework. We have seen (formula (3)) that, for x in a neighourhood ofx0, one has
D1F(x) =F0(x) = g0
f0 ◦h
(x) hence (again Fa`a di Bruno’s Formula)
DnF(x0) =Dn−1(D1F)(x0) =Dn−1 g0
f0 ◦h
(x0) = (8)
=
n−1
X
q=1
Dq g0
f0
(h(x0) =t0) ·b(n−1, q)(x0),
where (practically, we repeat the notation) b(n−1, q)(x0) =
= X
(a1,...,an−1)∈M(n−1,q)
(n−1)!
a1!·. . .·an−1!·
D1h(x0) 1!
a1
·. . .·
Dn−1h(x0) (n−1)!
an−1
. We lay stress upon the fact that formula (8) is valid forn≥2.
In order to write explicitely (8), we use (7) with u = g0, v = f0, the formula
Dmh(x0) = (−1)m+1 Dm(t0) f0(t0)m(m+1)2
and the fact that, for (a1, a2, . . . , an−1)∈M(n−1, q) one has 2a1+ 3a2+· · ·+nan−1 =
=a1+ 2a2+· · ·+ (n−1)an−1+a1+a2+· · ·+an−1 = (n−1) +q.
We get, finally, our
SECOND FORMULA (valid forn≥2)
(∗∗) DnF(x0) = (n−1)!(−1)n−1
n−1
X
q=1
q!(−1)qAq(t0), where
Aq(t0) =Bq(t0) X
(a1,...,an−1)∈M(n−1,q)
1
a1!(1!)a1 ·. . .·an−1!((n−1)!)an−1·
· D1(t0) f0(t0)1·22
!a1
·. . .· Dn−1(t0) f0(t0)(n−1)n2
!an−1
and
Bq(t0) = Dq+1g(t0)
q! · 1
f0(t0) +
q
X
p=1
Dq−p+1g(t0) (q−p)! ·
p
X
k=1
(−1)k·k! 1 f0(t0)k+1 ·
·
X
(k1,...,kp)∈M(p,k)
1 k1!·. . .·kp!
D2f(t0) 1!
k1
·. . .·
Dp+1f(t0) p!
kp
. Comment.Of course, formula (∗∗) is much more complicated than for- mula (∗) and, from practical point of view, less useful. We think this formula can be used to obtain some new identities.
Third Formula for DnF(x0)
In order to present our third formula, we introduce the sequence (Pn)n≥1
of uniquely determined real polynomialsPn(X1, X2, . . . , Xn;Y1, Y2, . . . , Yn) in 2nvariables, given as follows:
(i) P1(X1;Y1) =Y1,
Pn+1(X1, X2, . . . , Xn+1;Y1, Y2, . . . , Yn+1) = (ii)
=
n
X
i=1
∂Pn(X1, X2, . . . , Xn;Y1, Y2, . . . , Yn)
∂Xi ·X1Xi+1+
+
n
X
j=1
∂Pn(X1, X2, . . . , Xn;Y1, Y2, . . . , Yn)
∂Yj ·X1Yj+1−
−(2n−1)Pn(X1, X2, . . . , Xn;Y1, Y2, . . . , Yn)·X2. The first three polynomials:
P1(X1;Y1) =Y1⇒ ∂P1(X1;Y1)
∂X1
= 0 and ∂P1(X1;Y1)
∂Y1
= 1⇒
⇒P2(X1, X2;Y1, Y2) =X1Y2−X2Y1
P3(X1, X2, X3;Y1, Y2, Y3) =X12Y3+ 3X22Y1−X1X3Y1−3X1X2Y2. THIRD FORMULA
DnF(x0) = (∗ ∗ ∗)
= Pn(D1f(t0), D2f(t0), . . . , Dnf(t0);D1g(t0), D2g(t0), . . . , Dng(t0))
(D1f(t0))2n−1 .
The proof will be performed via induction onn.
Forn= 1, our assertion is true (see (1)):
D1F(x0) = D1g(t0) D1f(t0).
Let us accept the assertion for n and let us prove it for n+ 1. So, our hypothesis is thatfandgaren+1 times differentiable att0, withD1f(t0)6= 0.
It follows that f and g aren times differentiable in a neighbourhoodU of t0, with D1f(t)6= 0 for allt∈U (continuity ofD1f).
The induction hypothesis exhibits a n-degree homogeneous polynomial Pn(X1, X2, . . . , Xn;Y1, Y2, . . . , Yn) such that, for anyx=f(t)∈f(U), one has
DnF(x) = Pn(An(t)) f0(t)2n−1 ,
where An(t) = (D1f(t), D2f(t), . . . , Dnf(t);D1g(t), D2g(t), . . . , Dng(t)).
Writingu:U →K,u(t) =Pn(An(t)), we get DnF(x) = u(t)
f0(t)2n−1. It follows (becauseD1h(x0) = (f0(t0))−1) that
Dn+1F(x0) =D1
u (f0)2n−1
(t0)·D1h(x0) =
= u0(t0)f0(t0)2n−1−(2n−1)u(t0)f0(t0)2n−2f00(t0)
f0(t0)4n−2 · 1
f0(t0) =
= u0(t0)·f0(t0)−(2n−1)u(t0)f00(t0)
f0(t0)2n+1 = W(t0) f0(t0)2(n+1)−1. Here
W(t0) =
= n
X
i=1
∂Pn
∂Xi
(An(t0))Di+1f(t0) +
n
X
j=1
∂Pn
∂Yj
(An(t0))Dj+1g(t0)
D1f(t0)−
−(2n−1)Pn(An(t0))D2f(t0) =Pn+1(An+1(t0)).
We succeeded in proving the implication DnF(x0) = Pn(An(t0))
f0(t0)2n−1 ⇒Dn+1F(x0) =Pn+1(An+1(t0)) f0(t0)2(n+1)−1 and the induction proof is complete.
Verification. Using the previous formulae forP1,P2,P3 we get D1F(x0) = g0(t0)
f0(t0),
D2F(x0) = f0(t0)g00(t0)−f00(t0)g0(t0) f0(t0)3 , D3F(x0) =
= f0(t0)2g000(t0) + 3f00(t0)2g0(t0)−f0(t0)f000(t0)g0(t0)−3f0(t0)f00(t0)g00(t0)
f0(t0)5 .
Remarks. 1. We have
∂
∂Xi(X1α1. . . XnαnY1β1. . . Ynβn) = either 0 (in caseαi= 0) or αiX1α1. . . Xiαi−1. . . XnαnY1β1. . . Ynβn (in caseαi6= 0)
∂
∂Yj(X1α1. . . XnαnY1β1. . . Ynβn) = either 0 (in caseβj = 0) or βjX1α1. . . XnαnY1β1. . . Yjβj−1. . . Ynβn (in caseβj 6= 0).
We have P1(X1, Y1) =Y1, henceP1 is 1-degree homogeneous.
These facts, toghether with the recurrence formula (ii), imply that either Pn = 0 for some n (hence for all m ≥n) or Pn is n-degree homogeneous for all n. But Pn 6= 0 for any n, because one can take f :R → R,f(t) = t and g:R→R,g(t) =tn,n∈N∗henceF(x) =xnandDFn(x)6= 0, which implies (see formula (∗ ∗ ∗))Pn is not null.
Conclusion: all Pn aren-degree homogeneous polynomials.
2. The third formula is easily used in order to compute succesively D1F(x0),D2F(x0), . . . , DnF(x0), becauseP1, P2, . . . , Pnare easily computed.
Unfortunately, we have not a general formula forPn.
3. In order to obtain a recurrence for computing Dnh(x0), we have to work in the particular case g(t) ≡ t. Hence, one can take in the recurrence formula (ii): Y1= 1 andYk= 0, fork≥2 (becauseD1g(t) = 1 andDkg(t) = 0 for k≥2).
We get
Dnh(x0) = Pn(D1f(t0), D2f(t0), . . . , Dnf(t0); 1,0,0, . . . ,0)
f0(t0)2n−1 .
REFERENCES
[1] T.M. Apostol, Calculating higher derivatives of the inverses. Amer. Math. Month.107 (2000), 738–741.
[2] Cavaliere Francesco Fa`a di Bruno,Sullo svillupo delle funzioni. Annali di Scienze Mate- matiche e Fisiche6(1885), 479–480.
[3] Cavaliere Francesco Fa`a di Bruno,Note sur une nouvelle formule du calcul diff´erentiel.
Quarterly J. Pure Appl. Math.1(1857), 359–360.
[4] I. Chit¸escu, The Formula of Fa`a di Bruno and higher order derivatives of inverses.
Analele Univ. Buc. Mat.57(2008), 269–284.
[5] W.P. Johnson,The curious history of Fa`a di Bruno’s Formula. Amer. Math. Month.109 (2002), 217–234.
[6] W.P. Johnson,Combinatorics of higher derivatives of inverses. Amer. Math. Month.109 (2002), 273–277.
[7] S. Roman,The Formula of Fa`a di Bruno. Amer. Math. Month.87(1980), 805–809.
Received 26 April 2012 University of Bucharest
Faculty of Mathematics and Computer Science Academiei Str. 14
010014 Bucharest, Romania [email protected]