A NNALES DE L ’I. H. P., SECTION C
R. T. R OCKAFELLAR
Proto-differentiability of set-valued mappings and its applications in optimization
Annales de l’I. H. P., section C, tome S6 (1989), p. 449-482
<
http://www.numdam.org/item?id=AIHPC_1989__S6__449_0>
© Gauthier-Villars, 1989, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section C » (
http://www.elsevier.com/locate/anihpc) implique l’accord avec les condi- tions générales d’utilisation (
http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
MAPPINGS AND ITS APPLICATIONS IN OPTIMIZATION
R.T. ROCKAFELLAR
_ - . _ . _ - -- ---- -- ---- -- -
Department of Mathematics, University of washington, Seattle,
USAAbstract.
Proto-dinerentiability
of a set-valuedmapping (multifunction)
G fromone Euclidean space to another is defined in terms of
graphical
convergence of associated differencequotient
multifunctions. The nature and consequences of theproperty
areinvestigated
in considerable detail toprovide
a basis forapplica- tions. Applications
are demonstrated for thetheory
ofoptimization by verifying
theproto-differentiability
of some of the mostimportant
multifunctions in thattheory, specifically
multifunctionsgiving
the set of solutions to aparameterized system
of constraints or to aparameterized
variationalinequality
or collectionof
optimality
conditions. The fact that such multifunctions areactually
differen- tiabie in this ageneralized
sense has notpreviously
been detected.Keywords: Multifunctions, proto-differentiability, semi-differentiability. Lipschitz properties, generalized equations,
variationalinequalities, perturbations
of cons-traints,
set convergence, nonsmoothanalysis, optimization.
* This research was
supported
in pdrtby
grants from the Xátiolial Science Foundation and the Air Force Office of Scientific Research at theUniversity
ofWashington.
Seattle.1. Introduction
’
Set-valued
mappings,
or multifunctions as we shall call themhere,
arise in several ways in connection withproblems
ofoptimization.
In atypical
situation one has aproblem (P(u))
in
Rn
thatdepends
on a parameter vector u E multifunction G :Rd =4
R" can bedefined
b; letting G(u)
denote the set of allpoints z satisfying
the constraints of(P(u));
which could be
equations
andinequalities involving
certain functions with both x and u asarguments. Alternatively G(u)
could be the set ofoptimal
solutions to(P(u))
or theset of
points
xsatisfying
a collection of necessary conditions foroptimality,
and so forth.Multiplier
vectors could also be involved:G(u)
could consist of thepairs (x, ~/)
E Rn xsatisfying something
like the Kuhn-Tucker conditions forproblem (P(u~~,
for instance.In all these cases the multivaluedness of G is an inherent
feature.
or at least astrong possibility.
Even whenG(u)
is theoptimal
solution set, it may contain more than oneelement for certain choices of u that cannot
realistically
be left out of consideration.Such circumstances raise serious difficulties for the
study
of howG(u)
canchange
relative to
changes
in M. Classical notions ofcontinuity
anddifferentiability, developed
forfunctions rather than
multifunctions; obviously
do notapply.
The
goal
of this paper is to demonstrate that a wide class of multifunctionsimpor-
tant in
optimization
nonethelessenjoys
a property that we callproto-differentiability.
Tohelp
withunderstanding
the consequences of thisfact,
it is essential that the nature ofproto-difierentiability
be illuminated at the same time. This we do from severalangles, exploring
inparticular
therelationship
ofproto-differentiability
to truedifferentiability
and an attractive concept of
semi-differentiability.
From a
geometric point
ofview, proto-differentiation
of a multifunctioncorresponds
to
looking
at certain tangent cones to thegraph
of the multifunction. Such a pattern ofanalysis
has beenpioneered by
Aubin~11. [2]. f31.,
It hasalready
been shown toyield
much information of use in connection withoptimization
and alliedsubjects.
Aubinhas focused
chiefly
on thecontingent
cone and the Clarke tangent cone, while Frankowska~5~, ~7~. ~8~
has made potent use of an intermediate type of tangent cone(first
treatedby Ursescu [20]).
which in this paper is called the derivative cone. Thedistinguishing
featureof
proto-differentiability
is itsrequirement
that thecontingent
cone and derivative conecoincide. In this it may be
compared,
as far as the geometry ofgraphs
of multifunctions isconcerned,
with thestudy
oftangential regularity
in the sense of Clarke[9]. Tangential
regularity requires
thecontingent
cone to coincide with the Clarke tangent cone rather than’merely
with tbe derivative cone. This is a stronger property whichcorresponds,
in the caseof
multifunctions,
to a concept we refer to as strictproto-differentiability.
AU this tangent cone
terminology
could bebewildering
to someone not accustomedto
it,
so the reader may beglad
to know thatproto-differentiability
can be defined in arelatively simple
and natural way without it. The fact that this property iscommonly present
for the multifnnctions of interest inoptimization
has notpreviously
beenapplied
or even
recognized. Roughly speaking,
multifunctionsexpressing feasibility
turn out tobe proto-differentiable
when anappropriate
constraintqualification
issatisfied,
whereas thoseexpressing optimality
or thelike,
such assubgradient multifunctions,
are proto- differentiable when theparameterization
issufficiently
rich. The firstfact, although
notpreviously
demonstrated in thegenerality
furnishedhere,
is not verysurprising
in view of the studies in the framework of nonsmoothanalysis
that havealready
been made oftangential regularity
of sets definedby
constraints[9,
pp.55-57~. [10, Prop. 4.4j.
Thesecond fact is less
expected.
Multifunctions of the second
type
have indeed beeninvestigated previously
for certain differentialproperties
connected withtangent
cones to theirgraphs,
but theresults.
inconcentrating
on the Clarke tangent cone, haveprimarily
been somewhatnegative.
Thecase of
subgradient
multifunctions illustrates this well. Results in Rockafellar[ll]
establishthat for multifunctions
d~
associated with convex functions and saddlefunctions,
or moregenerally
for any maximal monotonemultifunction;
the Clarke tangent cone to thegraph
at any
point
isalways
asubspace
and thus isincapable
ofreflecting anything
other than"two-sided" aspects of differentiation. To the extent that a "corner" of the
graph
ofc3 f
may be involved, the Clarke tangent cone has to be
degenerate.
For suchmultifunctions, therefore;
strictproto-differentiability
is a property that has verypowerful
consequences when it ispresent-and
theorems can be stated about itbeing present
almosteverywhere
on the
graph (cf.
Rockafellar which is unusable incharacterizing
local one-sided behavior. It cannot be invoked at everypoint
of thegraph
of9/
unless thegraph happens
to be smooth and
f
itself iscorrespondingly
ageneralized
.sort ofC2
function.Sensitivity analysis
of the kind carried outby
Aubin[2]
in convexprogramming,
whichin effect assumes strict
proto-differentiability
at thepoint
underscrutiny;
aspointed
out in Rockafellar~11
suffers a serious limitation therefore in itsapplicability.
This limitation is removed if strictproto-differentiability
can bereplaced by proto-differentiability.
Although
we leave to another paper[12]
the fullstudy
ofproto-differentiability
int’e case of
subgradient
multifunctions in convexanalysis,
we do cover in§6
of rhe present paper a relared case withequal
claim toimportance
insensitivity analysis.
This concerns the multifunction which the Kuhn-Tuckerpuints
ill a smooth (not necessatilyconvex)
problem.
2.
Proto-differentiability.
The convergence of sets in
Rn will
be akey ingredient
in our definitions ofgeneralized differentiability
of multifunctions. Afamily
of setsSt
~ Rnparameterized by
t > 0 is said to converge to a set S C R" ast j O~
writtenif 5’ is closed and
where "dist" denotes Euclidean distance. It is often convenient to view this property as the
equation
Note that
the points
trbelonging
to the ’’lim inf" in(2.4)
are the onesexpressible
as thelimit of a
family
of elements n’; E?:
defined for all t in some interval(0, i’" L
whereas theones
belonging
to the ‘limsup"
in(2.5)
needonly
beexpressible
as the limit of somesequence u,w E
St03BD corresponding
to asequence t03BD ~
0.(We
usesuperscript
v in this paperas the universal index for sequences: 03BD =
1,2,...).
Both of the sets(2.4)
and(2.5)
arenecessarily
closed.Yet another way of
characterizing
theconcept
of set convergence is thefollowing: (2.1)
holds if and
only
if S is a closed set such that for,arbitrarily large
p > 0 andarbitrarily
small e >
0,
there exists r > 0 for whichHere B denotes the closed ball in the Euclidean norm but could be
replaced by
anv boundedneighborhood
of theorigin.
This characterization results from the fact that thf distance functions in(2.2)
areuniformly Lipschitzian
in w(with
modulus1).
Therefore.if
they
convergepointwise
on R~ as asserted in(2.2), they actually
convergeuniformly
onall bounded subsets of R".
The sets to which we shall want to
apply
such convergence in order to d7fine proto-diiferentiability
are thegraphs
of multifunctions. Recall that thegraph
of a multifunction G :R d =4
R" is the setThe effective domain of
G,
on the otherhand,
iswhile the effective range of G is
In terms of the inverse
G-I
ofG,
definedby
one
obviously
hasOne says that G has closed
graph
if thegph G
is closed as a subset ofR~
x R~ .Clearly
G is of closed
graph
if andonly
if G-l is of closedgraph.
Definition 2.1. Let G :
Rd
~ R~ be anymultifunction.
notnecessarily
of ciosedgraph,
and let u E dom G and ? E
G(u ).
LetDt : ~d ;
R~ be the differencequotient
multifunc-tion at u relative to z, defined
by
Bre shall say that G is
proto-differentiable
at u relative to z if there is a multifunction D :R~
= R" such thatDt
converges ingraph
toD,
i.e. the setSi
=gph D:
converges inx R" to the set S =
gph D
ast !
0. In :his event we shall call D theprito-derivative
of G at u relative to z and
employ
the notation D =Gi
~.The
relationship’
betweenproto-differentiability
and other ofdifferentiability
will be examined in§3.
For now wedevelop
the conceptalong
its own ratural lines.beginning
with thegeometry of epigraphs
and the connection between Definition 2.1 and the definitions ofgeneralized differentiability proposed by
Aubin[1], [2].
It will
help
us if we returntemporarily
to thestudy
of a set C c R~ and apoint
z E C. The set
is knou,n as the
contingent
cone to C at z,having
first beengiven
that nameby Bouligand
[13]
in 1932. It isgenerated by
all the directions from which x can beapproached by
asequence in C.
Specifically,
avector f belongs
to the "limsup"
in(2.13)
if andonly if
thereexists a sequence of
points
C andscalars t03BD ~
0 such that(x03BD - x)/t03BD - 03BE.
The wordcone in this context refers to a set that can be
expressed
as a union of raysemanating
fromthe
origin,
i.e. a set that is closed under theoperation
ofnonnegative
scalarmultiplication.
The
contingent
cone(2.13)
is characterized as the smallest conecontaining
0 and all the(direction) vectors ~
withj~
= 1expressible
as in the formwhere x03BD ~
C,
and the limit inquestion
exists.(The
sequenceis
said toconverge to x in the direction
of ~
in thiscase.)
The set
will be called here the derivative cone of C at z. It is less known but has been
employed by
a number ofauthors, expecially by
Frankowska[5], [6], [7],
who refers toit as
the
"intermediate cone" because it lies between thecontingent
cone and the Clarketangent
cone to C at z. Ourcalling
it the derivative cone issuggested by
thefollowing characterization,
which ties in with along
tradition in mathematicalprogramming.
Let us say
that y : [0, r)
- Rn is anemanating
arc in C at z ify{ t)
E C for allt E
JO, r), y(0)
= .r.y(t)
2014 ~ ast 1 0,
and the limitexists. Then
y’+ (0J
is the(right)
of y at l. The set(2.14)
turns out to consistof all the
vectors 03B6
E R11expressible y’+(0).
for the variousemanating
arcs y in C at r. This is apparent from thedescription given
earlier to the "lim inf ofSt
ast j 0
when
applied to St
=x].
The idea offorming
a cone that consists of derivative vectorslike ~
has been followed in mathematicalprogramming
since theearly days
in thedevelopment
ofoptimaliry conditions, except
that the arcs y haveusually
been considered to be differentiable on an interval[0,r)
rather thanjust "right
differentiable" at t = 0.Both the
contingent
cone(2.13)
and the derivative cone(2.14)
arealways
closed conescontaining
theorigin.
The second isobviously
contained within the first. We shall say that C isapproximate
at x if the two conescoincide,
i.e. if the limit setexists. This
property dictates
that the functionsconverge as
t 1
0 to a functionwhere K is a certain closed set,
necessarily containing
theorigin. Shapiro
has calledthe set
K,
when itexists,
theapproximating
cone to C at z. He did not connect it up with thetheory
of set convergence, however.Proposition
2.2. The multifunction G : Rn isproto-differentiable
at u relativeto a
point
z EG(u)
if andonly
if the setgphG
isa-pproximable
at(u; z).
Thegraph
ofthe
proto-derivative
multifunctionG’u,x
thenequals
theapproximating
cone togph
G at(u, z),
which issimultaneously
the derivative cone and thecontingent
cone.Proof. To establish this on the basis of what has
just
beenexplained,
all one needs is the observation that for C =gph G
the setf-1~C - (u, x)]
isjust gph Di.
C Someelemental
consequences of the definitionof proto-differentiability will
berecorded
next.
Proposition
2.3.Suppose
that theproto-derivative G’u,x
exists. Then for every z ER‘
one has .
and at the same time
(2.20) (w)
={~ ~ for
some arc t. :[0, r)
-Rd
witht.(0)
= u.t’i (0) =
j, one can select Efor aI~ so that
~(0) =
z,~(0) =~}.
Conversely,
if for ERd
the set definedby
theright
side of(2.19)
coincides with the set definedby
theright
side of(2.20),
then theproto-derivative G’u,z
exists.Proof. The
right
side of(2.19)
defines the setD~ (;~; ),
whereD+
is themultifunction
whosegraph
is thecontingent
cone togph G
at(z, u),
The
right
side of(2.20),
on the otherhand;
defines the stD~ (5J),
where D- is the multi- function whosegraph
is the derivative cone togph G
at(u, z),
The
proposition
comes down thenagain
10 the definition ofproto-differentiability: G’z,u
exists if andonly
ifD+
=D-,
in which event G’z,u =D+
= D". DProposition
2.4. Let G : Rn beproto-differentiable
at u relative to z, u-herex 6
G(u).
Then the derivative multifunctionG’z,u : Rd
R" has closedgraph
andsatisfies
Moreover G’u,z(0) is a closed cone winch includes the
contingent
cone toG(ui
at z andtherefore contains more than
just
0 when z is not an isolatedpoint
ofG(u).
Proof. From
Proposition
2.2 we know that thegraph
of is a certain closed conecontaining (0.0).
Inparticular
itequals
thecontingent
cone togph
G at(u..r). Everything
follows at once from this. Q.
Next ;e obtain a
simple
characterization ofproto-differentiability by elaborating
themeaning
ofgraphical
convergence for the differencequotient
multifunctionsZ~.
We use the notation that theimage
of a set ~l under a multifunction G is the ~PtProposition
2.5. Let G :Rd =t
R~ be any multifunction and !et u Edom G,
x EG(u).
In order G be
proto-differentiable
at u relative to x, jr is necessary. and sufficient that there exist aclosed-graph
multifunction D :R~
= be for which thefollowing
holds. For > 0(no
matter howsmall )
and p > 0(no
matterlarge )
one can find r > 0 such that
(2.25)
c D(03C9 + ~B) + ~Bfor all 03C9 ~ 03C1B, t ~ (0,),
(2.26) D(w)
npB
cDt(03C9
+:B)
+ ~B for E03C1B,
t E(0, r).
Proof.
For St
=gph Dt
and S =gph D
we invoke the characterizationthat St ~ S
if andonly
if S is closed and for every s > 0 and p > 0 there exists r > 0 such that(2.27) St
np(B
xB)
c S +ê(B
xB)
and S np(B
xB) c St
+:(B
xB).
This is
just
a restatement of the property described in(?.6)
in a form suitable for theproduct
spaceRd
xRn.
The two inclusions in(2.27)
areequivalent
to the ones in(2.25)
and
(2.26).
DFor the sake of
maintaining
ties with other areas of nonsmoothanalysis,
where theCiarke tangent cone is
fundamental,
thefollowing
concept needs 10 be mentioned in com-parison
withproto-differentiability.
Definition 2.6. Let G :
Rd 4
Rn be anymultifunction.
and let u 6 dom? and x EG(u).
We shall say that G isstrictly proto-differentiable
at u relative to x if it is proto- differentiable in the sensealready
defined andactually
has thefollowing;
strongerproperty
inplace
of formula(2.20)
ofProposition
2.3. Consider any -’ E domG~.1
E(:...:).
Then there exist : > 0 and r > 0 such that for each u’ E with
Bu’ - uJ
£and x’ E
G(u’)
with!x’ - xl
s(if
and for each t E[0,r).
it ispossible
to selectt’(~ tf~, j’~)
~ dom G andy(t, u’, x’)
6u~. ~~))
in such a way thaiThis concept
certainly
is much morecomplicated
thanplain proto-differentiability
and indeed is noi as easy to motivate in the present context. Fur reasons
álung
withof space, we shall not devote attention 10 it in
manner
the nextProposition
2.7. The mulrifunction G : Rn isstrict- proto-differentiable
at urelative to x, u-here x E
G(u),
if andonly
if thegraph
of G istangentially regular
at(u, z)
in the sense of nonsmooth
anah’sis.
i.e. thecontingent
cone at(t/, .r)
coincides with t~le Clarke tangent cone at(u, i).
In this e,-ent hasgraph,
hence is a convexprocess.
Proof. The Clarke
tangent
cone togph G
at(u; z)
is the set;here the "lim inf is restricted to elements
(u’. i’)
ofgph
G..4pair (;,~, ~) belongs
tothis set if and
only
if it can beexpressed
in the manner described in Definition2.6,
as the reader caneasily verify.
Strictproto-differentiahilitv
thusrequires
the Clarke tangent cone, rather thanjust
the derivative coneto agree with the
contingent
coneThe Clarke tangent cone is
always
to be convex. DProposition
2.7 does make itpossible
toverify
the strictproto-differentiability
ofvarious multifunctions
(and
thus theirproto-differentiability) by applying
known criteria fortangential regularity
to theirgraphs.
3.
Semi-differentiability
andDifferentiability.
To elucidate, further the
meaning
ofproto-differentiability,
weexplore
connections with anotherconcept
thatbridges
the way toward the classical pattern ofdifferentiability.
Definition 3.1. Let G : Rn be any multifunction, and let u E
dom G,
z EG(u).
We shall say that G is semi-differentiable at u relative to s if there is a multifunction D :
R =
Rn such that the differencequotients
in(2.12) satisfy
shall say that G is differentiable at u if. in
addition, G(u)
={L},
i.e. G Lsingle-valued
at u itself. and D is a linear transformation.
When G is
single-valued everywhere,
i.e. a function.differentiability
in the sense ofDefinition 3.1 reduces to the classical notion.
Theorem 3.2. If G is semidifferentiable at u relative to x, then in
particular G
isproto-differentiable af
u relative to ?, and rhe mulrifunction D in the definition of semi-differentiability
coincides theproto-derivative G’u,z.
Proof. We stand on
Proposition
2.3 as the characterization ofproto-differentiability
and also on the notation introduced in theproof
ofProposition 2.3, namely
the multifunctionsD+
and D". Since denotes theright
side of(2.19),
it is clear from(3.1)
thatD+
= D.We
needonly
-show that also D- =D,
where isgiven by
theright
sideof
(2.20).
Inasmuch asD-(~;)
it isonly
the inclusionthat needs
justifica.tion.
Consider E dom Dand ~
E in the casewhere D(w)
is
given by (3.1).
Inparticular
we haveby (3.1),
so there mustexist ~t
ED~’)
for all t in some interval~0, ~ )
such that~t - ~
as
t ~.
0. Let = u+tw. y(t)
= z + Thentv(0~
= 0 and = w, whiley(0)
= xand
y~.(0) = ~.
Thecondition ft
E can be written asor
equivalently
asy(t)
EG(v(t)). Therefore 03B6
ED-(:..!L
and ourgoal
has been achieved.[]A condition under which
proto-differentiability conversely implies
thestronger
prop-erty
ofsemi-differentiability will
bepresented
in Theorem 4.3. Our attention at the moment turns instead toward the characteristics ofsemi-differentiability
itself. We aim atidentify- ing
the domain in which this type ofgeneralization might
be notion ofcontinuity
will be used.Definition
3.3. A multifunction G :Rn
is continuous at u if(3.2)
limGlu’)
=G(u).
It is
locally
bounded at u if there exist p > 0 and ð > 0 such that(3.3) G(u’)
CpB
for allu’ satisfying ~u’ - u~
f_I This concept of
continuity
reduces to theordinary
one in the case where Ghappens
to besingle-valued,
i.e.- a function rather thanmerely
a multifunction. One must be cautious about itsinterpretation, however,
in the multivalued case. Thefollowing example,
where~c is
just
a realvariable, points
out thepitfail:
i ~_ _ , , : 6 -
In this instance G is continuous at u = 0 and has
G(0) = {0}.
As a matter offact,
G isproto-differentiable
at u = 0 relative to x = 0 and has 0. But G is not evenlocally
bounded at tr = 0. Thediscrepancy
comes. of course, from the fact that set convergenceonly
makes demands relative to anarbitrarily large
boundedregion
at any one time. In theexample
one does have the property that for every p > 0 and : > 0 there exists 6 > 0 such thatwhen
jul 6. (Here
B =[-1,1].)
Theorem 3.4. If the multifunction G :
Rd =+
is semi-dilfierenriable at u relative to z, thenG’u,z
is continuouseverywhere
onRd Rd.
Furthermoreu E int dom G and
Proof. Fix any w E
Rd.
Let D =C~ y. Property (3.1)
in the definition of semi-differentiability requires
that for every p > 0 and : > 0 there exist 6 > 0 and r > 0 such thatwhen 5 and t E
(0, r).
It followsthen,
as seenthrough
consideration of whathappens
ast ~ ( 0
with w’fixed,
that for every p > 0 and E > 0 there exists 5 > 0 such thatwhen ~ 2014 ~
Ð. This means that D is continuous at .Applying
the secondinequality
in
(3.7)
to the case where w =0,
we deduce thatD(./) ~ 0
for all ~/satisfying
6.(Recall
here that 0 ED(0),
soD(0)
n0.)
Thepositive homogeneity
of D =in
(2.23)
thengiyes
usD(~’~) 7~ 0
for all ~. Thus dom D =Rd.
The second inclusion in(3.6)
whenapplied
to 03C9 = 0 tells us in like manner that for any :- > 0there
exist 03B4 > 0 and r > 0 such thatWe can write this
equivalent ly
asTherefore u is an interior
point
of dom G and(3.5)
is correct. DThe conclusions of Theorem 3.4 may be
interpreted negatively
as well aspositively.
They
say that the concept ofsemi-differentiability, despite
its naturalappeal,
is not suit-able for the treatment for a multifunction G at a
boundary point
of the effective domain of G. Inasmuch asboundary points
doplay a
crucial role in the case of some of the im-portant
multifunctions associated withproblems
ofoptimization,
this observationprovides
motivation for
why
the moregeneral
concept ofproto-differentiability
isdefinitely
needed.Theorem 3.4 informs us likewise that
semi-differentiability
isinadequate
forhandling
,
pairs (u, z)
Egph G
for which(3.5)
fails.However,
in cases such as G =a f. where f
is aclosed proper convex
function, (3.5)
fails for every z EG(u)
unlessG(u) happens
to be asingleton ! (See [15,
Thm.24.6].)
True
differentiabilityr
is aspecial
type ofsemi-differentiability by-
Definition 3.1. There-fore, according
to Theorem3.2,
it falls within thelarger
realm ofproto-dinerentiability.
When G is
actually
afunction,
one cansensibly inquire
further about the circumstances in whichproto-differentiability
will be the same asdifferentiability.
An immediateconjecture
is that this holds whenever the
proto-derivative
is a linear transformation. Theconjecture
is
false,
however.The kind of situation to be wary of is demonstrated
by
anexample closely
related tothe one in
(3.4).
Let u be a real variable and define.
(3.8) G(u) = irrational.
0
if u is rational.There is no multivaluedness
here,
and G is not even continuous at u =0,
much less differentiable there. Nonetheless G isproto-differentiable
at u = 0 relative to z = 0 =G(0),
and the
proto-derivative
is linear(the
constant0).
When discontinuities such as seen in this
example
areexcluded, e;er3.thing
does fallinto
place.
however.Theorem 3.5.
Suppose
G :Rd - R"
isactually
a function (single-valued). Then G is differentiable at u if andonly
if G is continuous ar u and at the sarne time proro- differentiable at u restive to r =G(u),
with G’u,z linear.: The
proof
of Theorem 3.5 ispostponed
until§4. just
after theproof
of Theorem4.1,
because it will then be much easier to carry out..x comment at this
stage
may head off somepossible
confssion in the treatment of thespecial
case of a function g :Rd
-= R. One could consider such a function as amultifunction that
happens
to besingle-yalued:
in. present notation withBut one could also handle it in terms of
(3.10) G(u) = {x
E> g(u~~ (epigraphical framework)
or instead
(3.11) G(u) = (z
Eg(u)~ (hypographical framework)
All three choices lead to useful concepts of
generalized
differentiation. In the case of(3.10),
for
instance,
one canspeak
ofepigraphical proto-derivatives
in order tokeep
mattersstraight.
Animportant advantage
of(3.10)
and(3.11)
is thatthey
are not limited to real-yalued functions.
They
furnish viableapproaches
evenwhen g
can take on asvalues,
as often turns out to be convenient in
optimization theory.
There is a natural tie-in with first-order(epigraphical) epi-differentiation
of extended-real-valued functions asdeveloped
in Rockafellar
[10],
but we shall not look at this further here.4. Derivative Bounds and
Lipschitz Properties.
The
proto-derivative
multifunctionG~ ~,
when itexists,
isalways positively homogeneous
in the sense of
(2.23).
One can therefore define its(outer )
normby
with the convention that = oc if no
such
EiO, oo)
exists. The first result in this section is a characterization of the case where oc in terms of a kind ofpointwise Lipschitz growth property holding
for G at u relative to :r. Theproperty
inquestion
willbe used
subsequently
in the verification of Theorem 3.5.Theorem 4.1.
Suppose
G :Rd =t
Rn isproto-differentiable
at u relative to z. Then rhefollowing
conditions areequivalent:
l~a~ ~
(b )
the cone~ ~(0)
consistsonly of 0j (c)
thereexist ~
>0, p
> 0 and r > 0 such r~atIf these
properties
are present, z must be an isolatedpoint of G( u).
AforeorerI
is then the infimum of the
values
forR-bich ; c)
holds.Proof. We
begin
with condition(c)
andby ~akin~
~’ = 0 in(4.2)
that itimplies
G(u)
n(z
+pB) = {.r}.
Then .? is an isolatedpoint
ofG(u).
If we write(4.2)
next in theequivalent
formwhere
Dt
is as before the differencequotient
multifunction in(2.12),
we see thatThis follows from the formula
in
(2.19).
We deduce from(4.4)
and thepositive homogeneity
of in(2.23)
that~t. In
particular, (a)
holds.We assume next that
(a)
holds andthat
is a numbersatisfying
We need to show that
(c) holds,
i.e. that(4.3)
is validfor p = p
and some choice of p > 0 and r > 0. This will also establish that is the infimum of the ;alues for which(c)
holds.
.
. In the
contrary
case. where(4.6)
is satisfied and(4.3)
fails to hold for any choiceof p > 0 or >
0,
;e can takearbitrary.
sequencesp" 1 0 and 03BD ~
0 and somehow select~(0,~),
andThen
~v ~
0. If we setso that
= 7t, get
where |03B603BD|
== 1 and03C103BD.
Thent03BD -
0 and|03B603BD|/ = 1/ . Passing
tosubsequences
if necessary we can supposethat 03B603BD
converges tosome $
and 03C903BD to some (J.Then 03B6
EG’u,z(03C9) by (4.5),
andyet |03B6|
= 1.|03C9| 1/ ,
so that|03C9| ~ |03B6| ~
0. Thiscontradicts the strict
inequality
in(4.6):
there could not be anumber
E(0, )
for whichSo far we have verified the
equivalence
between(a)
and(c)
as well as thecorrespond- ing
assertions about r and To finish theproof
of Theorem 4.1 it will suffice Todemonstrate the
equivalence
between(a)
and(b).
Theimplication
from(a)
to(b)
isquite
trivial: under
(4.7),
the set(0)
cannot contain 0. Theimplication
from(b)
to(a)
is not much harder. If(a)
is untrue, we can take any sequencep" i
oo and select vec-tors E and
f"
E such that >By setting 03B603BD
=and = can transform this
(in
view of(2.23))
intoPassing
tosubsequences
if necessary, we canobtain ~ 2014 ~
and 0.Then (
E(0) by
the closedgraph property
inProposition 2.4,
butj~~
=1, so ~ ~
0. This contradicts(b).
0Proof of Theorem 3.5. If
G,
which is nowjust
afunction,
is differentiable at u, then from Definition 3.1 it is inparticular
semi-differentiable and we may concludeusing
Theorem3.2 that G is
proto-differentiable
with linear. Thecontinuity
of G at u follows then fromproperty (3.5)
in Theorem 3.4: of course this is also knownclassically.
Assume next, on the other
hand,
that G is continuous at u andproto-differentiable
relative
G( u)
with linear. Then for D =G:z
we haveI
oc, so theproperties
in Theorem 4.1 hold. Inparticular
there exist po > 0and ro
> 0 such that(j and t E
(U, foj such
that +!J5.
ESince
G(u’) - G(u)
= x as u’ - u, ;e canreplace ro by
a smaller value if necessary and write this asLet p
=(1 +
Then inparticular
Because of
proto-differentiability, we
can obtain fromProposition
2.5 for any > 0 a ; > v such thatBut
Thus we are able to find for any E > 0 a ; > 0 such that
This means that G is differentiable at u with derivative D. D .4.. sufficient condition for
proto-differentiability
toimply semi-differentiability
will bedeveloped
next in terms of ageneralized Lipschitz property
that was first introducedby
Aubin[2].
Definition 4.2. A multifunction G : Rn with closed
graph
is said to bepseudo- Lipschitzian
at u relative to z EG(u)
if there exist c >0; b
> 0 and p > 0 such thatSufficient conditions for G to be
pseudo-Lipschitz.ian
in this sense have beenprovided
in many forms in Rockafellar[16] through
the apparatus of subdifferential calculus.Theorem 4.3.
Suppose
G : R’‘ has closedgraph
and ispseudo-Lipschitzian
at urelatiie to z, u-here x E
G(u).
Then G isproto-differentiable
at u relative to x if andonly
if the limit
exists for every o, in which event the multifunction D that is defined in this manner is
G’u,x. Then,
moreover, G turns out to be semi-differentiable at u relatiie to x, and is itselfglobally Lipschitzian
in the sense that twhere
is the modulus ofpseudo-Lipschitz continuity
for G in Definition 4.2.Proof. Let
~, b and J1
be as in Definition 4.2. For t > 0 we haveas
long
as u+ t5J’
and u + both lie in the ball u 7 68. In other words.Suppose
G isproto-differentiable
at u relative to z. For any p > 0 and E’ > 0 there existsby Proposition
2.5 a r > 0 such that for p =2p(1
+~~
one hasRequire
£’ p, so that p + a’p
inparticular.
Take i’ E(0, r)
such thatEft
and> p 6
(0, r’).
Then forw’.
w" epB
and t E(0, r’)
one hasby (4.13)
thatBut also for
w’,
w" EpB
and t E(0, r’ )
one hasby (4.11)
thatbecause + -s and ~" +
c’s belong
to(p
+c’)B;
andThus for
6’, u"
EpB
and t E(0, r’)
one hasand therefore
Here p -
III
+ c’.j" +p
+~(2p~
+ p =p by
the definition ofp.
so thatby (4.12 J.
It follows that for E03C1B
one hasWe have demonstrated this for
arbitrary 03C1
> 0 and e’ E(0, p),
and ;e so may conclude(be-
cause has closed
graph according
toProposition 2.4)
that isglobally Lipschitzian
with modulus ~~ in the sense of
(4.10).
Using
this fact we argueby. (4.12) that
w-hen , EpB, r
E(0, r’),
one hasby (4.10)
and
consequently
alsoby (4.10)
forarbitrary ~
ERd
that(4.14)
npB
c +(p=’
+e’ -~ p;.~’ - :~: (jB
for allw’
EpB,
E(0. :’j.
By (4.13),
on the otherhand,
one has(since
pp - E’j
thatHere
pB
C whenactually
t E(0. :--’),
and in that caseby (4.11)
one hasChoose r" E
(~, r’ ~
smallenough
thatWe then obtain from the combination of
(4.15)
and(4.16)
the result that aslong
as p,one has
In summary, for any fixed w we can take
arbitwry p
> and ~’ 6(0,03C1)
and then have both(4.14)
and(4.17) holding
oversufficiently-
small intervals(0. r’)
and(0, r "i.
Thismeans that
t