A NNALES DE L ’I. H. P., SECTION C
H. A TTOUCH R. J.-B. W ETS
Epigraphical analysis
Annales de l’I. H. P., section C, tome S6 (1989), p. 73-100
<
http://www.numdam.org/item?id=AIHPC_1989__S6__73_0>
© Gauthier-Villars, 1989, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section C » (
http://www.elsevier.com/locate/anihpc) implique l’accord avec les condi- tions générales d’utilisation (
http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
H. ATTOUCH
R. J.-B. WETS*
Université de
Perpignan,
66025Perpignan-Cedex,
FranceUniversity of California-Davis,
CA 95616Abstract. We
provide
a succinct overview of the basic tools ofepigraphical analysis,
and of theaccompanying
calculus. We show that itprovides
a very rich and unified tool tostudy
alarge
class ofproblems
that includes variationalproblems, generalized equations,
differential inclusions and limitproblems.
* This research was
supported
in partby
the National Science Foundation.1. Introduction
The
study
of variationalproblems
has been much aidedby casting
them in thefollowing (simple)
framework:find x that minimizes
f ( x)
onX,
where X is the
underlying
space and f takes its values in(-00,00];
the value oobeing
used to model the constraints that may be
imposed
on the choice of x.Probably starting
with the work of Fenchel
(on
convexfunctions),
it has become more and more obviousthat the
properties
of suchproblems (stability, dualizability,...)
and theproperties
of theirsolutions, depend intimately
on theproperties
of theepigraph
off.
This is in contrastto the "classical"
approach
that would bemostly
concerned with theproperties
of thegraph
off.
Theanalysis
viaepigraphs requires
a number of(nonclassical) tools,
some of which have beenbrought
to the foreduring
the lastcouple
of decades and others that arewaiting
furtherdevelopment.
It is the purpose of this article to introduce the basic tools ofepigraphical analysis.
Acomplete
treatment of thesubject
is farbeyond
the scope of thiscontribution,
it wouldrequire
amonograph
of somelength.
Indeed one would need to broach suchtopics
as:epigraphical
calculus(algebraic operations), epi-convergence (and
associated
topological questions), epi-derivatives
andepi-integration,
as well as discussexisting
andpotential applications
in avariety
of areas of pure andapplied analysis.
We shall limit ourselves to a brief survey
concentrating
onelementary
calculus rules and definitions of limits. We also conclude this introductionby giving
a series ofexamples
that suggest the wideapplicability
of(and
the needfor) epigraphical analysis.
We consider
problems
of the type(for
which Rockafellar and Wets[39]
use the termvariational
system):
minimize for x E
X,
where
(x, B)
’2014~f(x,8) :
X x 0 2014~ IR. The variable xrepresenting
the decision variables(state, control, etc.)
with X of finite or infinite(distributed
systems, stochasticprob- lems,
forexample)
dimensions. The space of parameters e could arise fromperturbations (parametric optimization), approximations (numerical
solutionsprocedures, etc.),
or couldcorrespond
tophysical
characteristics that are intrinsic to theproblem
at hand. To each value of 8correspond:
the value of theinfimum,
i
and the set of
(optimal)
solutions of theproblem,
and the multifunction(set-valued map)
of solutions of the
problem,
We are interested in the
continuity
anddifferentiability properties
of theinfimal func- tion, 8
inff (8),
also called themarginal
function or the valuefunction, and
of theargmin-multifunction (set-valued map) 8
-argmin f (9), and,
when differentiable or sub-differentiable,
incomputing
their derivatives.As indicated
earlier,
theepigraphical approach
consists inderiving
theproperties
of those maps from those of the multifunctioni.e., an
epigraph-valued multifunction,
called theepigraphical multifunction.
Epigraphical analysis
can beregarded
as aspecialization
of results about multifunc- tions to themap 8
’2014~epi f(., 8).
And in many ways this is anappropriate viewpoint,
notwithstanding
the fact that thestudy of epigraphical
multifunctionsactually provides
much of the motivation for thestudy
of multifunctions.However,
because of thespecial
natureof
epigraphs,
of thequestions
raised in a functionalsetting,
and inparticular
because ofthe framework forced upon us
by
theapplications,
it is useful and instructive to have atheory specifically developed for,
anddirectly applicable
to,epigraphical
multifunctions.In this we are
helped by
the fact that thesubspace
ofepigraphical
multifunctions is closed with respect to the addition of theirdefining functions, epi-addition (Minkowski-addition
of the
epigraphs), differentiation, taking (epi-)limits, integration,
etc.,i.e.,
all the basicanalytical operations.
To conclude this
introduction,
let us consider a fewtypical
situations. Wegive
acouple examples
of each one of thefollowing
cases: 8 is anapproximation
parameter(A.),
B is a
perturbation parameter(B.),
and 8 represents somephysical
characteristics of theproblem(C.).
Example
A.I Stochasticoptimization.
Theproblem
is to minimize/(~c) =
where P is a
probability
measure on 3i cIRN,
and h is an extendedreal-valued function defined on X x ~.. For
example,
~(.r,~)
=D(~~ ~)~ ~
where
ç)
represents the constraints asthey depend
on x and on the random elements of theproblem ~.
This is a difficultproblem
to solvenumerically
because of the lack ofregularity of f (in general, f
takes on the value +00, and is not smooth on its effective do-main),
and the lack oftechniques
and calculus fordealing effectively
with multi-dimensionalintegration.
One is lead to considerapproximations
of the type:with the h8
converging
to h and theP8 (usually discrete)
measuresapproximating
P.The parameter space e = IN. We want to know:
(i)
if the solutions of theapproximat- ing problems
converge to the solution of thegiven problem,
and(ii)
at what rate(error bounds).
Example
A.2 Finite elementsapproximations.
Here X is a Sobolev space of typeH1(S2)
or andwhere a is a
symmetric, continuous,
coercive bilinear form on X x X. Let u. Eargmin f.
The numerical
procedures
forcalculating
u*rely
on thefollowing approximations.
Thespace X
isreplaced by
a finite dimensionalsubspace XB
obtainedby taking
linear combi-nations of a finite number of "elements". The
approximating problems
take the form:We are interested in the convergence of the
approximating
solutions U8 to u* , and inestimating (Note that,
this latter estimate willdepend
on theregularity of u*.) Example
B.I Mathematicalprogramming.
Ourpoint
ofdeparture
is theoptimiza-
tion
problem:
A very useful characterization
of optimality
isprovided by
the differential inclusion:where the
(yi,
==1, ... , m)
are(Lagrange) multipliers
associated with the constraints.The existence of such
multiplers,
and thus thepossibility
ofmaking
use of thepreced- ing
relation, isintimately
connected with theproperties
of the infimal function of the(variational)
system obtainedby parametrizing
theconstraints, namely:
In
particular,
thosemultipliers
can often be identified with thesubgradients
of the infimalfunction at 8 = 0. Constraint
qualifications
can be viewed asproviding
sufficient(some-
times
necessary)
conditions to guaranteecontinuity,
orsubdifferentiability
of 8 ~inf f(8)
at 0, or in a
neighborhood
of 0.Example
B.2Duality
in convexoptimization.
Theproblem
is as inExample assuming
now that the functionsf;
are convex for i - 0,..., s, and affine for i =s + 1, ... , m, and X is any normed linear space. As in
Example B.I,
theproblem
isembedded in a
family
ofparametrized problems,
with the same type ofperturbations
or
by perturbing
otherquantities,
but in such way that the function(x,8)
-,f (~, 8)
isconvex on X x e with e another normed linear space. The dual
problem
is theconjugate (Legendre-Fenchel transform)
of the infimalfunction, i.e.,
The bilinear form
~-, -) brings
the spaces V and 0 induality.
Let us consider thefollowing simple
calculus of variationsproblem:
for Sl a subset ofIR." ,
One,
of a number ofpossible
dualproblems
is obtainedby perturbing
theintegrand by
anL2
function p:The dual
problem
is then:This
approach
toduality
can be combined with such results as thecontinuity (with
respectto the
epi-topology)
of theLegendre-Fenchel
transform to derivestability
and convergenceproperties
for dualvariables,
cf. Back[14], [15], Attouch,
Aze and Wets[4].
Example
B.3Dynamic programming.
We consider thefollowing
system: for t ET],
where u is the control function and x is the
trajectory
of the system. We are interested inchoosing
u so as toOne can reformulate
(B .3)
as a differential inclusion:The
dynamic programming
method associates to thepreceding dynamical
system the value functionIf v can be calculated it is
relatively
easy to derive theoptimal
control u* thatyields
theoptimal trajectory.
When v issufficiently smooth,
one way ofcomputing v
is to solve the Hamilton-Jacobiequation
If v is not
sufficiently smooth,
we need torely
on nonsmoothanalysis
togive
aninterpre-
tation to the
preceding equation
as has been doneby introducing viscosity solutions ,
cf.Crandall,
Evans and Lions[22],
orcontingent derivatives,
cf. Frankowska[27].
Example
C.IHomogenization.
We aredealing
withcomposite materials,
or materi- als with many smallholes,
fibered or stratified materials. In suchmaterials,
thephysical
parameters(such
asconductivity, elasticity coefficients, etc.)
arediscontinuous,
and os- cillaterapidly
between the different values that characterize the various components. A variational formulation of suchproblems
takes thefollowing
form:where j
isY-periodic,
Y =b=~,
and 8 is a parameter near 0. The"homogenized"
problem
is obtained
by letting
the parameter tend to zero andtaking
a variational limit(an epi- limit,
cf. Section3)
of the energyfunctionals,
inparticular,
The calculation of
jhom again requires solving
a variational system(that depends
on a parameterz).
Example
C.2Optimum design.
Consider thefollowing
situation: two mediaS21
andD2
areseparated by
anisolating
screen E(a
smooth manifold of co-dimension1)
with"holes" D = UDi that allows for the transfer of heat. With heat source
h,
the stateequations
of the system are:with the
boundary
conditions determinedby
the temperature g,i.e.,
y = g on theboundary
of n =
S21
US~2
U E. Theproblem
is to"design"
D so that theresulting
solution yD of thepreceding
system is as close aspossible
to a desired statez(.).
Theperformance
criterion is then:A more
sophisticated
version of thisproblem
would allow for a control mechanism toregulate
theexchanges
between111
and~2,
cf. Attouch and Picard[5].
Ingeneral,
suchproblems
do not have anoptimal
solution. This comes from the an"homogenization"
phenomenon:
forgiven
J and a class of admissibledesigns,
the sets{Dv v
E of aminimizing
sequences could tend to be more and morefragmented (clouds
oftiny holes).
Epi-convergence
is used tostudy
thelimiting
behavior and to characterize thelimiting
"problems" .
2.
Epigraphical Operations.
Let X be a vector space. To any extended real-valued function
f :
X ---7 IR =[-00,00],
we can associate
the
epigraph
off,
andthe strict
epigraph of f;
these sets are empty if oo. The classical notions of(MinkoBvski)
sum of sets
and scalar
multiplication
of sets, for A EIR,
when
applied
toepigraphs
generate theepigraphs
of new functions that are called theepi-sum
and the(A-)epi-multiple.
Inparticular,
if we have two functionsf
and g, definedon X and with values in the extended
reals,
the(Minkowski)
additiondefine the strict
epigraph
of a function that we denoteby f + g
and call the epi-sum off
and g. The
subscript
"e"referring
to the fact that theoperation
takesplace
onepigraphs.
A functional definition is
given by
the next relation:we use here the
(epigraphical)
convention that ~ - oo = oo.If,
instead of strictepigraphs,
we want to relate the
epi-sum
off and g
to the sum of theepigraphs,
we have thefollowing identity:
where vcl means vertical closure: for a set C C X x
1R,
Similarly,
we define the(scalar) epi-m ultiplication e *
as follows: for x inX, 03BB
> 0since then
i.e.,
theepigraph
of A *f
is obtained as the03BB-multiple
of theepigraph
off.
For A == 0,e
the function
0+* f
e is the recession function off:
The function
0+* e f
is to be viewed as a limit of thecollection {03BB*ef, 03BB
>0}
when A goesto 0; more
precisely
a lowerepi-limit,
cf. Section 4.Remark 2.1. The
terminology
and the notations are different from that used in the past.In the literature one finds the
epi-sum f +
g, denotedby f og (or
and called the inf- convolution off
and g. The reference to "convolution" isformal,
whereas theepigraphical terminology
refers to thegeometric interpretation
of theseoperations.
The need forchange
became
imperative
as these notions came toplay
the central role inepigraphical analysis.
This
viewpoint
isvividly
illustratedby
the next statement.Theorem 2.2. For all
~, ~,c
>0, f , g
an d h exten ded real valued functionswhere the indicator function of
~0}.Thus,
the space ofextended real-valued functions with theepi-addition
is analgebraic semi-group. Also,
an d
if f
is convex,Finally,
iff and 9
are convex, so is( f + e g).
Proof. The argument is
straightforward.
For illustration purpose, we derive thefollowing identity.
When A >0,
it follows from the definitions thatMoreover, convexity
ispreserved by
theseoperations
since the sum of two convex sets isconvex and so is the scalar
multiple
of a convex set. 0.
The next two theorems are concerned with the basic variational
properties
of theseoperations. For f :
X -1R,
letTheorem 2.3. Given A >
0, f and g
two extended real valued functions defined on X.Then
(i) argmin f
+argmin g
Cargmin( f + e g) (2.9)
i.e.,
if x minimizesf
on X and y minimizes g, then x + y minimizesf +
g;(ii) A(argmin f )
=argmin(03BB*e f ),
e
(2.10) i.e.,
if x minimizesf
onX,
then(Az)
minimizes(A*/).
Proof. If x E
argmin f, and y
Eargmin g,
then for all u, v in XHence,
for all z in Xand this means that
from which
(2.9)
follows. To prove(ii),
if x Eargmin f,
for all A > 0 and v inX,
we haveAnd thus
or
equivalently (2.10).
0Theorem 2.4.
Suppose
A >0,
cl >0,
~2 >0,
andf, g
proper extended real valued functions defined on X withinf f
> 2014oo,inf g
> 2014oo. Then(i)
ci-argmin f
+ ~2-argmin g
C(£1
+6:2)- argmin( f + g) (2.11)
where
(it ) 03BB(~1- argmin f )
=(03BB~1)- argmin(03BB e * f ) (2.12) (iii)
for all £ >0, argmin( f
+g)
C £-argmin f
+ £-argmin g
e
Proof. For
(i)
and(ii),
theproof
of Theorem 2.3 also works here. If x Eargmin( f + g),
there
exist, by definition,
u~ , vF such that uE + v~ = x, andwhere the last
inequality
follows from the fact that the infimum off
+ g is the sum of the infima. And from this the assertionreadily
follows. D Before we continue with ourdevelopment,
we describe two classes ofproblems
where theseepigraphical operations play
animportant
and natural role.The
Average
Problem. Let/i,/2?--~/n : ~ 2014~
IRcorrespond
to the realizations of randomoptimization problems. Typically,
these come fromuncertainty
in the data due to a randomenvironment;
forexample: conductivity
coefficients inheterogeneous material, spatial porosity
parameters forsoils,
data measurement errors. For i = 1,... n, let ~; Eargmin /,.
The average of the solutionsis the solution of a new minimization
problem
called the average problem. We would be interested in
knowing:
(i)
the form of the limit averageproblem
as n oo;(ii) if ~-(~i + - -’ + xn) provides
a reasonableapproximation
to theoptimal
solution of thelimit
problem.
DAsymptotic Expansion
of Solutions. Let us consider thefollowing family
of(para- metric) optimization problems
/8 :
.X 2014)-(201400, oo],
oo, and 0 is a subset of a linear space that contains 0. Let us assume that for all-ine,
theproblems
have all the desirableproperties,
inparticular
that there exists
We are interested in the
asymptotic
behavior of uj as 8 goes to 0.One could
just
be interested in thetopological
aspects of theproblem. Namely,
does the sequence(uo , 8
-0} converge,
for whichtopologies,
and can one characterize this limit as the solution of a(new) optimization problem?
Thesequestions
are answered inthe framework
provided by
thetheory
ofepi-convergence
that we reviewquickly
in thenext section.
For numerical reasons, as well as for
calculating
error bounds andestimates,
one oftenneeds more information about how
(at
which rate,e.g.) the u03B8 approach
their limit. Thiscan be done either
by introducing epi-metrics,
inparticular
theepi-distance,
to measure the rate at which the functions18
converge tofo.
This leads toquantitative stability results,
cf. Section 4.Or,
in certainparticular
cases one mayhope
for anasymptotic expansion
of the solution Us which respect to 03B8The
question
is to determine uo, ul, u2, ... and to characterize them as solutions of newvariational
problems.
Let usproceed
here at a formal level and assume that such a devel- opment exists. From Theorem 2.3,assuming
thatwe would have
This
brings
us to consider anexpansion
forfe
of thefollowing
type:or
equivalently
We refer to these as
asymptotic epigraphical expansions.
D For the same reasons as those for which theequation: given f, fo,
find11
such thathas in
general
nosolution,
one cannot, ingeneral,
findepigraphical expansions.
The space of extended real-valued functions with theepi-addition
is an additivesemigroup,
not agroup, cf. Theorem 2.2. An
interesting question
is to characterize the class of funtcions for which thepreceding equation
has a solution. It follows from the recent work of Hiriart-Urruty
and Mazure[29],
and M. Volle[40],
that a solutionfl,
if itexists,
isgiven by
the formula:A
promising approach
to the existencequestion,
as well as the actualcalculation,
of asymp- toticexpansions
for the solution relies on the second orderepigraphical
differentialcalculus,
cf. Section 5.We shall return to this after we have introduced the
appropriate topological
concepts that will allow us to understand in which sense we mustinterpret limits, approximations,
etc.
Many
of the classicaloperations
on functions find their naturalinterpretation
in theepigraphical setting.
Inparticular, let fi
and12
be two extended real-valued functions defined on X. Thenwhere for all x,
Also,
let g ? J~ x ~’ ~ IR with Y another vector space and define the infim aJ functionThis function, also called the
marginal
value function or theperturbation function, plays
animportant
role inoptimization (mathematical programming,
calculus ofvariation, dynamic programming, optimal control, etc.).
Theoperation,
identifiedby (2.18),
is calledepi- projection,
in view of thefollowing identity
where
prjy x
R is theprojection
operator definedby
To establish
(2.19), simply
note thator
equivalently
The
epi-addition
and theepi-multiplication
have been introducedhistorically
as "dual"operations corresponding respectively
to the classical sum of(convex)
functions and scalarmultiplication
of functions.Suppose
that X ispaired
with X* *through
the bilinear form~-, ), and f
is an extended real-valued function defined on X. TheLegendre-Fenchel
transform
f ~ f * ,
wheref * :
X * ~ IR is definedby
This
function,
called theconjugate
off,
isclearly
a convexfunction,
since in view of(2.16),
and for all x,
f (x))
is a(convex) half-space.
Proposition
2.5. Letf, g :
X ~ IR, be any twofunctions,
and 03BB apositive
scalar. Thenajjd
Proof. From the
definitions,
wereadily
haveAlso,
when A > 0.
From this
proposition,
and the fact thatf**
*f,
we also have thatand
To obtain
equalities
one needsf** = f, g**
= g. Thisrequires f and g
to be proper, lower semiontinuous and convex, oridentically
+00 or 2014oo.If,
moreover( f* + e g* )** = f* +~’,
then
(2.22)
becomes3.
Epigraphical regularization
The
epi-addition
has asmoothing
effect.By this,
one means that a function obtained as anepi-sum
is at least as "smooth" as any of itselements,
up toCl regularity;
this isa
general phenomenon
whendealing
with unilateral variationalproblem.
Thisprovides
anatural
bridge
betweenepigraphical analysis
and classicalanalysis.
As we shall see,by choosing appropriately
theregularizing
term in anepi-sum,
one can endow theresulting
function with the
appropriate differentiability properties;
thissmoothing
effect could insome way,
justify calling
theepi-addition
an "inf-convolution" .To
begin with,
let us consider theepi-sum
of anarbitrary
function and a "kernel" of the type(~p)- I ~~ ~
Definition 3.1. Let
~)
be a normed linear space. Given p E and ~1 > 0, theepi-regularization
of index A of a functionf :
X ~ IR is defined asExplicitly
This
terminology
isjustified by
thefollowing
result.Proposition
3.2. Let(X, 11.11)
be a normed linearspace A
> 0, p E[1, oo),
andf :
X - iR.Suppose
that for some ao E1R,
al Ef
majorizes ao -i.e.,
Then,
provided
0 A2~(o’ip)-B
fx
islipschitz
continuousif p
= 1fa is locally lipschitz
continuous if p > 1,and the
lipschitz
constantsdepend only
on the value of the function at onepoint,
and on03B10,03B11 and
A;
moreover thisdependence
is continuous.Proof. A number of related results have been obtained
recently,
see Attouch[3], Fougeres
and Truffert
[26], Attouch,
Aze and Wets[4].
Thisparticular
theorem can be found inAttouch and Wets
[9,
Lemma3.2].
0Remark 3.3.
Geometrically,
thepreceding
result takes thefollowing
form:Now observe that
epi
is a "smooth" setifp>
1. The statement would thus follow from thegeneral
fact: "if D is a smooth set, then C + D inherits theregularity
ofD". We prove next a result of this type that has various
implications.
Proposition
3.4. Let H be a hilbert space and p : ~f 2014~ a potential, i.e., ;,~ iscontinuously differentiable, strictly
con veY and coercive,ct,(0)
= 0 and for any u and any sequence EIN}
one hasThen, for any function
f :
H --~ con irex, proper and lower semicontinuous; its03C6-epi-regularization
is contin
uously
differen tiable.Moreover,
for any non empty closed con vex set C C H one has :where
and
$c-
js the indicator function of the set C. This means thatC + B03C6
js a smooth set in thefollowing
sense: jt isequal
to the level setlev03B1 g
with x >inf g,
of a convexcontinuously
differentiable functjon g.
Proof. From convex calculus
(recall that 03C6
iscoercive),
we know that the infimum in theexpression (2.2)
that defines(/+ ~)(~)
is achieved at aunique point
that we denoteby
J~ (~).
and
see, e.g., Laurent
[31],
Aubin and Ekeland[12].
Thus, 8( f + 03C6)(x)
is reduced to asingle element,
and hencef +
p isgateaux
differen-tiable. Frechet
differentiability
off +
p will follow if we prove that thegateaux
derivativeis continuous. In view of the
preceding relations,
this boils down toshowing
thatusing
here the fact that~y~
is continuous.Let
{x03BD, 03BD
E be a sequenceconverging strongly
to x. Thecoercivity of 03C6
guaran- tees the boundedness of thecorresponding
sequence~J~ (x" ), v
E For all u inH,
wehave
and after
extracting
aweakly
convergentsubsequence
such that J03C6(x03BD’)-z,
we pass tow
the limit to obtain
Note that we
only
used theconvexity
and lowersemicontinuity
off.
Thepreceding
in-equality implies
that2- =J~(3-),
and that the whole sequence E
IN}
converge toJ,~ (~).
As a consequence ofthis,
we obtain the convergence of the infimal values:Since
we have that
which with the
preceding inequalities yields
Hence
Now,
use the fact that p is apotential
to conclude that And this com-$
pletes
theproof
of the frechetdifferentiability
We now turn to
(3.3).
Note thatThe assertion now follows from the first part of the
proof
withf
= 0 Let us consider aspecial
case of this result. Forany A
> 0, define(~) _ (2,À)-1I1xIl2.
Clearly,
px iscontinuously
differentiable on a hilbert space. For any proper lower semi- continuous convex functionsf :
H ~(-00, ~]
and 03BB > 0, we set:In view of the
previous proposition,
the functionf a
iscontinuously
differentiable. Thisparticular
result was first obtainedby
Brezis[18].
Thesefunctions {f03BB, 03BB
>0} play a
fundamental role inoptimization theory
and are called Moreau- Yosidaapproximates.
The reference to Yosidaapproximation (which usually
refers toapproximating operators)
isjustified by
thefollowing
property:where
(8,f )a
is the Yosidaapproximate
of the maximal monotone operatoraf .
4.
Epigraphical
LimitsWe
only
review here a few of the main features of thistheory
which has received a lot of attentionduring
the last decade. For more aboutepi-convergence,
the reader could refer to the book on "VariationalConvergence
for Functions andOperators",
Attouch[3].
Let be a normed linear space,
{/*’ :
X ~IR,
v EIN}
a sequence of extended real-valued functions defined on X. We say that thefll
epi-converge tof
at x, if(i)
for any sequence EIN } converging
to x,lim infv /"(3’")
>f (x),
and(ii)
thereconverging
to x such thatlim sup" f (x).
Note that these conditions
imply
thatf
is lower semicontinuous. We then say thatf
isthe
epi-limit
of thef L ,
and writef
=epi-lim f" .
We refer to this type of convergenceas
epi-convergence,
since it isequivalent
to the set-convergence of theepigraphs.
Interestin
epi-convergence
stems from the fact that from a variationalviewpoint
it is the weakest type of convergence that possesses thefollowing properties:
Proposition
4.1.(Rockafellar
and Wets[39]) Suppose {/;/~ :
X 2014~ =1, ...}
isa collection of functions such that
f
=epi-lim f".
Thenv- 00
and,
if~’~
Eargniin ,fYk
for somesubsequence {vk,
k =1, ...~
and x = limk~~ xk, it follows that
and
so in
particular
if there exists a compact set D C X such that for somesubsequence 1,...},
then the minimum
of f
is attained at somepoint
in D.Moreover,
if argmin f ~ 0,
then(inf IV)
= inff
if andonly
if ac Eargmin f implies
the existence of sequences{E" > 0,
v =1,...}
and~~" E X,
v =1,...}
withsuch that for all v = 1,...
We will not
provide
an overview ofepi-convergence (sometimes
calledr-convergence, DeGiorgi [24]).
We shall limit ourselves to a brief introduction to therecently developed quantitative theory
andstating
one result that illustrates therelationship
betweentaking epigraphical
limits andoperations.
Webegin
with this latter.Proposition
4.2.Suppose (X, BI
.II)
is a normed linear space, andf",
1/ E- {g; g1/, 1/
EIN}
two collections of extended real-valued functions defined onX,
such thatSuppose
also that EIN},
areequi-inf-com pact, i.e.,
for all a the level setsare contained in a compact set, for v
sufficiently large;
and suppose that{inf g",
v EIN}
are(equi-)bounded below, i.e.,
there exists/3 E
IR such that for vsufficiently large
infg"
>~3.
Then
Proof. It is immediate that
+~") ~ /+~-
The converseinequality
e t~
f + g +~)
is derived from theargument
that follows. Whenevere e
03B3 > lim inf03BD~~
(/" + g03BD)(z),
there exits a sequence{z03BD,03BD
~passing
to asubsequence
if necessary, such that
By
definition ofepi-sum,
it follows that there exist v E v E such that x"+ y" = z",
andSince the
g"
areequi-bounded below,
for vsufficiently large,
we have thatf " (z" )
Î -{3.
This means that from some v on, all x"
belong
to a compact set contained inlev~_~ f’ . Again passing
to asubsequence,
if necessary, let x denote the limit of the sequenceIN}.
Thecorresponding sequence {y",
v EIN)
then also converges to z - x := y. From theseobservations,
it follows thatThe first
inequality
follows from the definition ofepi-sum,
and the second one from theepi-convergence
of thef "
andg"
tof
and g. 0 We now turn to the definition of a distance between functions that would becompatible
with the
topology
ofepi-convergence.
Let d the distance functiongenerated by
the normon
X,
and letd(x, C)
denote the distancefrom ~
toC;
if C =0,
setC)
= For anyp > 0,
pB
denotes the ball of radius p and for any setC,
For the "ezcess"
function
of C on D is defined as,with the convention that For any p >
0,
thep-(Hausdorff-)distance
betweenC and D is
given by
Definition 4.3.
(Attouch
and Wets[9])
For p >0,
the03C1-(Hausdorff-)epi-distance
be-tween two extended real valued functions
f,g
defined onX,
iswhere the unit ball of X x IR is the set B :=
BX {(~, a~ :
1,1~.
Convergence
with respect to theepi-distances
is somewhat stronger thanepi-conver-
gence, at least in the infinite dimensional case, when X is a reflexive Banach space
and
epi-limits
are defined in terms ofMosco-convergence,
i.e.epi-convergence
with respect to both the strong and the weaktopology
on X. Let{~ :
X - =1,...}
be a sequenceof functions. We say that
f
is theMosco-epi-limit
of this sequence, if for all x in X:for any sequence v =
1,...} converging weakly
to z, >f(x),
and
there exists =
1,...} converging strongly
to x such that lim sup f03BD (03BD) ~f(x).
v
Proposition
4.4.Suppose
X is a reflexive Banach =1, ...~ a
collectionof proper, extended real
valued,
lowersemicontinuous,
convex functions defined on X.Then,
for all p
sufficiently large implies
Moreover, jf X (== IR")
is finitedimensional,
theepi-distance topology
is theepi-topology,
i.e., thetopology
ofepi-convergence (without
anyconvexity assum ptions ).
The
proof
can be found in Attouch and Wets[9],
where one can also find anexample
ofa sequence of functions defined on a hilbert space that
epi-converges
but has no limit with respect to theepi-distance. This, however,
does notpaint
the fullpicture.
It is also shown in[9,
Theorem4.7]
that most, if notall,
of the knownapplications
ofepi-convergence
ininfinite dimensions are such that
Mosco-epi-convergence
and convergence with respect to theepi-distance
coincide.These
pseudo-distances
are not theonly
ones that could be used toquantify epi-
convergence. In Attouch and Wets
[7],
we introduce a notion of distance based onepi- graphical regularization.
For fixed p E(1, oo)
and A >0,
we work with thefollowing epigraphical regularization: := f +
This leads to thefollowing
notion ofdistance between two functions
f
and g:Assuming
that f and g are proper, thisquantity
is well defined since bothfx and ga
arethen bounded on bounded sets. The next theorem puts forward the
relationship
betweenthese
pseudo-distances
and theepi-distance.
Theorem 4.5.
(Attouch
and Wets[9])
L etf an d g
be t wo extended real valued functions defined on a normed linear space(~~ ~ ’ ~),
such that for some 0 and al ER,
for 1 ~ p ~.
Then for 0 À(03B10p)-121-p, and p >
0with the constants 7 and
~3 depending only
on the norm of apoint
at whichf and g
are finite and the
corresponding
values off
and g, on 03B10, 03B11 and of course, on p and .1.Similarly,
for all 0 À(aQp)-121-p,
andwe have
We are thus
dealing
with the same uniformities. Theepi-distance
isrelatively
easy to calculate or toestimate,
one can use the Kenmochi conditions[9,
Section2]
or the results of Aze and Penot[13]
who derive bounds for thep-Hausdorff
distance between sets obtainedas the results of various
operations (union, intersection, addition, etc.).
On the otherhand,
the distances~jB ~
are better suited for theoreticalinvestigations;
forexample,
onecan demonstrate that the
Legendre-Fenchel
transform is anisometry
for those distancesf7).
We have used these distances to derive holderian and
lipschitzian properties
for theoptimal
and c-solutions ofoptimization problems,
cf. Attouch and Wets[8], [10].
Forexample,
in the normalized case(xl ==
0 and =0),
whenf
isquadratically
"con- ditioned" at 0,i.e., f (x) >
=cp(x)
for 1,and g
is someapproximation
orperturbation of f,
with x9 acorresponding minimizer,
Theorem 4.1 of[8]
asserts thatprovided
that theepi-distance (of
parameterp) hausp ( f, g)
issufficiently
small. We alsoshowed that this holderian
stability
result isoptimal.
These,
and relatedconsiderations,
seem to suggest that theepi-distance topology (or
equivalently,
thetopology generated by
thepseudo-distance plays
a central role in thetheory of
epi-convergence.
It allows for thedevelopment
of aquantitative theory
ofepi-
convergence
(used
to derive error bounds for the solution ofapproximating optimization problems),
theLegendre-Fenchel
transform is bicontinuous with respect to thistopology (in fact,
isometricproperties
can bederived), and,
convergence with respect to theepi-
distanc.. is sufficientgeneral
so that allsignificant applications
are covered. We know a few of itsproperties,
forexample,
the space of extended real-valued functionsequipped
withthe uniform structure
generated by
theepi-distances
p >0~
iscomplete
when Xis finite dimensional or if X is a reflexive Banach space and the functions are proper, lower semi continuous and convex.
Certainly
a much morethorough study
of thistopology
iswarranted.
5.
Epigraphical
Differential Calculus.The concept
of epi-derivative
hasemerged
from the need topush
the sub differential calculusbeyond
that for first orderderivatives; consult,
forexample,
the work of Aubin[11]
onthe
differentiability
of multifunctions(as
itapplies
tosubgradient multifunctions),
and ofRockafellar
[37], [38]
on the second ordergeneralized
derivatives.Definition 5.1.
Suppose f
is finite at x withf
an extended real-valued function definedon a normed linear space X. If
is a well-defined function
(possibly
with valuesthen f
is said to beepi-differen tiable
at x. The
function y - Dt y)
is the directionalepi-derivative of f
at ~.The difference with the classical definition of directional derivatives is the use of
epi-
limits instead of
pointwise
limits. We are lead to this in the most natural way if weapproach
theproblem geometrically.
For {C03BD
CX,
v EN}
be a filteredfamily ( N, H),
letwhere