Introduction to
Chance-Constrained Programming
René Henrion
1 Introduction
2 An Example
2.1 Deterministic
Version
2.2 Stochastic
Version: individual chance constraints
2.3 Stochastic
Version: joint chance constraints
3 Algorithmic Challenges and Theory
3.1 Structural
Aspects
3.2 Random
right-hand side with nondegenerate multivariate normal distribution
3.3 Convexity
3.4 Stability
3.5 Beyond the
classical setting
4 Concluding remarks
5 Links
1 Introduction
Chance Constrained Programming belongs to the major approaches for
dealing with random parameters in optimization problems. Typical areas
of
application are engineering and finance, where uncertainties like
product
demand, meteorological or demographic conditions, currency exchange
rates
etc. enter the inequalities describing the proper working of a system
under
consideration. The main difficulty of such models is due to (optimal)
decisions that have to be taken prior to the observation of random
parameters. In this situation, one can hardly find any decision which
would
definitely exclude later constraint violation caused by unexpected
random
effects. Sometimes, such constraint violation can be balanced
afterwards by
some compensating decisions taken in a second stage. For instance,
making
recourse to pumped storage plants or buying energy on the liberalized
market
is an option for power generating companies that are faced with
unforeseen
peaks of electrical load. As long as the costs of compensating
decisions are
known, these may be considered as a penalization for constraint
violation.
This idea leads to the important class of twostage or multistage
stochastic programs [2,16,28].
In many applications, however, compensations simply do not exist (e.g.,
for
safety relevant restrictions like levels of a water reservoir) or
cannot be
modeled by costs in any reasonable way. In such circumstances, one
would
rather insist on decisions guaranteeing feasibility 'as much as
possible'.
This loose term refers once more to the fact that constraint violation
can
almost never be avoided because of unexpected extreme events. On the
other
hand, when knowing or approximating the distribution of the random
parameter, it makes sense to call decisions feasible (in a stochastic
meaning) whenever they are feasible with high probability, i.e., only a
low
percentage of realizations of the random parameter leads to constraint
violation under this fixed decision. A generic way to express such a probabilistic
or chance constraint as an inequality is
[Switch to alternative lay-out]
Here, x and x are decision
and random vectors, respectively,
"h(x,x) ³
0" refers to a finite system of
inequalities and P is a probability measure. The value p
Î [0,1] is called the
probability level, and it is chosen by the
decision maker in order to model the safety requirements. Sometimes,
the
probability level is strictly fixed from the very beginning (e.g., p=0.95,0.99
etc.). In other situations, the decision maker may only
have a vague idea of a properly chosen level. Of course, he is aware
that higher values of p lead to fewer feasible decisions x
in
(1), hence to optimal solutions at higher costs.
Fortunately, it turns
out that usually p can be increased over quite a wide range
without
affecting too much the optimal value of some problem, until it closely
approaches 1 and then a strong increase of costs becomes evident. In
this way,
models with chance constraints can also give a hint to a good
compromise
between costs and safety.
Among the numerous applications of chance
constrained programming one may find areas like water resource
management,
energy production, circuit manufacturing, chemical engineering,
telecommunications, and finance. Rather than listing all possible work
in this context, we refer to the overview contained in [23]. The
present document is devoted to a brief introduction to models,
algorithmic
aspects and
theory of chance constrained programming and to motivate the reader to
further consult deeper treatments of this subject area such as
[21,28].
[Top of page]
2 An Example
2.1 Deterministic Version
As an illustrating example, we consider the following cash
matching problem taken from [8]
and [26]: the pension fund of
a company has to make certain payments for the next 15 years which
shall be
financed by buying three types of bonds. The assumed data are collected
in
Table 1. An initial capital K=250,000 is
given. The amount of cash
which is available at the end of the considered time horizon is to be
maximized under the condition of meeting the payment restrictions in
each
year. For i=1,¼,n:=3
and j=1,¼,m:=15, we
put:
|
|
|
| yield per bond of type i in year j; |
|
|
|
|
|
| payment required in year j; |
|
|
|
|
|
|
|
|
|
|
| number of bonds of type i to be bought. |
|
|
|
The amount of cash available in the fund at the end of year j
is
|
|
|
|
|
|
|
|
|
|
| cumulative yields of bonds |
|
|
|
|
|
|
Introducing the quantities aij:=åjk=1aik-gi
and bj:=åjk=1bk-K,
the optimization problem may be
written as
|
max
|
|
n
å
i=1
|
aimxi
subject to |
n
å
i=1
|
aijxi
³ bj
( j=1,¼,m) , |
|
(2) |
where the system of inequality constraints models the condition of
positive
cash in year j and the objective function corresponds to the
final cash
(up to a constant).
Table 1: Data for the cash matching problem
|
|
Yields per bond of type: |
| Year |
Payments |
1 |
2 |
3 |
|
| 1 |
11,000 |
0 |
0 |
0 |
| 2 |
12,000 |
60 |
65 |
75 |
| 3 |
14,000 |
60 |
65 |
75 |
| 4 |
15,000 |
60 |
65 |
75 |
| 5 |
16,000 |
60 |
65 |
75 |
| 6 |
18,000 |
1060 |
65 |
75 |
| 7 |
20,000 |
0 |
65 |
75 |
| 8 |
21,000 |
0 |
65 |
75 |
| 9 |
22,000 |
0 |
65 |
75 |
| 10 |
24,000 |
0 |
65 |
75 |
| 11 |
25,000 |
0 |
65 |
75 |
| 12 |
30,000 |
0 |
1060 |
75 |
| 13 |
31,000 |
0 |
0 |
75 |
| 14 |
31,000 |
0 |
0 |
75 |
| 15 |
31,000 |
0 |
0 |
1075 |
|
|
|
cost per bond: |
|
|
980 |
970 |
1050 |
Allowing for fractional numbers of bonds to be bought, the
solution of the linear programming problem (2) with
the given data
is
| x1=31.1, x2=55.5, x3=147.3, final amount of cash: 127,332. |
|
The amount of cash as a function of time is illustrated by the thick
line in
Figure 1 a.
2.2 Stochastic Version:
individual chance constraints
As one can see from Figure 1 a (thick line),
the cash
profile of the fund reaches zero several times. Formally, this does not
contradict feasibility of the solution, but it strongly depends on the
exactness of payment data from Table 1. It is not
hard to believe
that slight changes of these data would result in infeasible cash
profiles
under the solution found above. Indeed, there is good reason to assume
random payment data in our problem due to demographic uncertainty in a
future time period. Therefore, let us assume now, that the bj
introduced above are not the (deterministic) payments themselves but
rather
expected values of random payments ~bj.
Figure 1: Illustration of the cash matching
problem (deterministic version versus individual chance constraints).
For details see text.
To be more precise,
we suppose that the ~bj
are independently normally distributed
with expectation bj and
standard deviation sj:=500·j
for j=1,¼,m (growing
uncertainty over time). When simulating 100
payment profiles according to these assumptions and applying the
deterministic solution from above, one arrives at 100 cash profiles
illustrated in Figure 1 a (thin lines). As is
to be expected,
many of the cash profiles fall below zero in particular at the 'sharp
times'
when the deterministic profile reaches zero. Evidently, the
deterministic
solution is no longer satisfactory as it violates the constraint of
positive
cash with high risk. A clearer view is obtained when plotting for each
year
the percentage of simulated cash profiles yielding constraint violation
(Fig. 1 c (thick line)). Several times, this
value approaches
50%. In order to arrive at a more robust solution, one could impose the
restriction that with each year fixed, the probability of having
positive
cash exceeds say p=0.95. Passing to the stochastic counterpart
xj:=åjk=1~bk-K of the deterministic quantity bj
introduced above, problem (2) then turns
into an
optimization problem with individual chance constraints:
|
max
|
|
n
å
i=1
|
aimxi
subject to P |
æ
è
|
n
å
i=1
|
aijxi
³ xj |
ö
ø
|
³ p ( j=1,¼,m) . |
|
(3) |
The term 'individual' relates to the fact that each of the (stochastic)
constraints åni=1aijxi
³ xj
is transformed into a
chance constraint individually. A different - and more realistic -
version
will be presented later. Before solving (3), we
note that,
due to the distribution assumptions on ~bj, the random vector
x has a multivariate normal
distribution with mean vector b and
with covariance matrix
In particular, xj has
variance
~s2j=åk=1jsk2. Now, we pass to
the
standardized random variables
which are normally distributed with zero mean and unit standard
deviation.
Then,
| P |
æ
è
|
n
å
i=1
|
aijxi
³ xj |
ö
ø
|
=P |
æ
è
|
~
s
|
-1
j
|
|
æ
è
|
n
å
i=1
|
aijxi-bj |
ö
ø
|
³ hj |
ö
ø
|
. |
|
Denoting by qp the p-quantile of the
standard normal distribution
(i.e., q0.95=1.65), we have the following
equivalence:
| P |
æ
è
|
n
å
i=1
|
aijxi
³ xj |
ö
ø
|
³ pÛ
|
n
å
i=1
|
aijxi
³ |
~
b
|
j:=bj+ |
~
s
|
j
|
qp. |
|
In other words, the chance constrained problem (3)
becomes a
simple linear programming problem again, but now with inequality
constraints
having higher right hand side values than in (2).
The difference
~sjqp
may be interpreted as a safety term. The solution of
(3) with p=0.95 is
| x1=62.8, x2=72.6, x3=101.1, final amount of cash: 103,925. |
|
This solution is evidently more in favour of short term bonds and it
realizes a smaller
final amount of cash. On the other hand, applying this solution to the
same 100 payment profiles used above, it leads to cash profiles with
quite rare
constraint violations (see Fig. 1 b). The thin
line in Fig. 1 c shows that the maximum
percentage of negative cash is around 5% in good coincidence with the
chosen probability level. The gain in
reliability may be considered as substantial when compared to the loss
of
final cash. More insight is provided here by Fig. 1
d, where
the optimal value of problem (3) is plotted as a
function of the
probability level p. The decrease of the optimal value is quite
moderate
over a wide range of p, and it is only beyond 0.95 that a
further gain
in safety is bought by drastic losses in final cash.
When dealing with stochastic constraints, one could be led to the
simple
idea of replacing the random parameter by its expectation. In our
example,
this approach would reduce to the deterministic problem (since the
deterministic payments were taken as expectations for the stochastic
payments). Therefore, the solution based on chance constraints
illustrates
at the same time the superiority with respect to the reliability/costs
ratio over the expected value solution.
2.3 Stochastic Version:
joint chance constraints
The model with individual chance constraints is appealing for its
simple solvability. It has, however, the substantial drawback of not
reflecting the proper safety requirements. It guarantees that, for each
year fixed, the probability of negative cash is small. Yet, the
probability
of having negative cash at least once in the considered period may
remain
high. To illustrate this argument, Figure 2
plots again the 100 simulated cash profiles with those profiles being
emphasized as dark lines which become negative at least one time. In
case of the
expected value solution (Figure 2 a) there are
82 profiles with
occasional negative cash. This is an even more striking argument
against
the use of expected value solutions than the one of the preceding
section. For
the solution of individual chance constraints (Figure 2 b) it
turns out that 16 profiles fall below zero at certain times. In other
terms:
the probability of maintaining positive cash over the whole period is
around
0.84 and certainly significantly lower than the level of 0.95 chosen
for the
indivdual chance constraints. Obviously different subsets of profiles
violate
the constraint at different times. Of course, in practice, one has to
make sure
that - with high probability - cash remains positive over the whole
time
interval.
Figure 2: Illustration of the cash matching problem
(deterministic version
versus individual and joint chance constraints). For details see text.
In order to
take care of what one could call 'uniform safety', one would have to
replace
the m individual chance constraints in (3)
by one single joint chance constraint of the form
| P |
æ
è
|
n
å
i=1
|
aijxi
³ xj
( j=1,¼,m) |
ö
ø
|
³ p. |
|
(5) |
Observe the difference between (5) and the
constraint in
(3) which is given by the position of the the
counter
j=1,¼,m.
At first glance, dealing with a single rather than many inequalities,
seems
to be a simpler task. However, the opposite is true as the calculation
of
(5) requires dealing with multidimensional
distributions (see Section
3 below). The solution of the cash matching
problem with joint chance
constraints is
| x1=65.8, x2=83.7, x3=86.2, final amount of cash: 98,160. |
|
Comparing this with the previous solutions, once again there is some
shift
from long term to short term bonds, and the final amount of cash has
slightly
decreased again. On the other hand, robustness is significantly
improved:
Just 3 out of 100 cash profiles fall below zero, i.e. 97% of the
profiles
stay positive over the whole period (see Figure 2
c). This is in
good coincidence with the chosen probability level p=0.95 and
illustrates the
difference from individual chance constraints.
Although the cash matching problem is a prototype example for models
with
joint chance constraints, there are certain situations where the use of
individual chance constraints is appropriate. This applies, in
particular, when
proper working of some system (power generation, telecommunication) is
not
affected as long as only a sufficiently small number of components of
that
system fails. Then it does not make much sense to insist on proper
working of
a large majority of components over the whole period considered (joint
chance constraints). Rather, at each time, a possibly varying large
subset
of components should be out of failure.
Finally, we emphasize that our presentation of the cash matching
problem is just
a first approximation to a real life model. Better models would include
the
possibility of short term borrowing in case of constraint violation.
This leads to a mixture of probabilistic constraints and multistage
programs which is very challenging but goes beyond the purpose of
illustration
here. Also, to justify the simplified setting, one could imagine that,
for some
reason, borrowing is excluded or future conditions of borrowing are
extremely
uncertain. It has to be mentioned that our model has a similar
mathematical structure as many problems in engineering dealing with so
called storage level constraints [14,21,24].
[Top of page]
3 Algorithmic Challenges
and Theory
Not surprisingly, there does not exist a general solution method for
chance constrained
programs. The choice strongly depends on how random and decision
variables
interact in the constraint model. Sometimes a linear programming solver
will do
the job (e.g., the cash matching problem with individual chance
constraints, see
Section 2.2). In other models, one has to have
access to values and
gradients of multidimensional distribution functions (e.g., cash
matching
problem with joint chance constraints, see Section 2.3).
Of
particular interest is the application of algorithms from convex
optimization.
Convexity of chance constraints, however, does not only depend on
convexity
properties of the constraint function h in (1)
but also of the
distribution of the random parameter x. The
question of whether this
distribution is continuous or discrete is another crucial aspect for
algorithmic
treatment. The biggest challenges from the algorithmic and theoretical
points
of view arise in chance constraints where random and decision variables
cannot
be decoupled.
In contrast to conventional optimization problems, those including
chance
constraints introduce the presence of two kinds of approximations:
first, the
distribution of the random parameter is almost never known exactly and
has to
be estimated from historical data. Secondly, even a given multivariate
distribution (such as multivariate normal) cannot be calculated exactly
in
general but has to be approximated by simulations or bounding
arguments. Both
types of imprecision motivate the discussion of stability in programs
with chance constraints. All issues discussed up to now illustrate
the close tie
between algorithmic, structural and stability aspects. Some of these
shall
be briefly presented in the following sections. For the numerical
solution of
problems including chance constraints with random right hand side,
we refer to the SLP-IOR model management system (see [18] and
Section 5on web links). If more general models
are
considered, specially designed algorithms
are required.
3.1 Structural Aspects
Starting from a system of stochastic inequalities hj(x,x) ³ 0,
we have introduced before individual chance constraints as
| aj(x)
³ p
(j=1,¼,m), where aj(x):=P(hj(x,x) ³
0) (j=1,¼,m) |
|
and joint chance constraints as
| a(x)
³ p, where a(x):=P(hj(x,x) ³
0 (j=1,¼,m)). |
|
Formally, in both cases, one arrives at constraints on the decision
variables
as in usual optimization problems. However, a
or the aj are
not given by an explicit formula but rather defined as probabilities of
some
implicitly defined regions in the space of the random parameter x.
Calculating values, gradients and possibly Hessians of these functions
remains
the main challenge in chance-constrained programming. The degree of
difficulty
in doing so, will mainly depend on the following items:
- Model of chance constraints (individual or joint)
- Assumptions on the random vector (e.g., continuous or
discrete
distribution,
independent components)
- Type of stochastic inequalities (e.g., linear, convex,
random right
hand side)
The most favorable situation arises when
hj(x,x)=gj(x)-xj
(j=1,¼,m) for some
functions gj. In
that case, the stochastic inequalities take the form gj(x)
³ xj
so that
the random parameter appears on the right hand side. Using the
distribution
function Fh (z):=P(h £ z)
for some random variable h, we may
then write the individual chance constraints as
| aj(x)=P(gj(x)
³ xj)=Fxj(gj(x))
(j=1,¼,m). |
|
We recall that, for a one-dimensional random variable h, the following
relation holds true with respect to the p-
quantile qp of Fh:
| Fh (t) ³
pÛ t ³ qp: = |
inf
|
{t|Fh (t) ³ p}. |
|
Consequently, the individual chance constraints can be rewritten as
| aj(x)
³ pÛ gj(x) ³ q(j)p
(j=1,¼,m), |
|
where q(j)p is the p- quantile of Fxj.
In other words:
individual chance constraints with random right hand side inherit their
structure from the underlying stochastic constraint. If the latter was
linear
(as in the cash matching problem) then the induced individual chance
constraints
will be linear too. Unfortunately, this observation does not generalize
to
joint chance constraints. For random right hand side, the constraint
function
may still be written as a composition
| a(x)=P(gj(x)
³ xj
(j=1,¼,m))=Fx(g(x)), |
|
(6) |
where g=(g1,¼,gm)
and Fx is the
distribution function of
the random vector x. However, now the
distribution function is
multidimensional and simple quantile arguments can no longer be
applied. Evidently,
in order to calculate values and derivatives of a
one has to be able to
do so for multidimensional distribution functions (as g is
given by an
analytic formula in general). From the structural point of view, the
composition formula a = Fx°g
allows one to transfer analytic
properties like continuity, (local or global) Lipschitz continuity or
smoothness from Fx and g
to a. As far as convexity is
concerned, we refer to Section 3.3.
If the random vector x has independent
components, then the calculation
of a breaks down to one dimensional
distribution values again:
| a(x)=Fx1(g1(x))¼Fxm(gm(x)). |
|
Although the constraint a(x) ³ p cannot be further simplified
to an explicit constraint involving just the gj
(as was the case for
individual chance constraints), one may benefit from the fact that one
dimensional distributions are usually easy to calculate. On the other
hand, the
assumption of independent components is not justified in many
applications.
For instance, in the cash matching problem, the original random data
(payments)
were assumed to be independent, yet the cumulative data - which
appeared in the
chance constraint models - are definitely not independent as is
evident from the nondiagonal covariance matrix in (4).
3.2 Random right-hand
side with nondegenerate multivariate normal
distribution
Perhaps the most important special case in practical applications
arises from
joint chance constraints with random right-hand side having a
nondegenerate
multivariate normal distribution. This assumption was made for
instance, in the
cash matching problem of Section 2.3.
According to (6) the
constraint takes the form Fx(g(x))
³ p then. As g is
explicitly
given by a formula, in general, the evaluation of such constraints by
optimization algorithms requires the calculation of
Fx, ÑFx.
¼, i.e., of values and (higher)
derivatives of a nondegenerate multivariate normal distribution
function.
Fortunately, gradients of such distribution functions can be reduced
analytically to some lower dimensional multivariate normal distribution
functions (see [21], p. 204). Thus,
proceeding by induction
for higher order
derivatives, the whole optimization issue hinges upon the evaluation of
nondegenerate normal distribution functions in this situation.
Much progress has been made here when the dimension of x is
moderate. Two basic approaches
can be distinguished: the
first one is a combination of simulation and bounding techniques.
Efficient bounds on probabilities of the intersection of events
(special case: distribution functions)
are based on improvements of classical Bonferroni bounds and on
sophisticated graph theoretical derivations (e.g.,
[3,4,19]).
Starting from simulated samples,
these bounds may be used for constructing a variety of unbiased
estimators for the probability to be approximated. An optimal convex
combination of these estimators produces a final estimator having much
less
variance and, hence, yielding much more precise approximations than the
original ones (e.g., [31]). A
simulation method
specially designed for multivariate normal distributions has been
proposed
in [6]. The second approach, which
seems to be more precise
in low dimensions, relies on numerical integration (e.g.,
[11,29]).
A Fortran code is available at [10].
A hybrid method combining
the two approaches is discussed in [9].
3.3 Convexity
Convexity is a basic issue for theory (structure, stability) and
algorithms (convergence towards global solutions) in any optimization
problem. In chance constrained programming, the first question one
could
deal with is convexity of the feasible set defined say by a very simple
probabilistic constraint of the type
| {x | P(x £ x) ³ p}={x | Fx(x) ³
p}. |
|
(7) |
It is well known that such a set is convex if Fx is a quasiconcave
function. Although distribution functions can never be concave or
convex (due to being bounded by zero and one) it turns out that many of
them
are quasiconcave. The left plot of Figure 3
shows the graph of
the bivariate normal distribution function with independent components.
It is neither concave nor convex, but all of its upper level sets are
convex
(the boundary of the upper level set corresponding to the level p=0.5
is
depicted by a curve on the graph).
Figure 3: Bivariate normal distribution function (left)
and standard normal
distribution and its logarithm (right).
For algorithmic purposes it is often desirable to know that the
function
defining an inequality constraint of type ' ³
' is not just quasiconcave
but actually concave. As mentioned above, this cannot hold for
inequalities of type (7). However, a suitable
transformation
might do the job. Indeed, it turns out
that most of the prominent multivariate distribution functions (e.g.,
multivariate normal, uniform distribution on convex compact sets,
Dirichlet,
Pareto, etc.) share the property of being log-concave, i.e.,
log Fx is concave (an
illustration for the one-dimensional normal
distribution and its log is given in the right plot of Figure 3).
The key
for verifying such a nontrivial property for the distribution function
is to
check the same property of log-concavity for the density of Fx, if it exists. The latter task is easy in
general. For instance, a
nondegenerate normal density is proportional to the exponential of a
concave function, hence multivariate normal distributions are
logarithmically concave. The mentioned result is a consequence of a
famous
theorem due to Prékopa [21].
Now, when Fx is
log-concave, (7) may be equivalently rewritten as a
concave inequality constraint {x | log Fx(x) ³
log p}.
Nothing changes of course for more general constraints of the type
Fx(Ax) ³ p where A is a matrix.
This allows us, for instance, to solve the
cash matching
problem with joint chance constraints by means of convex optimization
algorithms, such as the supporting hyperplane method. Care has to be
taken
here, however, of the imprecise nature of Fx which is calculated by
the methods mentioned in Section 3.2. For
details of adapted
algorithms, we refer to [18,21,30].
A challenging question with importance both for algorithms and
stability
(see Section 3.4) is to characterize strong
concavity of the
log of distribution functions. Examples of distributions sharing this
property are the uniform distribution on rectangles [13] and
the multivariate normal distribution with independent components [15].
It is interesting to observe that uniform distributions on arbitrary
polytopes may lack strong log-concavity (e.g., on
conv {(0,1),(1,1)(1,0)}). The
problem of normal distributions with
correlated components is open.
3.4 Stability
In stochastic programming models, it is usually assumed that the
probability distribution of the random parameter is known. In practice,
however, this is rarely the case and one is faced with the need to
approximate this distribution by parameter estimation or
empirically. Solving the problem with approximating distributions
yields solutions and optimal values that differ from those of the
"true" problem. The
main concern then is whether small approximation errors may lead
to large deviations between solutions, or, expressed the other way
around,
whether it pays to spend large efforts in obtaining good
approximations in order to
arrive at solutions of high precision. The example of the
'value-at-risk'
(see below) confirms that even the most simple chance
constrained problems may fail to have stable solutions. In multistage
stochastic programming this issue is central, for instance, in the
context
of scenario reduction/construction. [25]
offers a general introduction to stability of stochastic programming
problems. Recent results on stability in chance constrained programming
can
be found in [13,15,17]. To give
a (strongly
simplified) idea of these results, consider a program of type
In particular the cash matching problem fits this setting upon
rewriting
(5) in terms of the distribution function Fx.
We denote by v(x) the optimal value
associated with (8),
where x is considered as a varying random
parameter close to some fixed
c (the theoretical random vector). The
following result holds:
Theorem 1
In (8), let f be convex and
let c have a log-concave
distribution function Fc.
If there exists some ¾x
such that Fc(A¾x)
> p , and if
the solution set for (8) at x
= c is
nonempty and bounded, then one has a local Lipschitz estimate of the
type
| | v(x)-v(c)| £ L |
sup
z Î Rs
|
|Fx(z)-Fc(z)| . |
|
(9) |
At the same time, the solution set mapping is upper semicontinuous at
c.
To achieve quantitative results (Hölder- or
Lipschitz stability) for solution sets, one has to impose further
structural
conditions (strong log-concavity, strict complementarity,
C1,1-differentiability, see [15]).
Let us apply the theorem above to the well-known
p-value-at-risk (p Î
[ 0,1] ) which is defined for a one
dimensional random variable x as the
quantity
| VaRp(x)= |
inf
|
{x | P(x ³ x) ³ p}. |
|
This is clearly a special instance of (8) with f(x)=x,
A=1,
so VaRp is the optimal value of a special
chance constrained program.
In particular, if c has a log-concave
distribution and
p Î ( 0,1) , then the
assumptions of the stability result in
Theorem 1, and we may derive from (9) the
following Lipschitz continuity result for VaRp:
| | VaRp(x)-VaRp(c)| £ L |
sup
z Î Rs
|
| Fx(z)-Fc(z)| . |
|
(10) |
Note that, in general, the value-at-risk does not depend continuously
on
x (e.g., for discrete distributions).
Hence, (10) strongly depends
on the assumption of c having a log-concave
distribution.
3.5 Beyond the classical
setting
The setting of joint chance constraints with random right-hand side and
nondegenerate multivariate normal distribution enjoys many desirable
features such as differentiability or convexity (via log-concavity). Of
course,
other settings may have practical importance too. For instance, the
distribution of the random right-hand side could be other than normal.
The cases of multivariate Gamma or Dirichlet distributions are
discussed
in [21], Section 6.6. Here,
log-concavity remains an important tool.
Things become different, however, when passing to discrete
distributions. These are of interest for at least
two reasons: first, the problem to be solved could have been
directly modeled by discrete
random variables (see, e.g., [1]).
Second,
there may be a need to approximate continuous distributions (e.g.,
multivariate normal) by discrete ones, for instance when treating
probabilistic constraints in two stage models with scenario
formulations
[27]. The key issue in discrete
chance constrained programming is
finding the so called p-efficient points (introduced in [20]) of
the distribution function Fx
of x. These are points z such
that Fx(z) ³ p and the relations Fx(y) ³
p, y £ z
(partial order of vectors) imply that y=z. One easily
observes that all
the information about the p-level set of Fx is contained in these
points because
| {y | Fx(y) ³
p}= |
È
z Î
E
|
(z+R+s), |
|
where E is the set of p-efficient points and R+s
is the
positive orthant in the space of the random vector. In the case of x
having integer-valued components and p Î
(0,1), P is a finite set (see
Theorem 1 in [7]). Algorithms for
enumerating or generating p-efficient points are described, for
instance, in
[1,7,22,23].
It is interesting to note that the log-concavity
concept, even if not directly applicable, can be adapted with useful
consequences to discrete distributions as well [7].
When aiming at the more general case of probabilistic constraints,
where the
random parameter x does not necessarily
appear on the right hand side
of the inequalities, it is mainly the convex or polyhedral case under
normal distribution which has some good chance of efficient treatment.
More
precisely, the function h(x,x)
is assumed to be quasiconcave in x
so that the probability of the convex set {x
| h(x,x) ³
0} is to be calculated
then. This can be done, for each given x, by the already
mentioned method
proposed in [6] which is designed
to calculate normal probabilities of
convex sets. However, unlike the case of normal distribution functions,
the derivative of P(h(·,x)
³ 0) no longer reduces
analytically to
values of the distribution function itself. A general derivative
formula for
probabilities of parameter
dependent inequalities has been established in [32], but its practical
use may be limited.
There is more hope in the specific polyhedral case h(x,x)=A(x)x-b(x), where A(x)
and b(x) are matrix and vector
functions, respectively. For each x fixed, (1)
amounts to the
calculation of the probability of some polyhedron. Apart from treating
polyhedra as special convex sets and applying [6] again, one could
alternatively pass to the transformed random vector hx
:=-A(x)x
so that
(1) can be equivalently written in terms of the
distribution function
Then, apparently, one seems to be back to the classical setting
discussed in
Section 3.2. For instance, if x had a multivariate normal
distribution, then so does hx
(as a linear transform). However, in many
important applications (like capacity expansion of networks with
stochastic
demands, see [21], section 14.3),
the number of inequalities
in the system
A(x)x ³ b(x) will typically exceed
the dimension of x even after
elimination of redundance. As a consequence, Fhx will have a
singular normal distribution which can no longer be
calculated by the
methods mentioned in Section 3.2. An
algorithm for calculating
singular normal distributions is proposed in [12]. A different
approach, which is more efficient in moderate dimension, relies on
directly
calculating (regular) normal distributions of polyhedra. Based on a
specific
inclusion-exclusion formula for polyhedra proved in [5],
the problem can be reduced to the calculation of a sum of regular
normal distribution
functions. At the same time, this offers the possibility of obtaining
gradients
too.
[Top of page]
4 Concluding remarks
Chance constraints offer a way to model reliability in optimization
problems.
This brief introduction could neither provide a survey on the whole
subject
nor give a representative list of references. There are many topics of
interest not covered by this paper. Research on theory and algorithms
of
chance constraints is quickly progressing with a focus on risk aversion
(e.g., integrated chance constraints or stochastic dominance)
which is important in finance applications. This in turn introduces
additional
interesting structures like "semi-infinite chance constraints"
(infinite
number of continuously indexed inequalities).
5 Links
References
- [1]
- P. Beraldi and A. Ruszczynski. A branch and bound
method for stochastic integer problems under probabilistic constraints.
Optimization Methods and Software, 17:359-382, 2002.
- [2]
- J.R. Birge and F. Louveaux. Introduction to
Stochastic Programming. Springer, New York, 1997.
- [3]
- E. Boros and A. Prékopa. Closed form two-sided
bounds
for probabilities that exactly r and at least r out of n
events occur.
Mathematics of Operations Research, 14:317-342, 1989.
- [4]
- J. Bukszár and T. Szántai. Probability
bounds
given by hypercherry trees. Optimization Methods and Software,
17:409-422, 2002.
- [5]
- J. Bukszár, R. Henrion, M. Hujter and T.
Szántai. Polyhedral inclusion-exclusion. Preprint No. 913,
Weierstrass
Institute Berlin, 2004, submitted.
- [6]
- I. Deák. Computing probabilities of rectangles in
case
of multidimensional distribution. Journal of Statistical
Computation
and Simulation, 26:101-114, 1986.
- [7]
- D. Dentcheva, A. Prékopa and A. Ruszczynski.
Concavity and efficient points for discrete distributions in stochastic
programming, Mathematical Programming, 89:55-79, 2000.
- [8]
- D. Dentcheva, B. Lai and A. Ruszczynski. Efficient point
methods for probabilistic optimization problems.
Stochastic Programming
E-Print Series (SPEPS),
No. 25, 2003.
- [9]
- H.I. Gassmann, I. Deák and T. Szántai.
Computing
multivariate normal probabilities: A new look. Journal of
Computational and Graphical Statistics, 11:920-949, 2002.
- [10]
- A. Genz.
http://www.sci.wsu.edu/math/faculty/genz/homepage
- [11]
- A. Genz. Numerical computation of the multivariate normal
probabilities. Journal of Computational and Graphical Statistics,
1:141-150, 1992.
- [12]
- A. Genz and K. Kwong. Numerical evaluation of Singular
Multivariate Normal Distributions, Journal of Statistical
Computation and Simulation, 68:1-21, 2000.
- [13]
- R. Henrion and W. Römisch. Metric regularity and
quantitative stability in stochastic programs with probabilistic
constraints. Mathematical Programming, 84:55-88, 1999.
- [14]
- R. Henrion and A. Möller. Optimization of a
continuous
distillation process under random inflow rate. Computers&Mathematics
with Applications, 45:247-262, 2003.
- [15]
- R. Henrion and W. Römisch. Hölder and Lipschitz
stability of solution sets in programs with probabilistic constraints.
Mathematical Programming, 100:589-611, 2004.
- [16]
- P. Kall and S. Wallace. Stochastic Programming.
Wiley, New York, 1994. Online available, see
SP
Resources at stoprog.org
- [17]
- V. Kanková. A note on multifunctions in stochastic
programming. Lecture Notes in Economics and Mathematical Systems,
458:154-168, 1998.
- [18]
- J. Mayer. Stochastic Linear Programming Algorithms.
Gordon and Breach, Amsterdam, 1998.
- [19]
- A. Prékopa. Boole-Bonferroni inequalities and
linear
programing. Operations Research, 36:145-162, 1988.
- [20]
- A. Prékopa. Sharp bound on probabilities using
linear
programming. Operations Research, 38:227-239, 1990.
- [21]
- A. Prékopa. Stochastic Programming. Kluwer,
Dordrecht, 1995.
- [22]
- A. Prékopa, B. Vizvári and T. Badics.
Programming
under probabilistic constraint with discrete random variable. In (F.
Giannessi et al. eds.): New Trends in Mathematical Programming,
pp.
235-255, Kluwer, Dordrecht, 1998.
- [23]
- A. Prékopa. Probabilistic programming. In: [28]
(Chapter 5).
- [24]
- A. Prékopa and T. Szántai. Flood control
reservoir
system design using stochastic programming. Mathematical
Programming
Study, 9:138-151, 1978.
- [25]
- W. Römisch. Stability of Stochastic Programming
Problems. In: [28] (Chapter 8).
- [26]
- A. Ruszczynski.
http://www.rusz.rutgers.edu
- [27]
- A. Ruszczynski. Probabilistic programming with discrete
distributions and precedence constrained knapsack polyhedra. Mathematical
Programming, 93:195-215, 2002.
- [28]
- A. Ruszczynski and A. Shapiro. Stochastic
Programming. Handbooks in Operations Research and Management
Science, Vol.
10. Elsevier, Amsterdam, 2003.
- [29]
- M. Schervish. Multivariate normal probabilities with error
bound. Applied Statistics, 33:81-87, 1984.
- [30]
- T. Szántai. A computer code for solution of
probabilistic-constrained stochastic programming problems. In (Y.
Ermoliev
and R.J.-B. Wets eds.): Numerical Techniques for Stochastic
Optimization, pp. 229-235, Springer, 1988.
- [31]
- T. Szántai and A. Habib. On the k-out-of-r-from-n:F
System with Unequal Element Probabilities. In (F. Giannessi et al.
eds.): New Trends in Mathematical Programming, pp. 289-303,
Kluwer,
Dordrecht, 1998.
- [32]
- S. Uryasev. Derivatives of probability functions and
integrals over sets given by inequalities. Journal of
Computational and Applied Mathematics 56:197-223, 1994.
|