1 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
Lagrange multiplier
From Wikipedia, the free encyclopedia
Contents
1 Introduction
1.1 Not necessarily extrema
2 Handling multiple constraints
2.1 Single constraint revisited
2.2 Multiple constraints
3 Interpretation of the Lagrange multipliers
4 Sucient conditions
5 Examples
5.1 Example 1
5.2 Example 2
5.3 Example: entropy
5.4 Example: numerical optimization
6 Applications
6.1 Economics
6.2 Control theory
7 See also
8 References
9 External links
Introduction
One of the most common problems in calculus is that of nding maxima or minima (in general,
10/24/2012 02:56 PM
2 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
"extrema") of a function, but it is often dicult to nd a closed form for the function being extremized.
Such diculties often arise when one wishes to maximize or minimize a function subject to xed outside
conditions or constraints. The method of Lagrange multipliers is a powerful tool for solving this class of
problems without the need to explicitly solve the conditions and use them to eliminate extra variables.
Consider the two-dimensional problem introduced above:
maximize
subject to
We can visualize contours of f given by
given by
and
are the respective gradients. The constant is required because although the two gradient vectors are
parallel, the magnitudes of the gradient vectors are generally not equal.
To incorporate these conditions into one equation, we introduce an auxiliary function
and solve
implies
One may reformulate the Lagrangian as a Hamiltonian, in which case the solutions are local minima for
the Hamiltonian. This is done in optimal control theory, in the form of Pontryagin's minimum principle.
10/24/2012 02:56 PM
3 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
The fact that solutions of the Lagrangian are not necessarily extrema also poses diculties for numerical
optimization. This can be addressed by computing the magnitude of the gradient, as the zeros of the
magnitude are necessarily local minima, as illustrated in the numerical optimization example.
10/24/2012 02:56 PM
4 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
This makes it clear that if we are at p, then all directions from this point that do not change the value of f
must be perpendicular to
(the gradient of f at p).
Now let us consider the eect of the constraints. Each constraint limits the directions that we can move
from a particular point and still satisfy the constraint. We can use the same procedure, to look for the set
containing the directions in which we can move and still satisfy the constraint. As above,
of vectors
, the following relation must hold:
for every vector v in
From this, we see that at point p, all directions from this point that will still satisfy this constraint must be
perpendicular to
.
Now we are ready to rene our idea further and complete the method: a point on f is a constrained
stationary point if and only if the direction that changes f violates at least one of the constraints. (We
can see that this is true because if a direction that changes f did not violate any constraints, then there
would a legal point nearby with a higher or lower value for f and the current point would then not be a
stationary point.)
If we now add another simultaneous equation to guarantee that we only perform this test when we are
at a point that satises the constraint, we end up with 2 simultaneous equations that when solved,
identify all constrained stationary points:
Note that the above is a succinct way of writing the equations. Fully expanded, there are
simultaneous equations that need to be solved for the
variables which are and
10/24/2012 02:56 PM
5 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
Multiple constraints
For more than one constraint, the same reasoning applies. If two or more constraints are active
together, each constraint contributes a direction that will violate it. Together, these violation directions
form a violation space, where innitesimal movement in any direction within the space will violate one
or more constraints. Thus, to satisfy multiple constraints we can state (using this new terminology) that
at the stationary points, the direction that changes f is in the violation space created by the
constraints acting jointly.
The violation space created by the constraints consists of all points that can be reached by adding any
linear combination of violation direction vectorsin other words, all the points that are reachable
when we use the individual violation directions as the basis of the space. Thus, we can succinctly state
that v is in the space dened by
if and only if there exists a set of multipliers
such that:
which for our purposes, translates to stating that the direction that changes f at p is in the violation
space dened by the constraints
if and only if:
As before, we now add simultaneous equation to guarantee that we only perform this test when we are
at a point that satises every constraint, we end up with simultaneous equations that when solved,
identify all constrained stationary points:
The method is complete now (from the standpoint of solving the problem of nding stationary points)
but as mathematicians delight in doing, these equations can be further condensed into an even more
elegant and succinct form. Lagrange must have cleverly noticed that the equations above look like
partial derivatives of some larger scalar function L that takes all the
and all the
as inputs. Next, he might then have noticed that setting every equation equal to zero is
10/24/2012 02:56 PM
6 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
exactly what one would have to do to solve for the unconstrained stationary points of that larger
function. Finally, he showed that a larger function L with partial derivatives that are exactly the ones we
require can be constructed very simply as below:
Solving the equation above for its unconstrained stationary points generates exactly the same
stationary points as solving for the constrained stationary points of f under the constraints
.
In Lagranges honor, the function above is called a Lagrangian, the scalars
are called
Lagrange Multipliers and this optimization method itself is called The Method of Lagrange Multipliers.
The method of Lagrange multipliers is generalized by the KarushKuhnTucker conditions, which can also
take into account inequality constraints of the form h(x) c.
then
So, k is the rate of change of the quantity being optimized as a function of the constraint variable. As
examples, in Lagrangian mechanics the equations of motion are derived by nding stationary points of
the action, the time integral of the dierence between kinetic and potential energy. Thus, the force on a
particle due to a scalar potential,
, can be interpreted as a Lagrange multiplier determining
the change in action (transfer of potential to kinetic energy) following a variation in the particle's
constrained trajectory. In control theory this is formulated instead as costate equations.
Moreover, by the envelope theorem the optimal value of a Lagrange multiplier has an interpretation as
the marginal eect of the corresponding constraint constant upon the optimal attainable value of the
original objective function: if we denote values at the optimum with an asterisk, then it can be shown
that
For example, in economics the optimal prot to a player is calculated subject to a constrained space of
actions, where a Lagrange multiplier is the change in the optimal value of the objective function due to
the relaxation of a given constraint (e.g. through a change in income); in such a context
is the
marginal cost of the constraint, and is referred to as the shadow price.
Sucient conditions
Main article: Hessian matrix#Bordered Hessian
Sucient conditions for a constrained local maximum or minimum can be stated in terms of a sequence
of principal minors (determinants of upper-left-justied sub-matrices) of the bordered Hessian matrix of
10/24/2012 02:56 PM
7 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
[6]
Examples
Example 1
Suppose we wish to maximize
subject to the
constraint
. The feasible set is the unit circle, and
the level sets of f are diagonal lines (with slope -1), so we can
see graphically that the maximum occurs at
,
and that the minimum occurs at
.
Setting the gradient
and
, so
and
, where
. Substituting into the last
, which implies that the stationary points are
, which is attained at
, which is
Example 2
Suppose we want to nd the maximum values of
with the condition that the x and y coordinates lie on the circle around the origin with radius 3, that is,
subject to the constraint
10/24/2012 02:56 PM
8 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
Then x2 = 2y2. Substituting into equation (iii) and solving for y gives this value of y:
Therefore, the objective function attains the global maximum (subject to the constraints) at
and the global minimum at
The point
is a local minimum and
maximum, as may be determined by consideration of the Hessian matrix of
Note that while
small positive
is a local
.
Example: entropy
10/24/2012 02:56 PM
9 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
at each point
must equal 1, so
, such that:
Hence, the uniform distribution is the distribution with the greatest entropy, among distributions on n
points.
10/24/2012 02:56 PM
http://en.wikipedia.org/wiki/Lagrange_multiplier
and
,
where
is a small value.
Applications
Economics
Constrained optimization plays a central role in economics. For example, the choice problem for a
consumer is represented as one of maximizing a utility function subject to a budget constraint. The
Lagrange multiplier has an economic interpretation as the shadow price associated with the constraint,
in this example the marginal utility of income. Other examples include prot maximization for a rm,
10 of 12
10/24/2012 02:56 PM
http://en.wikipedia.org/wiki/Lagrange_multiplier
Control theory
In optimal control theory, the Lagrange multipliers are interpreted as costate variables, and Lagrange
multipliers are reformulated as the minimization of the Hamiltonian, in Pontryagin's minimum principle.
See also
KarushKuhnTucker conditions: generalization of the method of Lagrange multipliers.
Lagrange multipliers on Banach spaces: another generalization of the method of Lagrange
multipliers.
Dual problem
Lagrangian relaxation
References
1. ^ Bertsekas, Dimitri P. (1999). Nonlinear Programming (Second ed.). Cambridge, MA.: Athena Scientic.
ISBN 1-886529-00-0.
2. ^ Vapnyarskii, I.B. (2001), "Lagrange multipliers" (http://www.encyclopediaofmath.org
/index.php?title=Lagrange_multipliers) , in Hazewinkel, Michiel, Encyclopedia of Mathematics, Springer,
ISBN 978-1-55608-010-4, http://www.encyclopediaofmath.org/index.php?title=Lagrange_multipliers.
3. ^
Lasdon, Leon S. (1970). Optimization theory for large systems. Macmillan series in operations research.
New York: The Macmillan Company. pp. xi+523. MR 337317 (http://www.ams.org/mathscinetgetitem?mr=337317) .
Lasdon, Leon S. (2002). Optimization theory for large systems (reprint of the 1970 Macmillan ed.).
Mineola, New York: Dover Publications, Inc.. pp. xiii+523. MR 1888251 (http://www.ams.org/mathscinetgetitem?mr=1888251) .
4. ^ Hiriart-Urruty, Jean-Baptiste; Lemarchal, Claude (1993). "XII Abstract duality for practitioners". Convex
analysis and minimization algorithms, Volume II: Advanced theory and bundle methods. Grundlehren der
Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. 306. Berlin: SpringerVerlag. pp. 136193 (and Bibliographical comments on pp. 334335). ISBN 3-540-56852-2. MR 1295240
(http://www.ams.org/mathscinet-getitem?mr=1295240) .
5. ^ Lemarchal, Claude (2001). "Lagrangian relaxation". In Michael Jnger and Denis Naddef. Computational
combinatorial optimization: Papers from the Spring School held in Schlo Dagstuhl, May 1519, 2000.
Lecture Notes in Computer Science. 2241. Berlin: Springer-Verlag. pp. 112156.
doi:10.1007/3-540-45586-8_4 (http://dx.doi.org/10.1007%2F3-540-45586-8_4) . ISBN 3-540-42877-1.
MR 1900016 (http://www.ams.org/mathscinet-getitem?mr=1900016) .
6. ^ Chiang, Alpha C., Fundamental Methods of Mathematical Economics, McGraw-Hill, third edition, 1984: p.
386.
External links
Exposition
Conceptual introduction (http://www.slimy.com/~steuard/teaching/tutorials/Lagrange.html) (plus a
brief discussion of Lagrange multipliers in the calculus of variations as used in physics)
Lagrange Multipliers for Quadratic Forms With Linear Constraints (http://www.eece.ksu.edu/~bala
/notes/lagrange.pdf) by Kenneth H. Carpenter
For additional text and interactive applets
11 of 12
http://en.wikipedia.org/wiki/Lagrange_multiplier
12 of 12
10/24/2012 02:56 PM