Solving Optimization Problems with Equality Constraints
Basic Concepts
This article discusses optimization problems of the following form
where , and we assume the function is continuously differentiable, i.e. . Let us introduce a few basic concepts:
Regular point: For a point satisfying the constraints , if the gradient vectors are linearly independent, then is said to be a regular point of the constraints.
Tangent space: The tangent space at a point on the surface is the set . We can see that the tangent space T(x^{_}) is the null space of the matrix Dh(x^{_}), i.e. T(x^{_})=N(Dh(x^{_})).
Normal space: The normal space at a point on the surface is the set . We can see that the normal space N(x^{_}) is the null space of the matrix Dh(x^{_}), i.e. N(x^{_})=R(Dh(x^{_})^{T}).
The Lagrange Condition
First consider an optimization problem with only two decision variables and one equality constraint. Let be the constraint function. We know that at a point in the domain of the function, the gradient is orthogonal to the level set of passing through that point. Choose a point such that and . The level set passing through the point is the set . We can parameterize it within a neighborhood of using a curve , where is a continuously differentiable vector function :
Next, we can show that is orthogonal to . Since is the constant 0 along the curve , i.e. for all we have
therefore for any we have
Using the chain rule we obtain
Therefore and are orthogonal. When is a minimizer of subject to , we can show that is orthogonal to . Construct the composite function of :
It attains its minimum when . By the first-order necessary condition for unconstrained extremum problems, we know that
Using the chain rule we obtain
Therefore and are orthogonal. We have already shown above that is orthogonal to , so the vectors and are parallel, and we can derive the Lagrange theorem for this case:
Lagrange theorem for n=2, m=3: Let the point be a minimizer of the function subject to the constraint . Then and are parallel, i.e. if , then there exists a scalar such that
where is the Lagrange multiplier. Generalizing this theorem to the general case, i.e. when , we obtain: Lagrange theorem: Let be a local minimizer (or maximizer) of subject to the constraint . If is a regular point, then there exists such that
Second-Order Conditions
Second-order necessary condition: Let be a local minimizer of subject to the constraint . If is a regular point, then there exists such that
- 2. For all , we have
Second-order sufficient condition: Suppose the functions . If there exist a point and such that
- 2. For all , we have
then is a strict local minimizer of subject to the constraint .
This article introduced the method of Lagrange multipliers under equality constraints. Later we will also cover the method of Lagrange multipliers under inequality constraints, the KKT conditions, and more. To be continued…