OR-Notes

J E Beasley

OR-Notes are a series of introductory notes on topics that fall under the broad heading of the field of operations research (OR). They were originally used by me in an introductory OR course I give at Imperial College. They are now available for use by any students and teachers interested in OR subject to the following conditions.

A full list of the topics available in OR-Notes can be found here.

Separable programming

We examine one special kind of heuristic algorithm (called separable programming) that can be applied to certain types of nonlinear program's. (A heuristic algorithm is an algorithm that does not guarantee to find an optimal solution). We will illustrate separable programming by applying it to an example.

Example

Consider the following NLP (nonlinear program)

maximise (x₁)² + x₂subject to
x₁ + x₂ <= 7 
x₁ <= 5 
x₂ <= 3
x₁,x₂ >= 0

Here we have only one nonlinear term (x₁)² which is in the objective function which we are trying to maximise.

Consider the example NLP. Then the only nonlinear term is (x₁)² and this nonlinear term involves just the single variable x₁. We can deduce from the NLP that this single variable x₁ must lie between zero and five (0 <= x₁ <= 5). Hence (x₁)² - the nonlinear term - must lie between zero and 25 and so we have the graph shown below.

Now one obvious way of solving (heuristically, that is approximately) the NLP is to approximate the nonlinear term (x₁)² by some linear term since then we would then have an LP which we could solve to give an approximate (heuristic) solution to the original NLP. To see how this can be done choose some x₁ values between 0 and 5 (e.g. x₁ = 2, x₁ = 3) and for these (two) x₁ values define (three) straight lines to give a linear approximation of (x₁)² in the range (0 <= x₁ <= 5) we are interested in (as shown above).

Over the entire range 0 <= x₁ <= 5 with the breakpoints defined at x₁ = 0, x₁ = 2, x₁ = 3 and x₁ = 5 (the two breakpoints we had before and the two endpoints) we claim that introducing variables A₁, A₂, A₃, A₄ (>= 0), 4 breakpoints in all so 4 variables, one associated with each breakpoint, i.e.

A₁ associated with the first breakpoint (at x₁ = 0),
A₂ associated with the second breakpoint (at x₁ = 2),
A₃ associated with the third breakpoint (at x₁ = 3),
A₄ associated with the fourth breakpoint (at x₁ = 5)

where

A₁ + A₂ + A₃ + A₄ = 1    (1)

and

x₁ = A₁(0) + A₂(2) + A₃(3) + A₄(5)    (2)

and

either only one A_i (some i <= 4) non-zero    (3)

or only two adjacent A_i, A_i+1 (some i <= 3) non-zero    (4)

enables us to approximate (x₁)² by the linear expression

A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)²

To see whether this claim is valid consider a few examples:

x₁ = 3

Then the appropriate A_i (i=1,2,3,4) values are clearly A₁ = A₂ = A₄ = 0 and A₃ = 1 with the approximation of (x₁)² being given by

A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)² = 0 + 0 + 1(3)² + 0 = 9

So in this case the approximation of (x₁)² is exact.

Note here that the values of x₁ and A_i (i=1,2,3,4) satisfy equations (1) and (2) and also that equation (3) is satisfied (but not equation (4)).

Note also that the A_i (i=1,2,3,4) values are uniquely defined (i.e. given x₁ there is only one set of A_i values satisfying equations (1), (2), (3) and (4)).

x₁ = 2.25

Then the appropriate A_i (i=1,2,3,4) values can be deduced as follows (with the aid of the diagram above).

x₁ = 2.25 lies between the second and third breakpoints so that A₂ > 0 and A₃ > 0 (implying A₁ = A₄ = 0). Also x₁ = 2.25 lies 0.25 (A₃) of the way from the second breakpoint to the third breakpoint. Hence A₃ = 0.25, A₂ = 1 - A₃ = 0.75 and A₁ = A₄ = 0.

Note that equations (1) and (2) are satisfied as is equation (4) (but not equation (3)).

The approximation of (x₁)² is given by

A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)² = 0 + 0.75(2)² + 0.25(3)² + 0 = 5.25

Since x₁ = 2.25 we have (x₁)² = 5.0625 and so we have a reasonable approximation.

x₁ = 1.0

Then the appropriate A_i (i=1,2,3,4) values can be deduced as follows. x₁ = 1.0 lies between the first and second breakpoints so that A₁ >= 0 and A₂ >= 0 (implying A₃ = A₄ = 0). Also x₁ = 1.0 lies 0.5 (A₂) of the way from the first breakpoint to the second breakpoint. Hence A₂ = 0.5, A₁ = 1 - A₂ = 0.5 and A₃ = A₄ = 0.

Note again that equations (1) and (2) are satisfied as is equation (4) but not equation (3)).

The approximation of (x₁)² is given by

A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)² = 0 + 0.5(2)² + 0 + 0 = 2

Since x₁ = 1.0 we have (x₁)² = 1.0 so in this case the approximation is not very near to the true value. Plainly over the entire range for x₁ (0 <= x₁ <= 5) the approximation will be better at some points than at others.

Note here that it is important to ensure that the A_i (i=1,2,3,4) are such that either equation (3) or equation (4) is satisfied. If this is not the case then the approximation will be very far from the true value e.g. consider A₁ = 0.55, A₂ = A₃ = 0 and A₄ = 0.45 which do not satisfy either equation (3) or equation (4) but do satisfy equation (1) and equation (2) (with x₁ = 2.25). Then with these A_i values (i=1,2,3,4) the approximation of (x₁)² is given by

A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)² = 0 + 0 + 0 + 0.45(5)² = 11.25

and compare this with the true value of 5.0625 and the previous approximation of 5.25 which we obtained with A₁ = 0, A₂ = 0.75, A₃ = 0.25 and A₄ = 0.

Hence we have shown how a nonlinear term like (x₁)² can be replaced by a linear approximation. So consider the original NLP which we were trying to solve:

maximise (x₁)² + x₂ 
subject to
x₁ + x₂ <= 7 
x₁ <= 5 
x₂ <= 3
x₁,x₂ >= 0

Introducing the linear approximation for (x₁)² we get the LP:

maximise [A₁(0)² + A₂(2)² + A₃(3)² + A₄(5)²] + x₂ 
subject to
x₁ + x₂ <= 7 
x₁ <= 5 
x₂ <= 3 
x₁ = A₁(0) + A₂(2) + A₃(3) + A₄(5) 
A₁ + A₂+ A₃ + A₄ = 1
and either only one A_i (some i <= 4) non-zero
or only two adjacent A_i, A_i+1 (some i <= 3) non-zero
x₁,x₂>=0 and A_i>=0 i=1,2,3,4

This linear program is known as a linear approximation of the original NLP and can be solved heuristically by a variant of the simplex algorithm for LP.

Note here that we cannot solve this program optimally because of the either/or restriction relating to the A_i's. Whilst we could resort to mixed-integer programming to deal with these either/or restrictions this is seldom done, given the computational cost of solving an MIP optimally and given that we are approximating the original NLP by the MIP.

Note also that other linear approximations exist - for example we could have split the (x₁)² nonlinear term differently by defining more breakpoints. This would lead to a more accurate linear approximation of (x₁)² but at the expense of a larger program (more variables and more constraints) for the variant of the simplex algorithm to solve (i.e. we are trading off accuracy of approximation against computer time).

In fact it is usual to vary the spacing of the breakpoints so that we have more breakpoints in areas of the nonlinear function we think are of special interest (e.g. an area likely to be involved in the solution or an area where the value of the nonlinear function changes rapidly/significantly) and less breakpoints in other areas.

So far we have only considered nonlinear terms in the objective function but it is plain that we can also deal with nonlinear terms in the constraints in a similar manner to that used above.

Note here that the condition for us to be able to take any NLP and convert it into a new, linear, program is that each nonlinear term is a function of just a single variable and does not involve more than one variable (i.e. the NLP contains terms like (x₁)² and (x₂)³ but no terms like (x₁)²(x₂)⁵ which involve two variables).

Summary

Let us then summarise the steps we need to go through to generate a linear approximation of an NLP:

Step 1

ensure that each nonlinear term is just a function of a single variable

Step 2

for each nonlinear term establish a range for the variable used in the nonlinear term (e.g. in our example above the nonlinear term was (x₁)² and the range for x₁ was 0 <= x₁ <= 5)

Step 3

introduce (arbitrary) breakpoints in the range of each variable, for each nonlinear term (e.g. in our example above we had one nonlinear term involving x₁ and breakpoints at x₁=2 and x₁=3 (plus the endpoints of x₁=0 and x₁=5))

Step 4

for each nonlinear term construct the linear approximation in the same manner as given in our example. To clarify the construction of the linear approximation let f(x) be the nonlinear function we are approximating with A_i (i=1,...,n) the variables associated with the breakpoints p_i (i=1,...,n) then f(x) is approximated by the linear expression

A₁(f(p₁)) + A₂(f(p₂)) + ... + A_n(f(p_n))
where x = A₁(p₁) + A₂(p₂) + ... + A_n(p_n)
A₁ + A₂ + ... + A_n = 1
and either only one A_i (some i <= n) non-zero or only two adjacent A_i, A_i+1 (some i <= n-1) non-zero.

Step 5

write out in full the complete linear approximation of the original NLP.

Any NLP in which all the nonlinear terms are just functions of single variables is called a separable program and the above method of linear approximation is sometimes called separable programming.

This restriction on the nature of nonlinear terms would seem to limit the applicability of the method but in fact we can deal with some product terms, like x₁x₂, by using the fact that

x₁x₂ = [(x₁ + x₂)/2]² - [(x₁ - x₂)/2]²

Again we shall illustrate by an example.

Example

maximise 2x₁ + x₂ 
subject to
x₁x₂ <= 10 
x₁ <= 5 
x₂ <= 3
x₁,x₂ >= 0

We approach the problem via the five steps given above.

Step 1

we need first to remove the x₁x₂ term (which is a function of two variables). Substituting from above for x₁x₂ the nonlinear constraint

x₁x₂ <= 10

becomes

[(x₁ + x₂)/2]² - [(x₁ - x₂)/2]² <= 10

We now introduce two new variables y₁ and y₂ where y₁ = (x₁ + x₂)/2 and y₂ = (x₁ - x₂)/2

Note here that y₁ >= 0 but y₂ can be positive or negative.

Then our nonlinear constraint becomes (y₁)² - (y₂)² <= 10 which is in a form we can deal with (e.g. consider the example given above where we approximated a square term).

Step 2

establish a range for both y₁ and y₂. To establish a valid range for y₁ we use the definition of y₁ given above. Since y₁ = (x₁ + x₂)/2 it is clear that

max value of y₁ = ([max value of x₁] + [max value of x₂])/2

and

min value of y₁ = ([min value of x₁] + [min value of x₂])/2

Now from the original NLP valid ranges for x₁ and x₂ are 0 <= x₁ <= 5 and 0 <= x₂ <= 3

Hence

max value of y₁ = (5 + 3)/2 = 4

min value of y₁ = (0 + 0)/2 = 0

so that a valid range for y₁ is 0 <= y₁ <= 4.

To establish a valid range for y₂ we proceed in a similar manner. Since y₂ = (x₁ - x₂)/2 it is clear that

max value of y₂ = ([max value of x₁] - [min value of x₂])/2 and

min value of y₂ = ([min value of x₁] - [max value of x₂])/2

So again using 0 <= x₁ <= 5 and 0 <= x₂ <= 3 we get that

max value of y₂ = (5 - 0)/2 = 2.5

min value of y₂ = (0 - 3)/2 = -1.5

Hence a valid range for y₂ is -1.5 <= y₂ <= 2.5.

Step 3

introduce (arbitrary) breakpoints in the range of each variable, for each nonlinear term, as below:

Term   Breakpoints                 Variables 
(y₁)²  y₁ = 2, y₁ = 3               A_i (i=1,2,3,4) 
(y₂)²  y₂ = -1, y₂ = 0, y₂ = 2      B_i (i=1,2,3,4,5)

Step 4

construct the linear approximation for each nonlinear term. Hence we have (y₁)² approximated by the linear expression

A₁(0)² + A₂(2)² + A₃(3)² + A₄(4)²

where

y₁ = A₁(0) + A₂(2) + A₃(3) + A₄(4)

A₁ + A₂ + A₃ + A₄ = 1

and either only one A_i (some i <= 4) non-zero or only two adjacent A_i, A_i+1 (some i <= 3) non-zero

(y₂)² approximated by the linear expression

B₁(-1.5)² + B₂(-1)² + B₃(0)² + B₄(2)² + B₅(2.5)²

where

y₂ = B₁(-1.5) + B₂(-1) + B₃(0) + B₄(2) + B₅(2.5)

B₁ + B₂ + B₃ + B₄ + B₅ = 1

and either only one B_i (some i <= 5) non-zero or only two adjacent B_i, B_i+1 (some i <= 4) non-zero

Step 5

the complete linear approximation of the original NLP is therefore given by:

maximise 2x₁ + x₂ 
subject to
y₁ = (x₁ + x₂)/2 
y₂ = (x₁ - x₂)/2 
y₁ = A₁(0) + A₂(2) + A₃(3) + A₄(4) 
y₂= B₁(-1.5) + B₂(-1) + B₃(0) + B₄(2) + B₅(2.5)
A₁ + A₂ + A₃ + A₄ = 1 
B₁ + B₂ + B₃ + B₄ + B₅ = 1 
[A₁(0)² + A₂(2)² + A₃(3)² + A₄(4)²] - 
[B₁(-1.5)² + B₂(-1)² + B₃(0)² + B₄(2)² + B₅(2.5)²] <= 10 
x₁ <= 5
x₂ <= 3 
x₁, x₂, y₁ >= 0 y₂ can be positive or negative 
A_i >= 0 i=1,2,3,4 
B_i >= 0 i=1,2,3,4,5
and either only one A_i (some i <= 4) non-zero
or only two adjacent A_i, A_i+1 (some i <= 3) non-zero
and either only one B_i (some i <= 5) non-zero
or only two adjacent B_i, B_i+1 (some i <= 4) non-zero

Note here that the y variables can be eliminated from the above program and so do not need to be explicitly considered.

Nonlinear programming example

Convert the following NLP into an appropriate linear approximation. Discuss the trade-off that occurs between the size of the resulting linear program and the accuracy of the approximation.

maximise (x₁)⁵ + x₂ 
subject to
x₁x₂ <= 17 
x₁ <= 3 
x₂ <= 4
x₁,x₂ >= 0

We again approach the problem via the five steps given above.

Step 1

in this NLP we have two nonlinear terms (x₁)⁵ and x₁x₂. Both of these nonlinear terms are functions of single variables if we use the expansion of x₁x₂ that we gave above, namely:

x₁x₂ = [(x₁ + x₂)/2]² - [(x₁ - x₂)/2]²

By putting

y₁ = (x₁ + x₂)/2

and y₂ = (x₁ - x₂)/2

where y₁ >= 0 but y₂ can be positive or negative we have that the nonlinear constraint

x₁x₂ <= 17

becomes (y₁)² - (y₂)² <= 17

which is in a form we can deal with as it contains two nonlinear terms, each a function of a single variable.

Step 2

establish a valid range for each of the variables involved in a nonlinear term. We have three nonlinear terms (x₁)⁵, (y₁)² and (y₂)² so we need to establish a valid range for x₁, y₁ and y₂.

Looking back at the original NLP it is clear that a valid range for x₁ is given by 0 <= x₁ <= 3.

To establish a valid range for y₁ we use the definition of y₁ given above.

Since y₁ = (x₁ + x₂)/2 it is clear that

max value of y₁ = ([max value of x₁] + [max value of x₂])/2 and

min value of y₁ = ([min value of x₁] + [min value of x₂])/2

Now from the original NLP valid ranges for x₁ and x₂ are given by 0 <= x₁ <= 3 and 0 <= x₂ <= 4 so that

max value of y₁ = (3 + 4)/2 = 7/2

min value of y₁ = (0 + 0)/2 = 0

Hence a valid range for y₁ is 0 <= y₁ <= 7/2.

We can establish a valid range for y₂ in a similar manner.

Since y₂ = (x₁ - x₂)/2 it is clear that

max value of y₂ = ([max value of x₁] - [min value of x₂])/2 and

min value of y₂ = ([min value of x₁] - [max value of x₂])/2

and again using 0 <= x₁ <= 3 and 0 <= x₂ <= 4

we get that

max value of y₂ = (3 - 0)/2 = 3/2

min value of y₂ = (0 - 4)/2 = -2

Hence a valid range for y₂ is -2 <= y₂ <= 3/2.

Step 3

introduce (arbitrary) breakpoints in the range of each variable, for each nonlinear term, as below:

Term    Breakpoints                  Variables 
(x₁)⁵   x₁ = 1, x₁ = 2                A_i (i=1,2,3,4) 
(y₁)²   y₁ = 1, y₁ = 2                B_i (i=1,2,3,4) 
(y₂)²   y₂ = -1, y₂ = 0, y₂ = 1       C_i (i=1,2,3,4,5)

Step 4

construct the linear approximation for each nonlinear term. Hence we have (x₁)⁵ approximated by the linear expression

A₁(0)⁵ + A₂(1)⁵ + A₃(2)⁵ + A₄(3)⁵

where

x₁ = A₁(0) + A₂(1) + A₃(2) + A₄(3)

A₁ + A₂ + A₃ + A₄ = 1

and either only one A_i (some i <= 4) non-zero or only two adjacent A_i, A_i+1 (some i <= 3) non-zero

(y₁)² approximated by the linear expression

B₁(0)² + B₂(1)² + B₃(2)² + B₄(7/2)²

where

y₁ = B₁(0) + B₂(1) + B₃(2) + B₄(7/2)

B₁ + B₂ + B₃ + B₄ = 1

and either only one B_i (some i <= 4) non-zero or only two adjacent B_i, B_i+1 (some i <= 3) non-zero

(y₂)² approximated by the linear expression

C₁(-2)² + C₂(-1)² + C₃(0)² + C₄(1)² + C₅(3/2)²

where

y₂ = C₁(-2) + C₂(-1) + C₃(0) + C₄(1) + C₅(3/2)

C₁ + C₂ + C₃ + C₄ + C₅ = 1

and either only one C_i (some i <= 5) non-zero or only two adjacent C_i, C_i+1 (some i <= 4) non-zero

Step 5

the complete linear programming approximation of the original NLP is therefore given by

maximise [A₁(0)⁵ + A₂(1)⁵ + A₃(2)⁵ +A₄(3)⁵] + x₂ 
subject to
x₁ = A₁(0) + A₂(1) + A₃(2) + A₄(3) 
y₁= (x₁ + x₂)/2 
y₂ = (x₁ - x₂)/2
y₁ = B₁(0) + B₂(1) + B₃(2) + B₄(7/2) 
y₂ =C₁(-2) + C₂(-1) + C₃(0) + C₄(1) + C₅(3/2)
A₁ + A₂ + A₃ + A₄ = 1 
B₁ + B₂ + B₃ + B₄ = 1 
C₁ + C₂ + C₃ + C₄ + C₅ = 1 
[B₁(0)² + B₂(1)² + B₃(2)² + B₄(7/2)²] - 
[C₁(-2)² + C₂(-1)² + C₃(0)² + C₄(1)² + C₅(3/2)²] <= 17 
x₁ <= 3 
x₂ <= 4 
All variables except y₂ >= 0, y₂ can be positive or negative 
and either only one A_i (some i <= 4) non-zero or only two adjacent A_i,
A_i+1 (some i <= 3) non-zero
and either only one B_i (some i <= 4) non-zero 
or only two adjacent B_i, B_i+1 (some i <= 3) non-zero
and either only one C_i (some i <= 5) non-zero 
or only two adjacent C_i, C_i+1 (some i <= 4) non-zero

Note here that the y variables (y₁,y₂) can be eliminated from the above program and do not need to be explicitly considered.

The trade-off between the size of the linear program and the accuracy of the approximation occurs because the more breakpoints we introduce for any nonlinear term the better we approximate the nonlinear term, but the larger the resulting linear program.

Nonlinear programming example

Convert the following NLP into an appropriate linear approximation.

maximise (x₁)² + 2x₂ + 3x₃ 
subject to
x₂ + log_e(x₁) >= 2 
x₂x₃ <= 20 
2 <= x₁ <= 3 
x₂ <= 5 
x₃ <= 20
x₁,x₂,x₃ >= 0

We again approach the problem via the five steps given above.

Step 1

in this NLP we have three nonlinear terms (x₁)², log_e(x₁) and x₂x₃. All of these nonlinear terms are functions of single variables if we use the expansion of x₂x₃ that we gave before, namely:

x₂x₃ = [(x₂ + x₃)/2]² - [(x₂ - x₃)/2]²

and putting

y₁ = (x₂ + x₃)/2

y₂ = (x₂ - x₃)/2

where y₁ >= 0 but y₂ could be positive or negative we have that the nonlinear constraint

x₂x₃ <= 20

becomes (y₁)² - (y₂)² <= 20

which is in a form we can deal with (as it contains two nonlinear terms, each a function of a single variable).

Step 2

establish a valid range for each of the variables involved in a nonlinear term - these ranges are

Term      Variable    Range 
(x₁²      x₁           2 <= x₁ <= 3 
log_e(x₁)  x₁           2 <= x₁ <= 3 
(y₁)²     y₁           0 <= y₁ <= 25/2
(y₂)²     y₂           -10 <= y₂ <= 5/2

Step 3

introduce (arbitrary) breakpoints in the range of each variable, for each nonlinear term as below:

Term      Breakpoints              Variables 
(x₁)²     None                     A_i (i=1,2) 
log_e(x₁)  x₁ = 2.5                  B_i (i=1,2,3) 
(y₁)²     y₁ = 4, y₁ = 8            C_i (i=1,2,3,4) 
(y₂)²     y₂ = -2, y₂ = 0, y₂ = 1   D_i (i=1,2,3,4,5)

Note here that we have different variables defined for the two nonlinear terms that involve x₁. This is essential if (as above) we have chosen different breakpoints for the two nonlinear terms. If we have the same breakpoints then the variables should be the same.

Step 4

construct the linear approximation for each nonlinear term. Hence we have (x₁)² approximated by the linear expression

A₁(2)² + A₂(3)²

where

x₁ = A₁(2) + A₂(3)

A₁ + A₂ = 1

log_e(x₁) approximated by the linear expression

B₁(log_e(2)) + B₂(log_e(2.5)) + B₃(log_e(3))

where

x₁ = B₁(2) + B₂(2.5) + B₃(3)

B₁ + B₂ + B₃ = 1

and either only one B_i (some i <= 3) non-zero or only two adjacent B_i, B_i+1 (some i <= 2) non-zero

Note especially here how the log_e(x₁) term has been approximated.

(y₁)² approximated by the linear expression

C₁(0)² + C₂(4)² + C₃(8)² + C₄(25/2)²

where

y₁ = C₁(0) + C₂(4) + C₃(8) + C₄(25/2)

C₁ + C₂ + C₃ + C₄ = 1

and either only one C_i (some i <= 4) non-zero or only two adjacent C_i, C_i+1 (some i <= 3) non-zero

(y₂)² approximated by the linear expression

D₁(-10)² + D₂(-2)² + D₃(0)² + D₄(1)² + D₅(5/2)²

where

y₂ = D₁(-10) + D₂(-2) + D₃(0) + D₄(1) + D₅(5/2)

D₁ + D₂ + D₃ + D₄ + D₅ = 1

and either only one D_i (some i <= 5) non-zero or only two adjacent D_i, D_i+1 (some i <= 4) non-zero

Step 5

the complete linear programming approximation of the original NLP is therefore given by

maximise [A₁(2)² + A₂(3)²] + 2x₂ + 3x₃ 
subject to
x₁ = A₁(2) + A₂(3) 
x₁ = B₁(2) + B₂(2.5) + B₃(3) 
y₁ = (x₂ + x₃)/2 
y₂ = (x₂ - x₃)/2 
y₁ = C₁(0) + C₂(4) + C₃(8)+ C₄(25/2) 
y₂ = D₁(-10) + D₂(-2) + D₃(0) + D₄(1) + D₅(5/2) 
A₁ + A₂ = 1 
B₁ + B₂ + B₃ = 1 
C₁ + C₂ + C₃ + C₄ = 1 
D₁ + D₂ + D₃ + D₄ + D₅ = 1 
x₂ + [B₁(log_e(2)) + B₂(log_e(2.5)) + B₃(log_e(3))] >= 2 
[C₁(0)² + C₂(4)² + C₃(8)² + C₄(25/2)²] - [D₁(-10)² +
D₂(-2)² + D₃(0)² + D₄(1)² + D₅(5/2)²] <= 20 
2 <= x₁ <= 3 
x₂ <= 5 
x₃ <= 20 
All variables except y₂ >= 0, y₂ can be positive or negative
and either only one B_i (some i <= 3) non-zero
or only two adjacent B_i, B_i+1 (some i <= 2) non-zero
and either only one C_i (some i <= 4) non-zero
or only two adjacent C_i, C_i+1 (some i <= 3) non-zero
and either only one D_i (some i <= 5) non-zero
or only two adjacent D_i, D_i+1 (some i <= 4) non-zero

Note here that we have no either/or condition relating to A_i since as we have only A₁ and A₂ the either/or condition that we would normally include is automatically satisfied. Note also that the y variables (y₁,y₂) can be eliminated from the above program and so do not need to be explicitly considered.

Nonlinear programming example 1986 UG exam

Transform the nonlinear program shown below to a linear program.

maximise (x₁)³ + x₁ + x₃ 
subject to 
x₃ + log_e(x₁) <= 7 
x₁ + x₂ + x₃ <= 9 
5 <= x₁ <= 10
0 <= x₂ <= 7 
4 <= x₃ <= 15

Solution

Step 1

each nonlinear term is already a function of a single variable.

Step 2

the ranges are

Nonlinear term   Variable    Range 
(x₁)³            x₁           5 <= x₁ <= 10 
log_e(x₁)         x₁           5 <= x₁ <= 10

Step 3

we now introduce (arbitrary) breakpoints in the range of each variable, for each nonlinear term, as below

Term     Breakpoints      Variables 
(x₁)³     x₁ = 7           A_i (i=1,2,3) 
log_e(x₁)  x₁ = 6, x₁ = 8    B_i (i=1,2,3,4)

Note here that although both nonlinear terms are functions of the same variable (x₁) we have different breakpoints and hence different variables (A_i and B_i).

Step 4

Construct the linear approximation for each nonlinear term.

Hence we have (x₁)³ approximated by the linear expression

A₁(5)³ + A₂(7)³ + A₃(10)³

where

x₁ = A₁(5) + A₂(7) + A₃(10)

A₁ + A₂ + A₃ = 1

and either only one A_i (some i <= 3) non-zero or only two adjacent A_i, A_i+1 (some i <= 2) non-zero

log_e(x₁) approximated by the linear expression

B₁(log_e(5)) + B₂(log_e(6)) + B₃(log_e(8)) + B₄(log_e(10))

where

x₁ = B₁(5) + B₂(6) + B₃(8) + B₄(10)

B₁ + B₂ + B₃ + B₄ = 1

and either only one B_i (some i <= 4) non-zero or only two adjacent B_i, B_i+1 (some i <= 3) non-zero

Step 5

the complete linear approximation of the original NLP is therefore given by

maximise A₁(5)³ + A₂(7)³ + A₃(10)³ + x₁+ x₃ 
subject to
x₃ + B₁(log_e(5)) + B₂(log_e(6)) + B₃(log_e(8)) + B₄(log_e(10)) <= 7 
x₁ = A₁(5) + A₂(7) + A₃(10) 
x₁ = B₁(5) + B₂(6) + B₃(8) + B₄(10)
A₁ + A₂ + A₃ = 1 
B₁ + B₂ + B₃ + B₄ = 1 
x₁ + x₂ + x₃ <= 9 
5 <= x₁ <= 10
0 <= x₂ <= 7 
4 <= x₃ <= 15
all variables >= 0
and either only one A_i (some i <= 3) non-zero
or only two adjacent A_i, A_i+1 (some i <= 2) non-zero
and either only one B_i (some i <= 4) non-zero
or only two adjacent B_i, B_i+1 (some i <= 3) non-zero