Full article: Using projected cutting planes in the extended cutting plane method

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this paper we show that simple projections can improve the algorithmic performance of cutting plane-based optimization methods. Projected cutting planes can, for example, be used as alternatives to standard cutting planes or supporting hyperplanes in the extended cutting plane (ECP) method. In the paper we analyse the properties of such an algorithm and prove that it will converge to a global optimum for smooth and nonsmooth convex mixed integer nonlinear programming problems. Additionally, we show that we are able to solve two old but very difficult facility layout problems (FLP), with previously unknown optimal solutions, to verified global optimum by using projected cutting planes in the algorithm. These solution results are also given in the paper.

Keywords:

2010 Mathematics Subject Classifications:

1. Introduction

A convergence proof for the outer approximation method, able to solve smooth (differentiable) convex mixed integer nonlinear programming (MINLP) problems to an optimal solution, was given in [Citation1]. The paper gave a starting point for a massive number of publications on MINLP applications, new methods and algorithms, many of them for the same class of problems but also some with proven convergence properties for more general classes of MINLP problems. Efforts were, later on, also put on collecting example problems in extensive problem libraries, for example the MINLP Library 2 [Citation2] on which MINLP algorithms and software have been and can be evaluated. Several overviews and performance studies on state of the art algorithms and software for MINLP problems have also been published [Citation3,Citation4].

In this paper we will analyse a modified version of one of these methods, the extended cutting plane (ECP) method [Citation5], which has its origin in the method of Kelley [Citation6] from 1960. In Kelley's method cutting planes are used to overestimate the feasible region and make the relaxation tighter in order to solve smooth convex nonlinear programming (NLP) problems to an optimal solution by a sequence of linear programming (LP) problems. The method was extended to smooth convex MINLP problems in [Citation5] where a convergence proof for the extended cutting plane (ECP) method was given. In a series of papers the ECP method has been further extended to smooth pseudoconvex MINLP problems in [Citation7,Citation8] and nonsmooth convex MINLP problems in [Citation9]. The method was further proven to solve generalized convex MINLP problems using cutting planes and supporting hyperplanes to a global optimal solution in [Citation10–12]. Note that Horst and Tuy [Citation13] have considered nonsmooth problems with linear outer approximations similar to cutting planes and supporting hyperplanes, as well. Standard cutting planes and supporting hyperplanes can also be replaced by (or used together with) projected cutting planes in the ECP algorithm [Citation14]. A projected cutting plane is a cutting plane generated at a MILP solution point or at the projected point. The projected point is obtained by projecting the MILP solution to the kernel of the linearization of the most violated constraint at the MILP solution point. This procedure can be continued to find different projected points. Similar projections has been used in the projection method [Citation15] to find a feasible point from the intersection of convex sets. In [Citation16,Citation17] it is shown that the projection method converges to a feasible point. In [Citation18] certain projections in nonsmooth NLP optimization was considered, as well. In this paper we consider more closely the convergence properties of the algorithm when projected cutting planes based on repeated projection steps are used to solve smooth and nonsmooth convex MINLP problems.

In Kelley's method a sequence of LP problems are solved to obtain points where cuts are to be generated. As LP problems in Kelley's method are replaced by MILP problems in the ECP method, the computational load is, generally, much heavier in the mixed integer case. However, cuts need not be generated at optimal solution points of the MILP subproblems. In proving that the solution sequence of the MILP subproblems converges to a global optimum of the MINLP problem, only the final MILP subproblem needs to be solved to optimality. When considering the ECP method, this observation is very essential since solving a single MILP problem to optimality, by a branch and bound method, may require solving millions of LP subproblems. This issue was discussed in [Citation8], where it was shown that the efficiency of the ECP method can be clearly improved when the cuts are generated mainly at integer feasible and not only at optimal MILP solution points. Such a procedure is easily implementable in an algorithm solving MILP subproblems with solvers like CPLEX [Citation19] or Gurobi [Citation20]. In these solvers the solution procedure can be interrupted after an integer feasible solution has been found and the procedure can therafter be continued utilizing the whole old branch and bound tree. In the ECP procedure, as described in [Citation8], the above property was utilized by using the ‘mip solutions limit’ (msl) parameter, when solving the MILP subproblems. After each iteration, either new cuts are added or the mip solutions limit is increased.

Since the MILP subproblem solutions are integer feasible it should be noted that two distinct sequences of points are obtained wherefrom upper and a lower bounds on the objective function can be calculated. Integer feasible MILP points being feasible in the MINLP problem give upper bounds while optimal MILP points being infeasible in the MINLP problem give lower bounds (assuming the objective function is minimized). Termination of the algorithm occurs when the gap between the bounds is closed or is small enough. In case all MILP subproblems are solved to optimality only one sequence of underestimating lower bounds of the objective function is obtained and an ε-termination criteria on the most violating constraint function is used.

A notable property of the algorithm is also that cuts are generated at integer feasible points. This is sometimes an advantage since there exist applied problems where nonlinear functions are not evaluatable at real values on integer variables [Citation21]. This property does, however, not generally retain when cuts are replaced by supporting hyperplanes. Supports are generated on the boundary of the relaxed feasible region. In addition to an (integer feasible) MILP point, being infeasible in the MINLP problem, a feasible MINLP point and a line search is usually needed to obtain a point on the boundary of the relaxed feasible region of the MINLP problem, where a supporting hyperplane is generated. When projected cuts are used, we will later show, that it is still possible to retain the same property for projected cuts as for standard ones, i.e. that the projected points may remain integer feasible.

When comparing cuts and supporting hyperplanes, one advantage of a supporting hyperplane is that it, generally, results in a tighter relaxation of a specific constraint, than a standard cutting plane. On the other hand, cutting planes are easier to calculate and several cuts (i.e. for different constraints) can be generated at the same MILP solution, while a point on the boundary of the feasible region of the MINLP problem rarely is valid to generate supports for several constraints. A higher number of cuts gives a tighter relaxation of the entire feasible region of the MINLP problem, resulting, often, in that fewer subproblems need be solved to obtain the optimal solution of the MINLP problem. On the other hand, a higher number of cuts, usually, also reduces the computational efficiency to solve the MILP subproblems and although fewer subproblems need be solved, the computational load to solve the entire MINLP problem may be higher.

Projected cutting planes have, in principle, the same advantages as standard cutting planes when comparing them to supporting hyperplanes. However, a projected cutting plane has an advantage when comparing it to a standard cutting plane. A projection moves the infeasible point towards the boundary of the feasible region and makes the projected cutting plane tighter. If a projection would move the point exactly to the boundary, then a projected cut would be a supporting hyperplane at this point. This is, however, not always desirable as the number of supports, that can be generated at a point on the boundary, is generally lower than the number of cuts that can be generated at a point in the infeasible region of the MINLP problem. It should also be mentioned that while a projection moves a point closer to the boundary of the feasible region it is not always possible to reach the boundary if some or all integer variables are fixed in the projection. This can, though, be avoided when relaxed values are allowed also for integer variables.

We will in the following show how projected points are calculated. The procedure can be applied in several steps, reprojecting already calculated projected points. In this case gradients or subgradients of the considered constraint need be calculated in every step of the reprojection. This issue and the convergence properties of the algorithm are considered in the next sections. In the final numerical section we will illustrate the computational procedure first using a simple example. Thereafter, we show that we were able to solve some very difficult facility layout problems (FLP), with formerly unknown optimal solutions, to verified global optimal solutions when using projected cutting planes instead of standard cutting planes in the ECP algorithm.

2. The MINLP problem

The MINLP problem to be solved is formulated as follows, (P) $\begin{aligned} x^{*} \in {a r g m i n}_{x \in L \cap C \cap Y} c^{T} x \\ L = {x \in X ∣ A x \leq a} \\ C = {x \in X ∣ g (x) \leq 0} \\ Y = {x \in X ∣ x_{i} \in Z, i \in I_{Z}, | I_{Z} | \leq n} \\ X = {x \in R^{n} ∣ x_{L} \leq x \leq x_{H}} \end{aligned}$ (P) i.e. find a vector $x^{*}$ of bounded real and integer variables, minimizing a linear function $c^{T} x$ in $L \cap C \cap Y$ . The continuous relaxation $L \cap C$ of $L \cap C \cap Y$ is supposed to be convex. We assume there exists an optimal solution $x^{*}$ and that the relaxed exterior L\C is nonempty and in case supporting hyperplanes (needing a line search procedure) are used that the relaxed interior ${x \in L ∣ g (x) < 0}$ is nonempty as well. The sets L and C are defined by linear and nonlinear inequality constraints, respectively. The integer variables in $x$ are defined by the index set $I_{Z}$ in Y and the bounds of all variables are defined in X. The nonlinear constraint functions $g$ in C are in this paper assumed to be convex, smooth or nonsmooth. If a smooth or nonsmooth convex objective function f is to be minimized an additional variable $x_{n + 1}$ can be minimized and the objective function be written as an additive constraint in C, as $f (x) - x_{n + 1} \leq 0$ . If the objective function would be a convex quadratic function $f (x) = x^{T} Q x$ then it can replace the linear objective function $c^{T} x$ in case the MILP subsolver in the ECP algorithm, is replaced by a MIQP subsolver. This can be done, for example, using the MILP/MIQP solvers in CPLEX and Gurobi.

3. Supporting hyperplanes, standard cutting planes or projected cutting planes

In a recent paper of Serrano et al. [Citation22] it was shown that the supporting hyperplane algorithm [Citation23] is equivalent to Kelley's cutting plane algorithm when reformulating the feasible region of the problem using its (sublinear) gauge function. This is theoretically interesting and in principle the case (neclecting the numerical differences) when comparing the smooth convex NLP algorithms of Kelley [Citation6] and Veinott [Citation24]. This also applies in the mixed integer case when considering the equivalence between the extended cutting plane method for smooth convex MINLP problems in Westerlund and Pettersson [Citation5] and the supporting hyperplane method for smooth convex MINLP problems in Kronqvist et al. [Citation23].

Supporting hyperplanes are halfspaces in the same way as cutting planes, cutting off a part of the infeasible region of a convex MINLP problem, while no part of the feasible region of the problem is cut off. A cutting plane has the property that it cuts off the MILP solution point where it is generated, as well. This is obvious as standard cutting planes are generated at points in the infeasible region of the MINLP problem. A supporting hyperplane is, on the other hand, generated at a point on the boundary of the feasible region (of the MINLP problem). It is, thus, obvious that such a point should not be cut off. To find a point on the boundary, a line search between two points is usually applied. One of the points is within the feasible region (of the MINLP problem) and the other outside it (this is the solution point from solving the MILP subproblem). Observe, however, that a line (in this case, between a feasible and an infeasible MINLP point) represents a convex set which can be divided into two line segments representing two convex subsets between which the supporting hyperplane acts as a separating hyperplane as well. No part of the feasible region (of the MINLP problem) is cut off by a supporting hyperplane. Obviously, no points on the line segment between the feasible point (in the MINLP problem) and the point on the boundary of the feasible region are cut off. Consequently, all points on the line segment being outside the feasible region (of the convex MINLP problem) are cut off by the supporting hyperplane, as it also acts as a separating hyperplane of the two convex subsets (i.e. the line segments) on the line, wherefrom the point $x_{s}$ on the boundary is calculated. The MILP solution point used in the line search will, thus, be cut off by a supporting hyperplane, in the same manner as a MILP solution point is cut off with a cutting plane.

Computationally there are, though, algorithmic differences when using supports or cutting planes, resulting in different performance. This is also the case when using projected cutting planes instead of standard ones. In this paper we use the accronyme PECP when using projected cutting planes in the ECP algorithm to distinguish, when standard or modified cutting planes in the ECP method have been replaced by projected cutting planes in the algorithm. In the solver GAECP (Generalized Alpha Extended Cutting Plane) [Citation25], used in the calulations of this paper, standard, modified and projected cutting planes as well as supporting hyperplanes can be used.

Standard cutting planes are generated at MILP solution points $x_{k}$ being in the infeasible region of the problem (P). The points $x_{s}$ where supporting hyperplanes are generated need usually satisfy $x_{s} \in {x ∣ g (x) = 0 \land x \in [x_{f}, x_{k}]}$ where $x_{k}$ is a MILP solution point and $x_{f}$ a point inside the relaxed feasible region $C \cap L$ of (P). In order to calculate the point $x_{s}$ a feasibility problem must, in this case, initially be solved to obtain the point $x_{f}$ . Each point $x_{s}$ is, thereafter, found by a line search between the point $x_{f}$ and the corresponding MILP solution point $x_{k}$ . The procedure to calculate the points $x_{p}$ where projected cutting planes are generated, is given in a later section. However, a first projected point $x_{p}$ is obtained when the MILP solution $x_{k}$ is projected orthogonally onto a first linearization $l_{k} (x, x_{k}) = g (x_{k}) + {ξ_{k}}^{T} (x - x_{k}) = 0,$ where $ξ_{k}$ is a subgradient belonging to the subdifferential $\partial g (x_{k}) = \{ξ ∣ g (x_{k}) + ξ^{T} (x - x_{k}) \leq g (x), \forall x \in R^{n}\} .$ For a smooth function $g : R^{n} \to R$ we have $\partial g (x) = {\nabla g (x)} f o r a n y x \in R^{n}$ . In Westerlund et al. [Citation12] basic definitions of subdifferentials, subgradients and their calculations can be found. More detailed information about nonsmooth functions and nonsmooth optimization can be found, for example, in the textbooks [Citation26,Citation27].

The projection results in a point with the closed form expression, $x_{p} = x_{k} - \frac{g (x_{k})}{ξ_{k}^{T} ξ_{k}} ξ_{k} .$ This point can then be used in an initial projected cutting plane. Reprojections from the obtained projected point can be done as well. At each step of the projection procedure the function value and a subgradient need be calculated at the previously obtained point when a new projection point is calculated.

A standard cutting plane (CP), supporting hyperplane (SH) or a projected cutting plane (PCP) generated for a constraint $g (x) \leq 0$ can be expressed as follows: (CP) $\begin{aligned} l_{k} (x, x_{k}) = g (x_{k}) + {ξ_{k}}^{T} (x - x_{k}) \leq 0; ξ_{k} \in \partial g (x_{k}) \end{aligned}$ (CP) (SH) $\begin{aligned} l_{k} (x, x_{s}) = g (x_{s}) + {ξ_{s}}^{T} (x - x_{s}) \leq 0; ξ_{s} \in \partial g (x_{s}) \end{aligned}$ (SH) (PCP) $\begin{aligned} l_{k} (x, x_{p}) = g (x_{p}) + {ξ_{p}}^{T} (x - x_{p}) \leq 0; ξ_{p} \in \partial g (x_{p}) . \end{aligned}$ (PCP) The expressions for (CP), (SH) and (PCP) are written in general form, valid for smooth and nonsmooth functions g.

3.1. The PECP algorithm

In [Citation5] it was shown that the extended cutting plane algorithm converges to a global optimum of the problem (P) using standard cutting planes when the constraint functions are smooth and convex. When supporting hyperplanes are used, Kronqvist et al. [Citation23] showed that also in this case the algorithm converges to a global optimum for smooth convex problems. In Eronen et al. [Citation9,Citation11] it was shown that the algorithm converges to a global optimum of the problem (P) for nonsmooth convex constraint functions as well using standard cutting planes and supporting hyperplanes, by only replacing gradients with subgradients in the cuts. This might look trivial. However, in [Citation9] it was shown that such a modification is not directly applicable to the linear outer approximation method by Fletcher and Leyffer [Citation28], when considering nonsmooth convex problems, since the linear outer approximation algorithm might end in an endless loop using such a modification.

In the following we will study the properties of the PECP algorithm both in the smooth and the nonsmooth convex case, i.e. when we replace standard cutting planes with projected cutting planes in the ECP algorithm. We will initially, reproduce the algorithm, in compact form, utilizing the solution sequence given in [Citation8] where intermediate MILP problems are mainly solved only to feasible solutions and not necessarily to optimal ones. In [Citation8] it was shown that smooth problems (P) can be solved to a global optimum using a sequence of feasible (and/or optimal) MILP problems $(P_{k})$ utilizing cutting planes.

In this paper we will prove that the algorithm also converges to a global optimum when considering nonsmooth problems (P), and solving the subproblems mainly to integer feasible solutions while using projected cutting planes in the algorithm. The sequence of points is generated as follows, $\begin{aligned} x^{*} \in a r g \tilde{min_{x \in L_{k} \cap Y}} c^{T} x k = 0, 1, \dots, K \\ Y = {x \in X ∣ x_{i} \in Z, i \in I_{Z}, | I_{Z} | \leq n} \\ X = {x \in R^{n} ∣ x_{L} \leq x \leq x_{H}} . \end{aligned}$ $L_{k}$ is a polytope, $L_{k} \supset L \cap C$ , overestimating $L \cap C$ in the subproblem $(P_{k})$ (i.e. subproblem k in the sequence, $k = 0, 1, \dots, K$ ). Notation $min^{~}$ indicates that all subproblems need not be solved to optimality. The procedure to solve the problems $(P_{k})$ and to generate the polytopes $L_{k}$ is given in the PECP algorithm. A 'mip solutions limit' parameter ( $m s l$ ) is used in MILP solvers like CPLEX and Gurobi (using a branch and bound procedure) to interrupt the solution of a MILP problem after the $m s l$ -first integer feasible solutions have been found. The solution of the MILP problem can thereafter (if no new cuts are added) be continued to a new integer feasible solution by increasing the value of $m s l$ while utilizing the old branch and bound (B&B) tree. If new cuts are added the old B&B tree cannot be utilized and the new MILP problem is solved from the beginning to the actual msl-value. The total number of integer feasible solutions to be found in a B&B tree before the MILP solution is found optimal is individual for each MILP problem to be solved. Thus, an individual ${m s l}_{k}$ index is used for each subproblem ( $P_{k}$ ).

The first subproblem is, by default, solved to the first integer feasible solution in the algorithm, thus ${m s l}_{0} = 1$ . The solution points of the subproblems $(P_{k})$ , are all integer feasible, when interrupting at a certain value of ${m s l}_{k}$ . If such a solution would be an optimal MILP solution the ${m s l}_{k}$ value is marked by ${m s l}_{k}^{*}$ . The optimal MILP solutions limit is, though, individual for each subproblem $(P_{k})$ and given by the MILP solver first when the MILP subproblem has been solved to optimality. Since all solution points $x_{k}$ are integer feasible and satisfy all linear constraints, a global optimum to (P) can be verified when a solution $x_{k}$ is within the set $C \cap L \cap Y$ and ${m s l}_{k} = {m s l}_{k}^{*}$ .

In the algorithm lower and upper bounds, LB and UB, of the objective function of the problem (P) can be calculated and used to verify the optimal solution. One can also terminate if the max function of the constraints, denoted $\tilde{g} (x) = max_{i} {g_{i} (x)}$ , is satisfied. When starting the algorithm the objective function is constrained in a given interval $L B_{l o w} \leq c^{T} x \leq U B_{h i g h}$ . The limit value ${m s l}_{h i g h}$ can be selected as the default value used, for example, in CPLEX. But typically ${m s l}_{h i g h} \geq 100$ is enough.

The PECP algorithm proceeds as follows:

(Data)	${m s l}_{h i g h}, ϵ_{g} > 0, ϵ > 0, U B_{h i g h} > L B_{l o w}$
(A)	$k = 0, {m s l}_{0} = 1, {m s l}_{0}^{*} = {m s l}_{h i g h}, L_{0} = L, U B = U B_{h i g h}, L B = L B_{l o w}$
(B)	$S o l v e (P_{k}) f r o m t h e b e g i n n i n g t o {m s l}_{k} a n d g o t o (D)$
(C)	$S o l v e (P_{k}) t o {m s l}_{k} u t i l i z i n g t h e o l d B & B - t r e e$
(D)	$I f t h e s o l u t i o n x_{k} o f (P_{k}) i s f o u n d o p t i m a l l e t {m s l}_{k}^{*} = {m s l}_{k}$
(E)	$I f \tilde{g} (x_{k}) \leq ϵ_{g} a n d c^{T} x_{k} < U B l e t U B = c^{T} x_{k}$
(F)	$I f {m s l}_{k} = {m s l}_{k}^{*} a n d c^{T} x_{k} > L B l e t L B = c^{T} x_{k}$
(G)	$I f {m s l}_{k} = {m s l}_{k}^{} a n d \tilde{g} (x_{k}) \leq ϵ_{g} o r \| U B - L B \| \leq ϵ t h e n x_{k}^{} = x_{k}, S T O P$
(H)	$I f {m s l}_{k} \neq {m s l}_{k}^{*} a n d \tilde{g} (x_{k}) \leq ϵ_{g} t h e n$ ${m s l}_{k} = {m s l}_{k} + 1 a n d g o t o (C)$
(I)	$C a l c u l a t e x_{p}, p r o j e c t e d c u t s, l_{k} (x, x_{p}) a n d l e t L_{k + 1} = {x ∣ l_{k} (x, x_{p}) \leq 0} \cap L_{k}$
(J)	$k = k + 1, {m s l}_{k}^{*} = {m s l}_{h i g h} a n d g o t o (B)$

Observe that the number of MILP iterations is $k + {m s l}_{k}$ when an increment of one is used on ${m s l}_{k}$ . The index k indicates which polytope $L_{k}$ is used when solving problem $(P_{k})$ . In the first iteration k = 0 and ${m s l}_{0} = 1$ . In the next iteration, either k or ${m s l}_{k}$ is increased. Thus at iteration 2 we have (k = 1 and ${m s l}_{1} = 1$ ) or (k = 0 and ${m s l}_{0} = 2$ ). When the algorithm proceeds, at each iteration, either k or ${m s l}_{k}$ is increased and the MILP iteration number will be equal to $k + {m s l}_{k}$ . In the following sections we will show how projected points $x_{p}$ and the projected cutting planes $l_{k} (x, x_{p})$ are calculated. The convergence properties of the algorithm are analysed in the section thereafter.

3.2. The projection step

In this section we give the projection procedure in the PECP algorithm and its closed form projections. Given an MILP/MIQP solution point $x_{k}$ the projections are used to generate a point where tighter cuts can be generated improving the numerical performance of the extended cutting plane algorithm. The projection procedure is, thus, primarily aimed to produce projection points where several tight cuts can be generated and not to generate points on the boundary of the feasible region where only one, or exceptionally only a few, supporting hyperplanes can be generated. However, in special cases, supporting hyperplanes can also be generated with the projection procedure.

A projected point, utilizing the max function, $\tilde{g}$ , and using successive projections is calculated as follows:

(Data)	$ϵ_{P} > ϵ_{g} > 0, n \times n m a t r i x D, P > 0$
(a)	$L e t p = 0; x_{0} = x_{k}; ξ_{0} = ξ_{k} w h e r e ξ_{k} \in \partial \tilde{g} (x_{k})$
(b)	$I f \tilde{g} (x_{p}) \leq ϵ_{P} t h e n g o t o (i)$
(c)	$L e t d_{p} = D ξ_{p} . I f d_{p}^{T} d_{p} = 0, g o t o (i) . O t h e r w i s e, c a l c u l a t e x_{p + 1} = x_{p} - \frac{\tilde{g} (x_{p})}{d_{p}^{T} d_{p}} d_{p}$
(d)	$C a l c u l a t e \tilde{g} (x_{p + 1}) = max_{i} {g_{i} (x_{p + 1})} a n d ξ_{p + 1}$
(e)	$I f \tilde{g} (x_{p + 1}) + ξ_{p + 1}^{T} (x_{k} - x_{p + 1}) \leq ϵ_{g} t h e n g o t o (i)$
(f)	$p = p + 1$
(g)	$I f p < P t h e n g o t o (b)$
(h)	$T h e r e s u l t i n g p o i n t i s x_{p} (w i t h ξ_{p})$
(i)	$S T O P : x_{p} i s t h e p r o j e c t e d p o i n t$

The parameter P used in the procedure is the maximum number of allowed repeated projections at a main iteration (k). The projections are additionally limited by a limit value restriction,

\tilde{g} (x_{p}) > ϵ_{P}

, considered before a projection or reprojection is done. A limit value,

ϵ_{P} = 1

has been used in the considered example problems. By the matrix

D

the direction of the projections can be selected. If the matrix

D

is an identity matrix the whole subgradient

ξ_{p}

at the point

x_{p}

is used in the projection. As the diagonal elements of

D

are connected to the elements in

x_{p}

certain variables can be left unaffected by the projections. Thus, for example, by letting the diagonal elements corresponding to the integer variables be zero, the projections are done only in the directions of the continuous variables. In this way one can, for example, avoid that nonlinear functions need be calculated at points with real values on the integer variables at a projected point. This may be an advantage when applying the algorithm to certain applied problems.

In [Citation13] a projection procedure to obtain supporting hyperplanes was suggested. The projection procedure in this paper, is on the other hand, primarily intended to tighten standard cuts, not to obtain (SH), of reasons discussed in the introduction. In special cases the procedure can, though, result in (PCP) equal to (SH) or (CP). In the next section we will show that the PECP algorithm converges to a global optimum when projected cutting planes are used and the algorithm is applied to smooth or nonsmooth convex MINLP problems (P).

4. Convergence properties of the PECP algorithm

In this section we prove that PECP algorithm converges to an $ϵ_{g}$ -feasible minimum when solving problem (P) assuming that L is a compact set and an optimal solution exists. A point $\tilde{x}$ is $ϵ_{g}$ -feasible minimum of problem (P), if $\tilde{g} (\tilde{x}) \leq ϵ_{g}$ and there are no feasible points yielding smaller objective function value than $c^{T} \tilde{x}$ . The convergence proof is quite similar to that for ECP method [Citation9] and its core ideas can be found in [Citation13], as well. We first prove that projected cutting planes do not cut off feasible points but will cut the previous MILP solution. The solution sequence will consist of different points and will have an accumulation point on a compact set. Any accumulation point turns out to be a global minimum and, thus, $ϵ_{g}$ -feasible minimum will be found after a finite number of iterations.

In this section we must consider two things not existing in the proof of [Citation9]. In this paper we do not assume that every MILP problem is solved to the optimum. Furthermore, we need an assumption that ${m s l}_{h i g h} = \infty$ and projected cutting plane is generated at the point $x_{p}$ if $l_{k} (x_{k}, x_{p}) > ϵ_{g}$ . We first prove that no feasible points are cut off.

Lemma 4.1

The projected cutting plane (1) $\tilde{g} (x_{p}) + ξ^{T} (x - x_{p}) \leq 0, ξ \in \partial \tilde{g} (x_{p})$ (1) does not cut off feasible points.

Proof.

Let $x \in C$ and $x_{p} \in R^{n}$ be arbitrary. By convexity $\tilde{g} (x_{p}) + ξ^{T} (x - x_{p}) \leq \tilde{g} (x) \leq 0,$ where the last inequality follows from the feasibility of $x$ . Hence, the feasible points will remain feasible after introducing projected cutting planes to the model.

Next, we prove that previous MILP solution is cut off.

Lemma 4.2

The point $x_{k}$ is cut off by projected cutting plane $l_{k} (x, x_{p}) \leq 0$ .

Proof.

If $x_{p} = x_{k}$ , then the projected cutting plane is a standard cutting plane. In this case $\tilde{g} (x_{k}) + ξ^{T} (x_{k} - x_{k}) = \tilde{g} (x_{k}) ≰ 0$ and $x_{k}$ is cut off. Otherwise, the projected point is chosen by the projection step so that $x_{k}$ will be cut off.

Next, we prove that $ϵ_{g}$ -feasible solution is found after a finite number of iterations. For the proof we recall that a convex function is locally Lipschitz continuous and on a compact set L this implies Lipschitz continuity on that set. Furthermore, if Lipschitz constant is K then for any subgradient of the function on the set L the inequality $∥ ξ ∥ \leq K$ holds true. This result can be found, for example, in [Citation27].

Theorem 4.3

Suppose that the projected point $x_{p}$ is accepted if $l_{k} (x_{k}, x_{p}) \geq ϵ_{g}$ . Then, the PECP algorithm will find a point $x_{k}$ such that $\tilde{g} (x_{k}) \leq ϵ_{g}$ after a finite number of iterations.

Proof.

Suppose that $ϵ_{g}$ -feasible point is not found after a finite number of iterations. By Lemma 4.2 MILP solutions generate a sequence of different points. In a compact set L this sequence will have an accumulation point. Thus, there are MILP solutions $x_{k_{1}}, x_{k_{2}}$ such that $k_{1} < k_{2}$ and $∥ x_{k_{2}} - x_{k_{1}} ∥ \leq \frac{ϵ_{g}}{2 K}$ , where K is the Lipschitz constant of $\tilde{g}$ in a compact set L.

Suppose that a standard cutting plane is created at the point $x_{k_{1}}$ . Then, $\begin{aligned} \tilde{g} (x_{k_{1}}) + ξ^{T} (x_{k_{2}} - x_{k_{1}}) & \geq ϵ_{g} - ∥ξ∥ ∥x_{k_{2}} - x_{k_{1}}∥ \\ \geq ϵ_{g} - K \cdot \frac{ϵ_{g}}{2 K} = \frac{ϵ_{g}}{2} > 0. \end{aligned}$ Thus, the cutting plane would cut off the point $x_{k_{2}}$ contradicting assumptions $k_{1} < k_{2}$ and $x_{k_{2}}$ being a MILP solution point.

Suppose then that a projected cutting plane is created at the point $x_{p}$ . Then, $\begin{aligned} \tilde{g} (x_{p}) + ξ^{T} (x_{k_{2}} - x_{p}) & = \tilde{g} (x_{p}) + ξ^{T} (x_{k_{1}} - x_{p}) + ξ^{T} (x_{k_{2}} - x_{k_{1}}) \\ \geq l_{k} (x_{k_{1}}, x_{p}) - ∥ξ∥ ∥x_{k_{2}} - x_{k_{1}}∥ \\ \geq ϵ_{g} - K \cdot \frac{ϵ_{g}}{2 K} = \frac{ϵ_{g}}{2} > 0. \end{aligned}$ Thus, the projected cutting plane would also cut off the point $x_{k_{2}}$ contradicting assumptions. Hence, an $ϵ_{g}$ -feasible point is found after a finite number of iterations.

The following lemma helps us deal with the case that all MILP solutions are not optimal.

Lemma 4.4

If L is a compact set, every MILP solution will be optimal after a finite number of iterations.

Proof.

Since L is compact and hence bounded, there are a finite number of different integer solutions. When solving MILP problem with branch and bound method, certain integer solution can be found only once, and thus, can not be revisited. Hence, if the parameter msl is greater than the number of different integer solutions, the optimal MILP solution will be found. By Theorem 4.3 and the PECP algorithm, the parameter msl is increased by one after a finite number of iterations. Hence, msl will be great enough to guarantee optimal MILP solution after a finite number of iterations.

Finally, we can prove the convergence theorem.

Theorem 4.5

Suppose that in the problem (P) the set L is compact. Then the PECP algorithm will find an $ϵ_{g}$ -feasible solution to the problem (P) after a finite number of iterations.

Proof.

The algorithm will stop if it finds a point $x_{k}$ that was optimal MILP solution and $\tilde{g} (x_{k}) \leq ϵ_{g}$ . Since the feasible set of any MILP includes the feasible set of the original problem by Theorem 4.1 and the objective functions of the problems are the same, $x_{k}$ is $ϵ_{g}$ -feasible solution to problem (P).

It is still to be proven that such $x_{k}$ is found after a finite number of iterations. By Lemma 4.4 after some iteration k, every MILP solution is optimal. By Theorem 4.3 we will find a point $x_{k}$ satisfying $\tilde{g} (x_{k}) \leq ϵ_{g}$ after a finite number of iterations.

Finally it may be mentioned that the given algorithm is easily extended to $f^{\circ}$ -pseudoconvex constraint functions. For such generalized convex constraints the subgradients have, though, to be taken from Clarke's subdifferentials (see [Citation27] for definitions). Additionally, the alpha-procedure in [Citation7,Citation8,Citation10] need to be applied on the projected (and standard) cutting planes.

5. Numerical examples

In this section we will solve some convex MILP problems with the PECP algorithm and illustrate the obtained solution results. First we consider a small smooth convex example problem from Kronqvist et al. [Citation23] and thereafter some larger smooth and nonsmooth convex facility layout problems from Castillo et al. [Citation29].

5.1. A small illustrative example

In [Citation23] an extended supporting hyperplane (ESH) method to solve smooth convex MILP problems was given. A comparison of the ESH algorithm with other algorithms, including the ECP method, was done in the paper and the ESH method was found very efficient. The paper contained a small smooth convex MINLP example problem, by which it was illustrated that the number of supporting hyperplanes was significantly lower than the number of standard cutting planes, when solving the problem with the ECP method. The example problem is given below letting us a nice opportunity to compare the results with those obtained with the PECP algorithm. The example problem is the following: (EP1) $\begin{aligned} m i n & - x_{1} - x_{2} \\ Subject to & g_{1} (x_{1}, x_{2}) = 0.15 (x_{1} - 8)^{2} + 0.1 (x_{2} - 6)^{2} + 0.025 e^{x_{1}} x_{2}^{- 2} - 5 \leq 0 \\ g_{2} (x_{1}, x_{2}) = \frac{1}{x_{1}} + \frac{1}{x_{2}} - x_{1}^{0.5} x_{2}^{0.5} + 4 \leq 0 \\ 2 x_{1} - 3 x_{2} - 2 \leq 0 \\ 1 \leq x_{1}, x_{2} \leq 20 x_{1} \in R x_{2} \in Z . \end{aligned}$ (EP1) The optimal solution to the problem, obtained in [Citation23] was: $x_{1}^{*} = 8.90363$ , $x_{2}^{*} = 12$ with an objective function value equal to $- 20.9036$ . The number of iterations, supports and cutting planes when solving the problem using supporting hyperplanes and standard cutting planes were also given in [Citation23] and can in this paper be found in the first two rows of Table .

Table 1. Results when solving the example problem (EP1) with ESH, PECP and ECP.

Download CSV Display Table

When solving the problem with the ESH method an interior point needs initially be found. In [Citation23] this point was found by solving a relaxed feasibility problem by letting the right hand side (RHS) of the two nonlinear constrains be a variable to be minimized. Solving this NLP problem the feasible solution $x_{1} = 7.45$ , $x_{2} = 8.54$ was found with the right hand side of the nonlinear constraints RHS $= - 3.72$ . Neglecting the computations to solve the feasibility problem the MINLP problem could, thereafter, be solved with the ESH method in six iterations using in total five supporting hyperplanes to obtain the optimal solution. In [Citation23] the feasibility problem was solved with the ECP algorithm, using the GAMS/AlphaECP solver. As illustrated in Figure in [Citation23] the problem (EP1) was solved to optimality with the ECP algorithm using in total 17 iterations when adding one cutting plane per iteration. These solution results together with our results when solving the example problem using supporting hyperplanes, standard cutting planes and projected cutting planes with the GAECP solver [Citation25] are given in Table as well.

Since this small example problem contains only one integer variable all MILP subproblems were solved to optimality. The parameter $m s l$ , used in the PECP algorithm was, therefore, initially given the value ${m s l}_{0} = 100$ . The projections, in the PECP algorithm, were in this small example calculated using both variables, i.e. the $D$ matrix was selected to be the identity matrix. In Table we report the solution results when using ESH, ECP and PECP, as well as the computational effort needed to obtain the feasible point in the ESH method. The feasibility problem is in our computations, solved as a relaxed NLP problem, in the same way as it was reported to be solved in [Citation23]. The effort needed to solve the feasibility problem should be added to the remaining computations needed to solve the optimization problem with the ESH algorithm. The total number of function evaluations as well as the CPU time needed to solve the problem are given in the last column of Table .

When using the PECP procedure different values on the parameter P, allowing different numbers of reprojections, were also tested. In Table the parameter value of P is indicated after the acronyme PCP. From Table one observe that the total number of projected cuts needed to solve the problem to optimality can clearly be reduced already by one allowed projection per cut, i.e. P = 1. The total number of cuts needed to obtain the optimal solution can be significantly reduced if the number of allowed reprojections is higher. In the table one can find that the total number of cuts needed to solve the MINLP problem to optimality is 10 if P = 1 and the number is reduced to 7 if the reprojections are allowed to be done in 2 steps per iteration. In case P is increased to 5 then the considered MINLP problem can be solved to global optimality using only 4 projected cuts. In Table the whole solution sequence for the PECP algorithm, with P = 5 in the projection procedure, is given. The projected points as well as all projected cuts to obtain a global optimum are given in the table. An $ϵ_{P}$ value equal to 1 was used in the projection procedure and an $ϵ_{g}$ value equal to 0.001 was used in the termination criterion of the PECP algorithm. From Table one can find that 5 MILP subproblems were solved and termination occured when $\tilde{g} (x_{k})$ was equal to 0.00000382. Observe, that no projections or reprojections were needed in the two last iterations. It may also be noted that only 3 projected cuts would have been needed to obtain the optimal solution (by solving only 4 MILP subproblems) in case an $ϵ_{g}$ value equal to 0.01 had been used.

Table 2. Solution sequence when solving the example problem (EP1) with PECP.

Display Table

In Table the corresponding solution sequence for ECP, including the generated cutting planes, are given. An $ϵ_{g}$ value equal to 0.001 was used in the ECP calculations as well. From the table it is found that the total number of cutting planes needed to obtain the optimal solution and the number of MILP subproblems to be solved are in this case much higher. In total 17 MILP subproblems were solved and 16 cutting planes had to be generated to obtain an optimum of the considered MINLP problem with the ECP algorithm.

Table 3. Solution sequence when solving the example problem (EP1) with ECP.

Display Table

When comparing the results of PECP with those of ESH we find that the number of projected cuts to solve the MINLP problem to a global optimum is equal to or lower when using projected cutting planes than when using supporting hyperplanes to solve the considered example problem. When comparing the number of function evaluations and the CPU time used we also find that PECP needed fewer function evaluations and less CPU time (except PCP 1) than ESH to solve the example problem. ECP needed even less function evaluations than PECP but the CPU time was the greatest.

5.2. Solving some more challenging MINLP problems

Encouraged of the results when solving the small example problem, we decided to test the projection procedure on some larger MINLP problems. From the papers [Citation30–33], we found that essential improvements on solution methods for facility layout problems (FLPs) have been made, during the last decade. Good feasible solutions are found to many FLPs by genetic algorithms and/or tabu searh procedures but it is still very difficult and a great challenge to solve even moderate size FLPs to global optimality, with exact methods.

In the paper by Castillo et al. [Citation29] the ECP method was found to be an attractive alternative to solve, at least moderate size, FLPs to optimality. The FLPs were formulated as smooth convex MINLP problems in [Citation29] and it was found that several of the problems considered could be solved to optimality by the ECP method. The ECP solution approach when solving intermediate MILP subproblems mainly to feasible and not optimal solutions was found to be very efficient, when applied to the FLPs. The largest FLPs, VC10, BA12 and BA14 (or BA13), considered in [Citation29], could, though, not be solved to optimal solution with the ECP algorithm.

Having this in mind we found it a challenge to try solving those FLPs that could not be solved to optimality with the ECP method in [Citation29]. The facility layout problem formulation in [Citation29] contains a large number of integer variables and the procedure of solving intermediate MILP subproblems mainly to feasible solutions was found attractive. We decided, therefore, to use the same strategy in the PECP procedure. The parameter msl in the PECP algorithm is, thus, given an initial value ${m s l}_{0} = 1$ in all our calculations.

The facility layout problem formulation in [Citation29] was initially convex and nonsmooth. The formulation was, thereafter, reformulated to smooth convex form before the problems were solved. The initial nonsmooth convex facility layout problem formulation in [Citation29] is given by: (FLP1) $\begin{aligned} m i n & \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} c_{i, j} (| x_{i} - x_{j} | + | y_{i} - y_{j} |) \\ {Subject to} & x_{i} + \frac{1}{2} w_{i} \leq W, i = 1, \dots, N \end{aligned}$ (FLP1) (L2) $\begin{aligned} x_{i} - \frac{1}{2} w_{i} \geq 0, i = 1, \dots, N \end{aligned}$ (L2) (L3) $\begin{aligned} y_{i} + \frac{1}{2} h_{i} \leq H, i = 1, \dots, N \end{aligned}$ (L3) (L4) $\begin{aligned} y_{i} - \frac{1}{2} h_{i} \geq 0, i = 1, \dots, N \end{aligned}$ (L4) (L5) $\begin{aligned} \frac{1}{2} (w_{i} + w_{j}) - (x_{i} - x_{j}) \leq W (X_{i, j} + Y_{i, j}), \forall 1 \leq i < j \leq N \end{aligned}$ (L5) (L6) $\begin{aligned} \frac{1}{2} (w_{i} + w_{j}) - (x_{j} - x_{i}) \leq W (1 + X_{i, j} - Y_{i, j}), \forall 1 \leq i < j \leq N \end{aligned}$ (L6) (L7) $\begin{aligned} \frac{1}{2} (h_{i} + h_{j}) - (y_{i} - y_{j}) \leq H (1 - X_{i, j} + Y_{i, j}), \forall 1 \leq i < j \leq N \end{aligned}$ (L7) (L8) $\begin{aligned} \frac{1}{2} (h_{i} + h_{j}) - (y_{j} - y_{i}) \leq H (2 - X_{i, j} - Y_{i, j}), \forall 1 \leq i < j \leq N \end{aligned}$ (L8) (C1) $\begin{aligned} - h_{i} + \frac{A_{i}}{w_{i}} \leq 0, i = 1, \dots, N \end{aligned}$ (C1) (C2) $\begin{aligned} - w_{i} + \frac{A_{i}}{h_{i}} \leq 0, i = 1, \dots, N \end{aligned}$ (C2) (B1) $\begin{aligned} w_{min} \leq w_{i} \leq \frac{A_{i}}{h_{min}}, i = 1, \dots, N \end{aligned}$ (B1) (B2) $\begin{aligned} h_{min} \leq h_{i} \leq \frac{A_{i}}{w_{min}}, i = 1, \dots, N \end{aligned}$ (B2) $\begin{aligned} x_{i}, y_{i}, w_{i}, h_{i} \in R, i = 1, \dots, N \\ X_{i, j}, Y_{i, j} \in {0, 1}, \forall 1 \leq i < j \leq N . \end{aligned}$ The nonsmooth convex MINLP problem (FLP1) has a nonsmooth convex objective function, $2 N (N + 1) + 3$ linear and $2 N$ smooth convex constraints, $4 N$ real and $N (N - 1)$ integer variables. The parameter N corresponds to the number of rectangular departments to be allocated in a rectangular facility area with the total width W and total height(lenght) H. The area of the departments are $A_{i}$ and their widths and height(lenght) are restricted by $w_{min}$ and $h_{min}$ respectively. The problem is to minimize the sum of the rectilinear distances between the midpoints of the departments multiplied by a cost factor $c_{i, j}$ related to flows between the departments. The variables $x_{i}$ and $y_{i}$ correspond to the midpoint coordinates of each department while $w_{i}$ and $h_{i}$ correspond to the department widths and heigths(lenghts) respectively. $X_{i, j}$ and $Y_{i, j}$ are binary variables ensuring that the overlapping prevention constrains for the departments are met. Symmetric layout solutions imply the existence of multiple solutions. To avoid such solutions some symmetry breaking constraints were used in [Citation29]. The following first two constraints avoid upside-down and mirror symmetric solutions. The third constraint (LB3) follows from the two first in the formulation. (LB1) $\begin{aligned} x_{n} - x_{m} \geq 0 \end{aligned}$ (LB1) (LB2) $\begin{aligned} y_{m} - y_{n} \geq 0 \end{aligned}$ (LB2) (LB3) $\begin{aligned} X_{n, m} - Y_{n, m} = 0 \end{aligned}$ (LB3) The constraints (LB1)–(LB2) force the department n to be located South–East of the department m, except for the situation when the centre point of both departments remain on the same horizontal or vertical axis. In this case only one of the considered symmetries is avoided. In the computations we tested some alternatives on n and m. Although at least one of the symmetries is always avoided, we found that n = 1 and m = 2 (or vice versa) worked well for problem VC10 and n = 1 and m = 7 gave good results for the other instances. These values were used in the remaining calculations. Note that in [Citation29] values n = 1 and m = 2 were used.

The problem (FLP1) is a nonsmooth convex MINLP problem since absolute values appear in the objective function. Problem (FLP1) can be written in the form (P) in different ways. A straightforward formulation is obtained by rewriting the objective function in (FLP1) to a single nonsmooth objective function constraint. However, utilizing separability reformulations can often be made numerically more efficient [Citation34]. The rectilinear distances between two departments can, for example, be written as separate constraints for each active connection. Then we obtain the following nonsmooth convex FLP2: (FLP2) $\begin{aligned} min & \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} c_{i, j} μ_{i, j} c_{i, j} \neq 0 \\ {Subject to} & (L1), \dots, (L8), (LB1), \dots, (LB3), (C1), (C2), (B1), (B2) \\ | x_{i} - x_{j} | + | y_{i} - y_{j} | - μ_{i, j} \leq 0 \forall i, j \in {i, j ∣ c_{i, j} \neq 0} \end{aligned}$ (FLP2) (B3) $\begin{aligned} μ_{i, j, min} \leq μ_{i, j} \leq μ_{i, j, max} \forall i, j \in {i, j ∣ c_{i, j} \neq 0} \end{aligned}$ (B3) $\begin{aligned} μ_{i, j} \in R \forall i, j \in {i, j ∣ c_{i, j} \neq 0} \\ x_{i}, y_{i}, w_{i}, h_{i} \in R, i = 1, \dots, N \\ X_{i, j}, Y_{i, j} \in {0, 1}, \forall 1 \leq i < j \leq N . \end{aligned}$ The nonsmooth convex problem (FLP2) has a linear objective function, $2 N (N + 1) + 3$ linear, M nonsmooth and $2 N$ smooth convex constraints, 4N + M real and $N (N - 1)$ integer variables. Here M corresponds to the number of nonzero elements $c_{i, j}$ in the problem.

The problem (FLP1) can also be reformulated to smooth convex form, by modelling each absolute value using an additional variable and two constraints as in [Citation29]. The problem formulation (FLP1) is then reformulated to the following smooth convex FLP3 form: $\begin{aligned} (FLP3) & min & \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} c_{i, j} (d_{i, j}^{x} + d_{i, j}^{y}) \\ (LS1) & S u b j e c t t o & d_{i, j}^{x} \geq x_{i} - x_{j}, \forall 1 \leq i < j \leq N \\ (LS2) & d_{i, j}^{x} \geq x_{j} - x_{i}, \forall 1 \leq i < j \leq N \\ (LS3) & d_{i, j}^{y} \geq y_{i} - y_{j}, \forall 1 \leq i < j \leq N \\ (LS4) & d_{i, j}^{y} \geq y_{j} - y_{i}, \forall 1 \leq i < j \leq N \\ (L1), \dots, (L8), (LB1), \dots, (LB3), (C1), (C2), (B1), (B2) \\ x_{i}, y_{i}, w_{i}, h_{i} \in R, i = 1, \dots, N \\ d_{i, j}^{x}, d_{i, j}^{y} \in R, \forall 1 \leq i < j \leq N \\ X_{i, j}, Y_{i, j} \in {0, 1}, \forall 1 \leq i < j \leq N . \end{aligned}$

The smooth convex problem (FLP3) has a linear objective function, $4 N^{2} + 3$ linear and $2 N$ smooth convex constraints, $N (N + 3)$ real and $N (N - 1)$ integer variables. The corresponding numbers of constraints and variables for the considered problems with different formulations are given in Table .

Table 4. Dimensions for each problem in different formulations.

Download CSV Display Table

In [Citation29] the considered facility layout problems were solved in the smooth convex FLP3 form. We have, for comparison, used the same FLP3 formulation, when solving the FLPs with the PECP algorithm, but also made a few other comparisons using the nonsmooth convex formulation (FLP2). The parameters for the considered FLPs can be found in the tables in the appendix. In Tables the best solutions found when solving the instances VC10, BA12, BA14 and BA14B, with PECP in the FLP3 form are given. In Figures the corresponding layout solutions are shown. In the PECP algorithm we have used the parameter values P = 3, $ϵ_{P} = 1$ , ${m s l}_{0} = 1$ and $ϵ_{g} = 0.001$ , unless stated otherwise. All the problems were solved using version 5.538 (2019-5-17) of the GAECP solver described in [Citation25]. The MILP subsolver was CPLEX version 12.6.1 [Citation19] using default parameters, unless stated otherwise. The computations were done on a computer with a 64 bit Windows 10 operating system, running on an Intel(R) Core(TM) i7-5600U CPU @2.6 GHz with installed 8.00 GB RAM.

Figure 1. VC10, Optimal solution 19973.2.

Figure 2. BA12, Optimal solution 8021.0.

Figure 3. BA14, Best found solution 4628.5.

Figure 4. BA14B, Best found solution 4714.3.

Table 5. Verified optimal solution, 19973.2, of VC10 solved in FLP3 form using PECP.

Display Table

Table 6. Verified optimal solution, 8021.0, of BA12 solved in FLP3 form using PECP.

Display Table

Table 7. Best found solution, 4628.5, of BA14 solved in FLP3 form using PECP.

Display Table

Table 8. Best found solution, 4714.3, of BA14B solved in FLP3 form using PECP.

Display Table

The VC10 problem was solved in the FLP3 form to a verified optimal solution (using $ϵ_{g} = 0.0001$ ) in 154 iterations (msl = 38). The optimal solution was found at iteration 149 (msl = 34). The BA12 problem was solved in the FLP3 form to a verified optimal solution (using $ϵ_{g} = 0.000001$ ) in 44 iterations (msl = 27). The optimal solution was found at iteration 29 (msl = 14). The best found solution of the BA14 problem solved in FLP3 form was found after 73 iterations (msl = 33). The best found solution of the BA14B problem solved in FLP3 form was found after 86 iterations (msl = 48). However, problems BA14 and BA14B could not be solved to a verified optimal solution because of limitation in memory space for the MILP subsolver.

In Table the results, when solving VC10, BA12, BA14 and BA14B in FLP3 form, with the PECP algorithm are summarized. Equal or almost equal solution results to the considered FLPs have, though, been found by genetic [Citation32] and tabu search [Citation30] algorithms some years ago. In Table we have summarized results from some papers where the considered FLPs have been solved together with our results when solving the problems with PECP, ECP and ESH in the smooth convex FLP3 form. From the table it is found that genetic and tabu search algorithms are very efficient in finding good feasible solutions for FLPs. However, genetic and tabu search algorithms lack the property of verifying if an obtained solution, is optimal or not, while the PECP algorithm has this property. From the results in Table it is, though, found that the final verification of optimality was very time consuming even if the problems were solved in smooth convex FLP3 form. However, the solution points given in Tables and , obtained with PECP, are the global optimal ones and to the authors knowledge this is the first time ever, that proven global optimal solutions to these problems have been published. The problems BA14 and BA14B could, however, not be solved to global optimality with the PECP algorithm in the FLP3 form. These solution results are, given in Tables and as well as illustrated in the Figures and . These solutions with PECP are the best solutions published as far as the authors know. All the results with PECP in Tables were obtained by solving the problems in smooth convex FLP3 form in order to obtain a comparison with the results in [Citation29], where this formulation initially was used. The ECP method used in this paper correspond to the ECP solution method in [Citation29] but it has, though, been solved with a faster computer and a newer version of the MILP subsolver CPLEX in the ECP algorithm.

Table 9. Results when solving four challenging facility layout design problems.

Display Table

In the final rows of Table the results using the ECP and ESH methods are included as well. From the table it is found that the VC10 problem could also be solved to a verified optimal solution with the ECP and ESH methods, but the required CPU time was shorter for the PECP method. The feasibility problem in ESH could be solved with the algorithm in [Citation12] in a fraction of a second, resulting in a RHS of the constraints equal to $- 16.8$ . But the total CPU time to solve the VC10 problem was magnitudes longer and also much longer than with PECP. Surprisingly, no other of the considered FLP-problems could be solved with ESH. The reason for this is that no relaxed interior points $x_{f} \in {x \in X ∣ g (x) < 0}$ can be found for the problems BA12, BA14 and BA14B. One could use the technique of [Citation23] to create approximate interior points but this is sensitive to numerical accuracies which might be hard to deal with. Supporting hyperplanes can, therefore, not be generated at points $x_{s} \in {x ∣ \tilde{g} (x) = 0 \land x \in [x_{f}, x_{k}]}$ by using a line-search procedure. All relaxed feasible, as well as all integer feasible solutions for the problems BA12, BA14 and BA14B are, thus, found on the boundary of $L \cap C$ . One reason for the lack of an interior point on those problems is that for i = 12 the width and height of the department is constrained to 1. Thus the nonlinear constraints (C1) and (C2) are always active for i = 12. From Table it is, furthermore, found that the BA12 problem could not be solved to the same solution with ECP as with PECP and ECP could not verify an optimal solution, because of limitation in memory space for the B&B tree in the MILP subsolver. This was also the case when solving the BA14 and BA14B problems with both ECP and PECP. The solutions found with PECP where, though, better with PECP than with ECP.

As ECP, PECP and ESH are able to solve smooth and nonsmooth convex MINLP problems to optimality we did also a small comparative study with the problem VC10 to find out in which form (FLP2) or (FLP3) and by which approach (ECP, PECP or ESH) this FLP is most efficiently solved. In the comparison we used the parameter values: ${m s l}_{0} = 1$ and $ϵ_{g} = 0.001$ in ECP, PECP and ESH as well as $ϵ_{P} = 1$ and P = 3 in PECP. The results are given in Table , where one finds that the problem VC10 was solved to global optimality with ECP, PECP and ESH using both the nonsmooth convex FLP2 and the smooth convex FLP3 form of the problem. From Table it is found that the VC10 problem could be solved faster with PECP than with ECP or ESH and the problem was solved faster in the nonsmooth convex FLP2 form than in the smooth convex FLP3 form with all the methods. Since the smooth convex FLP3 formulation include linear expressions (LS1)–(LS4) for all $1 \leq i < j \leq N$ while the nonsmooth convex FLP2 form include corresponding nonlinear inequalities (C3-NS) only for those $i, j \in {i, j ∣ c_{i, j} \neq 0}$ one could assume that the formulation FLP3 would be faster if the linear expressions (LS1)–(LS4) would also be restricted to only those $i, j \in {i, j ∣ c_{i, j} \neq 0}$ . We call this more compact FLP3 formulation as FLP3c in Table . From the Table we can see that the higher computational time of the FLP3 formulation compared to the FLP2 formulation can not be explained by a higher number of (LS1)–(LS4) constraints in the original FLP3 form than the corresponding number of (C3-NS) constraints in the FLP2 form where the tighter index set $i, j \in {i, j ∣ c_{i, j} \neq 0}$ was used. From Table it is also found that the MINLP problem VC10 was solved to global optimality by only solving one MILP subproblem to optimality in each of the cases with ECP and PECP.

Table 10. Results when solving VC10 in FLP2, FLP3 and FLP3c form by ECP, PECP and ESH (using $ϵ_{g} = 0.001$ ).

Display Table

From Table it is found that the number of iterations is higher with ESH than with ECP and PECP. This has its explanation in that the number of supports generated per iteration with ESH is much lower than the number of cuts that can be generated per iteration with ECP and PECP. When using ESH, it was found that a maximum of two supports could be generated and added in a few iterations while only one could be added in the other iterations. The final polytope $L_{k}$ approximating the relaxed feasible region, $L \cap C$ , (in the problem $(P_{k})$ ) was, thus, obtained with a higher k-value and a higher total computational load with ESH than with ECP and PECP. Furthermore, it is notable that the CPU time needed to build up the final polytope, $L_{k}$ , and the final MILP problem, $P_{k}$ , was in some cases shorter than the CPU time needed to solve the final MILP problem, $P_{k}$ , once to optimality This has an additional value, because not only an optimal solution result is obtained for each case, but also a final MILP problem, $P_{k}$ , wherefrom the solution can be obtained.

In connection to all results it should be mentioned that when comparing our solution results with those given in other papers we found that even better solution results than our optimal solutions have been reported. However, in the papers, where better solutions than ours were found, we also found differencies in the parameters or in the formulation used, explaining this discrepancy. For example, Zarali et al. [Citation35] reported a solution (with the objective function value) 7715.03 on the instance BA12, while the (global optimal) solution we obtained, has an objective function value equal to 8021.0 with at least 6 decimals accuracy. But, when considering the solution in [Citation35], we found that the authors have used a total facility area of $7 \cdot 9 = 63$ , while the total facility area for the BA12 problem in [Citation36] was $6 \cdot 10 = 60$ . In addition Zarali et al. [Citation35] used euclidean distances between the departments in the objective function and not rectilinear. Using a rectilinear distances criteria the solution of BA12 in [Citation35] would have been 9703. The best solution value 4165.24 on the objective function for the instance BA14 found in [Citation35] was, as well, much lower than the best found solution value 4628.5 we obtained. However, also in this case, Zarali et al. [Citation35] has used euclidean distances in the objective function. Thus, these solution values are not comparable.

We found, furthermore, some smaller, but clearly notable, differences between our solution results and earlier reported ones. We found, though, that some of these differences could be explained by different solution accuracy only. For example, Goncalves and Resende [Citation32] reported a best known solution 19951.2 to the problem VC10 from [Citation37] while we report a global optimal solution with the objective function value equal to 19973.2. When solving this problem with different accuracies, we found that it is, most probably, the accuracy that has been used by Goncalves and Resende [Citation32] that makes the difference. When we solved the VC10 problem to optimality with different accuracy, we obtained optimal objective function values of the problem between 19933.5 and 19973.2. These solution results are given in Table . It may be noted that the results in Table correspond to those when solving the VC10 problem in the smooth convex FLP3 form. The computational times are slightly lower in case the nonsmooth FLP2 form is used. The accuracy measured as the maximum relative error (in percentage) of any department area is a simple measure, given in the papers [Citation29,Citation32]. This accuracy measure is calculated as follows, (2) $E_{max} = max_{i = 1, \dots, N} \frac{| A_{i} - w_{i} h_{i} |}{A_{i}} \cdot 100 % .$ (2) For the solutions between 19933.5 to 19973.2 the $E_{max}$ value was between 0.2 % and 0.0002 % in our calculations, as can be found in Table . According to these results, a solution with an objective function value equal to 19951.7, would, most likely have an $E_{max}$ value close to 0.05%. Thus we feel the solution reported in [Citation31], would, in principle, be the same as ours, but obtained with a lower accuracy.

Table 11. Optimal results when solving VC10 in FLP3 form with PECP using different $ϵ_{g}$ .

Display Table

As the parameters, for the considered instances may deviate in different papers we report in the appendix the values of all parameters and restrictions that we have used for the instances VC10, BA12, BA14 and BA14B. From the appendix it can also be observed that the only difference between the instances BA14 and BA14B is the restrictions on the width and length of the 14th department which are restricted to $w_{min} = l_{min} = 1$ in the problem BA14B while they are not resticted at all in the problem BA14. We report, for comparison, our results for both these cases, because this problem has been solved with and without restrictions on the width and length of the department 14 in several papers, without mentioning which formulation has been used.

6. Conclusions

In this paper it was shown that projected cutting planes can be an attractive alternative in optimization algorithms using cutting planes and especially in the extended cutting plane algorithm. The algorithm was proven to converge to a global optimum for both smooth and nonsmooth convex MINLP problems. The computational efficiency of the algorithm was, further, demonstrated by solving some very difficult facility layout problems to global optimality. To the authors knowledge this is the first time when it has been shown that the considered facility layout problems, VC10 and BA12, have been solved to global optimality.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Correction Statement

This article has been republished with minor changes. These changes do not impact the academic content of the article.

References

Duran M, Grossmann IE. An outer approximation algorithm for a class of mixed-integer nonlinear programs. Math Program. 1986;36:307–339.
Web of Science ®Google Scholar
MINLP Library 2. Mixed-integer nonlinear programming library; 2020. Available from: http://www.minlplib.org/
Google Scholar
Bussieck MR, Vigerske S. MINLP solver software. In Wiley encyclopedia of operations research and management science;. 2010. doi:10.1002/9780470400531.eorms0527.
Google Scholar
Bonami P, Kilinic M, Linderoth J. Algorithms and software for convex mixed integer nonlinear programs. In: Lee J, Leyffer S, editors. Mixed integer nonlinear programming. The IMA volumes in mathematics and its applications, Vol. 154. New York: Springer; 2012. p. 1–39.
Google Scholar
Westerlund T, Pettersson F. An extended cutting plane method for solving convex MINLP problems. Comput Chem Engin Sup. 1995;19:131–136.
Web of Science ®Google Scholar
Kelley JE. The cutting plane method for solving convex programs. J SIAM. 1960;8:703–712.
Web of Science ®Google Scholar
Westerlund T, Skrifvars H, Harjunkoski I, et al. An extended cutting plane method for a class of non-convex MINLP problems. Comput Chem Eng. 1998;22:357–365.
Web of Science ®Google Scholar
Westerlund T, Pörn R. Solving pseudo-convex mixed integer optimization problems by cutting plane techniques. Optim Eng. 2002;3:253–280.
Web of Science ®Google Scholar
Eronen VP, Mäkelä MM, Westerlund T. On the generalization of ECP and OA methods to nonsmooth convex MINLP problems. Optimization. 2014;63:1057–1073.
Web of Science ®Google Scholar
Eronen VP, Mäkelä MM, Westerlund T. Extended cutting plane method for a class of nonsmooth nonconvex MINLP problems. Optimization. 2015;64:641–661.
Web of Science ®Google Scholar
Eronen VP, Kronqvist J, Westerlund T, et al. Method for solving generalized convex nonsmooth mixed-integer nonlinear programming problems. J Global Optim. 2017;69:443–459.
Web of Science ®Google Scholar
Westerlund T, Eronen V-P, Mäkelä MM. On solving generalized convex MINLP problems using supporting hyperplane techniques. J Global Optim. 2018;71:987–1011.
Web of Science ®Google Scholar
Horst R, Tuy H. Global optimization. 3rd ed. Heidelberg: Springer-Verlag; 1996.
Google Scholar
Pörn R. Mixed integer non-linear programming: convexification techniques and algorithm development [PhD thesis]. Abo Akademi University; 2000.
Google Scholar
Censor Y, Lent A. Cyclic subgradient projections. Math Program. 1982;24:233–235.
Web of Science ®Google Scholar
Bauschke HH, Borwein JM. On projection algorithms for solving convex feasibilty problems. SIAM Rev. 1996;38:367–426.
Web of Science ®Google Scholar
Bauschke HH, Combettes PL. A weak-to-strong convergence principle for Fejér-monotone methods in Hilbert spaces. Math Oper Res. 2001;26:248–264.
Web of Science ®Google Scholar
D'Antonio GH, Frangioni F. Convergence analysis of deflated conditional approximate subgradient methods. SIAM J Optim. 2009;20:357–386.
Web of Science ®Google Scholar
IBM ILOG Optimization Studio. CPLEX User's Manual, version 12.7. IBM; 2017.
Google Scholar
Gurobi. Gurobi optimizer reference manual. version 9.0. Gurobi Optimization, LCC; 2020.
Google Scholar
Emet S, Westerlund T. Comparisons of solving a chromatographic separation problem using MINLP methods. Comput Chem Eng. 2004;28:673–682.
Web of Science ®Google Scholar
Serrano F, Schwarz R, Gleixner A. On the relation between the extended supporting hyperplane algorithm and Kelley's cutting plane algorithm. J Global Optim. 2020;78:161–179.
Web of Science ®Google Scholar
Kronqvist J, Lundell A, Westerlund T. The extended supporting hyperplane algorithm for convex mixed-integer nonlinear programming. J Global Optim. 2016;64:249–272.
Web of Science ®Google Scholar
Veinott Jr AF. The supporting hyperplane method for unimodal programming. Oper Res. 1967;15:147–152.
Web of Science ®Google Scholar
Westerlund T. User's guide for GAECP, version 5.537. An interactive solver for generalized convex MINLP-problems using cutting plane and supporting hyperplane teqniques. Abo Akademi University. 2017. Available from: http://www.abo.fi/twesterl/GAECPDocumentation.pdf
Google Scholar
Mäkelä MM, Neittaanmäki P. Nonsmooth optimization: analysis and algorithms with applications to optimal control. Singapore: World Scientific Publishing Co.; 1992.
Google Scholar
Bagirov A, Karmitsa N, Mäkelä MM. Introduction to nonsmooth optimization. Heidelberg: Springer Cham; 2014. (Theory, practice and software).
Google Scholar
Fletcher R, Leyffer S. Solving mixed integer nonlinear programs by outer approximation. Math Program. 1994;66:327–349.
Web of Science ®Google Scholar
Castillo I, Westerlund J, Emet S, et al. Optimization of block layout design problems with unequal areas: a comparison of MILP and MINLP optimization methods. Comput Chem Engin. 2005;30:54–69.
Web of Science ®Google Scholar
Scholz D, Petrick A, Domschke W. STaTS: A slicing tree and tabu search based heuristic for unequal area facility layout. Eur J Oper Res. 2009;197:166–178.
Web of Science ®Google Scholar
Kulturel-Konak S, Konak A. Unequal area flexible bay facility layout using ant colony optimization. Int J Product Optim. 2011;49:1877–1902.
Web of Science ®Google Scholar
Goncalves JF, Resende MGC. A biased random-key genetic algorithm for unequal area facility layout problem. Eur J Oper Res. 2015;246:86–107.
Web of Science ®Google Scholar
Konak A, Kulturel-Konak S, Norman BA, et al. A new mixed integer programming formulation for facility design using flexible bays. Oper Res Lett. 2006;34:660–672.
Web of Science ®Google Scholar
Kronqvist J, Lundell A, Westerlund T. Reformulations for utilizing separability when solving convex MINLP problems. J Global Optim. 2018;71:571–592.
Web of Science ®Google Scholar
Zarali F, Yazagan HR, Delice Y. A new solution method of ant colony-based logistic center area layout problem. Sãdhanã. 2018;83:1–17.
Google Scholar
Bazaraa MS. Computerized layout design; a branch and bound approach. AIIE Trans. 1975;7:432–438.
Google Scholar
van Camp DJ, Carter MW, Vanelli A. A nonlinear optimization approach for solving facility layout problems. Eur J Oper Res. 1991;57:174–189.
Web of Science ®Google Scholar
Castillo I, Westerlund T. An epsilon-accurate model for optimal unequal-area block layout design. Comput Oper Res. 2005;32:429–447.
Web of Science ®Google Scholar
Liu Q, Meller RD. A sequence-pair representation and MIP-model-based heuristic for the facility layout problem with rectangular departments. IIE Trans. 2007;39:377–394.
Web of Science ®Google Scholar

Appendix. Initial data to facility layout problems

Using projected cutting planes in the extended cutting plane method

Abstract

1. Introduction

2. The MINLP problem