Integer programming

From HandWiki
Short description: A mathematical optimization problem restricted to integers

An integer programming problem is a mathematical optimization or feasibility program in which some or all of the variables are restricted to be integers. In many settings the term refers to integer linear programming (ILP), in which the objective function and the constraints (other than the integer constraints) are linear.

Integer programming is NP-complete. In particular, the special case of 0–1 integer linear programming, in which unknowns are binary, and only the restrictions must be satisfied, is one of Karp's 21 NP-complete problems.[1]

If some decision variables are not discrete, the problem is known as a mixed-integer programming problem.[2]

Canonical and standard form for ILPs

In integer linear programming, the canonical form is distinct from the standard form. An integer linear program in canonical form is expressed thus (note that it is the [math]\displaystyle{ \mathbf{x} }[/math] vector which is to be decided):[3]

[math]\displaystyle{ \begin{align} & \underset{\mathbf{x} \in \mathbb{Z}^n}{\text{maximize}} && \mathbf{c}^\mathrm{T} \mathbf{x}\\ & \text{subject to} && A \mathbf{x} \le \mathbf{b}, \\ & && \mathbf{x} \ge \mathbf{0} \end{align} }[/math]

and an ILP in standard form is expressed as

[math]\displaystyle{ \begin{align} & \underset{\mathbf{x} \in \mathbb{Z}^n}{\text{maximize}} && \mathbf{c}^\mathrm{T} \mathbf{x}\\ & \text{subject to} && A \mathbf{x} + \mathbf{s} = \mathbf{b}, \\ & && \mathbf{s} \ge \mathbf{0}, \\ & && \mathbf{x} \ge \mathbf{0}, \end{align} }[/math]

where [math]\displaystyle{ \mathbf{c}\in \mathbb{R}^n, \mathbf{b} \in \mathbb{R}^m }[/math] are vectors and [math]\displaystyle{ A \in \mathbb{R}^{m \times n} }[/math] is a matrix. As with linear programs, ILPs not in standard form can be converted to standard form by eliminating inequalities, introducing slack variables ([math]\displaystyle{ \mathbf{s} }[/math]) and replacing variables that are not sign-constrained with the difference of two sign-constrained variables.

Example

IP polytope with LP relaxation

The plot on the right shows the following problem.

[math]\displaystyle{ \begin{align} \underset{x,y \in \mathbb{Z}}{\text{maximize}} \quad & y \\ \text{subject to} \quad &-x +y \leq 1 \\ &3x + 2y \leq 12 \\ &2x + 3y \leq 12 \\ & x,y \ge 0 \end{align} }[/math]

The feasible integer points are shown in red, and the red dashed lines indicate their convex hull, which is the smallest convex polyhedron that contains all of these points. The blue lines together with the coordinate axes define the polyhedron of the LP relaxation, which is given by the inequalities without the integrality constraint. The goal of the optimization is to move the black dashed line as far upward while still touching the polyhedron. The optimal solutions of the integer problem are the points [math]\displaystyle{ (1,2) }[/math] and [math]\displaystyle{ (2,2) }[/math] that both have an objective value of 2. The unique optimum of the relaxation is [math]\displaystyle{ (1.8,2.8) }[/math] with objective value of 2.8. If the solution of the relaxation is rounded to the nearest integers, it is not feasible for the ILP.

Proof of NP-hardness

The following is a reduction from minimum vertex cover to integer programming that will serve as the proof of NP-hardness.

Let [math]\displaystyle{ G = (V,E) }[/math] be an undirected graph. Define a linear program as follows:

[math]\displaystyle{ \begin{align} \min \sum_{v \in V} y_v \\ y_v + y_u & \ge 1 && \forall uv \in E\\ y_v & \ge 0 && \forall v \in V\\ y_v & \in \mathbb{Z} && \forall v \in V \end{align} }[/math]

Given that the constraints limit [math]\displaystyle{ y_v }[/math] to either 0 or 1, any feasible solution to the integer program is a subset of vertices. The first constraint implies that at least one end point of every edge is included in this subset. Therefore, the solution describes a vertex cover. Additionally given some vertex cover C, [math]\displaystyle{ y_v }[/math] can be set to 1 for any [math]\displaystyle{ v\in C }[/math] and to 0 for any [math]\displaystyle{ v\not\in C }[/math] thus giving us a feasible solution to the integer program. Thus we can conclude that if we minimize the sum of [math]\displaystyle{ y_v }[/math] we have also found the minimum vertex cover.[4]

Variants

Mixed-integer linear programming (MILP) involves problems in which only some of the variables, [math]\displaystyle{ x_i }[/math], are constrained to be integers, while other variables are allowed to be non-integers.

Zero–one linear programming (or binary integer programming) involves problems in which the variables are restricted to be either 0 or 1. Any bounded integer variable can be expressed as a combination of binary variables.[5] For example, given an integer variable, [math]\displaystyle{ 0\le x\le U }[/math], the variable can be expressed using [math]\displaystyle{ \lfloor \log_2U\rfloor+1 }[/math] binary variables:

[math]\displaystyle{ x = x_1+2x_2+4x_3+\cdots+2^{\lfloor \log_2U\rfloor}x_{\lfloor \log_2U\rfloor+1}. }[/math]

Applications

There are two main reasons for using integer variables when modeling problems as a linear program:

  1. The integer variables represent quantities that can only be integer. For example, it is not possible to build 3.7 cars.
  2. The integer variables represent decisions (e.g. whether to include an edge in a graph) and so should only take on the value 0 or 1.

These considerations occur frequently in practice and so integer linear programming can be used in many applications areas, some of which are briefly described below.

Production planning

Mixed-integer programming has many applications in industrial productions, including job-shop modelling. One important example happens in agricultural production planning and involves determining production yield for several crops that can share resources (e.g. land, labor, capital, seeds, fertilizer, etc.). A possible objective is to maximize the total production, without exceeding the available resources. In some cases, this can be expressed in terms of a linear program, but the variables must be constrained to be integer.

Scheduling

These problems involve service and vehicle scheduling in transportation networks. For example, a problem may involve assigning buses or subways to individual routes so that a timetable can be met, and also to equip them with drivers. Here binary decision variables indicate whether a bus or subway is assigned to a route and whether a driver is assigned to a particular train or subway. The zero–one programming technique has been successfully applied to solve a project selection problem in which projects are mutually exclusive and/or technologically interdependent. It is used in a special case of integer programming, in which all the decision variables are integers. Variable can assume only the values zero or one.

Territorial partitioning

Territorial partitioning or districting problems consist of partitioning a geographical region into districts in order to plan some operations while considering different criteria or constraints. Some requirements for this problem are: contiguity, compactness, balance or equity, respect of natural boundaries, and socio-economic homogeneity. Some applications for this type of problem include: political districting, school districting, health services districting and waste management districting.

Telecommunications networks

The goal of these problems is to design a network of lines to install so that a predefined set of communication requirements are met and the total cost of the network is minimal.[6] This requires optimizing both the topology of the network along with setting the capacities of the various lines. In many cases, the capacities are constrained to be integer quantities. Usually there are, depending on the technology used, additional restrictions that can be modeled as linear inequalities with integer or binary variables.

Cellular networks

The task of frequency planning in GSM mobile networks involves distributing available frequencies across the antennas so that users can be served and interference is minimized between the antennas.[7] This problem can be formulated as an integer linear program in which binary variables indicate whether a frequency is assigned to an antenna.

Other applications

Algorithms

The naive way to solve an ILP is to simply remove the constraint that x is integer, solve the corresponding LP (called the LP relaxation of the ILP), and then round the entries of the solution to the LP relaxation. But, not only may this solution not be optimal, it may not even be feasible; that is, it may violate some constraint.

Using total unimodularity

While in general the solution to LP relaxation will not be guaranteed to be integral, if the ILP has the form [math]\displaystyle{ \max\mathbf{c}^\mathrm{T} \mathbf{x} }[/math] such that [math]\displaystyle{ A\mathbf{x} = \mathbf{b} }[/math] where [math]\displaystyle{ A }[/math] and [math]\displaystyle{ \mathbf{b} }[/math] have all integer entries and [math]\displaystyle{ A }[/math] is totally unimodular, then every basic feasible solution is integral. Consequently, the solution returned by the simplex algorithm is guaranteed to be integral. To show that every basic feasible solution is integral, let [math]\displaystyle{ \mathbf{x} }[/math] be an arbitrary basic feasible solution . Since [math]\displaystyle{ \mathbf{x} }[/math] is feasible, we know that [math]\displaystyle{ A\mathbf{x}=\mathbf{b} }[/math]. Let [math]\displaystyle{ \mathbf{x}_0=[x_{n_1},x_{n_2},\cdots,x_{n_j}] }[/math] be the elements corresponding to the basis columns for the basic solution [math]\displaystyle{ \mathbf{x} }[/math]. By definition of a basis, there is some square submatrix [math]\displaystyle{ B }[/math] of [math]\displaystyle{ A }[/math] with linearly independent columns such that [math]\displaystyle{ B\mathbf{x}_0=\mathbf{b} }[/math].

Since the columns of [math]\displaystyle{ B }[/math] are linearly independent and [math]\displaystyle{ B }[/math] is square, [math]\displaystyle{ B }[/math] is nonsingular, and therefore by assumption, [math]\displaystyle{ B }[/math] is unimodular and so [math]\displaystyle{ \det(B)=\pm1 }[/math]. Also, since [math]\displaystyle{ B }[/math] is nonsingular, it is invertible and therefore [math]\displaystyle{ \mathbf{x}_0=B^{-1}\mathbf{b} }[/math]. By definition, [math]\displaystyle{ B^{-1}=\frac{B^\mathrm{adj}}{\det(B)}=\pm B^\mathrm{adj} }[/math]. Here [math]\displaystyle{ B^\mathrm{adj} }[/math] denotes the adjugate of [math]\displaystyle{ B }[/math] and is integral because [math]\displaystyle{ B }[/math] is integral. Therefore, [math]\displaystyle{ \begin{align} &\Rightarrow B^{-1}=\pm B^\mathrm{adj} \text{ is integral.} \\ &\Rightarrow \mathbf{x}_0=B^{-1}b \text{ is integral.} \\ &\Rightarrow \text{Every basic feasible solution is integral.} \end{align} }[/math] Thus, if the matrix [math]\displaystyle{ A }[/math] of an ILP is totally unimodular, rather than use an ILP algorithm, the simplex method can be used to solve the LP relaxation and the solution will be integer.

Exact algorithms

When the matrix [math]\displaystyle{ A }[/math] is not totally unimodular, there are a variety of algorithms that can be used to solve integer linear programs exactly. One class of algorithms are cutting plane methods, which work by solving the LP relaxation and then adding linear constraints that drive the solution towards being integer without excluding any integer feasible points.

Another class of algorithms are variants of the branch and bound method. For example, the branch and cut method that combines both branch and bound and cutting plane methods. Branch and bound algorithms have a number of advantages over algorithms that only use cutting planes. One advantage is that the algorithms can be terminated early and as long as at least one integral solution has been found, a feasible, although not necessarily optimal, solution can be returned. Further, the solutions of the LP relaxations can be used to provide a worst-case estimate of how far from optimality the returned solution is. Finally, branch and bound methods can be used to return multiple optimal solutions.

Exact algorithms for a small number of variables

Suppose [math]\displaystyle{ A }[/math] is an m-by-n integer matrix and [math]\displaystyle{ \mathbf{b} }[/math] is an m-by-1 integer vector. We focus on the feasibility problem, which is to decide whether there exists an n-by-1 vector [math]\displaystyle{ \mathbf{x} }[/math] satisfying [math]\displaystyle{ A \mathbf{x} \le \mathbf{b} }[/math].

Let V be the maximum absolute value of the coefficients in [math]\displaystyle{ A }[/math] and [math]\displaystyle{ \mathbf{b} }[/math]. If n (the number of variables) is a fixed constant, then the feasibility problem can be solved in time polynomial in m and log V. This is trivial for the case n=1. The case n=2 was solved in 1981 by Herbert Scarf.[13] The general case was solved in 1983 by Hendrik Lenstra, combining ideas by László Lovász and Peter van Emde Boas.[14] Doignon's theorem asserts that an integer program is feasible whenever every subset of [math]\displaystyle{ 2^n }[/math] constraints is feasible; a method combining this result with algorithms for LP-type problems can be used to solve integer programs in time that is linear in [math]\displaystyle{ m }[/math] and fixed-parameter tractable (but possibly doubly exponential) in [math]\displaystyle{ n }[/math], with no dependence on [math]\displaystyle{ V }[/math].[15]

In the special case of 0-1 ILP, Lenstra's algorithm is equivalent to complete enumeration: the number of all possible solutions is fixed (2n), and checking the feasibility of each solution can be done in time poly(m, log V). In the general case, where each variable can be an arbitrary integer, complete enumeration is impossible. Here, Lenstra's algorithm uses ideas from Geometry of numbers. It transforms the original problem into an equivalent one with the following property: either the existence of a solution [math]\displaystyle{ \mathbf{x} }[/math] is obvious, or the value of [math]\displaystyle{ x_n }[/math] (the n-th variable) belongs to an interval whose length is bounded by a function of n. In the latter case, the problem is reduced to a bounded number of lower-dimensional problems. The run-time complexity of the algorithm has been improved in several steps:

  • The original algorithm of Lenstra[14] had run-time [math]\displaystyle{ 2^{O(n^3)}\cdot (m\cdot \log V)^{O(1)} }[/math].
  • Kannan[16] presented an improved algorithm with run-time [math]\displaystyle{ n^{O(n)}\cdot (m\cdot \log V)^{O(1)} }[/math].[17]
  • Frank and Tardos[18] presented an improved algorithm with run-time [math]\displaystyle{ n^{2.5 n} \cdot 2^{O(n)} \cdot (m\cdot \log V)^{O(1)} }[/math].[19][20]:Prop.8
  • Dadush[21] presented an improved algorithm with run-time [math]\displaystyle{ n^n \cdot 2^{O(n)} \cdot (m \cdot \log V)^{O(1)} }[/math].
  • Reis and Rothvoss[22] presented an improved algorithm with run-time [math]\displaystyle{ (\log n)^{O(n)} \cdot (m\cdot \log V)^{O(1)} }[/math].

Heuristic methods

Since integer linear programming is NP-hard, many problem instances are intractable and so heuristic methods must be used instead. For example, tabu search can be used to search for solutions to ILPs.[23] To use tabu search to solve ILPs, moves can be defined as incrementing or decrementing an integer constrained variable of a feasible solution while keeping all other integer-constrained variables constant. The unrestricted variables are then solved for. Short-term memory can consist of previously tried solutions while medium-term memory can consist of values for the integer constrained variables that have resulted in high objective values (assuming the ILP is a maximization problem). Finally, long-term memory can guide the search towards integer values that have not previously been tried.

Other heuristic methods that can be applied to ILPs include

There are also a variety of other problem-specific heuristics, such as the k-opt heuristic for the traveling salesman problem. A disadvantage of heuristic methods is that if they fail to find a solution, it cannot be determined whether it is because there is no feasible solution or whether the algorithm simply was unable to find one. Further, it is usually impossible to quantify how close to optimal a solution returned by these methods are.

Sparse integer programming

It is often the case that the matrix [math]\displaystyle{ A }[/math] that defines the integer program is sparse. In particular, this occurs when the matrix has a block structure, which is the case in many applications. The sparsity of the matrix can be measured as follows. The graph of [math]\displaystyle{ A }[/math] has vertices corresponding to columns of [math]\displaystyle{ A }[/math], and two columns form an edge if [math]\displaystyle{ A }[/math] has a row where both columns have nonzero entries. Equivalently, the vertices correspond to variables, and two variables form an edge if they share an inequality. The sparsity measure [math]\displaystyle{ d }[/math] of [math]\displaystyle{ A }[/math] is the minimum of the tree-depth of the graph of [math]\displaystyle{ A }[/math] and the tree-depth of the graph of the transpose of [math]\displaystyle{ A }[/math]. Let [math]\displaystyle{ a }[/math] be the numeric measure of [math]\displaystyle{ A }[/math] defined as the maximum absolute value of any entry of [math]\displaystyle{ A }[/math]. Let [math]\displaystyle{ n }[/math] be the number of variables of the integer program. Then it was shown in 2018[24] that integer programming can be solved in strongly polynomial and fixed-parameter tractable time parameterized by [math]\displaystyle{ a }[/math] and [math]\displaystyle{ d }[/math]. That is, for some computable function [math]\displaystyle{ f }[/math] and some constant [math]\displaystyle{ k }[/math], integer programming can be solved in time [math]\displaystyle{ f(a,d)n^k }[/math]. In particular, the time is independent of the right-hand side [math]\displaystyle{ b }[/math] and objective function [math]\displaystyle{ c }[/math]. Moreover, in contrast to the classical result of Lenstra, where the number [math]\displaystyle{ n }[/math] of variables is a parameter, here the number [math]\displaystyle{ n }[/math] of variables is a variable part of the input.

See also

References

  1. Karp, Richard M. (1972). "Reducibility among Combinatorial Problems". in R. E. Miller. Complexity of Computer Computations. New York: Plenum. pp. 85–103. doi:10.1007/978-1-4684-2001-2_9. ISBN 978-1-4684-2003-6. http://cgi.di.uoa.gr/~sgk/teaching/grad/handouts/karp.pdf. 
  2. "Mixed-Integer Linear Programming (MILP): Model Formulation". http://macc.mcmaster.ca/maccfiles/chachuatnotes/07-MILP-I_handout.pdf. 
  3. Papadimitriou, C. H.; Steiglitz, K. (1998). Combinatorial optimization: algorithms and complexity. Mineola, NY: Dover. ISBN 0486402584. 
  4. Erickson, J. (2015). "Integer Programming Reduction". https://courses.engr.illinois.edu/cs498dl1/sp2015/solutions/hw10sol.pdf. 
  5. Williams, H.P. (2009). Logic and integer programming. International Series in Operations Research & Management Science. 130. ISBN 978-0-387-92280-5. 
  6. Borndörfer, R.; Grötschel, M. (2012). "Designing telecommunication networks by integer programming". http://www.zib.de/groetschel/teaching/SS2012/120503Vorlesung-DesigningTelcomNetworks-reduced.pdf. 
  7. Sharma, Deepak (2010). "Frequency Planning". http://www.slideshare.net/deepakecrbs/gsm-frequency-planning. 
  8. Morais, Hugo; Kádár, Péter; Faria, Pedro; Vale, Zita A.; Khodr, H. M. (2010-01-01). "Optimal scheduling of a renewable micro-grid in an isolated load area using mixed-integer linear programming" (in en). Renewable Energy 35 (1): 151–156. doi:10.1016/j.renene.2009.02.031. ISSN 0960-1481. http://www.sciencedirect.com/science/article/pii/S0960148109001001. 
  9. Omu, Akomeno; Choudhary, Ruchi; Boies, Adam (2013-10-01). "Distributed energy resource system optimisation using mixed integer linear programming" (in en). Energy Policy 61: 249–266. doi:10.1016/j.enpol.2013.05.009. ISSN 0301-4215. http://www.sciencedirect.com/science/article/pii/S0301421513003418. 
  10. Schouwenaars, T.; Valenti, M.; Feron, E.; How, J. (2005). "Implementation and Flight Test Results of MILP-based UAV Guidance". 2005 IEEE Aerospace Conference. pp. 1–13. doi:10.1109/AERO.2005.1559600. ISBN 0-7803-8870-4. 
  11. Radmanesh, Mohammadreza; Kumar, Manish (2016-03-01). "Flight formation of UAVs in presence of moving obstacles using fast-dynamic mixed integer linear programming" (in en). Aerospace Science and Technology 50: 149–160. doi:10.1016/j.ast.2015.12.021. ISSN 1270-9638. 
  12. Bast, Hannah; Brosi, Patrick; Storandt, Sabine (2017-10-05). "Efficient Generation of Geographically Accurate Transit Maps". arXiv:1710.02226 [cs.CG].
  13. Scarf, Herbert E. (1981). "Production Sets with Indivisibilities, Part I: Generalities". Econometrica 49 (1): 1–32. doi:10.2307/1911124. ISSN 0012-9682. https://www.jstor.org/stable/1911124. 
  14. 14.0 14.1 Lenstra, H. W. (1983-11-01). "Integer Programming with a Fixed Number of Variables". Mathematics of Operations Research 8 (4): 538–548. doi:10.1287/moor.8.4.538. ISSN 0364-765X. https://pubsonline.informs.org/doi/abs/10.1287/moor.8.4.538. 
  15. "Proceedings of the AMS Special Session on Algebraic and Geometric Methods in Applied Discrete Mathematics held in San Antonio, TX, January 11, 2015". 685. Providence, Rhode Island: American Mathematical Society. 2017. pp. 55–95. doi:10.1090/conm/685. ISBN 9781470423216. 
  16. Kannan, Ravi (1987-08-01). "Minkowski's Convex Body Theorem and Integer Programming". Mathematics of Operations Research 12 (3): 415–440. doi:10.1287/moor.12.3.415. ISSN 0364-765X. https://pubsonline.informs.org/doi/abs/10.1287/moor.12.3.415. 
  17. Goemans, Michel X.; Rothvoss, Thomas (2020-11-07). "Polynomiality for Bin Packing with a Constant Number of Item Types". Journal of the ACM 67 (6): 38:1–38:21. doi:10.1145/3421750. ISSN 0004-5411. 
  18. Frank, András; Tardos, Éva (1987-03-01). "An application of simultaneous diophantine approximation in combinatorial optimization" (in en). Combinatorica 7 (1): 49–65. doi:10.1007/BF02579200. ISSN 1439-6912. https://doi.org/10.1007/BF02579200. 
  19. Bliem, Bernhard; Bredereck, Robert; Niedermeier, Rolf (2016-07-09). "Complexity of efficient and envy-free resource allocation: few agents, resources, or utility levels". Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. IJCAI'16 (New York, New York, USA: AAAI Press): 102–108. ISBN 978-1-57735-770-4. https://dl.acm.org/doi/abs/10.5555/3060621.3060636. 
  20. Bredereck, Robert; Kaczmarczyk, Andrzej; Knop, Dušan; Niedermeier, Rolf (2019-06-17). "High-Multiplicity Fair Allocation: Lenstra Empowered by N-fold Integer Programming". Proceedings of the 2019 ACM Conference on Economics and Computation. EC '19. Phoenix, AZ, USA: Association for Computing Machinery. pp. 505–523. doi:10.1145/3328526.3329649. ISBN 978-1-4503-6792-9. https://doi.org/10.1145/3328526.3329649. 
  21. Dadush, Daniel (2012-06-14). "Integer Programming, Lattice Algorithms, and Deterministic Volume Estimation.
  22. Reis, Victor; Rothvoss, Thomas (2023-03-26). "The Subspace Flatness Conjecture and Faster Integer Programming".
  23. Glover, F. (1989). "Tabu search-Part II". ORSA Journal on Computing 1 (3): 4–32. doi:10.1287/ijoc.2.1.4. 
  24. Koutecký, Martin; Levin, Asaf; Onn, Shmuel (2018) (in en). A Parameterized Strongly Polynomial Algorithm for Block Structured Integer Programs. Michael Wagner. pp. 14 pages. doi:10.4230/LIPICS.ICALP.2018.85. http://drops.dagstuhl.de/opus/volltexte/2018/9089/. 

Further reading

  • George L. Nemhauser; Laurence A. Wolsey (1988). Integer and combinatorial optimization. Wiley. ISBN 978-0-471-82819-8. 
  • Alexander Schrijver (1998). Theory of linear and integer programming. John Wiley and Sons. ISBN 978-0-471-98232-6. 
  • Laurence A. Wolsey (1998). Integer programming. Wiley. ISBN 978-0-471-28366-9. 
  • Dimitris Bertsimas; Robert Weismantel (2005). Optimization over integers. Dynamic Ideas. ISBN 978-0-9759146-2-5. 
  • John K. Karlof (2006). Integer programming: theory and practice. CRC Press. ISBN 978-0-8493-1914-3. 
  • H. Paul Williams (2009). Logic and Integer Programming. Springer. ISBN 978-0-387-92279-9. 
  • 50 Years of Integer Programming 1958-2008: From the Early Years to the State-of-the-Art. Springer. 2009. ISBN 978-3-540-68274-5. 
  • Der-San Chen; Robert G. Batson; Yu Dang (2010). Applied Integer Programming: Modeling and Solution. John Wiley and Sons. ISBN 978-0-470-37306-4. 
  • Gerard Sierksma; Yori Zwols (2015). Linear and Integer Optimization: Theory and Practice. CRC Press. ISBN 978-1-498-71016-9. 

External links