Maximum cardinality matching

From HandWiki
Short description: Graph theory problem: find a matching containing the most edges

Maximum cardinality matching is a fundamental problem in graph theory.[1] We are given a graph G, and the goal is to find a matching containing as many edges as possible; that is, a maximum cardinality subset of the edges such that each vertex is adjacent to at most one edge of the subset. As each edge will cover exactly two vertices, this problem is equivalent to the task of finding a matching that covers as many vertices as possible.

An important special case of the maximum cardinality matching problem is when G is a bipartite graph, whose vertices V are partitioned between left vertices in X and right vertices in Y, and edges in E always connect a left vertex to a right vertex. In this case, the problem can be efficiently solved with simpler algorithms than in the general case.

Algorithms for bipartite graphs

Flow-based algorithm

The simplest way to compute a maximum cardinality matching is to follow the Ford–Fulkerson algorithm. This algorithm solves the more general problem of computing the maximum flow. A bipartite graph (X + Y, E) can be converted to a flow network as follows.

  • Add a source vertex s; add an edge from s to each vertex in X.
  • Add a sink vertex t; add an edge from each vertex in Y to t.
  • Assign a capacity of 1 to each edge.

Since each edge in the network has integral capacity, there exists a maximum flow where all flows are integers; these integers must be either 0 or 1 since the all capacities are 1. Each integral flow defines a matching in which an edge is in the matching if and only if its flow is 1. It is a matching because:

  • The incoming flow into each vertex in X is at most 1, so the outgoing flow is at most 1 too, so at most one edge adjacent to each vertex in X is present.
  • The outgoing flow from each vertex in Y is at most 1, so the incoming flow is at most 1 too, so at most one edge adjacent to each vertex in Y is present.

The Ford–Fulkerson algorithm proceeds by repeatedly finding an augmenting path from some xX to some yY and updating the matching M by taking the symmetric difference of that path with M (assuming such a path exists). As each path can be found in O(E) time, the running time is O(VE), and the maximum matching consists of the edges of E that carry flow from X to Y.

Advanced algorithms

An improvement to this algorithm is given by the more elaborate Hopcroft–Karp algorithm, which searches for multiple augmenting paths simultaneously. This algorithm runs in [math]\displaystyle{ O(\sqrt{V}E) }[/math] time.

The algorithm of Chandran and Hochbaum[2] for bipartite graphs runs in time that depends on the size of the maximum matching k, which for |X| < |Y| is

[math]\displaystyle{ O\left(\min\{|X|k,E\}+ \sqrt{k} \min \{k^2,E\}\right). }[/math]

Using boolean operations on words of size [math]\displaystyle{ \lambda }[/math] the complexity is further improved to[2]

[math]\displaystyle{ O\left(\min \left\{|X|k, \frac{|X||Y|}{\lambda}, E\right\} + k^2 + \frac{k^{2.5}}{\lambda}\right). }[/math]

More efficient algorithms exist for special kinds of bipartite graphs:

  • For sparse bipartite graphs, the maximum matching problem can be solved in [math]\displaystyle{ \tilde{O}(E^{10/7}) }[/math] with Madry's algorithm based on electric flows.[3]
  • For planar bipartite graphs, the problem can be solved in time O(n log3 n) where n is the number of vertices, by reducing the problem to maximum flow with multiple sources and sinks.[4]

Algorithms for arbitrary graphs

The blossom algorithm finds a maximum-cardinality matching in general (not necessarily bipartite) graphs. It runs in time [math]\displaystyle{ O(|V|^2 \cdot |E|) }[/math]. A better performance of O(VE) for general graphs, matching the performance of the Hopcroft–Karp algorithm on bipartite graphs, can be achieved with the much more complicated algorithm of Micali and Vazirani.[5] The same bound was achieved by an algorithm by Blum (de)[6] and an algorithm by Gabow and Tarjan.[7]

An alternative approach uses randomization and is based on the fast matrix multiplication algorithm. This gives a randomized algorithm for general graphs with complexity [math]\displaystyle{ O(V^{2.376}) }[/math].[8] This is better in theory for sufficiently dense graphs, but in practice the algorithm is slower.[2]

Other algorithms for the task are reviewed by Duan and Pettie[9] (see Table I). In terms of approximation algorithms, they also point out that the blossom algorithm and the algorithms by Micali and Vazirani can be seen as approximation algorithms running in linear time for any fixed error bound.

Applications and generalizations

References

  1. West, Douglas Brent (1999), Introduction to Graph Theory (2nd ed.), Prentice Hall, Chapter 3, ISBN 0-13-014400-2 
  2. 2.0 2.1 2.2 Chandran, Bala G.; Hochbaum, Dorit S. (2011), Practical and theoretical improvements for bipartite matching using the pseudoflow algorithm, Bibcode2011arXiv1105.1569C, "the theoretically efficient algorithms listed above tend to perform poorly in practice" .
  3. Madry, A (2013), "Navigating Central Path with Electrical Flows: From Flows to Matchings, and Back", Foundations of Computer Science (FOCS), 2013 IEEE 54th Annual Symposium on, pp. 253–262, Bibcode2013arXiv1307.2205M 
  4. Borradaile, Glencora; Klein, Philip N.; Mozes, Shay; Nussbaum, Yahav; Wulff–Nilsen, Christian (2017), "Multiple-source multiple-sink maximum flow in directed planar graphs in near-linear time", SIAM Journal on Computing 46 (4): 1280–1303, doi:10.1137/15M1042929 
  5. Micali, S.; Vazirani, V. V. (1980), "An [math]\displaystyle{ \scriptstyle O(\sqrt{|V|}\cdot|E|) }[/math] algorithm for finding maximum matching in general graphs", Proc. 21st IEEE Symp. Foundations of Computer Science, pp. 17–27, doi:10.1109/SFCS.1980.12 .
  6. Blum, Norbert (1990). Paterson, Michael S.. ed. "A new approach to maximum matching in general graphs" (in en). Automata, Languages and Programming. Lecture Notes in Computer Science (Berlin, Heidelberg: Springer) 443: 586–597. doi:10.1007/BFb0032060. ISBN 978-3-540-47159-2. https://web.eecs.umich.edu/~pettie/matching/Blum-matching-ICALP90.pdf. 
  7. Gabow, Harold N; Tarjan, Robert E (1991-10-01). "Faster scaling algorithms for general graph matching problems" (in EN). Journal of the ACM 38 (4): 815–853. doi:10.1145/115234.115366. https://web.eecs.umich.edu/~pettie/matching/Gabow-Tarjan-scaling-general-graph-matching.pdf. 
  8. Mucha, M.; Sankowski, P. (2004), "Maximum Matchings via Gaussian Elimination", Proc. 45th IEEE Symp. Foundations of Computer Science, pp. 248–255, http://www.mimuw.edu.pl/~mucha/pub/mucha_sankowski_focs04.pdf 
  9. Duan, Ran; Pettie, Seth (2014-01-01). "Linear-Time Approximation for Maximum Weight Matching" (in EN). Journal of the ACM 61: 1–23. doi:10.1145/2529989. https://web.eecs.umich.edu/~pettie/papers/ApproxMWM-JACM.pdf. 
  10. Karp, Richard M. (1972), Miller, Raymond E.; Thatcher, James W.; Bohlinger, Jean D., eds., "Reducibility among Combinatorial Problems" (in en), Complexity of Computer Computations: Proceedings of a symposium on the Complexity of Computer Computations, held March 20–22, 1972, at the IBM Thomas J. Watson Research Center, Yorktown Heights, New York, and sponsored by the Office of Naval Research, Mathematics Program, IBM World Trade Corporation, and the IBM Research Mathematical Sciences Department, The IBM Research Symposia Series (Boston, MA: Springer US): pp. 85–103, doi:10.1007/978-1-4684-2001-2_9, ISBN 978-1-4684-2001-2