Computer-assisted proof

From HandWiki
Short description: Mathematical proof at least partially generated by computer

A computer-assisted proof is a mathematical proof that has been at least partially generated by computer.

Most computer-aided proofs to date have been implementations of large proofs-by-exhaustion of a mathematical theorem. The idea is to use a computer program to perform lengthy computations, and to provide a proof that the result of these computations implies the given theorem. In 1976, the four color theorem was the first major theorem to be verified using a computer program.

Attempts have also been made in the area of artificial intelligence research to create smaller, explicit, new proofs of mathematical theorems from the bottom up using automated reasoning techniques such as heuristic search. Such automated theorem provers have proved a number of new results and found new proofs for known theorems.[citation needed] Additionally, interactive proof assistants allow mathematicians to develop human-readable proofs which are nonetheless formally verified for correctness. Since these proofs are generally human-surveyable (albeit with difficulty, as with the proof of the Robbins conjecture) they do not share the controversial implications of computer-aided proofs-by-exhaustion.


One method for using computers in mathematical proofs is by means of so-called validated numerics or rigorous numerics. This means computing numerically yet with mathematical rigour. One uses set-valued arithmetic and inclusion principle[clarify] in order to ensure that the set-valued output of a numerical program encloses the solution of the original mathematical problem. This is done by controlling, enclosing and propagating round-off and truncation errors using for example interval arithmetic. More precisely, one reduces the computation to a sequence of elementary operations, say [math]\displaystyle{ (+, -, \times, /) }[/math]. In a computer, the result of each elementary operation is rounded off by the computer precision. However, one can construct an interval provided by upper and lower bounds on the result of an elementary operation. Then one proceeds by replacing numbers with intervals and performing elementary operations between such intervals of representable numbers.[citation needed]

Philosophical objections

Main page: Non-surveyable proof

Computer-assisted proofs are the subject of some controversy in the mathematical world, with Thomas Tymoczko first to articulate objections. Those who adhere to Tymoczko's arguments believe that lengthy computer-assisted proofs are not, in some sense, 'real' mathematical proofs because they involve so many logical steps that they are not practically verifiable by human beings, and that mathematicians are effectively being asked to replace logical deduction from assumed axioms with trust in an empirical computational process, which is potentially affected by errors in the computer program, as well as defects in the runtime environment and hardware.[1]

Other mathematicians believe that lengthy computer-assisted proofs should be regarded as calculations, rather than proofs: the proof algorithm itself should be proved valid, so that its use can then be regarded as a mere "verification". Arguments that computer-assisted proofs are subject to errors in their source programs, compilers, and hardware can be resolved by providing a formal proof of correctness for the computer program (an approach which was successfully applied to the four-color theorem in 2005) as well as replicating the result using different programming languages, different compilers, and different computer hardware.

Another possible way of verifying computer-aided proofs is to generate their reasoning steps in a machine-readable form, and then use a proof checker program to demonstrate their correctness. Since validating a given proof is much easier than finding a proof, the checker program is simpler than the original assistant program, and it is correspondingly easier to gain confidence into its correctness. However, this approach of using a computer program to prove the output of another program correct does not appeal to computer proof skeptics, who see it as adding another layer of complexity without addressing the perceived need for human understanding.

Another argument against computer-aided proofs is that they lack mathematical elegance—that they provide no insights or new and useful concepts. In fact, this is an argument that could be advanced against any lengthy proof by exhaustion.

An additional philosophical issue raised by computer-aided proofs is whether they make mathematics into a quasi-empirical science, where the scientific method becomes more important than the application of pure reason in the area of abstract mathematical concepts. This directly relates to the argument within mathematics as to whether mathematics is based on ideas, or "merely" an exercise in formal symbol manipulation. It also raises the question whether, if according to the Platonist view, all possible mathematical objects in some sense "already exist", whether computer-aided mathematics is an observational science like astronomy, rather than an experimental one like physics or chemistry. This controversy within mathematics is occurring at the same time as questions are being asked in the physics community about whether twenty-first century theoretical physics is becoming too mathematical, and leaving behind its experimental roots.

The emerging field of experimental mathematics is confronting this debate head-on by focusing on numerical experiments as its main tool for mathematical exploration.


Theorems proved with the help of computer programs

Inclusion in this list does not imply that a formal computer-checked proof exists, but rather, that a computer program has been involved in some way. See the main articles for details.

See also


  1. Tymoczko, Thomas (1979), "The Four-Color Problem and its Mathematical Significance", The Journal of Philosophy 76 (2): 57–83, doi:10.2307/2025976 .
  2. Gonthier, Georges (2008), "Formal Proof—The Four-Color Theorem", Notices of the American Mathematical Society 55 (11): 1382–1393, 
  3. Hass, J.; Hutchings, M.; Schlafly, R. (1995). "The double bubble conjecture". Electronic Research Announcements of the American Mathematical Society 1 (3): 98–102. doi:10.1090/S1079-6762-95-03001-0. 
  4. Kouril, Michal (2006). A Backtracking Framework for Beowulf Clusters with an Extension to Multi-Cluster Computation and Sat Benchmark Problem Implementation (Ph.D. thesis). University of Cincinnati.
  5. Kouril, Michal (2012). "Computing the van der Waerden number W(3,4)=293". Integers 12: A46. 
  6. Kouril, Michal (2015). "Leveraging FPGA clusters for SAT computations". Parallel Computing: On the Road to Exascale: 525–532. 
  7. Ahmed, Tanbir (2009). "Some new van der Waerden numbers and some van der Waerden-type numbers". Integers 9: A6. doi:10.1515/integ.2009.007. 
  8. 8.0 8.1 Ahmed, Tanbir (2010). "Two new van der Waerden numbers w(2;3,17) and w(2;3,18)". Integers 10 (4): 369–377. doi:10.1515/integ.2010.032. 
  9. Ahmed, Tanbir (2012). "On computation of exact van der Waerden numbers". Integers 12 (3): 417–425. doi:10.1515/integ.2011.112. 
  10. Ahmed, Tanbir (2013). "Some More Van der Waerden numbers". Journal of Integer Sequences 16 (4): 13.4.4. 
  11. Ahmed, Tanbir; Kullmann, Oliver; Snevily, Hunter (2014). "On the van der Waerden numbers w(2;3,t)". Discrete Applied Mathematics 174 (2014): 27–51. doi:10.1016/j.dam.2014.05.007. 
  12. "God's Number is 20". July 2010. 
  13. Cesare, Chris (1 October 2015). "Maths whizz solves a master's riddle". Nature 526 (7571): 19–20. doi:10.1038/nature.2015.18441. PMID 26432222. Bibcode2015Natur.526...19C. 
  14. Lamb, Evelyn (26 May 2016). "Two-hundred-terabyte maths proof is largest ever". Nature 534 (7605): 17–18. doi:10.1038/nature.2016.19990. PMID 27251254. Bibcode2016Natur.534...17L. 
  15. Celletti, A.; Chierchia, L. (1987). "Rigorous estimates for a computer-assisted KAM theory". Journal of Mathematical Physics 28 (9): 2078–86. doi:10.1063/1.527418. Bibcode1987JMP....28.2078C. 
  16. Figueras, J.L.; Haro, A.; Luque, A. (2017). "Rigorous computer-assisted application of KAM theory: a modern approach". Foundations of Computational Mathematics 17 (5): 1123–93. doi:10.1007/s10208-016-9339-3. 
  17. Heule, Marijn J. H. (2017). "Schur Number Five". arXiv:1711.08076 [cs.LO].
  18. "Schur Number Five". 
  19. Brakensiek, Joshua; Heule, Marijn; Mackey, John; Narváez, David (2020). "The Resolution of Keller's Conjecture". in Peltier, Nicolas; Sofronie-Stokkermans, Viorica. Automated Reasoning. Lecture Notes in Computer Science. 12166. Springer. pp. 48–65. doi:10.1007/978-3-030-51074-9_4. ISBN 978-3-030-51074-9. 
  20. Hartnett, Kevin (2020-08-19). "Computer Search Settles 90-Year-Old Math Problem" (in en). 
  21. Subercaseaux, Bernardo; Heule, Marijn J. H. (2023-01-23). "The Packing Chromatic Number of the Infinite Square Grid is 15". arXiv:2301.09757 [cs.DM].
  22. Hartnett, Kevin (2023-04-20). "The Number 15 Describes the Secret Limit of an Infinite Grid" (in en). 

Further reading

External links