Kolmogorov–Arnold representation theorem
In real analysis and approximation theory, the Kolmogorov–Arnold representation theorem (or superposition theorem) states that every multivariate continuous function [math]\displaystyle{ f\colon[0,1]^n\to \R }[/math] can be represented as a superposition of the two-argument addition of continuous functions of one variable. It solved a more constrained form of Hilbert's thirteenth problem, so the original Hilbert's thirteenth problem is a corollary.[1][2][3]
The works of Vladimir Arnold and Andrey Kolmogorov established that if f is a multivariate continuous function, then f can be written as a finite composition of continuous functions of a single variable and the binary operation of addition.[4] More specifically,
- [math]\displaystyle{ f(\mathbf x) = f(x_1,\ldots ,x_n) = \sum_{q=0}^{2n} \Phi_{q}\!\left(\sum_{p=1}^{n} \phi_{q,p}(x_{p})\right) }[/math].
where [math]\displaystyle{ \phi_{q,p}\colon[0,1]\to \R }[/math] and [math]\displaystyle{ \Phi_{q}\colon \R \to \R }[/math].
There are proofs with specific constructions.[5]
In a sense, they showed that the only true multivariate function is the sum, since every other function can be written using univariate functions and summing.[6]:180
History
The Kolmogorov–Arnold representation theorem is closely related to Hilbert's 13th problem. In his Paris lecture at the International Congress of Mathematicians in 1900, David Hilbert formulated 23 problems which in his opinion were important for the further development of mathematics.[7] The 13th of these problems dealt with the solution of general equations of higher degrees. It is known that for algebraic equations of degree 4 the solution can be computed by formulae that only contain radicals and arithmetic operations. For higher orders, Galois theory shows us that the solutions of algebraic equations cannot be expressed in terms of basic algebraic operations. It follows from the so called Tschirnhaus transformation that the general algebraic equation
- [math]\displaystyle{ x^{n}+a_{n-1}x^{n-1}+\cdots +a_{0}=0 }[/math]
can be translated to the form [math]\displaystyle{ y^{n}+b_{n-4}y^{n-4}+\cdots +b_{1}y+1=0 }[/math]. The Tschirnhaus transformation is given by a formula containing only radicals and arithmetic operations and transforms. Therefore, the solution of an algebraic equation of degree [math]\displaystyle{ n }[/math] can be represented as a superposition of functions of two variables if [math]\displaystyle{ n\lt 7 }[/math] and as a superposition of functions of [math]\displaystyle{ n-4 }[/math] variables if [math]\displaystyle{ n\geq 7 }[/math]. For [math]\displaystyle{ n=7 }[/math] the solution is a superposition of arithmetic operations, radicals, and the solution of the equation [math]\displaystyle{ y^{7}+b_{3}y^{3}+b_{2}y^{2}+b_{1}y+1=0 }[/math].
A further simplification with algebraic transformations seems to be impossible which led to Hilbert's conjecture that "A solution of the general equation of degree 7 cannot be represented as a superposition of continuous functions of two variables". This explains the relation of Hilbert's thirteenth problem to the representation of a higher-dimensional function as superposition of lower-dimensional functions. In this context, it has stimulated many studies in the theory of functions and other related problems by different authors.[8]
Variants
A variant of Kolmogorov's theorem that reduces the number of outer functions [math]\displaystyle{ \Phi_{q} }[/math] is due to George Lorentz.[9] He showed in 1962 that the outer functions [math]\displaystyle{ \Phi_{q} }[/math] can be replaced by a single function [math]\displaystyle{ \Phi }[/math]. More precisely, Lorentz proved the existence of functions [math]\displaystyle{ \phi _{q,p} }[/math], [math]\displaystyle{ q=0,1,\ldots, 2n }[/math], [math]\displaystyle{ p=1,\ldots,n, }[/math] such that
- [math]\displaystyle{ f(\mathbf x) = \sum_{q=0}^{2n} \Phi\!\left(\sum_{p=1}^{n} \phi_{q,p}(x_{p})\right) }[/math].
David Sprecher [10] replaced the inner functions [math]\displaystyle{ \phi_{q,p} }[/math] by one single inner function with an appropriate shift in its argument. He proved that there exist real values [math]\displaystyle{ \eta, \lambda_1,\ldots,\lambda_n }[/math], a continuous function [math]\displaystyle{ \Phi\colon \mathbb{R} \rightarrow \R }[/math], and a real increasing continuous function [math]\displaystyle{ \phi\colon [0,1] \rightarrow [0,1] }[/math] with [math]\displaystyle{ \phi \in \operatorname{Lip}(\ln 2/\ln (2N+2)) }[/math], for [math]\displaystyle{ N \geq n \geq 2 }[/math], such that
- [math]\displaystyle{ f(\mathbf x) = \sum_{q=0}^{2n} \Phi\!\left(\sum_{p=1}^{n} \lambda_p \phi(x_{p}+\eta q)+q \right) }[/math].
Phillip A. Ostrand [11] generalized the Kolmogorov superposition theorem to compact metric spaces. For [math]\displaystyle{ p=1,\ldots,m }[/math] let [math]\displaystyle{ X_p }[/math] be compact metric spaces of finite dimension [math]\displaystyle{ n_p }[/math] and let [math]\displaystyle{ n = \sum_{p=1}^{m} n_p }[/math]. Then there exists continuous functions [math]\displaystyle{ \phi_{q,p}\colon X_p \rightarrow [0,1], q=0,\ldots,2n, p=1,\ldots,m }[/math] and continuous functions [math]\displaystyle{ G_q\colon [0,1] \rightarrow \R, q=0,\ldots,2n }[/math] such that any continuous function [math]\displaystyle{ f\colon X_1 \times \dots \times X_m \rightarrow \mathbb{R} }[/math] is representable in the form
- [math]\displaystyle{ f(x_1,\ldots,x_m) = \sum_{q=0}^{2n} G_{q}\!\left(\sum_{p=1}^{m} \phi_{q,p}(x_{p})\right) }[/math].
Limitations
The theorem does not hold in general for complex multi-variate functions, as discussed here.[2] Furthermore, the non-smoothness of the inner functions and their "wild behavior" has limited the practical use of the representation,[12] although there is some debate on this.[13]
See also
References
- ↑ Khesin, Boris A.; Tabachnikov, Serge L. (2014). Arnold: Swimming Against the Tide. American Mathematical Society. p. 165. ISBN 978-1-4704-1699-7. https://books.google.com/books?id=aBWHBAAAQBAJ&pg=PA165.
- ↑ 2.0 2.1 Akashi, Shigeo (2001). "Application of ϵ-entropy theory to Kolmogorov—Arnold representation theorem" (in en). Reports on Mathematical Physics 48 (1-2): 19–26. doi:10.1016/S0034-4877(01)80060-4.
- ↑ Morris, Sidney A. (2020-07-06). "Hilbert 13: Are there any genuine continuous multivariate real-valued functions?" (in en). Bulletin of the American Mathematical Society 58 (1): 107–118. doi:10.1090/bull/1698. ISSN 0273-0979. https://www.ams.org/bull/2021-58-01/S0273-0979-2020-01698-8/.
- ↑ Bar-Natan, Dror. "Dessert: Hilbert's 13th Problem, in Full Colour". http://www.math.toronto.edu/drorbn/Talks/Fields-0911/.
- ↑ Braun, Jürgen; Griebel, Michael (2009). "On a constructive proof of Kolmogorov’s superposition theorem" (in en). Constructive Approximation 30 (3): 653–675. doi:10.1007/s00365-009-9054-2.
- ↑ Diaconis, Persi; Shahshahani, Mehrdad (1984). "On linear functions of linear combinations". SIAM Journal on Scientific Computing 5 (1): 175–191. http://www-stat.stanford.edu/~cgates/PERSI/papers/nonlin_func.pdf.
- ↑ Hilbert, David (1902). "Mathematical problems". Bulletin of the American Mathematical Society 8 (10): 461–462. doi:10.1090/S0002-9904-1902-00923-3.
- ↑ Jürgen Braun, On Kolmogorov's Superposition Theorem and Its Applications, SVH Verlag, 2010, 192 pp.
- ↑ Lorentz, G. G. (1962). "Metric entropy, widths, and superpositions of functions". American Mathematical Monthly 69 (6): 469–485. doi:10.1080/00029890.1962.11989915.
- ↑ David A. Sprecher, On the structure of continuous functions of several variables, Transactions of the American Mathematical Society, 115 (1965), pp. 340–355.
- ↑ Ostrand, Phillip A. (1965). "Dimension of metric spaces and Hilbert's problem 13". Bulletin of the American Mathematical Society 71 (4): 619–622. doi:10.1090/s0002-9904-1965-11363-5.
- ↑ F. Girosi and T. Poggio, "Representation Properties of Networks: Kolmogorov's Theorem Is Irrelevant," in Neural Computation, vol. 1, no. 4, pp. 465–469, Dec. 1989, doi: 10.1162/neco.1989.1.4.465.
- ↑ Věra Kůrková, "Kolmogorov's Theorem Is Relevant", https://doi.org/10.1162/neco.1991.3.4.617
Sources
- Andrey Kolmogorov, "On the representation of continuous functions of several variables by superpositions of continuous functions of a smaller number of variables", Proceedings of the USSR Academy of Sciences, 108 (1956), pp. 179–182; English translation: Amer. Math. Soc. Transl., 17 (1961), pp. 369–373.
- Vladimir Arnold, "On functions of three variables", Proceedings of the USSR Academy of Sciences, 114 (1957), pp. 679–681; English translation: Amer. Math. Soc. Transl., 28 (1963), pp. 51–54.
Further reading
- S. Ya. Khavinson, Best Approximation by Linear Superpositions (Approximate Nomography), AMS Translations of Mathematical Monographs (1997)
External links
- A deep machine learning algorithm for construction of the Kolmogorov-Arnold representation.
- Practical way of building Kolmogorov-Arnold model.
Original source: https://en.wikipedia.org/wiki/Kolmogorov–Arnold representation theorem.
Read more |