Coordinate-wise descent method

One of the methods for minimizing a function of several variables based only on the values of the function to be minimized. The method is used when the function is not differentiable or if a calculation of the derivatives involves a large amount of computation. Below the use of the coordinate-wise descent method for minimizing a function <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c0264601.png" /> on a set

i.e. one performs a cyclic selection of the coordinate vectors <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646023.png" />. First one checks if the conditions

(1)

are fulfilled. If (1) is fulfilled, one sets <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646025.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646026.png" />. If on the other hand (1) is not fulfilled, one checks the condition

(2)

If (2) is fulfilled, one sets <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646028.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646029.png" />. If conditions (1) and (2) are both not fulfilled, one sets <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646030.png" />,

(3)

where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646032.png" /> is the parameter of the method, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646033.png" />. Condition (3) means that if at least one of the conditions (1) and (2) is fulfilled in a single cycle of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646034.png" /> iterations involving a selection of all coordinate vectors <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646035.png" /> with step <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646036.png" />, then the length of the step <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646037.png" /> is not reduced and is retained at least during the following cycle of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646038.png" /> iterations; if on the other hand neither (1) nor (2) is ever fulfilled in the subsequent <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646039.png" /> iterations, the step <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646040.png" /> is reduced.

Let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646041.png" /> be convex and continuously differentiable on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646042.png" />, let the set <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646043.png" /> be bounded and let <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646044.png" /> be a positive number. Then the methods (1)–(3) converge, i.e.

and the sequence <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646046.png" /> converges to the set of minima for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646047.png" /> in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646048.png" />. If <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646049.png" /> is not differentiable on <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/c/c026/c026460/c02646050.png" />, the method need not converge [1], [2].

References

[1]	F.P. Vasil'ev, "Numerical methods for solving extremum problems" , Moscow (1980) (In Russian)
[2]	V.G. Karmanov, "Mathematical programming" , Moscow (1975) (In Russian)

Comments

References

[a1]	W.I. Zangwill, "Nonlinear programming: a unified approach" , Prentice-Hall (1969)

0.00

(0 votes)

From The Encyclopedia of Math: Coordinate-wise descent method.

Anonymous

Search

Coordinate-wise descent method

Namespaces

More

Page actions

References

Comments

References

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Coordinate-wise descent method

References

Comments

References

Navigation

Wiki tools

Page tools

Other projects

Categories