Multivariable calculus

From HandWiki
Short description: Calculus of functions of several variables


Multivariable calculus (also known as multivariate calculus) is the extension of calculus in one variable to calculus with functions of several variables: the differentiation and integration of functions involving multiple variables (multivariate), rather than just one.[1]

Multivariable calculus may be thought of as an elementary part of advanced calculus. For advanced calculus, see calculus on Euclidean space. The special case of calculus in three dimensional space is often called vector calculus.

Introduction

In single-variable calculus, operations like differentiation and integration are made to functions of a single variable. In multivariate calculus, it is required to generalize these to multiple variables, and the domain is therefore multi-dimensional. Care is therefore required in these generalizations, because of two key differences between 1D and higher dimensional spaces:

  1. There are infinite ways to approach a single point in higher dimensions, as opposed to two (from the positive and negative direction) in 1D;
  2. There are multiple extended objects associated with the dimension; for example, for a 1D function, it must be represented as a curve on the 2D Cartesian plane, but a function with two variables is a surface in 3D, while curves can also live in 3D space.

The consequence of the first difference is the difference in the definition of the limit and differentiation. Directional limits and derivatives define the limit and differential along a 1D parametrized curve, reducing the problem to the 1D case; further higher-dimensional objects can be constructed from these operators.

The consequence of the second difference is the existence of multiple types of integration, including line integrals, surface integrals and volume integrals. Due to the non-uniqueness of these integrals, an antiderivative or indefinite integral cannot be properly defined.

Limits

A study of limits and continuity in multivariable calculus yields many counterintuitive results not demonstrated by single-variable functions.

A limit along a path may be defined by considering a parametrised path [math]\displaystyle{ s(t): \mathbb{R} \to \mathbb{R}^n }[/math] in n-dimensional Euclidean space. Any function [math]\displaystyle{ f(\overrightarrow{x}): \mathbb{R}^n \to \mathbb{R}^m }[/math] can then be projected on the path as a 1D function [math]\displaystyle{ f(s(t)) }[/math]. The limit of [math]\displaystyle{ f }[/math] to the point [math]\displaystyle{ s(t_0) }[/math] along the path [math]\displaystyle{ s(t) }[/math] can hence be defined as

[math]\displaystyle{ \lim_{\overrightarrow{x} \to s(t_0)} f(\overrightarrow{x}) = \lim_{t \to t_0} f(s(t)) }[/math]

 

 

 

 

(1)

Note that the value of this limit can be dependent on the form of [math]\displaystyle{ s(t) }[/math], i.e. the path chosen, not just the point which the limit approaches.[1]:19–22 For example, consider the function [math]\displaystyle{ f(x,y) = \frac{x^2y}{x^4+y^2} }[/math]. If the point [math]\displaystyle{ (0,0) }[/math] is approached through the line [math]\displaystyle{ y=kx }[/math], or in parametric form:

Plot of the function f(x, y) = (x²y)/(x4 + y2)

[math]\displaystyle{ x(t) = t, y(t) = kt }[/math]

 

 

 

 

(2)

Then the limit along the path will be:

[math]\displaystyle{ \lim_{t \to 0} f(x(t),y(t)) = \lim_{t \to 0} \frac{k^2 t^3}{k^4 t^4 + t^2} = 0 }[/math]

 

 

 

 

(3)

On the other hand, if the path [math]\displaystyle{ y=\pm x^2 }[/math] (or parametrically, [math]\displaystyle{ x(t)=t, y(t)=\pm t^2 }[/math]) is chosen, then the limit becomes:

[math]\displaystyle{ \lim_{t \to 0} f(x(t),y(t)) = \lim_{t \to 0} \frac{\pm t^4}{t^4 + t^4} = \pm \frac{1}{2} }[/math]

 

 

 

 

(4)

Since taking different paths towards the same point yields different values, a general limit at the point [math]\displaystyle{ (0,0) }[/math] cannot be defined for the function.

A general limit can be defined if the limits to a point along all possible paths converge to the same value, i.e. we say for a function [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R}^m }[/math] that the limit of [math]\displaystyle{ f }[/math] to some point [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math] is L, if and only if

[math]\displaystyle{ \lim_{t \to t_0} f(s(t)) = L }[/math]

 

 

 

 

(5)

for all continuous functions [math]\displaystyle{ s(t): \mathbb{R} \to \mathbb{R}^n }[/math] such that [math]\displaystyle{ s(t_0)=x_0 }[/math].

Continuity

From the concept of limit along a path, we can then derive the definition for multivariate continuity in the same manner, that is: we say for a function [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R}^m }[/math] that [math]\displaystyle{ f }[/math] is continuous at the point [math]\displaystyle{ x_0 }[/math], if and only if

[math]\displaystyle{ \lim_{t \to t_0} f(s(t)) = f(x_0) }[/math]

 

 

 

 

(5)

for all continuous functions [math]\displaystyle{ s(t): \mathbb{R} \to \mathbb{R}^n }[/math] such that [math]\displaystyle{ s(t_0)=x_0 }[/math].

As with limits, being continuous along one path [math]\displaystyle{ s(t) }[/math] does not imply multivariate continuity.

Continuity in each argument not being sufficient for multivariate continuity can also be seen from the following example.[1]:17–19 For example, for a real-valued function [math]\displaystyle{ f: \mathbb{R}^2 \to \mathbb{R} }[/math] with two real-valued parameters, [math]\displaystyle{ f(x,y) }[/math], continuity of [math]\displaystyle{ f }[/math] in [math]\displaystyle{ x }[/math] for fixed [math]\displaystyle{ y }[/math] and continuity of [math]\displaystyle{ f }[/math] in [math]\displaystyle{ y }[/math] for fixed [math]\displaystyle{ x }[/math] does not imply continuity of [math]\displaystyle{ f }[/math].

Consider

[math]\displaystyle{ f(x,y)= \begin{cases} \frac{y}{x}-y & \text{if}\quad 0 \leq y \lt x \leq 1 \\ \frac{x}{y}-x & \text{if}\quad 0 \leq x \lt y \leq 1 \\ 1-x & \text{if}\quad 0 \lt x=y \\ 0 & \text{everywhere else}. \end{cases} }[/math]

It is easy to verify that this function is zero by definition on the boundary and outside of the quadrangle [math]\displaystyle{ (0,1)\times (0,1) }[/math]. Furthermore, the functions defined for constant [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math] and [math]\displaystyle{ 0 \le a \le 1 }[/math] by

[math]\displaystyle{ g_a(x) = f(x,a)\quad }[/math] and [math]\displaystyle{ \quad h_a(y) = f(a,y)\quad }[/math]

are continuous. Specifically,

[math]\displaystyle{ g_0(x) = f(x,0) = h_0(0,y) = f(0,y) = 0 }[/math] for all x and y. Therefore, [math]\displaystyle{ f(0,0)=0 }[/math] and moreover, along the coordinate axes, [math]\displaystyle{ \lim_{x \to 0} f(x,0) = 0 }[/math] and [math]\displaystyle{ \lim_{y \to 0} f(0,y) = 0 }[/math]. Therefore the function is continuous along both individual arguments.

However, consider the parametric path [math]\displaystyle{ x(t) = t, y(t) = t }[/math]. The parametric function becomes

[math]\displaystyle{ f(x(t),y(t))= \begin{cases} 1-t & \text{if}\quad t \gt 0 \\ 0 & \text{everywhere else}. \end{cases} }[/math]

 

 

 

 

(6)

Therefore,

[math]\displaystyle{ \lim_{t \to 0^+} f(x(t),y(t)) = 1 \neq f(0,0) = 0 }[/math]

 

 

 

 

(7)

It is hence clear that the function is not multivariate continuous, despite being continuous in both coordinates.

Theorems regarding multivariate limits and continuity

  • All properties of linearity and superposition from single-variable calculus carry over to multivariate calculus.
  • Composition: If [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R}^m }[/math] and [math]\displaystyle{ g: \mathbb{R}^m \to \mathbb{R}^p }[/math] are both multivariate continuous functions at the points [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math] and [math]\displaystyle{ f(x_0) \in \mathbb{R}^m }[/math] respectively, then [math]\displaystyle{ g \circ f: \mathbb{R}^n \to \mathbb{R}^p }[/math] is also a multivariate continuous function at the point [math]\displaystyle{ x_0 }[/math].
  • Multiplication: If [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R} }[/math] and [math]\displaystyle{ g: \mathbb{R}^n \to \mathbb{R} }[/math] are both continuous functions at the point [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math], then [math]\displaystyle{ fg: \mathbb{R}^n \to \mathbb{R} }[/math] is continuous at [math]\displaystyle{ x_0 }[/math], and [math]\displaystyle{ \frac{f}{g}: \mathbb{R}^n \to \mathbb{R} }[/math] is also continuous at [math]\displaystyle{ x_0 }[/math] provided that [math]\displaystyle{ g(x_0) \neq 0 }[/math].
  • If [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R} }[/math] is a continuous function at point [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math], then [math]\displaystyle{ |f| }[/math] is also continuous at the same point.
  • If [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R}^m }[/math] is Lipschitz continuous (with the appropiate normed spaces as needed) in the neighbourhood of the point [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math], then [math]\displaystyle{ f }[/math] is multivariate continuous at [math]\displaystyle{ x_0 }[/math].
Proof

From the Lipschitz continuity condition for [math]\displaystyle{ f }[/math] we have

[math]\displaystyle{ |f(s(t))-f(s(t_0))| \leq K|s(t)-s(t_0)| }[/math]

 

 

 

 

(8)

where [math]\displaystyle{ K }[/math] is the Lipschitz constant. Note also that, as [math]\displaystyle{ s(t) }[/math] is continuous at [math]\displaystyle{ t_0 }[/math], for every [math]\displaystyle{ \delta \gt 0 }[/math] there exists a [math]\displaystyle{ \epsilon \gt 0 }[/math] such that [math]\displaystyle{ |s(t)-s(t_0)| \lt \delta }[/math] [math]\displaystyle{ \forall |t-t_0| \lt \epsilon }[/math].

Hence, for every [math]\displaystyle{ \alpha \gt 0 }[/math], choose [math]\displaystyle{ \delta = \frac{\alpha}{K} }[/math]; there exists an [math]\displaystyle{ \epsilon \gt 0 }[/math] such that for all [math]\displaystyle{ t }[/math] satisfying [math]\displaystyle{ |t-t_0| \lt \epsilon }[/math], [math]\displaystyle{ |s(t)-s(t_0)| \lt \delta }[/math], and [math]\displaystyle{ |f(s(t)) - f(s(t_0))| \leq K|s(t)-s(t_0)| \lt K\delta = \alpha }[/math]. Hence [math]\displaystyle{ \lim_{t \to t_0} f(s(t)) }[/math] converges to [math]\displaystyle{ f(s(t_0)) }[/math] regardless of the precise form of [math]\displaystyle{ s(t) }[/math].

Differentiation

Directional derivative

The derivative of a single-variable function is defined as

[math]\displaystyle{ \frac{df}{dx} = \lim_{h \to 0} \frac{f(x+h)-f(x)}{h} }[/math]

 

 

 

 

(9)

Using the extension of limits discussed above, one can then extend the definition of the derivative to a scalar-valued function [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R} }[/math] along some path [math]\displaystyle{ s(t): \mathbb{R} \to \mathbb{R}^n }[/math]:

[math]\displaystyle{ \left . \frac{df}{dx} \right |_{s(t),t=t_0} = \lim_{h \to 0} \frac{f(s(t_0+h))-f(s(t_0))}{|s(t_0+h)-s(t_0)|} }[/math]

 

 

 

 

(10)

Unlike limits, for which the value depends on the exact form of the path [math]\displaystyle{ s(t) }[/math], it can be shown that the derivative along the path depends only on the tangent vector of the path at [math]\displaystyle{ s(t_0) }[/math], i.e. [math]\displaystyle{ s'(t_0) }[/math], provided that [math]\displaystyle{ f }[/math] is Lipschitz continuous at [math]\displaystyle{ s(t_0) }[/math], and that the limit exits for at least one such path.

Proof

For [math]\displaystyle{ s(t) }[/math] continuous up to the first derivative (this statement is well defined as [math]\displaystyle{ s }[/math] is a function of one variable), we can write the Taylor expansion of [math]\displaystyle{ s }[/math] around [math]\displaystyle{ t_0 }[/math] using Taylor's theorem to construct the remainder:

[math]\displaystyle{ s(t) = s(t_0) + s'(\tau) (t-t_0) }[/math]

 

 

 

 

(11)

where [math]\displaystyle{ \tau \in [t_0,t] }[/math].

Substituting this into 10,

[math]\displaystyle{ \left . \frac{df}{dx} \right |_{s(t),t=t_0} = \lim_{h \to 0} \frac{f(s(t_0)+s'(\tau)h)-f(s(t_0))}{|s'(\tau)h|} }[/math]

 

 

 

 

(12)

where [math]\displaystyle{ \tau(h) \in [t_0,t_0+h] }[/math].

Lipschitz continuity gives us [math]\displaystyle{ |f(x)-f(y)| \leq K|x-y| }[/math] for some finite [math]\displaystyle{ K }[/math], [math]\displaystyle{ \forall x,y\in \mathbb{R}^n }[/math]. It follows that [math]\displaystyle{ |f(x+O(h))-f(x)| \sim O(h) }[/math].

Note also that given the continuity of [math]\displaystyle{ s'(t) }[/math], [math]\displaystyle{ s'(\tau) = s'(t_0)+O(h) }[/math] as [math]\displaystyle{ h \to 0 }[/math].

Substituting these two conditions into 12,

[math]\displaystyle{ \left . \frac{df}{dx} \right |_{s(t),t=t_0} = \lim_{h \to 0} \frac{f(s(t_0)+s'(t_0)h)-f(s(t_0))+O(h^2)}{|s'(t_0)h|+O(h^2)} }[/math]

 

 

 

 

(13)

whose limit depends only on [math]\displaystyle{ s'(t_0) }[/math] as the dominant term.

It is therefore possible to generate the definition of the directional derivative as follows: The directional derivative of a scalar-valued function [math]\displaystyle{ f:\mathbb{R}^n \to \mathbb{R} }[/math] along the unit vector [math]\displaystyle{ \hat{\bold{u}} }[/math] at some point [math]\displaystyle{ x_0 \in \mathbb{R}^n }[/math] is

[math]\displaystyle{ \nabla_{\hat{\bold{u}}} f(x_0) = \lim_{t \to 0} \frac{f(x_0+\hat{\bold{u}} t) - f(x_0)}{t} }[/math]

 

 

 

 

(14)

or, when expressed in terms of ordinary differentiation,

[math]\displaystyle{ \nabla_{\hat{\bold{u}}} f(x_0) = \left . \frac{df(x_0+\hat{\bold{u}}t)}{dt} \right |_{t=0} }[/math]

 

 

 

 

(15)

which is a well defined expression because [math]\displaystyle{ f(x_0+\hat{\bold{u}}t) }[/math] is a scalar function with one variable in [math]\displaystyle{ t }[/math].

It is not possible to define a unique scalar derivative without a direction; it is clear for example that [math]\displaystyle{ \nabla_{\hat{\bold{u}}}f(x_0) = - \nabla_{-\hat{\bold{u}}}f(x_0) }[/math]. It is also possible for directional derivatives to exist for some directions but not for others.

Partial derivative

The partial derivative generalizes the notion of the derivative to higher dimensions. A partial derivative of a multivariable function is a derivative with respect to one variable with all other variables held constant.[1]:26ff

A partial derivative may be thought of as the directional derivative of the function along a coordinate axis.

Partial derivatives may be combined in interesting ways to create more complicated expressions of the derivative. In vector calculus, the del operator ([math]\displaystyle{ \nabla }[/math]) is used to define the concepts of gradient, divergence, and curl in terms of partial derivatives. A matrix of partial derivatives, the Jacobian matrix, may be used to represent the derivative of a function between two spaces of arbitrary dimension. The derivative can thus be understood as a linear transformation which directly varies from point to point in the domain of the function.

Differential equations containing partial derivatives are called partial differential equations or PDEs. These equations are generally more difficult to solve than ordinary differential equations, which contain derivatives with respect to only one variable.[1]:654ff

Multiple integration

The multiple integral expands the concept of the integral to functions of any number of variables. Double and triple integrals may be used to calculate areas and volumes of regions in the plane and in space. Fubini's theorem guarantees that a multiple integral may be evaluated as a repeated integral or iterated integral as long as the integrand is continuous throughout the domain of integration.[1]:367ff

The surface integral and the line integral are used to integrate over curved manifolds such as surfaces and curves.

Fundamental theorem of calculus in multiple dimensions

In single-variable calculus, the fundamental theorem of calculus establishes a link between the derivative and the integral. The link between the derivative and the integral in multivariable calculus is embodied by the integral theorems of vector calculus:[1]:543ff

In a more advanced study of multivariable calculus, it is seen that these four theorems are specific incarnations of a more general theorem, the generalized Stokes' theorem, which applies to the integration of differential forms over manifolds.[2]

Applications and uses

Techniques of multivariable calculus are used to study many objects of interest in the material world. In particular,

Type of functions Applicable techniques
Curves Osculating circle.svg [math]\displaystyle{ f: \mathbb{R} \to \mathbb{R}^n }[/math]
for [math]\displaystyle{ n \gt 1 }[/math]
Lengths of curves, line integrals, and curvature.
Surfaces Helicoid.svg [math]\displaystyle{ f: \mathbb{R}^2 \to \mathbb{R}^n }[/math]
for [math]\displaystyle{ n \gt 2 }[/math]
Areas of surfaces, surface integrals, flux through surfaces, and curvature.
Scalar fields Surface-plot.png [math]\displaystyle{ f: \mathbb{R}^n \to \mathbb{R} }[/math] Maxima and minima, Lagrange multipliers, directional derivatives, level sets.
Vector fields Vector field.svg [math]\displaystyle{ f: \mathbb{R}^m \to \mathbb{R}^n }[/math] Any of the operations of vector calculus including gradient, divergence, and curl.

Multivariable calculus can be applied to analyze deterministic systems that have multiple degrees of freedom. Functions with independent variables corresponding to each of the degrees of freedom are often used to model these systems, and multivariable calculus provides tools for characterizing the system dynamics.

Multivariate calculus is used in the optimal control of continuous time dynamic systems. It is used in regression analysis to derive formulas for estimating relationships among various sets of empirical data.

Multivariable calculus is used in many fields of natural and social science and engineering to model and study high-dimensional systems that exhibit deterministic behavior. In economics, for example, consumer choice over a variety of goods, and producer choice over various inputs to use and outputs to produce, are modeled with multivariate calculus.

Non-deterministic, or stochastic systems can be studied using a different kind of mathematics, such as stochastic calculus.

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 Richard Courant; Fritz John (14 December 1999). Introduction to Calculus and Analysis Volume II/2. Springer Science & Business Media. ISBN 978-3-540-66570-0. 
  2. Spivak, Michael (1965). Calculus on Manifolds. New York: W. A. Benjamin, Inc.. ISBN 9780805390216. https://archive.org/details/SpivakM.CalculusOnManifolds_201703. 

External links