Directional derivative
Part of a series of articles about |
Calculus |
---|
A directional derivative is a concept in multivariable calculus that measures the rate at which a function changes in a particular direction at a given point.[citation needed]
The directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity specified by v.
The directional derivative of a scalar function f with respect to a vector v at a point (e.g., position) x may be denoted by any of the following: [math]\displaystyle{ \nabla_{\mathbf{v}}{f}(\mathbf{x})=f'_\mathbf{v}(\mathbf{x})=D_\mathbf{v}f(\mathbf{x})=Df(\mathbf{x})(\mathbf{v})=\partial_\mathbf{v}f(\mathbf{x})=\mathbf{v}\cdot{\nabla f(\mathbf{x})}=\mathbf{v}\cdot \frac{\partial f(\mathbf{x})}{\partial\mathbf{x}}. }[/math]
It therefore generalizes the notion of a partial derivative, in which the rate of change is taken along one of the curvilinear coordinate curves, all other coordinates being constant. The directional derivative is a special case of the Gateaux derivative.
Definition
The directional derivative of a scalar function [math]\displaystyle{ f(\mathbf{x}) = f(x_1, x_2, \ldots, x_n) }[/math] along a vector [math]\displaystyle{ \mathbf{v} = (v_1, \ldots, v_n) }[/math] is the function [math]\displaystyle{ \nabla_{\mathbf{v}}{f} }[/math] defined by the limit[1] [math]\displaystyle{ \nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h}}. }[/math]
This definition is valid in a broad range of contexts, for example where the norm of a vector (and hence a unit vector) is undefined.[2]
For differentiable functions
If the function f is differentiable at x, then the directional derivative exists along any unit vector v at x, and one has
[math]\displaystyle{ \nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \mathbf{v} }[/math]
where the [math]\displaystyle{ \nabla }[/math] on the right denotes the gradient, [math]\displaystyle{ \cdot }[/math] is the dot product and v is a unit vector.[3] This follows from defining a path [math]\displaystyle{ h(t) = x + tv }[/math] and using the definition of the derivative as a limit which can be calculated along this path to get: [math]\displaystyle{ \begin{align} 0 &=\lim_{t \to 0}\frac {f(x+tv)-f(x)-tDf(x)(v)} t \\ &=\lim_{t \to 0}\frac {f(x+tv)-f(x)} t - Df(x)(v) \\ &=\nabla_v f(x)-Df(x)(v). \end{align} }[/math]
Intuitively, the directional derivative of f at a point x represents the rate of change of f, in the direction of v with respect to time, when moving past x.
Using only direction of vector
thumb|The angle α between the tangent A and the horizontal will be maximum if the cutting plane contains the direction of the gradient A. In a Euclidean space, some authors[4] define the directional derivative to be with respect to an arbitrary nonzero vector v after normalization, thus being independent of its magnitude and depending only on its direction.[5]
This definition gives the rate of increase of f per unit of distance moved in the direction given by v. In this case, one has [math]\displaystyle{ \nabla_{\mathbf{v}}{f}(\mathbf{x}) = \lim_{h \to 0}{\frac{f(\mathbf{x} + h\mathbf{v}) - f(\mathbf{x})}{h|\mathbf{v}|}}, }[/math] or in case f is differentiable at x, [math]\displaystyle{ \nabla_{\mathbf{v}}{f}(\mathbf{x}) = \nabla f(\mathbf{x}) \cdot \frac{\mathbf{v}}{|\mathbf{v}|} . }[/math]
Restriction to a unit vector
In the context of a function on a Euclidean space, some texts restrict the vector v to being a unit vector. With this restriction, both the above definitions are equivalent.[6]
Properties
Many of the familiar properties of the ordinary derivative hold for the directional derivative. These include, for any functions f and g defined in a neighborhood of, and differentiable at, p:
- sum rule: [math]\displaystyle{ \nabla_{\mathbf{v}} (f + g) = \nabla_{\mathbf{v}} f + \nabla_{\mathbf{v}} g. }[/math]
- constant factor rule: For any constant c, [math]\displaystyle{ \nabla_{\mathbf{v}} (cf) = c\nabla_{\mathbf{v}} f. }[/math]
- product rule (or Leibniz's rule): [math]\displaystyle{ \nabla_{\mathbf{v}} (fg) = g\nabla_{\mathbf{v}} f + f\nabla_{\mathbf{v}} g. }[/math]
- chain rule: If g is differentiable at p and h is differentiable at g(p), then [math]\displaystyle{ \nabla_{\mathbf{v}}(h\circ g)(\mathbf{p}) = h'(g(\mathbf{p})) \nabla_{\mathbf{v}} g (\mathbf{p}). }[/math]
In differential geometry
Let M be a differentiable manifold and p a point of M. Suppose that f is a function defined in a neighborhood of p, and differentiable at p. If v is a tangent vector to M at p, then the directional derivative of f along v, denoted variously as df(v) (see Exterior derivative), [math]\displaystyle{ \nabla_{\mathbf{v}} f(\mathbf{p}) }[/math] (see Covariant derivative), [math]\displaystyle{ L_{\mathbf{v}} f(\mathbf{p}) }[/math] (see Lie derivative), or [math]\displaystyle{ {\mathbf{v}}_{\mathbf{p}}(f) }[/math] (see Tangent space § Definition via derivations), can be defined as follows. Let γ : [−1, 1] → M be a differentiable curve with γ(0) = p and γ′(0) = v. Then the directional derivative is defined by [math]\displaystyle{ \nabla_{\mathbf{v}} f(\mathbf{p}) = \left.\frac{d}{d\tau} f\circ\gamma(\tau)\right|_{\tau=0}. }[/math] This definition can be proven independent of the choice of γ, provided γ is selected in the prescribed manner so that γ(0) = p and γ′(0) = v.
The Lie derivative
The Lie derivative of a vector field [math]\displaystyle{ W^\mu(x) }[/math] along a vector field [math]\displaystyle{ V^\mu(x) }[/math] is given by the difference of two directional derivatives (with vanishing torsion): [math]\displaystyle{ \mathcal{L}_V W^\mu=(V\cdot\nabla) W^\mu-(W\cdot\nabla) V^\mu. }[/math] In particular, for a scalar field [math]\displaystyle{ \phi(x) }[/math], the Lie derivative reduces to the standard directional derivative: [math]\displaystyle{ \mathcal{L}_V \phi=(V\cdot\nabla) \phi. }[/math]
The Riemann tensor
Directional derivatives are often used in introductory derivations of the Riemann curvature tensor. Consider a curved rectangle with an infinitesimal vector [math]\displaystyle{ \delta }[/math] along one edge and [math]\displaystyle{ \delta' }[/math] along the other. We translate a covector [math]\displaystyle{ S }[/math] along [math]\displaystyle{ \delta }[/math] then [math]\displaystyle{ \delta' }[/math] and then subtract the translation along [math]\displaystyle{ \delta' }[/math] and then [math]\displaystyle{ \delta }[/math]. Instead of building the directional derivative using partial derivatives, we use the covariant derivative. The translation operator for [math]\displaystyle{ \delta }[/math] is thus [math]\displaystyle{ 1+\sum_\nu \delta^\nu D_\nu=1+\delta\cdot D, }[/math] and for [math]\displaystyle{ \delta' }[/math], [math]\displaystyle{ 1+\sum_\mu \delta'^\mu D_\mu=1+\delta'\cdot D. }[/math] The difference between the two paths is then [math]\displaystyle{ (1+\delta'\cdot D)(1+\delta\cdot D)S^\rho-(1+\delta\cdot D)(1+\delta'\cdot D)S^\rho=\sum_{\mu,\nu}\delta'^\mu \delta^\nu[D_\mu,D_\nu]S_\rho. }[/math] It can be argued[7] that the noncommutativity of the covariant derivatives measures the curvature of the manifold: [math]\displaystyle{ [D_\mu,D_\nu]S_\rho=\pm \sum_\sigma R^\sigma{}_{\rho\mu\nu}S_\sigma, }[/math] where [math]\displaystyle{ R }[/math] is the Riemann curvature tensor and the sign depends on the sign convention of the author.
In group theory
Translations
In the Poincaré algebra, we can define an infinitesimal translation operator P as [math]\displaystyle{ \mathbf{P}=i\nabla. }[/math] (the i ensures that P is a self-adjoint operator) For a finite displacement λ, the unitary Hilbert space representation for translations is[8] [math]\displaystyle{ U(\boldsymbol{\lambda})=\exp\left(-i\boldsymbol{\lambda}\cdot\mathbf{P}\right). }[/math] By using the above definition of the infinitesimal translation operator, we see that the finite translation operator is an exponentiated directional derivative: [math]\displaystyle{ U(\boldsymbol{\lambda})=\exp\left(\boldsymbol{\lambda}\cdot\nabla\right). }[/math] This is a translation operator in the sense that it acts on multivariable functions f(x) as [math]\displaystyle{ U(\boldsymbol{\lambda}) f(\mathbf{x})=\exp\left(\boldsymbol{\lambda}\cdot\nabla\right) f(\mathbf{x}) = f(\mathbf{x}+\boldsymbol{\lambda}). }[/math]
In standard single-variable calculus, the derivative of a smooth function f(x) is defined by (for small ε) [math]\displaystyle{ \frac{df}{dx} = \frac{f(x+\varepsilon) - f(x)}{\varepsilon}. }[/math] This can be rearranged to find f(x+ε): [math]\displaystyle{ f(x+\varepsilon)=f(x)+\varepsilon \,\frac{df}{dx}=\left(1+\varepsilon\,\frac{d}{dx}\right)f(x). }[/math] It follows that [math]\displaystyle{ [1+\varepsilon\,(d/dx)] }[/math] is a translation operator. This is instantly generalized[9] to multivariable functions f(x) [math]\displaystyle{ f(\mathbf{x}+\boldsymbol{\varepsilon}) = \left(1+\boldsymbol{\varepsilon}\cdot\nabla\right) f(\mathbf{x}). }[/math] Here [math]\displaystyle{ \boldsymbol{\varepsilon}\cdot\nabla }[/math] is the directional derivative along the infinitesimal displacement ε. We have found the infinitesimal version of the translation operator: [math]\displaystyle{ U(\boldsymbol{\varepsilon}) = 1 + \boldsymbol{\varepsilon}\cdot\nabla. }[/math] It is evident that the group multiplication law[10] U(g)U(f)=U(gf) takes the form [math]\displaystyle{ U(\mathbf{a})U(\mathbf{b})=U(\mathbf{a+b}). }[/math] So suppose that we take the finite displacement λ and divide it into N parts (N→∞ is implied everywhere), so that λ/N=ε. In other words, [math]\displaystyle{ \boldsymbol{\lambda} = N \boldsymbol{\varepsilon}. }[/math] Then by applying U(ε) N times, we can construct U(λ): [math]\displaystyle{ [U(\boldsymbol{\varepsilon})]^N = U(N\boldsymbol{\varepsilon}) = U(\boldsymbol{\lambda}). }[/math] We can now plug in our above expression for U(ε): [math]\displaystyle{ [U(\boldsymbol{\varepsilon})]^N = \left[1+\boldsymbol{\varepsilon}\cdot\nabla\right]^N = \left[1+\frac{\boldsymbol{\lambda}\cdot\nabla}{N}\right]^N. }[/math] Using the identity[11] [math]\displaystyle{ \exp(x)=\left[1+\frac{x}{N}\right]^N, }[/math] we have [math]\displaystyle{ U(\boldsymbol{\lambda})=\exp\left(\boldsymbol{\lambda}\cdot\nabla\right). }[/math] And since U(ε)f(x) = f(x+ε) we have [math]\displaystyle{ [U(\boldsymbol{\varepsilon})]^N f(\mathbf{x}) = f(\mathbf{x}+N\boldsymbol{\varepsilon}) = f(\mathbf{x}+\boldsymbol{\lambda}) = U(\boldsymbol{\lambda})f(\mathbf{x}) = \exp\left(\boldsymbol{\lambda}\cdot\nabla\right)f(\mathbf{x}), }[/math] Q.E.D.
As a technical note, this procedure is only possible because the translation group forms an Abelian subgroup (Cartan subalgebra) in the Poincaré algebra. In particular, the group multiplication law U(a)U(b) = U(a+b) should not be taken for granted. We also note that Poincaré is a connected Lie group. It is a group of transformations T(ξ) that are described by a continuous set of real parameters [math]\displaystyle{ \xi^a }[/math]. The group multiplication law takes the form [math]\displaystyle{ T(\bar{\xi})T(\xi) = T(f(\bar{\xi},\xi)). }[/math] Taking [math]\displaystyle{ \xi^a = 0 }[/math] as the coordinates of the identity, we must have [math]\displaystyle{ f^a(\xi,0)=f^a(0,\xi)=\xi^a. }[/math] The actual operators on the Hilbert space are represented by unitary operators U(T(ξ)). In the above notation we suppressed the T; we now write U(λ) as U(P(λ)). For a small neighborhood around the identity, the power series representation [math]\displaystyle{ U(T(\xi))=1+i\sum_a\xi^a t_a+\frac{1}{2}\sum_{b,c}\xi^b\xi^c t_{bc}+\cdots }[/math] is quite good. Suppose that U(T(ξ)) form a non-projective representation, i.e., [math]\displaystyle{ U(T(\bar{\xi}))U(T(\xi))=U(T(f(\bar{\xi},\xi))). }[/math] The expansion of f to second power is [math]\displaystyle{ f^a(\bar{\xi},\xi)=\xi^a+\bar{\xi}^a+\sum_{b,c}f^{abc}\bar{\xi}^b\xi^c. }[/math] After expanding the representation multiplication equation and equating coefficients, we have the nontrivial condition [math]\displaystyle{ t_{bc}=-t_b t_c-i\sum_a f^{abc}t_a. }[/math] Since [math]\displaystyle{ t_{ab} }[/math] is by definition symmetric in its indices, we have the standard Lie algebra commutator: [math]\displaystyle{ [t_b, t_c]=i\sum_a(-f^{abc}+f^{acb})t_a=i\sum_a C^{abc}t_a, }[/math] with C the structure constant. The generators for translations are partial derivative operators, which commute: [math]\displaystyle{ \left[\frac{\partial}{\partial x^b},\frac{\partial }{\partial x^c}\right]=0. }[/math] This implies that the structure constants vanish and thus the quadratic coefficients in the f expansion vanish as well. This means that f is simply additive: [math]\displaystyle{ f^a_\text{abelian}(\bar{\xi},\xi)=\xi^a+\bar{\xi}^a, }[/math] and thus for abelian groups, [math]\displaystyle{ U(T(\bar{\xi}))U(T(\xi))=U(T(\bar{\xi}+\xi)). }[/math] Q.E.D.
Rotations
The rotation operator also contains a directional derivative. The rotation operator for an angle θ, i.e. by an amount θ = |θ| about an axis parallel to [math]\displaystyle{ \hat{\theta} = \boldsymbol{\theta}/\theta }[/math] is [math]\displaystyle{ U(R(\mathbf{\theta}))=\exp(-i\mathbf{\theta}\cdot\mathbf{L}). }[/math] Here L is the vector operator that generates SO(3): [math]\displaystyle{ \mathbf{L}=\begin{pmatrix} 0& 0 & 0\\ 0& 0 & 1\\ 0& -1 & 0 \end{pmatrix}\mathbf{i}+\begin{pmatrix} 0 &0 & -1\\ 0& 0 &0 \\ 1 & 0 & 0 \end{pmatrix}\mathbf{j}+\begin{pmatrix} 0&1 &0 \\ -1&0 &0 \\ 0 & 0 & 0 \end{pmatrix}\mathbf{k}. }[/math] It may be shown geometrically that an infinitesimal right-handed rotation changes the position vector x by [math]\displaystyle{ \mathbf{x}\rightarrow \mathbf{x}-\delta\boldsymbol{\theta}\times\mathbf{x}. }[/math] So we would expect under infinitesimal rotation: [math]\displaystyle{ U(R(\delta\boldsymbol{\theta})) f(\mathbf{x}) = f(\mathbf{x}-\delta\boldsymbol{\theta}\times\mathbf{x})=f(\mathbf{x})-(\delta\boldsymbol{\theta}\times\mathbf{x})\cdot\nabla f. }[/math] It follows that [math]\displaystyle{ U(R(\delta\mathbf{\theta}))=1-(\delta\mathbf{\theta}\times\mathbf{x})\cdot\nabla. }[/math] Following the same exponentiation procedure as above, we arrive at the rotation operator in the position basis, which is an exponentiated directional derivative:[12] [math]\displaystyle{ U(R(\mathbf{\theta}))=\exp(-(\mathbf{\theta}\times\mathbf{x})\cdot\nabla). }[/math]
Normal derivative
A normal derivative is a directional derivative taken in the direction normal (that is, orthogonal) to some surface in space, or more generally along a normal vector field orthogonal to some hypersurface. See for example Neumann boundary condition. If the normal direction is denoted by [math]\displaystyle{ \mathbf{n} }[/math], then the normal derivative of a function f is sometimes denoted as [math]\displaystyle{ \frac{ \partial f}{\partial \mathbf{n}} }[/math]. In other notations, [math]\displaystyle{ \frac{ \partial f}{\partial \mathbf{n}} = \nabla f(\mathbf{x}) \cdot \mathbf{n} = \nabla_{\mathbf{n}}{f}(\mathbf{x}) = \frac{\partial f}{\partial \mathbf{x}} \cdot \mathbf{n} = Df(\mathbf{x})[\mathbf{n}]. }[/math]
In the continuum mechanics of solids
Several important results in continuum mechanics require the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors.[13] The directional directive provides a systematic way of finding these derivatives.
See also
- Del in cylindrical and spherical coordinates – Mathematical gradient operator in certain coordinate systems
- Differential form – Expression that may be integrated over a region
- Ehresmann connection
- Fréchet derivative – Derivative defined on normed spaces
- Gateaux derivative – Generalization of the concept of directional derivative
- Generalizations of the derivative – Fundamental construction of differential calculus
- Semi-differentiability
- Hadamard derivative
- Lie derivative – A derivative in Differential Geometry
- Material derivative – Time rate of change of some physical quantity of a material element in a velocity field
- Structure tensor – Tensor related to gradients
- Physics:Tensor derivative (continuum mechanics)
- Total derivative – Type of derivative in mathematics
Notes
- ↑ R. Wrede; M.R. Spiegel (2010). Advanced Calculus (3rd ed.). Schaum's Outline Series. ISBN 978-0-07-162366-7.
- ↑ The applicability extends to functions over spaces without a metric and to differentiable manifolds, such as in general relativity.
- ↑ If the dot product is undefined, the gradient is also undefined; however, for differentiable f, the directional derivative is still defined, and a similar relation exists with the exterior derivative.
- ↑ Thomas, George B. Jr.; and Finney, Ross L. (1979) Calculus and Analytic Geometry, Addison-Wesley Publ. Co., fifth edition, p. 593.
- ↑ This typically assumes a Euclidean space – for example, a function of several variables typically has no definition of the magnitude of a vector, and hence of a unit vector.
- ↑ Hughes Hallett, Deborah; McCallum, William G.; Gleason, Andrew M. (2012-01-01). Calculus : Single and multivariable.. John wiley. pp. 780. ISBN 9780470888612. OCLC 828768012.
- ↑ Zee, A. (2013). Einstein gravity in a nutshell. Princeton: Princeton University Press. p. 341. ISBN 9780691145587.
- ↑ Weinberg, Steven (1999). The quantum theory of fields (Reprinted (with corr.). ed.). Cambridge [u.a.]: Cambridge Univ. Press. ISBN 9780521550017. https://archive.org/details/quantumtheoryoff00stev.
- ↑ Zee, A. (2013). Einstein gravity in a nutshell. Princeton: Princeton University Press. ISBN 9780691145587.
- ↑ Cahill, Kevin Cahill (2013). Physical mathematics (Repr. ed.). Cambridge: Cambridge University Press. ISBN 978-1107005211.
- ↑ Larson, Ron; Edwards, Bruce H. (2010). Calculus of a single variable (9th ed.). Belmont: Brooks/Cole. ISBN 9780547209982.
- ↑ Shankar, R. (1994). Principles of quantum mechanics (2nd ed.). New York: Kluwer Academic / Plenum. p. 318. ISBN 9780306447907.
- ↑ J. E. Marsden and T. J. R. Hughes, 2000, Mathematical Foundations of Elasticity, Dover.
References
- Hildebrand, F. B. (1976). Advanced Calculus for Applications. Prentice Hall. ISBN 0-13-011189-9.
- K.F. Riley; M.P. Hobson; S.J. Bence (2010). Mathematical methods for physics and engineering. Cambridge University Press. ISBN 978-0-521-86153-3. https://archive.org/details/mathematicalmeth00rile.
- Shapiro, A. (1990). "On concepts of directional differentiability". Journal of Optimization Theory and Applications 66 (3): 477–487. doi:10.1007/BF00940933.
External links
Original source: https://en.wikipedia.org/wiki/Directional derivative.
Read more |