Directional derivative

From HandWiki
Short description: Instantaneous rate of change of the function

A directional derivative is a concept in multivariable calculus that measures the rate at which a function changes in a particular direction at a given point.[citation needed]

The directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity specified by v.

The directional derivative of a scalar function f with respect to a vector v at a point (e.g., position) x may be denoted by any of the following: 𝐯f(𝐱)=f'𝐯(𝐱)=D𝐯f(𝐱)=Df(𝐱)(𝐯)=𝐯f(𝐱)=𝐯f(𝐱)=𝐯f(𝐱)𝐱.

It therefore generalizes the notion of a partial derivative, in which the rate of change is taken along one of the curvilinear coordinate curves, all other coordinates being constant. The directional derivative is a special case of the Gateaux derivative.

Definition

A contour plot of f(x,y)=x2+y2, showing the gradient vector in black, and the unit vector 𝐮 scaled by the directional derivative in the direction of 𝐮 in orange. The gradient vector is longer because the gradient points in the direction of greatest rate of increase of a function.

The directional derivative of a scalar function f(𝐱)=f(x1,x2,,xn) along a vector 𝐯=(v1,,vn) is the function 𝐯f defined by the limit[1] 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h.

This definition is valid in a broad range of contexts, for example where the norm of a vector (and hence a unit vector) is undefined.[2]

For differentiable functions

If the function f is differentiable at x, then the directional derivative exists along any unit vector v at x, and one has

𝐯f(𝐱)=f(𝐱)𝐯

where the on the right denotes the gradient, is the dot product and v is a unit vector.[3] This follows from defining a path h(t)=x+tv and using the definition of the derivative as a limit which can be calculated along this path to get: 0=limt0f(x+tv)f(x)tDf(x)(v)t=limt0f(x+tv)f(x)tDf(x)(v)=vf(x)Df(x)(v).

Intuitively, the directional derivative of f at a point x represents the rate of change of f, in the direction of v with respect to time, when moving past x.

Using only direction of vector

thumb|The angle α between the tangent A and the horizontal will be maximum if the cutting plane contains the direction of the gradient A. In a Euclidean space, some authors[4] define the directional derivative to be with respect to an arbitrary nonzero vector v after normalization, thus being independent of its magnitude and depending only on its direction.[5]

This definition gives the rate of increase of f per unit of distance moved in the direction given by v. In this case, one has 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h|𝐯|, or in case f is differentiable at x, 𝐯f(𝐱)=f(𝐱)𝐯|𝐯|.

Restriction to a unit vector

In the context of a function on a Euclidean space, some texts restrict the vector v to being a unit vector. With this restriction, both the above definitions are equivalent.[6]

Properties

Many of the familiar properties of the ordinary derivative hold for the directional derivative. These include, for any functions f and g defined in a neighborhood of, and differentiable at, p:

  1. sum rule: 𝐯(f+g)=𝐯f+𝐯g.
  2. constant factor rule: For any constant c, 𝐯(cf)=c𝐯f.
  3. product rule (or Leibniz's rule): 𝐯(fg)=g𝐯f+f𝐯g.
  4. chain rule: If g is differentiable at p and h is differentiable at g(p), then 𝐯(hg)(𝐩)=h(g(𝐩))𝐯g(𝐩).

In differential geometry

Let M be a differentiable manifold and p a point of M. Suppose that f is a function defined in a neighborhood of p, and differentiable at p. If v is a tangent vector to M at p, then the directional derivative of f along v, denoted variously as df(v) (see Exterior derivative), 𝐯f(𝐩) (see Covariant derivative), L𝐯f(𝐩) (see Lie derivative), or 𝐯𝐩(f) (see Tangent space § Definition via derivations), can be defined as follows. Let γ : [−1, 1] → M be a differentiable curve with γ(0) = p and γ′(0) = v. Then the directional derivative is defined by 𝐯f(𝐩)=ddτfγ(τ)|τ=0. This definition can be proven independent of the choice of γ, provided γ is selected in the prescribed manner so that γ(0) = p and γ′(0) = v.

The Lie derivative

The Lie derivative of a vector field Wμ(x) along a vector field Vμ(x) is given by the difference of two directional derivatives (with vanishing torsion): VWμ=(V)Wμ(W)Vμ. In particular, for a scalar field ϕ(x), the Lie derivative reduces to the standard directional derivative: Vϕ=(V)ϕ.

The Riemann tensor

Directional derivatives are often used in introductory derivations of the Riemann curvature tensor. Consider a curved rectangle with an infinitesimal vector δ along one edge and δ along the other. We translate a covector S along δ then δ and then subtract the translation along δ and then δ. Instead of building the directional derivative using partial derivatives, we use the covariant derivative. The translation operator for δ is thus 1+νδνDν=1+δD, and for δ, 1+μδ'μDμ=1+δD. The difference between the two paths is then (1+δD)(1+δD)Sρ(1+δD)(1+δD)Sρ=μ,νδ'μδν[Dμ,Dν]Sρ. It can be argued[7] that the noncommutativity of the covariant derivatives measures the curvature of the manifold: [Dμ,Dν]Sρ=±σRσρμνSσ, where R is the Riemann curvature tensor and the sign depends on the sign convention of the author.

In group theory

Translations

In the Poincaré algebra, we can define an infinitesimal translation operator P as 𝐏=i. (the i ensures that P is a self-adjoint operator) For a finite displacement λ, the unitary Hilbert space representation for translations is[8] U(λ)=exp(iλ𝐏). By using the above definition of the infinitesimal translation operator, we see that the finite translation operator is an exponentiated directional derivative: U(λ)=exp(λ). This is a translation operator in the sense that it acts on multivariable functions f(x) as U(λ)f(𝐱)=exp(λ)f(𝐱)=f(𝐱+λ).

Rotations

The rotation operator also contains a directional derivative. The rotation operator for an angle θ, i.e. by an amount θ = |θ| about an axis parallel to θ^=θ/θ is U(R(θ))=exp(iθ𝐋). Here L is the vector operator that generates SO(3): 𝐋=(000001010)𝐢+(001000100)𝐣+(010100000)𝐤. It may be shown geometrically that an infinitesimal right-handed rotation changes the position vector x by 𝐱𝐱δθ×𝐱. So we would expect under infinitesimal rotation: U(R(δθ))f(𝐱)=f(𝐱δθ×𝐱)=f(𝐱)(δθ×𝐱)f. It follows that U(R(δθ))=1(δθ×𝐱). Following the same exponentiation procedure as above, we arrive at the rotation operator in the position basis, which is an exponentiated directional derivative:[12] U(R(θ))=exp((θ×𝐱)).

Normal derivative

A normal derivative is a directional derivative taken in the direction normal (that is, orthogonal) to some surface in space, or more generally along a normal vector field orthogonal to some hypersurface. See for example Neumann boundary condition. If the normal direction is denoted by 𝐧, then the normal derivative of a function f is sometimes denoted as f𝐧. In other notations, f𝐧=f(𝐱)𝐧=𝐧f(𝐱)=f𝐱𝐧=Df(𝐱)[𝐧].

In the continuum mechanics of solids

Several important results in continuum mechanics require the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors.[13] The directional directive provides a systematic way of finding these derivatives.

See also


Notes

  1. R. Wrede; M.R. Spiegel (2010). Advanced Calculus (3rd ed.). Schaum's Outline Series. ISBN 978-0-07-162366-7. 
  2. The applicability extends to functions over spaces without a metric and to differentiable manifolds, such as in general relativity.
  3. If the dot product is undefined, the gradient is also undefined; however, for differentiable f, the directional derivative is still defined, and a similar relation exists with the exterior derivative.
  4. Thomas, George B. Jr.; and Finney, Ross L. (1979) Calculus and Analytic Geometry, Addison-Wesley Publ. Co., fifth edition, p. 593.
  5. This typically assumes a Euclidean space – for example, a function of several variables typically has no definition of the magnitude of a vector, and hence of a unit vector.
  6. Hughes Hallett, Deborah; McCallum, William G.; Gleason, Andrew M. (2012-01-01). Calculus : Single and multivariable.. John wiley. pp. 780. ISBN 9780470888612. OCLC 828768012. 
  7. Zee, A. (2013). Einstein gravity in a nutshell. Princeton: Princeton University Press. p. 341. ISBN 9780691145587. 
  8. Weinberg, Steven (1999). The quantum theory of fields (Reprinted (with corr.). ed.). Cambridge [u.a.]: Cambridge Univ. Press. ISBN 9780521550017. https://archive.org/details/quantumtheoryoff00stev. 
  9. Zee, A. (2013). Einstein gravity in a nutshell. Princeton: Princeton University Press. ISBN 9780691145587. 
  10. Cahill, Kevin Cahill (2013). Physical mathematics (Repr. ed.). Cambridge: Cambridge University Press. ISBN 978-1107005211. 
  11. Larson, Ron; Edwards, Bruce H. (2010). Calculus of a single variable (9th ed.). Belmont: Brooks/Cole. ISBN 9780547209982. 
  12. Shankar, R. (1994). Principles of quantum mechanics (2nd ed.). New York: Kluwer Academic / Plenum. p. 318. ISBN 9780306447907. 
  13. J. E. Marsden and T. J. R. Hughes, 2000, Mathematical Foundations of Elasticity, Dover.

References