Outer product

From HandWiki
Revision as of 19:36, 6 February 2024 by Wikisleeper (talk | contribs) (update)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Vector operation

In linear algebra, the outer product of two coordinate vectors is the matrix whose entries are all products of an element in the first vector with an element in the second vector. If the two coordinate vectors have dimensions n and m, then their outer product is an n × m matrix. More generally, given two tensors (multidimensional arrays of numbers), their outer product is a tensor. The outer product of tensors is also referred to as their tensor product, and can be used to define the tensor algebra.

The outer product contrasts with:

Definition

Given two vectors of size [math]\displaystyle{ m \times 1 }[/math] and [math]\displaystyle{ n \times 1 }[/math] respectively

[math]\displaystyle{ \mathbf{u} = \begin{bmatrix} u_1 \\ u_2 \\ \vdots \\ u_m \end{bmatrix}, \quad \mathbf{v} = \begin{bmatrix} v_1 \\ v_2 \\ \vdots \\ v_n \end{bmatrix} }[/math]

their outer product, denoted [math]\displaystyle{ \mathbf{u} \otimes \mathbf{v}, }[/math] is defined as the [math]\displaystyle{ m \times n }[/math] matrix [math]\displaystyle{ \mathbf{A} }[/math] obtained by multiplying each element of [math]\displaystyle{ \mathbf{u} }[/math] by each element of [math]\displaystyle{ \mathbf{v} }[/math]:[1]

[math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} = \mathbf{A} = \begin{bmatrix} u_1v_1 & u_1v_2 & \dots & u_1v_n \\ u_2v_1 & u_2v_2 & \dots & u_2v_n \\ \vdots & \vdots & \ddots & \vdots \\ u_mv_1 & u_mv_2 & \dots & u_mv_n \end{bmatrix} }[/math]

Or, in index notation:

[math]\displaystyle{ (\mathbf{u} \otimes \mathbf{v})_{ij} = u_i v_j }[/math]

Denoting the dot product by [math]\displaystyle{ \,\cdot,\, }[/math] if given an [math]\displaystyle{ n \times 1 }[/math] vector [math]\displaystyle{ \mathbf{w}, }[/math] then [math]\displaystyle{ (\mathbf{u} \otimes \mathbf{v}) \mathbf{w} = (\mathbf{v} \cdot \mathbf{w}) \mathbf{u}. }[/math] If given a [math]\displaystyle{ 1 \times m }[/math] vector [math]\displaystyle{ \mathbf{x}, }[/math] then [math]\displaystyle{ \mathbf{x} (\mathbf{u} \otimes \mathbf{v}) = (\mathbf{x} \cdot \mathbf{u}) \mathbf{v}^{\operatorname{T}}. }[/math]

If [math]\displaystyle{ \mathbf{u} }[/math] and [math]\displaystyle{ \mathbf{v} }[/math] are vectors of the same dimension bigger than 1, then [math]\displaystyle{ \det (\mathbf{u} \otimes\mathbf{v}) = 0 }[/math].

The outer product [math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} }[/math] is equivalent to a matrix multiplication [math]\displaystyle{ \mathbf{u} \mathbf{v}^{\operatorname{T}}, }[/math] provided that [math]\displaystyle{ \mathbf{u} }[/math] is represented as a [math]\displaystyle{ m \times 1 }[/math] column vector and [math]\displaystyle{ \mathbf{v} }[/math] as a [math]\displaystyle{ n \times 1 }[/math] column vector (which makes [math]\displaystyle{ \mathbf{v}^{\operatorname{T}} }[/math] a row vector).[2][3] For instance, if [math]\displaystyle{ m = 4 }[/math] and [math]\displaystyle{ n = 3, }[/math] then[4]

[math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} = \mathbf{u}\mathbf{v}^\textsf{T} = \begin{bmatrix}u_1 \\ u_2 \\ u_3 \\ u_4\end{bmatrix} \begin{bmatrix}v_1 & v_2 & v_3\end{bmatrix} = \begin{bmatrix} u_1 v_1 & u_1 v_2 & u_1 v_3 \\ u_2 v_1 & u_2 v_2 & u_2 v_3 \\ u_3 v_1 & u_3 v_2 & u_3 v_3 \\ u_4 v_1 & u_4 v_2 & u_4 v_3 \end{bmatrix}. }[/math]

For complex vectors, it is often useful to take the conjugate transpose of [math]\displaystyle{ \mathbf{v}, }[/math] denoted [math]\displaystyle{ \mathbf{v}^\dagger }[/math] or [math]\displaystyle{ \left(\mathbf{v}^\textsf{T}\right)^* }[/math]:

[math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} = \mathbf{u} \mathbf{v}^\dagger = \mathbf{u} \left(\mathbf{v}^\textsf{T}\right)^*. }[/math]

Contrast with Euclidean inner product

If [math]\displaystyle{ m = n, }[/math] then one can take the matrix product the other way, yielding a scalar (or [math]\displaystyle{ 1 \times 1 }[/math] matrix):

[math]\displaystyle{ \left\langle\mathbf{u}, \mathbf{v}\right\rangle = \mathbf{u}^\textsf{T} \mathbf{v} }[/math]

which is the standard inner product for Euclidean vector spaces,[3] better known as the dot product. The dot product is the trace of the outer product.[5] Unlike the dot product, the outer product is not commutative.

Multiplication of a vector [math]\displaystyle{ \mathbf{w} }[/math] by the matrix [math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} }[/math] can be written in terms of the inner product, using the relation [math]\displaystyle{ \left(\mathbf{u} \otimes \mathbf{v}\right)\mathbf{w} = \mathbf{u}\left\langle\mathbf{v}, \mathbf{w}\right\rangle }[/math].

The outer product of tensors

Given two tensors [math]\displaystyle{ \mathbf{u}, \mathbf{v} }[/math] with dimensions [math]\displaystyle{ (k_1, k_2, \dots, k_m) }[/math] and [math]\displaystyle{ (l_1, l_2, \dots, l_n) }[/math], their outer product [math]\displaystyle{ \mathbf{u} \otimes \mathbf{v} }[/math] is a tensor with dimensions [math]\displaystyle{ (k_1, k_2, \dots, k_m, l_1, l_2, \dots, l_n) }[/math] and entries

[math]\displaystyle{ (\mathbf{u} \otimes \mathbf{v})_{i_1, i_2, \dots i_m, j_1, j_2, \dots, j_n} = u_{i_1, i_2, \dots, i_m} v_{j_1, j_2, \dots, j_n} }[/math]

For example, if [math]\displaystyle{ \mathbf{A} }[/math] is of order 3 with dimensions [math]\displaystyle{ (3, 5, 7) }[/math] and [math]\displaystyle{ \mathbf{B} }[/math] is of order 2 with dimensions [math]\displaystyle{ (10, 100), }[/math] then their outer product [math]\displaystyle{ \mathbf{C} }[/math] is of order 5 with dimensions [math]\displaystyle{ (3, 5, 7, 10, 100). }[/math] If [math]\displaystyle{ \mathbf{A} }[/math] has a component A[2, 2, 4] = 11 and [math]\displaystyle{ \mathbf{B} }[/math] has a component B[8, 88] = 13, then the component of [math]\displaystyle{ \mathbf{C} }[/math] formed by the outer product is C[2, 2, 4, 8, 88] = 143.

Connection with the Kronecker product

The outer product and Kronecker product are closely related; in fact the same symbol is commonly used to denote both operations.

If [math]\displaystyle{ \mathbf{u} = \begin{bmatrix}1 & 2 & 3\end{bmatrix}^\textsf{T} }[/math] and [math]\displaystyle{ \mathbf{v} = \begin{bmatrix}4 & 5\end{bmatrix}^\textsf{T} }[/math], we have:

[math]\displaystyle{ \begin{align} \mathbf{u} \otimes_\text{Kron} \mathbf{v} &= \begin{bmatrix} 4 \\ 5 \\ 8 \\ 10 \\ 12 \\ 15\end{bmatrix}, & \mathbf{u} \otimes_\text{outer} \mathbf{v} &= \begin{bmatrix} 4 & 5 \\ 8 & 10 \\ 12 & 15\end{bmatrix} \end{align} }[/math]

In the case of column vectors, the Kronecker product can be viewed as a form of vectorization (or flattening) of the outer product. In particular, for two column vectors [math]\displaystyle{ \mathbf{u} }[/math] and [math]\displaystyle{ \mathbf{v} }[/math], we can write:

[math]\displaystyle{ \mathbf{u} \otimes_{\text{Kron}} \mathbf{v} = \operatorname{vec}(\mathbf{v} \otimes_\text{outer} \mathbf{u}) }[/math]

(The order of the vectors is reversed on the right side of the equation.)

Another similar identity that further highlights the similarity between the operations is

[math]\displaystyle{ \mathbf{u} \otimes_{\text{Kron}} \mathbf{v}^\textsf{T} = \mathbf u \mathbf{v}^\textsf{T} = \mathbf{u} \otimes_{\text{outer}} \mathbf{v} }[/math]

where the order of vectors needs not be flipped. The middle expression uses matrix multiplication, where the vectors are considered as column/row matrices.

Connection with the matrix product

Given a pair of matrices [math]\displaystyle{ \mathbf{A} }[/math] of size [math]\displaystyle{ m\times p }[/math] and [math]\displaystyle{ \mathbf{B} }[/math] of size [math]\displaystyle{ p\times n }[/math], consider the matrix product [math]\displaystyle{ \mathbf{C} = \mathbf{A}\,\mathbf{B} }[/math] defined as usual as a matrix of size [math]\displaystyle{ m\times n }[/math].

Now let [math]\displaystyle{ \mathbf a^\text{col}_k }[/math] be the [math]\displaystyle{ k }[/math]-th column vector of [math]\displaystyle{ \mathbf A }[/math] and let [math]\displaystyle{ \mathbf b^\text{row}_k }[/math] be the [math]\displaystyle{ k }[/math]-th row vector of [math]\displaystyle{ \mathbf B }[/math]. Then [math]\displaystyle{ \mathbf{C} }[/math] can be expressed as a sum of column-by-row outer products:

[math]\displaystyle{ \mathbf{C} = \mathbf{A}\, \mathbf{B} = \left( \sum_{k=1}^p {A}_{ik}\, {B}_{kj} \right)_{ \begin{matrix} 1\le i \le m \\[-20pt] 1 \le j\le n \end{matrix} } = \begin{bmatrix} & & \\ \mathbf a^\text{col}_{1} & \cdots & \mathbf a^\text{col}_{p} \\ & & \end{bmatrix} \begin{bmatrix} & \mathbf b^\text{row}_{1} & \\ & \vdots & \\ & \mathbf b^\text{row}_{p} & \end{bmatrix} = \sum_{k=1}^p \mathbf a^\text{col}_k \otimes \mathbf b^\text{row}_k }[/math]

This expression has duality with the more common one as a matrix built with row-by-column inner product entries (or dot product): [math]\displaystyle{ C_{ij} = \langle{\mathbf a^\text{row}_i,\,\mathbf b_j^\text{col}}\rangle }[/math]

This relation is relevant[6] in the application of the Singular Value Decomposition (SVD) (and Spectral Decomposition as a special case). In particular, the decomposition can be interpreted as the sum of outer products of each left ([math]\displaystyle{ \mathbf{u}_k }[/math]) and right ([math]\displaystyle{ \mathbf{v}_k }[/math]) singular vectors, scaled by the corresponding nonzero singular value [math]\displaystyle{ \sigma_k }[/math]:

[math]\displaystyle{ \mathbf{A} = \mathbf{U \Sigma V^T} = \sum_{k=1}^{\operatorname{rank}(A)}(\mathbf{u}_k \otimes \mathbf{v}_k) \, \sigma_k }[/math]

This result implies that [math]\displaystyle{ \mathbf{A} }[/math] can be expressed as a sum of rank-1 matrices with spectral norm [math]\displaystyle{ \sigma_k }[/math] in decreasing order. This explains the fact why, in general, the last terms contribute less, which motivates the use of the truncated SVD as an approximation. The first term is the least squares fit of a matrix to an outer product of vectors.

Properties

The outer product of vectors satisfies the following properties:

[math]\displaystyle{ \begin{align} (\mathbf{u} \otimes \mathbf{v})^\textsf{T} &= (\mathbf{v} \otimes \mathbf{u}) \\ (\mathbf{v} + \mathbf{w}) \otimes \mathbf{u} &= \mathbf{v} \otimes \mathbf{u} + \mathbf{w} \otimes \mathbf{u} \\ \mathbf{u} \otimes (\mathbf{v} + \mathbf{w}) &= \mathbf{u} \otimes \mathbf{v} + \mathbf{u} \otimes \mathbf{w} \\ c (\mathbf{v} \otimes \mathbf{u}) &= (c\mathbf{v}) \otimes \mathbf{u} = \mathbf{v} \otimes (c\mathbf{u}) \end{align} }[/math]

The outer product of tensors satisfies the additional associativity property:

[math]\displaystyle{ (\mathbf{u} \otimes \mathbf{v}) \otimes \mathbf{w} = \mathbf{u} \otimes (\mathbf{v} \otimes \mathbf{w}) }[/math]

Rank of an outer product

If u and v are both nonzero, then the outer product matrix uvT always has matrix rank 1. Indeed, the columns of the outer product are all proportional to the first column. Thus they are all linearly dependent on that one column, hence the matrix is of rank one.

("Matrix rank" should not be confused with "tensor order", or "tensor degree", which is sometimes referred to as "rank".)

Definition (abstract)

Let V and W be two vector spaces. The outer product of [math]\displaystyle{ \mathbf v \in V }[/math] and [math]\displaystyle{ \mathbf w \in W }[/math] is the element [math]\displaystyle{ \mathbf v \otimes \mathbf w \in V \otimes W }[/math].

If V is an inner product space, then it is possible to define the outer product as a linear map VW. In this case, the linear map [math]\displaystyle{ \mathbf x \mapsto \langle \mathbf v, \mathbf x\rangle }[/math] is an element of the dual space of V, as this maps linearly a vector into its underlying field, of which [math]\displaystyle{ \langle \mathbf v, \mathbf x\rangle }[/math] is an element. The outer product VW is then given by

[math]\displaystyle{ (\mathbf w \otimes \mathbf v) (\mathbf x) = \left\langle \mathbf v, \mathbf x \right\rangle \mathbf w. }[/math]

This shows why a conjugate transpose of v is commonly taken in the complex case.

In programming languages

In some programming languages, given a two-argument function f (or a binary operator), the outer product, f, of two one-dimensional arrays, A and B, is a two-dimensional array C such that C[i, j] = f(A[i], B[j]). This is syntactically represented in various ways: in APL, as the infix binary operator ∘.f; in J, as the postfix adverb f/; in R, as the function outer(A, B, f) or the special %o%;[7] in Mathematica, as Outer[f, A, B]. In MATLAB, the function kron(A, B) is used for this product. These often generalize to multi-dimensional arguments, and more than two arguments.

In the Python library NumPy, the outer product can be computed with function np.outer().[8] In contrast, np.kron results in a flat array. The outer product of multidimensional arrays can be computed using np.multiply.outer.

Applications

As the outer product is closely related to the Kronecker product, some of the applications of the Kronecker product use outer products. These applications are found in quantum theory, signal processing, and image compression.[9]

Spinors

Suppose s, t, w, zC so that (s, t) and (w, z) are in C2. Then the outer product of these complex 2-vectors is an element of M(2, C), the 2 × 2 complex matrices:

[math]\displaystyle{ \begin{pmatrix} sw & tw \\ sz & tz \end{pmatrix}. }[/math]

The determinant of this matrix is swtzsztw = 0 because of the commutative property of C.

In the theory of spinors in three dimensions, these matrices are associated with isotropic vectors due to this null property. Élie Cartan described this construction in 1937,[10] but it was introduced by Wolfgang Pauli in 1927[11] so that M(2,C) has come to be called Pauli algebra.

Concepts

The block form of outer products is useful in classification. Concept analysis is a study that depends on certain outer products:

When a vector has only zeros and ones as entries, it is called a logical vector, a special case of a logical matrix. The logical operation and takes the place of multiplication. The outer product of two logical vectors (ui) and (vj) is given by the logical matrix [math]\displaystyle{ \left(a_{ij}\right) = \left(u_i \land v_j\right) }[/math]. This type of matrix is used in the study of binary relations, and is called a rectangular relation or a cross-vector.[12]

See also

Products

Duality

References

  1. Lerner, R. G.; Trigg, G. L. (1991). Encyclopaedia of Physics (2nd ed.). VHC. ISBN 0-89573-752-3. https://archive.org/details/encyclopediaofph00lern. 
  2. Lipschutz, S.; Lipson, M. (2009). Linear Algebra. Schaum’s Outlines (4th ed.). McGraw-Hill. ISBN 978-0-07-154352-1. 
  3. 3.0 3.1 Keller, Frank (February 23, 2020). "Algebraic Properties of Matrices; Transpose; Inner and Outer Product". https://www.inf.ed.ac.uk/teaching/courses/cfcs1/lectures/cfcs_l10.pdf. 
  4. James M. Ortega (1987) Matrix Theory: A Second Course, page 7, Plenum Press ISBN:0-306-42433-9
  5. Stengel, Robert F. (1994). Optimal Control and Estimation. New York: Dover Publications. p. 26. ISBN 0-486-68200-5. https://books.google.com/books?id=jDjPxqm7Lw0C&pg=PA26. 
  6. Trefethen, Lloyd N.; Bau III, David (1997). Numerical linear algebra. Philadelphia: Society for Industrial and Applied Mathematics. ISBN 978-0-89871-361-9. 
  7. "outer function | R Documentation". https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/outer. 
  8. "numpy.outer — NumPy v1.19 Manual". https://numpy.org/doc/stable/reference/generated/numpy.outer.html. 
  9. Steeb, Willi-Hans; Hardy, Yorick (2011). "Applications (Chapter 3)". Matrix Calculus and Kronecker Product: A Practical Approach to Linear and Multilinear Algebra (2 ed.). World Scientific. ISBN 978-981-4335-31-7. 
  10. Élie Cartan (1937) Lecons sur la theorie des spineurs, translated 1966: The Theory of Spinors, Hermann, Paris
  11. Pertti Lounesto (1997) Clifford Algebras and Spinors, page 51, Cambridge University Press ISBN:0-521-59916-4
  12. Ki Hang Kim (1982) Boolean Matrix Theory and Applications, page 37, Marcel Dekker ISBN:0-8247-1788-0

Further reading