Pushforward and Pullback - Everything2.com

The pushforward and pullback maps in mathematics are natural ways of mapping objects like vectors and one-forms from one differential manifold to another, given a map Φ: M₁ → M₂ between points on the two manifolds. The concepts are very abstract, but they can be concretely represented in computational form when we look at specific coordinate systems on the manifolds.

The Pullback of a Function

We start with two manifolds M₁ and M₂, of dimension n₁ and n₂, respectively. Additionally, we have a smooth map Φ: M₁ → M₂. This map need only be well-defined and smooth; it does not have to be a one-to-one map, nor must it map onto the entire manifold M₂. Now, let's look at the space of smooth functions f(q) on points {q} in M₂. Is there a natural way to map these to the set of functions g(p) on points {p} in M₁? That is, given a function f(q), can we use the map Φ to naturally produce a function g(p) defined on M₁?

The answer is much simpler than the question. Simply note that the function f(Φ(p)) is a function well-defined on M₁. In other words, we can use the original map Φ from the points in M₁ to the points in M₂ to produce a new map which we call Φ^∗, the pullback map from functions on M₂ to functions on M₁, by the formula:

Φ^∗ [ f ] (p) = f(Φ(p))

This is clearly well-defined; given any smooth function f(q) on M₂, we can always use this formula to produce a function g(p) = f(Φ(p)) on M₁. The pullback map is not generally one-to-one, nor does it always map onto the entire space of smooth functions on M₁. In other words, we cannot generally "push forward" functions. In the special case that Φ is invertible, so is Φ^∗; we can push functions forward simply by using the pullback of the inverse map.

To summarize, the pullback of a function defined on M₂ is simply the function's representation on points in M₁ which get mapped to points in M₂ by the map Φ.

The Pushforward of a Vector

Now we abstract the concept further by considering the space of vectors at a given point p in the manifold M₁, i.e. the tangent space T_pM. We know that the space of vectors at this point can be considered the space of directional derivatives on smooth functions, evaluated at p. In other words, vectors are smooth maps acting on functions on M₁. The space of smooth functions on M₁ is itself a manifold, as is the space of smooth functions on M₂. Let's call these manifolds F₁ and F₂. We have already constructed a map between these spaces; this is just the pullback map:

Φ^∗: F₂ → F₁

Since vectors are themselves linear maps on F₁, we should be able to find a natural map to vectors on F₂, by pulling back again! We have to check that this map indeed produces a directional derivative, and not some general map, but for now accept that it will.

What does this map look like? Given a vector V in T_pM₁, this is a directional derivative map on functions on M₁, given by the formula:

V [ g(p) ] = Vⁱ∂g/∂xⁱ|_p,

where {x_i} is a local coordinate system in the vicinity of p on M₁. Now use the pullback map on functions to get a vector acting on functions in M₂. This is what we will call the pushforward map, Φ_∗:

Φ_∗ V [ f(q) ] = V [ Φ^∗f ] = V [ f(Φ(p)) ]

Let's get our head straight about things. V is a map on functions on M₁. f(q) is a function on M₂. Φ_∗ V is a vector field defined on M₂, which acts on functions f(q). Φ^∗f is a function on points in M₁, which is acted on by V.

Computation

We can just use the chain rule to evaluate this:

Φ_∗ V [ f(q) ] = V [ f(Φ(p)) ] = Vⁱ ∂f/∂xⁱ = Vⁱ (∂Φ^k/∂xⁱ) ∂f/∂Φ^k = (∂y^k/∂xⁱ) ∂f/∂y^k

Here, we're also using a local coordinate chart on M₂, {y^k}. We have simplified the notation by expressing Φ(p) in both of the coordinate representations as y^k(xⁱ).

We now have a natural map from T_pM₁ to T_Φ(p)M₂. We can think of this map as a matrix A_ik = ∂y^k/∂xⁱ acting on the vector V. A_ik is an n₂ × n₁ matrix, mapping the n₁ components of V to the n₂ components of Φ_∗V. Note that A_ik is coordinate-dependent. Notice also that we can push vectors forward, but we cannot pull them back, unless Φ has an inverse map. In other words, A_ik is not generally an invertible matrix (It might not even have the same number of rows as columns).

Another way to see how the pushforward map acts on vectors is to look at another natural manifestation of the tangent vector space T_pM₁: velocities of curves passing through p. Since Φ maps points in M₁ to points in M₂, it naturally maps curves in M₁ to curves in M₂. It is easy to show that this is the same pushforward map that we defined on directional derivatives. The velocity of the mapped curve is given by the same chain rule as above. This is a nice way of picturing the pushforward map on vectors, as it requires no computation to properly visualize.

The Pullback of a One-Form

Just when you thought we couldn't take the definitions any deeper, we are about to define yet another pullback map. We know that one-forms are linear maps on vector fields, and therefore we can define the pullback of a one-form to be the one-form's action on pushed-forward vector fields:

Φ^∗ω[ V ] = ω[ Φ_∗V ]

Again, let's make sure we know what we're looking at. ω is a one-form in M₂; that is, a linear map on vectors in M₂. V is a vector in M₁. ω cannot act directly on V, because they live on different manifolds. Φ^∗ω a one-form in M₁, which acts on vectors V in M₁. This action of Φ^∗ω on V in M₁ is dictated by ω's action on the vector pushed forward to M₂. That is exactly what we've written down.

Computation

Φ^∗ω[ V ] = ω[ A_ik Vⁱ ∂/∂x^k ]

= A_ik Vⁱ ω[ ∂/∂x^k ]

= Vⁱ A_ik ω_k

= (A^T ω)_i Vⁱ

Thus, the pullback of a one-form is given by the action of the transposed matrix A^T acting on its components. We might have expected this, especially if we note that A is an n₂ × n₁ matrix, while A^T is an n₁ × n₂ matrix, which is exactly what we'd need to send the n₂ components of ω to an n₁-component one-form defined on M₁.

In general, we can pull back any (0,k) tensor, and push forward any (k,0) tensor, the generalization being to simply multiply by additional copies of the matrix A or A^T. However, we cannot do the reverse, nor can we push or pull any (l,m) tensor, for nonzero l and m, unless Φ is invertible. In the case where Φ is invertible, it is completely possible to push or pull tensors of any rank, essentially by multiplying by inverse matrices, where applicable. Obviously there are some details to be filled in, but you get the idea.

Examples

A readily available example shows up in the coordinate charts on a given manifold, M. Notice that any coordinate chart (usually also labeled Φ, which might normally confuse us, but in this case they are referring to the same function) is an invertible map between an open set in M and an open set in Rⁿ. Therefore, we can pull functions on M back to functions on Rⁿ. This is how we define things like continuity and differentiability, and in the case of complex manifolds, analyticity. We pull functions back to Rⁿ, and evaluate these properties in a more concrete setting.

We can also push vectors from Rⁿ to T_pM, using a coordinate chart in a neighborhood of p. This is essentially what we are doing when we determine the components of a vector V, for a given coordinate system, given by Φ. Different coordinate charts will generally give us different push-forward maps, which give us different components for V.

The Special Case M₁ = M₂

Set M₁ = M₂ = M. In other words, look at bijective maps Φ: M → M from a manifold onto itself. In particular, look at a smoothly varying family of such maps, {Φ_pq}, where Φ_pq is a smooth, bijective map from M onto itself, which maps p to q. An example of this is the family of rotations on the manifold of S¹, the circle. For any two points p and q on the circle, there is a unique rotation Φ_pq which sends p to q (rotations by θ and θ + 2π are considered to be indistinct).

We can define Φ-invariant vector fields on M to be vector fields which are invariant under the pushforward map Φ_pq∗: T_pM → T_qM. Thus, V(q) = Φ_pq∗ V(p). The set of Φ-invariant vector fields forms a vector space, since any two invariant vector fields can be added together to find another invariant vector field. It is easily seen that this vector space is equivalent to T_pM, for any point p in the manifold, for we can push any tangent vector forward from p to every point q in the manifold, using Φ_pq∗, producing a manifestly invariant vector field: V(q) = Φ_pq∗ V_p. Since we can do this for any vector V at p, and since Φ_pq is bijective (by assumption here), we can output a unique invariant vector field V(q) given any vector V_p at p. Going back the other direction is even easier. Given an invariant vector field, we can get a unique vector V_p ∈ T_pM, simply by evaluating the vector field at p: V_p = V(p). Notice also that these two maps are linear.

Let's make this more concrete with the example of rotations on the circle. Given a point p on the circle, and a vector V_p at p, we should be able to produce a Φ-invariant vector field for all {q} on the circle. In this simple case, the vector space is one-dimensional, so V_p = V ∂/∂θ|_p, and the vector field is just constant: V(q) = V ∂/∂θ|_q. In higher-dimensional examples, the Φ-invariant vector fields are less trivial.

We have now shown that there is a natural linear correspondence between T_pM and the Φ-invariant fields on the manifold M. This is useful because it provides yet another manifestation of the tangent space, T_pM, whenever the manifold M possesses a smooth family of maps Φ_pq, for all q ∈ M. This picture may seem circular, since it requires an understanding of vector fields, but it can be a powerful manifestation nonetheless, as it provides a connection between the local and the global.

The notation for pullback should be an upper-asterisk, and a lower-asterisk for pushforward. This will probably not resolve properly on most computers; it will most likely just look like a dot to you.

abstract nonsense	lie group	tangent space	fiber bundle
Pullback	Tadashi Nakashima	directional derivative	sonic boom
manifold	mathematics