This page is a blog article in progress, written by Tim van Beek. To see discussions of this article while it was being written, visit the Azimuth Forum. For the final polished article, go to the Azimuth Blog.
Water waves can do a lot of things that light waves cannot, like “breaking”:
In mathematical models this difference shows up through the kind of partial differential equation (PDE) that models the waves:
• light waves are modelled by linear equations while
• water waves are modelled by nonlinear equations.
Physicists like to point out that linear equations model things that do not interact, while nonlinear equations model things that interact with each other. In quantum field theory, people speak of “free fields” versus “interacting fields”.
Some nonlinear PDE that describe fluid flows turn out to also describe geodesics on infinite-dimensional Riemannian manifolds. This fascinating observation is due to the Russian mathematician Vladimir Arnold. In this blog post I would like to talk a little bit about the concepts involved and show you a little toy example.
The Euler viewpoint on fluids is that a fluid is made of tiny “packages” or “particles”. The fluid flow is described by specifying where each package or particle is at a given time . When we start at some time on a given manifold , the flow of every fluid package is described by a path on parametrized by time, and for every time there is a diffeomorphism defined by the requirement that it maps the initial position of each fluid package to its position at time :
This picture is taken from the book
• V.I. Arnold and B.A. Khesin, Topological Methods in Hydrodynamics, Springer, Berlin, 1998. (Review at Zentralblatt Mathematik.)
We will take as a model of the domain of the fluid flow a compact Riemannian manifold . A fluid flow, as pictured above, is then a path in the diffeomorphism group . In order to apply geometric concepts in this situation, we will have to turn or some closed subgroup of it into a manifold, which will be infinite dimensional.
The curvature of such a manifold can provide a great deal about the stability of fluid flows: On a manifold with negative curvature geodesics will diverge from each other. If we can model fluid flows as geodesics in a Riemannian manifold and calculate the curvature, we could try to infer a bound on weather forecasts (in fact, that is what Vladimir Arnold did!): The solution that you calculate is one geodesic. But if you take into account errors with determining your starting point (involving the measurement of the state of the flow at the given start time), what you are actually looking at is a bunch of geodesics starting in a neighborhood of your starting point. If they diverge fast, that means that measurement errors make your result unreliable fast.
If you never thought about manifolds in infinite dimensions, you may feel a little bit insecure as to how the concepts that you know from differential geometry can be generalized from finite dimensions. At least I felt this way when I first read about it. But it turns out that the part of the theory one needs to know in order to understand Arnold’s insight is not that scary, so I will talk a little bit about it next.
The basic strategy when handling finite-dimensional, smooth, real manifolds is that you have a complicated manifold , but also locally for every point a neighborhood and an isomorphism (a “chart”) of to an open subset of the vastly simpler space , the “model space”. These isomorphisms can be used to transport concepts from to . In infinite dimensions it is however not that clear what kind of model space should be taken in place of . What structure should have?
Since we would like to differentiate, we should for example be able to define the derivative of a curve in :
If we write down the usual formula for a derivative“
we see that to make sense of this we need to be able to add elements, have a scalar multiplication, and a topology such that the algebraic operations are continuous. Sets with this structure are called topological vector spaces.
A curve that has a first derivative, second derivative, third derivative… and so on at every point is called a smooth curve, just as in the finite dimensional case.
So should at least be a topological vector space. We can, of course, put more structure on to make it “more similar” to , and choose as model space in ascending order of generality:
1) A Hilbert space, which has an inner product,
2) a Banach space that does not have a inner product, but a norm,
3) a Fréchet space that does not have a norm, but a metric,
4) a general topological vector space that need not be metrizable.
People talk accordingly of Hilbert, Banach and Fréchet manifolds. Since the space of smooth maps of , for example, is not a Banach space but a Fréchet space, we should not expect that we can model diffeomorphism groups on Banach spaces, but on Fréchet spaces. So we will use the concept of Fréchet manifolds.
But if you are interested in a more general theory using locally convex topological vector spaces as model spaces, you can look it up here:
• Andreas Kriegl and Peter W. Michor, The Convenient Setting of Global Analysis, American Mathematical Society, Providence Rhode Island, 1999.
Note that Kriegl and Michor use a different definition of “smooth function of Fréchet spaces” than we will below.
If you learn functional analysis, you will most likely start with operators on Hilbert spaces. One could say that the theory of topological vector spaces is about abstracting away as much structure from a Hilbert space and look what structure you need for important theorems to still hold true, like the open mapping/closed graph theorem. If you would like to learn more about this, my favorite book is this one:
• Francois Treves, Topological Vector Spaces, Distributions and Kernels, Dover Publications, 2006.
Since we replace the model space with a Fréchet space , there will be certain things that won’t work out as easily as for the finite dimensional , or not at all.
It is nevertheless possible to define both integrals and differentials that behave much in the expected way. You can find a nice exposition of how this can be done in this paper:
• Richard S. Hamilton, The inverse function theorem of Nash and Moser, Bulletin of the American Mathematical Society 7 (1982), pages 65-222.
The story starts with the definition of the directional derivative that can be done just as in finite dimensions:
Let and be Fréchet spaces, open and a continuous map. The derivative of at the point in the direction is the map
A simple, but nontrivial example is the operator
with the derivative
It is possible to define higher derivatives and also prove that the chain rule holds, so that we can define that a function between Fréchet spaces is smooth if it has derivatives at every point of all orders. The definition of a smooth Fréchet manifold is then straightforward: you can copy the usual definition of a smooth manifold word for word, replacing by some Fréchet space.
With tangent vectors, you may remember that there are several different ways to define them in the finite dimensional case, which turn out to be equivalent. Since there are situations in infinite dimensions where these definitions turn out to not be equivalent, I will be explicit and define tangent vectors in the “kinematic way”:
The (kinematic) tangent vector space of a Fréchet manifold at a point consists of all pairs where is a smooth curve
With this definition, the set of pairs forms a Fréchet manifold, the tangent bundle , just as in finite dimensions.
The first serious (more or less) problem we encounter is the definition of the cotangent bundle: is isomorphic to its dual vector space. This is still true for every Hilbert space (this is known as the Riesz representation theorem). It fails already for Banach spaces: The dual space will still be a Banach space, but a Banach space does not need to be isomorphic to its dual, or even the dual of its dual (though the latter situation happens quite often, and such Banach spaces are called reflexive).
With Fréchet spaces things are even a little bit worse, because the dual of a Fréchet space (which is not a Banach space) is not even a Fréchet space! Since I did not know that and could not find a reference, I asked about this on mathoverflow here and promptly got an answer. Mathoverflow is a really amazing platform for this kind of question!
So, if we naively define the cotangent space as in finite dimensions by taking the dual space of every tangent space, then the cotangent bundle won’t be a Fréchet manifold.
We will therefore have to be careful with the definition of differential forms for Fréchet manifolds and define it explicitly:
A differential form (a one form) is a smooth map
where is the tangent bundle, such that restricts to a linear map on every tangent space .
Another pitfall is that theorems from multivariable calculus may fail in Fréchet spaces, like the existence and uniqueness theorem of Picard-Lindelöf for ordinary differential equations. Things are much easier in Banach spaces: If you take a closer look at multivariable calculus, you will notice that a lot of definitions and theorems actually make use of the Banach space structure of only, so that a lot generalizes straight forward to infinite dimensional Banach spaces. But that is less so for Fréchet spaces.
By now you should feel reasonably comfortable with the notion of a Fréchet manifold, so let us talk about the kind of gadget that Arnold used to describe fluid flows: diffeomorphism groups that are both infinite-dimensional Riemannian manifolds and Lie groups.
If is both a Riemannian manifold and a Lie group, it is possible to define the concept of left or right invariant metric. A left or right invariant metric on is one that does not change if we multiply the arguments with a group element:
A metric is left invariant iff for all :
Similarly, is right invariant iff:
How does one get a one-sided invariant metric?
Here is one possibility: If you take a Lie group off the shelf, you get two automorphisms for free, namely the left and right multiplication by a group element :
Pictorially speaking, you can use the differentials of these to transport vectors from the Lie algebra of - which is the tangential space at the group identity - to any other tangent space . If you can define a scalar product on the Lie algebra, you can use this trick to transport the scalar product to all the other tangential spaces by left or right multiplication, which will get you a left or right invariant metric.
To be more precise, for every tangent vectors of a tangent space there are unique vectors that are mapped to by the differential of the right multiplication , that is
and we can define the scalar product of and to have the value of that of and :
This works for the left multiplication , too, of course.
For a one-sided invariant metric, the geodesic equation looks somewhat simpler than for general metrics. Let us take a look at that:
On a Riemannian manifold with tangential bundle there is a unique connection, the Levi-Civita connection, with the following properties for vector fields :
If we combine both formulas we get
If the scalar products are constant along every flow, i.e. the metric is (left or right) invariant, then the first three terms on the right hand side vanish, so that we get
This latter formula can be written in a more succinct way if we introduce the coadjoint operator. Remeber the adjoint operator defined to be
With the help of the scalar product we can define the adjoint of the adjoint operator:
Beware! We’re using the word ‘adjoint’ in two completely different ways here, both of which are very common in math. One way is to use ‘adjoint’ for the operation of taking a Lie bracket: . Another is to use ‘adjoint’ for the linear map coming from a linear map by . Please don’t blame me for this terminology.
Then the formula above for the covariant derivative can be written as
Since the inner product is nondegenerate, we can eliminate and get
A geodesic curve is one whose tangent vector is transported parallel to itself, that is we have
Using the formula for the covariant derivative for an invariant metric above we get
as a reformulation of the geodesic equation.
For time dependent dynamical systems, we have the time axis as an additional dimension and every vector field has as an additional summand. So, in this case we get as geodesic equation (again: for an invariant metric)
As a simple example we will look at the circle and its diffeomorphism group . The Lie algebra of can be identified with the space of all vector fields on . If we sloppily identify with with coordinate , then we can write for vector fields and the commutator
where is short for the derivative:
And of course we have a scalar product via
which we can use to define either a left or a right invariant metric on , by transporting it via left or right multiplication to every tangent space.
Let us evaluate the geodesic equation for this example. We have to calculate the effect of the coadjoint operator:
If we write for the vector fields , and , this results in
where the last step employs integration by parts and uses the periodic boundary condition for the involved functions.
So we get for the coadjoint operator
Finally, the geodesic equation
turns out to be
A similar equation,
is known as Hopf or inviscid Burgers’ equation. It looks simple, but its solutions can produce behaviour that looks like turbulence, so it is interesting in its own right.
If we take a somewhat more sophisticated diffeomorphism group, we can get slightly more complicated and therefore more interesting partial differential equations like the Korteweg-de Vries equation. But since this post is quite long already, that topic will have to wait for another post.