Showing changes from revision #8 to #9:
Added | Removed | Changed
This page is a blog article in progress, written by Blake Stacey. To discuss as it is being written, go to the Azimuth Forum.
Moment closures are a way of forgetting information about a system in a controlled fashion, in the hope that an incomplete, fairly heavily “coarse-grained” picture of the system will still be useful in figuring out what will happen to it. Sometimes, this is a justifiable hope, but in other cases, we are right to wonder whether all the algebra it generates actually leads us to any insights. Here, we’ll be concerned with a particular application of this technology: studying the vulnerability of an ecosystem to invasion. We shall find expressions for invasion fitness, the expected relative growth rate of an initially-rare species or variety.
Consider a lattice, each site of which can occupied by an individual of “resident” type (), occupied by a mutant (), or empty (). The difference between the two is encoded in the choice of transition rules representing death, birth and migration. We can get an aggregate measure of the situation by finding the probability that a randomly chosen site will be in state , where can take values in the set . A finer degree of distinction is provided by the conditional probabilities , where, for example, denotes the probability that a randomly chosen neighbor site to a randomly chosen mutant is of resident type. Note that if a mutant is injected into a native resident population and its offspring form a geographical cluster, can be much larger than : few individuals are mutants overall, but the probability of a mutant life-form interacting with another mutant is high.
The pair dynamics of the system involves the time evolution of the probabilities , that is, the probability that a randomly selected lattice edge will have on one end and on the other. The differential equation for , for example, will have terms reflecting the processes which can form and destroy pairs: is one possibility, and is another. Death, which comes for organisms and leaves empty spaces behind, introduces processes like , and . Reproduction can lead to formerly empty spaces becoming occupied: and . In language more like we’ve used in the Azimuth discussions of stochastic Petri net theory, we’ve moved beyond just creating and annihilating residents and mutants, and now we’re dynamically changing the number of “resident–resident” and “resident–mutant” pairs. So, we probably could make a Petri net diagram for these processes, but the labels on the boxes might start looking a little funny.
Each term in our differential equations will have a transition rate dependent upon a conditional probability of the form , denoting the probability that a of a pair will have a neighbor of type . The differential equations for the pair probabilities thus depend on triplet probabilities , which depend upon quadruplet probabilities and so forth. To make progress, we truncate this hierarchy in a manner reminiscent to cutting off the BBGKY hierarchy in statistical mechanics: we impose the pair approximation that
This approximation destroys information about spatial structure and thereby introduces bias which in an ideal world ought to be accounted for.
(It just occurred to me that if a Petri net specifies a symmetric monoidal category, then successive truncations of the moment-dynamics hierarchy yield mappings between categories. Going from a pair approximation to a mean-field approximation, for example, transforms a Petri net whose circles are labelled with pair states to one labelled by site states. Category theory might be able to say something interesting here. Anything which can tame the horrible spew of equations which arises in these problems would be great to have.)
Invasion fitness is judged in the following manner. We start with a lattice devoid of mutants () and find the equilibrium densities and by setting
The exact form of and will depend upon interaction details which we won’t worry about just yet. We then inject a mutant strain into this situation; as the mutants are initially rare, we can say they do not affect the large-scale dynamics of the resident population. Summarizing the pair probabilities with the shorthand , we write the differential equation in matrix form
where the matrix encapsulates the details of our chosen dynamics. The pair approximation, in which we discard correlations of third and higher order, lets us simplify this to
When people started doing simulations of lattice models like these, they found that the conditional probabilities equilibrate. That is to say, even if the global density of mutants changes, the local statistical structure of a mutant cluster remains constant. This is the key statement which allows us to linearize the dynamics and write the behavior of in terms of eigenvectors and eigenvalues:
The dominant eigenvalue of is the “invasion exponent” which characterizes whether an invasion will fail () or succeed (). The eigenvector associated with describes the vehicle of selection for the mutants’ particular genetic variation, by summarizing the structure of their geographical cluster.
The extent to which pair-dynamics models are satisfactory depends on the goal of the modeler. As we have seen, these models do not capture all of the phenomena that can be observed in simulations of fully spatial probabilistic cellular automata. Basically, the approximation fails whenever spatial structures arise that are difficult to “describe” using pairs alone. More technically, the method fails whenever significant higher-order correlations arise – that is, whenever the frequency of parituclar triplets (or triangles, squares, or all sorts of star-like configurations) starts to diverge from what one would expect on the basis of pair densities. Thus, pair-dynamics models satisfactorily describe probabilistic cellular automata in which only “small-scale” patterns arise. Larger, “meso-scale” patterns such as spirals are difficult to capture using this method.
—in Dieckmann et al. (2000), chapter 19.
We follow van Baalen (in Dieckmann et al. (2000), chapter 19).
(Petri net pictures will go here.)
We write for the coordination number of the lattice. Birth:
with rate .
Death:
with rate .
Movement or migration:
with rate .
If we ignore spatial structure altogether, we can say that
which by normalization of probability means
So,
This should look familiar: it’s a logistic equation for population growth, with growth rate and equilibrium population .
(This might spin off into its own page later, depending on whether I decide to emphasize more the idea of invasion fitness or the general relation between stochastic Petri nets and moment closures. Saving it here so I don’t forget about it.)
In a mean-field world view, we could say , but this can’t capture rewiring.
Pair approximation:
The “mean excess degree” is the number of additional links we expect to find after we follow a random link. It turns out that is a reasonable approximation.
Our dynamical system is defined by three equations. First, we have the rule we wrote before,
and then the rules we deduced using pair approximation:
and
… One application of this machinery has been to understand how evolution can producealtruistic behaviour. A behaviouristic definition of “altruism” would go something like, “Acting to increase the reproductive success of another individual at the expense of one’s own”. This can be written out using game theory; one defines a parameter to stand for benefit, to stand for cost and a payoff function which depends on , and the strategy employed by an organism.
occur at rate , and occur at rate .
In the differential equation for , the transition contributes a term proportional to the density of pairs:
where we have written for , to save a little ink. All things told, the rate of change of is given by
Yuck.
In particular, is often taken to be the average relatedness of interacting individuals, as compared to the average relatedness in the population, in which case inequality (1) is referred to as Hamilton’s rule. It is important to note that inequality (1) is only a description of whether the current level of assortment as subsumed in the parameter is sufficient to favour cooperation, but not a description of the mechanisms that would lead to such assortment. It has been suggested repeatedly that the problem of cooperation can be understood entirely based on Hamilton’s rules of the form (1). Even though often taken as gospel, this claim is wrong in general, for two reasons.
First, and foremost, even if a rule of the form (1) predicts the direction of selection for cooperation at a given point in time, the long-term evolution of cooperation cannot be understood without having a dynamic equation for the quantity , i.e., without understanding the temporal dynamics of assortment. The dynamics of in turn cannot be understood based solely on the current level of cooperation, and hence expressions of type (1) are in general insufficient to describe the evolutionary dynamics of cooperation. Second, the quantity , which measures the average relatedness among interacting individuals, is insufficient to construct Hamilton’s rule in models that account for variable individual-level death rates and/or group-level events.
Contrary to the popular use of the word, “relatedness” describes a population of interacting individuals, where refers to how assorted similar individuals are in the population.
every definition of relatedness must take into account the population. Therefore, relatedness is not the percent of genome shared, genetic distance, or any extent of similarity between two isolated individuals in a larger population. Also, because horizontal gene transfer is commonplace between microbes and selection is strong, phylogenetic distance or any other indirect genetic measure is likely to be inaccurate. Many of these false definitions live on partly because ambiguous heuristics like “ for brothers, for cousins,” which require very specific assumptions, are repeated in the primary literature. Also, most non-theoretical papers simply define relatedness as “a measure of genetic similarity” and do not elaborate or instead leave the precise definition to the supplemental information …. Unfortunately, scientists can easily misinterpret this “measure of genetic similarity” to be anything that is empirically convenient such as genetic distance or percent of genome shared. Largely because of this confusion, we support the more widespread use of the term ‘‘assortment,’’ which is harder to misinterpret …. For similar reasons of reader understanding, we also encourage authors to make calculations more explicit, either in the main or supplemental text, and to avoid repeating previous results without giving the assumptions that went into deriving them.