Integration of Forces

INTEGRATION OF EVOLUTIONARY FORCES

Selection leads to an increase in the average fitness of the population. Illustrated by the following example. Recall that the average fitness ("w bar") = w_AAp²+ w_Aa2pq + w_aa q². Now consider the case where p=.2, q= .8 so f(AA)=0.04, f(Aa)=0.32 and f(aa)=0.64. If their respective fitnesses are 1.0, 0.2 and 0.1, then w bar = .04(1)+.32(.2)+.64(.1) = .132. If selection were to continue for a number of generations q would decrease and p would increase. Lets say that we recalculated the average fitness when p=0.8 and q=0.2: thus f(AA)=.64, f(Aa)=.32 and f(aa)=.04 so w bar = .64(1)+.32(.2)+.04(.1)=.708. When will the average fitness reach a maximum? when the deleterious allele (a) is selected out of the population so the only genotype is AA, the most fit genotype. (How would this differ if there was dominance?)

The sickle cell story with a new twist. There are actually more than just two alleles relevant to the sickle cell story. The normal allele (A) and the sickle allele (S) produce the genotypes traditionally considered: AA is normal but susceptible to malaria, AS individuals are not severely debilitated, but they are resistant to malaria, SS individuals are severely affected and rarely survive to reproduce. A third allele (C) results in the following genotypes and phenotypes: AC is normal but susceptible, SC has mild anemia and CC is normal and resistant to malaria. The following approximate fitnesses have been assigned to these genotypes:

Genotype	w	phenotype
AA	0.9	malarial susceptibility
AS	1.0	malarial resistance
SS	0.2	anemia
AC	0.9	malarial susceptibility
SC	0.7	anemia
CC	1.3	malarial resistance

Bantu speaking people moved into central and western Africa. A slash and burn agriculture opened up habitat for mosquitoes and along with them came Plasmodium the malarial agent. Consider a large population with the above fitness exposed to malaria for the first time. f(A) ~ 1.0 f(S) and f(C) very low, thus all S and C alleles will be in heterozygous states as AC or SC. Virtually No SS or CC genotypes will be present (the product of two very small numbers). Selection will lead to an increase of the S allele due to its higher fitness as a AS heterozygote. Expect no change in the C allele because heterozygotes have equal or lower fitness than other common genotypes. Can't "pass through" the heterozygote stage.

Selection is "short-sighted" cannot "see" the best solution, i.e., the highest fitness state where C goes to fixation. How do we get to the "best" condition?

Consider the effects of population structure: Bantu people establish local breeding groups of small effective population size. This necessarily will bring on some inbreeding which serves to increase homozygotes and decrease heterozygotes. Some CC genotypes will be produced by the chance effects of inbreeding and now the "best" genotype is present in the population and selection can "see" it so the frequency of the C allele increases, and the population would go to fixation for C.

The highest fitness state could not be reached by selection alone; Drift can affect the outcome of selection. This has been conceptualized as an Adaptive landscape:

Pick two allele frequencies, one for the A locus and one for the B locus. This defines a "population" (obviously this would be done in many dimensions for all loci in the genome; only two here for illustration). The population will evolve by selection to the top of the nearest peak. If a population starts with f(A) ~ f(B) ~ 0.2, this population would evolve to the top of the peak at the lower right of the diagram. This is not the highest peak, but selection acts to increase average fitness and can only "see" the nearest peak. If drift due to low effective population size rapidly shifted both f(A) and f(B) to higher frequencies, then the population might be in the "domain of attraction" of the highest peak in the upper right corner.

The population would stop evolving when it reached the top of the highest peak because there is no higher peak to shift to unless the environment changes at which point we would have to redraw the adaptive landscape.

Sewell Wright conceived of this view of evolution and believed that this was a more accurate description of how "real" populations evolved sine most species do have some structure to their populations and experience drift. So there will be a shifting balance between allele frequencies and a shifting balance between drift and selection as the causative agents in evolution. This is the so called shifting balance theory.

Wright envisioned different stages of evolution by shifting balance: 1) Drift in local populations would shift allele frequencies to new values and the demes may evolve up a local peak because the allele frequency in such a deme drifted near the 'domain of attraction' of a peak. The assumption is that without drift, a population on a 'flat' section of the adaptive landscape would not evolve by natural selection, because there would be no fitness variation in a flat region of the landscape. 2) intrademic selection where selection within local populations (demes) would drive the various demes to the top of their nearest peak. Even if several populations were at different "locations" on the adaptive landscape, the highest peak may not be reached. One can invoke stage 2.5 by saying that drift in local populations might move such a population's allele frequency off one peak and into the 'domain of attraction' of an adjacent peak with a different maximum fitness. This peak may be lower or higher than the old one, but after several rounds of drift at least one population may evolve to the top of the highest peak. 3) This (these) high-fitness populations will produce many emigrants and tend to change the other demes' allele frequencies closer to their own as a result of gene flow; this third stage is called interdemic selection (selection among demes); emigration rates are proportional to the extent that the fitness of a given population is greater or less than the average fitness of all populations. When all populations are homogenized to the allele frequencies of maximal fitness a new balance will be achieved and the allele frequencies will be maintained by selection until the environment changes the adaptive landscape. (For empirical support of this theory see Wade and Goodnight, 1991 Science vol. 253 pg. 1015-1018 and a commentary on page 973 by Crow).

Alternative way to view the shifting balance theory: consider a surface with troughs and pits in it. Put several marbles on the surface. If marble is near pit it falls in selection ~ gravity. Shake surface and balls will roll up out of pits against gravity and make their way to new pit. Shaking ~ drift. See figures 8.8-8.11, pgs. 215-219.

Before discussing the shifting balance view of evolution we considered selection as if it were acting on a single locus. This is a gross oversimplification because many loci are linked along the chromosome. Who's to say that selection is acting the same way on both loci? Things get much more interesting (but more complicated) when we face the reality of linked loci. Consider the cross between the two two-locus genotypes:

The offspring can be AB/AB, AB/ab or ab/ab. Other two locus genotypes are possible: or . But these can only be produced in the cross if there is recombination between the two loci. We can thus refer to four two-locus gametes AB and ab are the coupling gametes and aB and Ab are the repulsion gametes (another way to think about gametes is to just refer to them as a "chromosome" since this will reflect the linear array of whatever alleles are linked together). The frequency of these four gametes will be determined by two things 1) the frequencies of the respective alleles (p and q for the A locus and a different p and q for the B locus) and 2) the degree of linkage disequilibrium which describes whether recombination has broken up any association between the two linked loci.

When allele frequencies all = 0.5 and all gametes are in equal frequency then f(AB) = f(ab) = f(Ab) = f(aB). But if A alleles tend to be associated (linked) to B alleles then AB gametes will be in higher frequency than expected at random. We can quantify the disequilibrium as follows:

D = [f(AB) f(ab)] - [f(Ab) f(aB)]. (Note frequencies are multiplied) When all gametes are in equal frequency D = 0 i.e., linkage equilibrium. When only the coupling gametes are present D = 0.25; when only the repulsion gametes are present D = -0.25. If the frequencies of the alleles are less than 0.5, then the maximum value for D will be less than 0.25.

Note that when allele frequencies are different from p = 0.5 = q, the maximum value of D (absolute value)will be less than 0.25. For example if p=0.8, q=0.2 and if only the coupling gametes were in the population then D = 0.16

A worked example: gamete frequencies in 1000 observations: 580 AB's, 140 Ab's, 60 aB's and 280 ab's. Thus f(A) allele = (520+140)/1000 = 0.66 so f(a) = .34. f(B) = (520+60)/1000 = 0.58 so f(b) = 0.42. At random we expect the following gamete frequencies: f(AB) should be .66(.58)1000 = 383. f(Ab) should be .66(.42)1000 = 277. f(aB) should be .34(.58)1000 = 197 and f(ab) should be .34(.42)1000 = 143. These numbers of expected gametes are clearly different from the observed gametes. We can thus calculate the linkage disequilibrium as d = [.52(.28)] - [.14(.06)] = 0.1372. This tells us that the A and B alleles are in linkage disequilibrium.

This disequilibrium will be broken up by recombination and the rate of breakup will be determined by the rate of recombination (see figure 8.2, pg. 202).

Now let's say that the A locus was under selection with A alleles favored. If the A and B loci were in linkage disequilibrium in the coupling state what would happen to the B alleles? They too would be selected for, but not because they were under selection. This is a very important phenomenon in population and evolutionary genetics called hitchhiking. It demonstrates a very important distinction we must make about selection and phenotype: we need to distinguish between selection "of" and selection "for" If the A allele is favored, there is selection for the A allele and selection of the B allele due to its linkage to the A allele (i.e., linkage between the A and B loci).

Now consider the situation where the nose length is the result of interactions between the two loci. In the first case the interaction is additive, in the second case there is epistasis

	AA	Aa	aa		AA	Aa	aa
BB	1	2	3	BB	1	2	3
Bb	2	4	6	Bb	4	6	4
bb	3	6	9	bb	9	6	3

In the first case if we selected for long noses, we would tend to drive the a and b alleles to high frequency. If we selected for long noses in the second case we would tend to drive the A and b alleles to high frequency. The important distinction between these two tables is that in the simple two-locus additive case on the left, heterozygotes at one locus are intermediate between the two homozygotes regardless of the genotype at the other locus. In contrast, the table on the right shows that the relationship between genotype and phenotype at one locus depends on the genotype at the other, interacting locus. In a sense, one locus is modifying the expression of the other locus. If selection were to act in favor of nose length in the right-hand epistatic system, the way alleles "marched to fixation" would be very different.

Now consider how linkage and epistasis can affect the response to selection. In the second case above if there was high linkage disequilibrium so that all we had we AB and ab chromosomes in the population (= AB or ab gametes in the gamete pool), there would be less variation to select on (sizes 1, 6 and 3). Now if there was recombination such that Ab and aB chromosomes were produced, then the full range of phenotypic variation would be exposed (up to 9) and selection would rapidly shift the mean phenotype to longer noses and to high frequency of A and b alleles.

The general point is that loci do not act independently and their response to selection depends critically on their linkage relationships and their interaction with other loci. For the ecologically minded, there are some interesting parallels between community ecology and population genetics: there are an uncountable number of ways that the interacting participants can interact. In community ecology one considers species in a community; in population genetics one considers genes in the genome. The fate of each player depends on the degree to which it is "connected" to the other players in the system. Darwin referred to the complexity of nature as a "tangled bank"; this is very true of the genes within genomes within populations.