• No results found

Gene Duplication, Gene Conversion and the Evolution of the Y Chromosome

N/A
N/A
Protected

Academic year: 2020

Share "Gene Duplication, Gene Conversion and the Evolution of the Y Chromosome"

Copied!
16
0
0

Loading.... (view fulltext now)

Full text

(1)

DOI: 10.1534/genetics.110.116756

Gene Duplication, Gene Conversion and the Evolution

of the Y Chromosome

Tim Connallon

1

and Andrew G. Clark

Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853-2703 Manuscript received March 17, 2010

Accepted for publication May 31, 2010

ABSTRACT

Nonrecombining chromosomes, such as the Y, are expected to degenerate over time due to reduced efficacy of natural selection compared to chromosomes that recombine. However, gene duplication, coupled with gene conversion between duplicate pairs, can potentially counteract forces of evolutionary decay that accompany asexual reproduction. Using a combination of analytical and computer simulation methods, we explicitly show that, although gene conversion has little impact on the probability that duplicates become fixed within a population, conversion can be effective at maintaining the functionality of Y-linked duplicates that have already become fixed. The coupling of Y-linked gene duplication and gene conversion between paralogs can also prove costly by increasing the rate of nonhomologous crossovers between duplicate pairs. Such crossovers can generate an abnormal Y chromosome, as was recently shown to reduce male fertility in humans. The results represent a step toward explaining some of the more peculiar attributes of the human Y as well as preliminary Y-linked sequence data from other mammals and Drosophila. The results may also be applicable to the recently observed pattern of tetraploidy and gene conversion in asexual, bdelloid rotifers.

N

ONRECOMBINING chromosomes are often asso-ciated with genetic degradation and a loss of functional genes, and nowhere is this pattern more exaggerated than on the Y chromosome (Charlesworth and Charlesworth2000; Bachtrog2006). However, in addition to the more widely recognized pattern of gene loss, genome sequences of mammals and Drosoph-ila are also yielding evidence for Y-linked functional gene gain followed by amplification of duplicate genes (Skaletskyet al.2003; Koerichet al.2008; Carvalho et al.2009; Krsticevicet al.2009; Hugheset al.2010). Duplication and retention of functional Y-linked gene copies is somewhat surprising because evolutionary the-ory predicts an opposing pattern. First, to the extent that gene duplicates are fixed via positive selection, they are less likely to become fixed on nonrecombining relative to recombining chromosomes (Otto and Goldstein 1992; Clark 1994; Yong1998; Otto and Yong 2002; Tanaka and Takahasi 2009). Second, regardless of whether Y-linked duplicates become fixed via genetic drift or by natural selection, the actions of Muller’s ratchet, genetic hitchhiking, and background selection are expected to greatly increase the probabil-ity that Y-linked genes degenerate into nonfunctional

pseudogenes (Charlesworth and Charlesworth 2000; Bachtrog2006; Engelstadter2008).

The issue is more complex when one considers data from the well-characterized human Y chromosome. A majority of functional Y-linked genes are members of duplicate gene pairs residing within large palindromes and are almost exclusively testis expressed (Skaletsky et al.2003). In contrast to many of the single-copy genes with X-linked homologs, members of Y-linked gene families are apparently not degenerating, but rather have become fixed and maintained over many millions of years (Skaletskyet al.2003; Yuet al.2008). Although Y chromosomes are not well characterized in other taxa, currently available data suggest that duplication is a common feature of Y chromosomes in other mammal species as well as Drosophila (Rozenet al.2003; Verkaar et al.2004; Murphyet al.2006; Alfo¨ ldi2008; Wilkerson et al.2008; Krsticevicet al.2009; Geraldeset al.2010). Thus, patterns of gene duplication and retention, for at least a subset of Y-linked genes, may be a general rule of Y chromosome evolution.

Another attribute of the mammalian Y appears to be relevant for duplicate gene evolution. Comparative analysis between humans and chimpanzees suggests ongoing recombination between the gene duplicate pairs that reside on the same Y chromosome. Such ‘‘intrachromosomal’’ recombination includes both non-reciprocal (gene conversion) and non-reciprocal exchange (crossing over) between gene duplicate pairs (Rozen et al.2003; Langeet al.2009). Gene conversion between Supporting information is available online athttp://www.genetics.org/

cgi/content/full/genetics.110.116756/DC1.

1Corresponding author:Department of Molecular Biology and Genetics,

Cornell University, Biotechnology Bldg. (Room 227), Ithaca, NY 14853-2703. E-mail: [email protected]

(2)

the duplicates potentially maintains gene function by counteracting stochastic forces of Y chromosome de-generation (Rozen et al. 2003; Charlesworth 2003; Noordam and Repping 2006). The rationale behind this hypothesis is subtle. As with other clonally inherited chromosomes, each evolutionary lineage of the Y is phys-ically coupled to, and its evolutionary fate is influenced by, the presence of deleterious mutations. Mutation-bearing lineages represent evolutionary dead ends unless they can somehow remove or compensate for deleterious mutations. Recombination between duplicates can ‘‘res-cue’’ functionality via gene conversion between func-tional and nonfuncfunc-tional copies.

On the other hand, double-strand DNA breaks, which precede gene conversion events (Marais 2003), also precede crossing over. Crossovers between Y-linked genes can generate acentric and dicentric Y chromo-somes, resulting in infertility and disruption of the sex determination pathway (e.g., Repping et al. 2002; Heinritzet al.2005; Lange et al.2009). Considering both gene conversion and crossing over on the Y, re-combination can be viewed as a factor that either con-strains (via gene conversion) or promotes (via crossing over) Y chromosome degeneration.

These observations concerning Y chromosome gene content and recombination raise interesting questions that have not been formally addressed by evolutionary theory (but see the recent study by Maraiset al.2010). First, what conditions favor the evolutionary invasion of Y-linked gene duplicates, and does recombination in-fluence the probability that duplicates eventually become fixed within a population? Second, what affect does recombination have on Y-linked fitness and the mainte-nance of functional duplicate genes? To address these questions, we develop and analyze a series of population-genetic models of Y chromosome evolution. We show that, when direct selection on gene duplicates is weak, biased gene conversion can increase, whereas crossing over will decrease, their probability of fixation. For dupli-cates with larger fitness effects, the probability of fixation is largely independent of Y-linked recombination. Finally, gene conversion has a major impact on the retention of functional Y-linked genes that are already fixed within the population and maintains multiple gene copies with or without selection favoring these duplicates.

MODEL AND RESULTS

Gene conversion and the invasion of new gene duplicates: We first consider conditions favoring the evolutionary invasion of new Y-linked duplicate genes at low initial frequency within the population. Determin-istic invasion dynamics are described for a two-locus model, and it is shown separately that the two-locus model characterizes duplicate gene invasion conditions on a Y chromosome carrying an arbitrary number of genes (seesupporting information, File S1). We then

develop and analyze a diffusion approximation and perform stochastic simulations to examine the proba-bility that a rare gene duplicate eventually becomes fixed within a population of small size.

Invasion of a new gene duplicate: Consider a single Y-linked locus with a functional allele, A, and a non-functional allele,a. Mutation fromAtoaoccurs at rateu per generation and there is no back mutation. By introducing a duplication of the locus, the population is expanded to include five genotypic classes: the original single-copy classes (A and a), those with two functional gene copies (AA), those with one functional and one nonfunctional copy (Aa), and those with two nonfunc-tional copies (aa). As in the single-locus case, transitions between states (AA/AaoraA;AaoraA/aa) can occur by mutation, at rate ofuper locus; because there are now two loci, the mutation rate per chromosome is 2u.

For Y chromosomes carrying duplicates, recombina-tion (crossing over and gene conversion) can poten-tially occur between loci. Throughout our analysis, we examine cases where recombination occurs at a rate ofd per paralog pair, per generation. The probability that a single recombination event is a crossover, which gen-erates an abnormal (sterile) Y chromosome (e.g., Reppinget al.2002; Heinritzet al. 2005; Lange et al. 2009), is equal to the constant c . The remainder of recombination events (1c) represent gene conversion events between duplicate pairs. Gene conversion in-volvingAaoraAindividuals yieldsAAoraasperm at rate b and 1 b, respectively. Thus, b can be viewed as a biased gene conversion parameter, where the functional copy A preferentially replaces the nonfunctional a wheneverb.0.5 (there is no bias whenb¼0.5).

Compared to individuals with two functional gene copies, individuals with zero functional copies suffer a fitness reduction ofs, while those with one functional copy suffer a reduction ofsh, wherehis equivalent to a dominance coefficient. Complete masking of a non-functional allele occurs when h ¼0, and there is no direct fitness benefit of carrying twovs.one functional gene. Partial masking occurs when 1.h .0; in such cases, there is a fitness benefit of having two functional copies. Genotypes, genotypic fitness, and zygotic fre-quencies are described in Table 1.

TABLE 1

Parameterization for the gene duplicate invasion model

Genotype Frequency Fitness

AA x11 1

Aa, aA x10 1sh

A x1 1sh

aa x00 1s

a x0 1s

(3)

For a sequence of events of (i) birth, (ii) selection, (iii) mutation, (iv) recombination, and (v) random mating (and ignoring factors of u2), the frequency

change of each genotype, per generation, is given by the following six recursions,

x119¼ x11

w ð12uÞð1dcÞ

1x11

w 2udð1cÞb1

x10ð1shÞ

w ð1uÞdð1cÞb

x109¼ x11

w 2uð1dÞ1

x10ð1shÞ

w ð1uÞð1dÞ

x009¼

x11

w 2udð1cÞð1bÞ1

x10ð1shÞ

w uð1dcÞ

1x10ð1shÞ

w ð1uÞdð1cÞð1bÞ1

x00ð1sÞ

w ð1dcÞ

xs9¼ x11

w dc1

x10ð1shÞ

w dc1

x00ð1sÞ

w dc

x19¼

x1ð1shÞ w ð1uÞ

x09¼

x1ð1shÞ

w u1

x0ð1sÞ

w ;

where mean fitness is w ¼x111ðx101x1Þð1shÞ 1

ðx001x0Þð1sÞ.

To describe conditions promoting the invasion of duplicates, we analyzed the stability of an evolutionary equilibrium in which duplicated genotypes are absent from the population. Under such a condition, the fre-quencies x1 and x0 equilibrate to ˆx1 ¼1uð1hsÞ=

½sð1hÞ ¼1ˆx0 and the leading eigenvalue of the

stability matrix is

l¼ð12uÞð1dcÞ12udð1cÞb1ð1shÞð1uÞð1dÞ 2ð1shÞð1uÞ

1

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

fð12uÞð1dcÞ12udð1cÞb1ð1shÞð1uÞð1dÞg2

4ð12uÞð1dcÞð1dÞð1shÞð1uÞ

v u u t

2ð1shÞð1uÞ :

ð1aÞ

Selection favors the invasion of a duplicate when the leading eigenvalue is greater than one (Ottoand Day 2007). The magnitude of the leading eigenvalue also represents the strength of selection acting in favor of a rare duplicate gene [i.e., the probability of fixation is pro-portional to l (Otto and Bourguet 1999; Otto and Yong 2002); see below for additional details]. Without recombination (d¼0), the leading eigenvalue reduces to

l¼1

21

12u1jshð1uÞ uj

2ð1shÞð1uÞ ð1bÞ

and evolutionary invasion of a duplicate-bearing Y is favored whensh . u/(1 u). Duplicates are favored

when the direct fitness benefit of additional functional gene copies outweighs the indirect consequences of doubling the deleterious mutation rate, as previously reported for both haploid and diploid systems without recombination (Clark1994; Ottoand Yong2002; also see Ottoand Goldstein1992).

How does recombination alter the evolutionary dy-namics of Y chromosomes? When duplicates do not directly increase fitness (sh ¼ 0), and there is no re-combination, selection never favors invasion (Equation 1b above). We can ask whether gene conversion expands the conditions favorable to invasion of a duplicate in a way that is similar to previous models of gene duplication with crossing over (Ottoand Yong2002). By permitting Y-linked recombination between duplicates, and assum-ing that the crossover rate is zero (dc ¼ 0; hence, all recombination is by gene conversion), the leading ei-genvalue can be approximated for low rates of gene conversion (d0, per generation),

l¼l

d¼0 1@l

@d d¼0

d1Oðd2Þ 11dð2b1Þ; ð1cÞ

which indicates that selection favors duplicates (l.1) when gene conversion is biased toward transmission of functional over nonfunctional gene copies (b . 0.5). Numerical evaluation of Equation 1a indicates that, although higher rates of gene conversion can increase the leading eigenvalue (and hence the probability of invasion), this positive relationship quickly saturates. Thus, a little bit of gene conversion has about as much of an impact on the leading eigenvalue as a high rate of gene conversion does. Nevertheless, the strength of such positive selection (with magnitude ofl1) is on the order of the mutation rate (u) and is therefore extremely weak. Stochastic simulations (see below) show that the probability of duplicate fixation is marginally influenced by biased gene conversion alone.

Further analysis of Equation 1a shows that, as with the case of no recombination (Otto and Yong 2002), selection will favor duplicates if they directly increase fitness (sh.0). Gene conversion (including unbiased gene conversion: b¼0.5) can increase the strength of selection favoring invasion of a duplicate (l1; Figure 1). However, the relative impact of gene conversion is minor when sh ? u. In other words, when there are weak direct benefits of having multiple gene copies, the strength of natural selection favoring Y-linked gene duplicates will be enhanced by gene conversion be-tween paralogs. This conclusion holds if the crossover rate between duplicate pairs (dc) is small (Figure 1). As the rate of crossing over increases, the production of abnormal Y haplotypes can generate purifying selection against Y chromosomes that carry gene duplicates.

(4)

dynamics for a population fixed for the single-gene hap-lotype. Because this explanation is heuristic, we ignore crossovers and assume that they do not occur (c¼0). The rate of increase for a rare haplotype with two functional gene copies depends on its relative competitiveness against the resident, single-copy haplotype. For initial condition x11 ¼ 1/N and x10 ¼ x00 ¼ 0, the expected

proportion of functional duplicate haplotypes (x11) within

the gamete pool isE½x119 ¼x11½12uð1dbÞ=w, and

the duplicate is favored when ½12uð1dbÞ=w.1. Invasion is clearly facilitated by gene conversion (db .

0). Nevertheless, because the term 2u(1db) is extremely small, gene conversion will marginally influence the probability of fixation wheneversh?u.

Probability of duplicate fixation: The deterministic model presented above can be modified to describe the evolutionary dynamics in finite populations. Follow-ing Ottoand Bourguet(1999) and Ottoand Yong (2002), the selection coefficient for a rare gene dupli-cate can be approximated as l 1, where l is the leading eigenvalue of the stability matrix (Equation 1a, above). Given this selection coefficient, the probability that a rare duplicate is eventually fixed can be estimated by diffusion approximation (Kimura1957, 1962), with drift and diffusion coefficientsM¼(l1)x(1x) and V¼x(1x)/N, respectively, wherexis the frequency of a duplicate-bearing Y haplotype andNis the Y chromo-some effective population size. For an initial frequency of 1/N, the probability that a duplicate is fixed will be

PrðfixationÞ ¼ 1e 2ð1lÞ 1e2Nð1lÞ

2ðl1Þ

1e2Nð1lÞ: ð2Þ To assess the validity of Equation 2, we conducted computer simulations that incorporate mutation, selec-tion, and genetic drift. Each simulation was initiated at x11¼1/N,x0¼u(1hs)/(ssh), andx1¼1x11x0.

To generate genotypic frequencies for the next gener-ation, N genotypes were randomly drawn from a multinomial distribution, after selection, from the six genotypes described above. Mutation–selection–drift recursions were iterated until the duplicate genotype was either fixed or lost from the population. Equation 2 provides a good approximation for the probability of duplicate fixation over a broad range of parameter space (Figure 2 andFigure S1). As direct selection on a duplicate approaches zero (sh/0), the probability of fixation approaches 1/N. As direct selection increases in strength (1?1l?1/N), the probability of fixation approaches 2(l1).

Gene conversion had little impact on the probability of duplicate fixation (seeFigure S1). As shown above, the leading eigenvalue of the stability matrix is not substantially influenced by gene conversion unlessshis of similar order to u. Even though the selection co-efficient approximation (l1) can increase with gene conversion, its absolute magnitude under weak direct selection (sh0) will generally be too small for natural selection to be effective, unless of courseNu.1, which is particularly unlikely for Y-linked loci. Thus, gene conversion is unlikely to significantly enhance the rate of duplicate gene fixation, but can potentially reduce the fixation rate of duplicates if the rate of deleterious crossovers between paralogs is high.

Gene conversion and the maintenance of gene duplicates:A major hypothesis inspired by the human Y chromosome is that gene conversion between duplicates Figure1.—Gene conversion can enhance the strength of

positive selection for rare duplicate genes, whereas crossovers select against duplicates. Selection coefficient approxima-tions (l1) are based on the leading eigenvalue (Equation 1a), as described and justified in the text, and are presented as a ratio of selection with (d.0)vs. without recombination (d ¼ 0). Representative results are presented for u ¼ 105

(5)

may prevent the accumulation of mutations and ulti-mately prevent or slow down Y chromosome degenera-tion due to Muller’s ratchet (Charlesworth 2003; Rozenet al.2003; Noordamand Repping2006). To for-mally evaluate this possibility, we considered two models for the maintenance of functional Y-linked genes. We first conducted simulations of our two-locus model with initial conditionx11¼1 (a pair of functional duplicates is

ini-tially fixed within the population) and analyzed whether gene conversion prevented the loss of one or both of the functional gene copies. Gene conversion between Y-linked paralogs decreased the rate of gene loss under a wide range of fitness conditions, including the extreme case where there was no direct benefit of having two, as opposed to one, functional gene copies (Figure S2). Although gene conversion can substantially reduce the rate of gene loss, the results indicate that loss of completely redundant genes (wheresh¼0) will persist under gene conversion, albeit at a substantially reduced rate.

Prior models of Muller’s ratchet generally find that the rate at which deleterious mutations become fixed depends upon both the strength of purifying selection and the number of loci evolving on an asexual chromo-some (Charlesworth and Charlesworth 2000; Bachtrog 2008). To account for selection and gene conversion across many loci, we extended our model to describe the degeneration of Y chromosomes carrying an arbitrary number of genes. To permit gene conver-sion, we assumed that each Y initially carriesndistinct gene types, each with a duplicate copy (for a total of 2n loci). Because the increased number of genes greatly expands the number of possible genotypic and fitness states (and consequently the matrix of transition

prob-abilities between states), we made a simplifying assump-tion that each of thengene types represents an essential male fertility factor. Males lacking a functional copy of one or more gene types are sterile and comprise a heterogeneous genotypic class with reproductive suc-cess of zero. Although the essentiality assumption is useful for modeling purposes, it will often be biologi-cally reasonable because Y-linked genes, at least in mammals and Drosophila, are often essential for male fertility. For example, human Y chromosome micro-deletions within Y-palindromic regions are often associ-ated with spermatogenic failure (Noordamand Repping 2006; Lange et al. 2009). In Drosophila melanogaster, mutations in at least three of seven currently Y-annotated genes (kl-2,kl-3, andkl-5, as well as an additional set of unannotated genes: kl-1, ks-1, and ks-2; data obtained fromhttp://flybase.org/) are known to cause male-sterile phenotypes. Nevertheless, the overall agreement be-tween our multilocus and two-locus results (the latter does not assume essentiality; seeFigure S2) suggests that a violation of the essentiality assumption is unlikely to strongly affect our conclusions.

For each paralog pair, there are three possible genotypes: both loci functional, one functional and one nonfunctional, and both nonfunctional. Transi-tions between genotypic states can occur by mutation, by gene conversion, or by crossing over, with crossover yielding an abnormal Y chromosome. For individuals carrying a structurally normal Y, fitness follows the functionw¼(1sh)k(0)j, where

jrefers to the number of gene pairs with both copies nonfunctional, and k refers to the number of pairs where one of the two gene copies is functional (0#k#n). Individuals withj.0 and individuals carrying abnormal Y chromosomes are sterile. After selection, the reproductive contribution of an individual withkY-linked mutations is

xkS¼

xkwk w ;

wherexkis the zygotic frequency ofk-bearing males,wk¼

(1sh)k

is the fitness of a male withkmutations, and mean male fitness with respect to the Y isw ¼Pnk¼0xkwk.

(The reproductive contribution of sterile individuals is zero.)

To facilitate analytical tractability, we assume that the rates of recombination and mutation are both small enough to ignore multiple mutation and multiple re-combination events per generation. In other words, there is a zero probability of an individual with k mutations producing a fertile son withk2 ork12 mutations. This assumption is justified as long as 2nu>1 andnd>1, which requires that the mutation and recombination rate per locus is small, and the number of loci mutable to a nonfunctional allele is much smaller than the reciprocal of the mutation or gene conversion rate:n>min[1/u, 1/ d]. Because n represents a small fraction of Y-linked nucleotides (i.e., it represents a very specific functional Figure2.—The probability of fixation for Y-linked

dupli-cate genes. The solid line depicts the analytical approxima-tion from Equaapproxima-tion 2. Circles represent the proporapproxima-tion of duplicate genotypes (out of 100,000 replicate simulations for each data point) that eventually become fixed within the population. Results are shown ford¼0,N¼1000, and u¼105, per locus, per generation. Values ofd.0 yield

(6)

class), this assumption is biologically reasonable. Never-theless, a violation of these assumptions is expected to make our results conservative by downwardly biasing the speed of Muller’s ratchet (which is enhanced by a higher mutation rate) and minimizing the positive effect of gene conversion (higher gene conversion rates increasingly counteract Muller’s ratchet). Extending across the 2n loci, the probability that a Y chromosome experiences one mutation is Pr(M¼1)¼2nu¼U. The probability that zero mutations occur is Pr(M ¼0) ¼1 U. The probability of a recombination event between one of then paralog pairs is Pr(R¼1)¼nd¼D. The probability of no recombination is Pr(R¼0)¼1D.

Given a sequence of events of (i) birth, (ii) selection, (iii) mutation, (iv) recombination, and (v) random mat-ing, the frequency of fertile males in the next generation follows the recursion

xk9¼xk1ð1shÞ k1 w

Uðnk11Þ n

1xkð1shÞ k

w

Uk12nð1UÞ

2n

3 Dð1cÞðnkÞ

n 11D

1xkð1shÞ k

w

UðnkÞ

n 1

xk11ð1shÞk11

w

3Uðk11Þ12nð1UÞ

2n

3Dbð1cÞðk11Þ

n :

The ‘‘least-loaded’’ (k¼0) and ‘‘most-loaded’’ (k¼n) classes of fertile males follow the recursion

x09¼ x0 w

ð1UÞð1DcÞn1UDð1cÞb n

1x1ð1shÞ w

U

2n 11U

Dð1cÞb n

and

xn9¼xn1ð1shÞ n1 w

Uð1DÞ n

1xnð1shÞ n

w

ð2UÞð1DÞ

2 ;

respectively. The frequency of sterile males in the next generation (via crossover, mutation, or gene conver-sion) will be

xs9¼1 Xn

k¼0 xk9:

Deterministic equilibria and mean fitness of the Y: When there is no recombination between duplicates (D¼0),

mean Y chromosome fitness as well as the distribution of mutations among individuals can be analytically de-termined. If mutations that eliminate duplicate gene function are deleterious (sh.0), and the number of unique Y-linked genes is large (n ? U/sh), the pop-ulation approaches the equilibrium: ˆxk PoisðU=shÞ, ˆxs0, andw 1U. This is analogous to the case of

mutation–selection balance with incomplete domi-nance (sh.0), with a Y-linked genetic load ofL¼U

1 eU

(e.g., Haldane 1937; Kimura and Maruyama 1966; Kondrashovand Crow1988). When knocking out a duplicate yields no fitness effect (sh¼0), or the number of Y-linked genes is small (n > U/sh), the population approaches the equilibrium: ˆxn1U=2, ˆxsU=2, andw 1U=2. Under this scenario, the

genetic load is reduced by a factor of 2, toL¼U/21 eU/2(Haldane1937).

Gene conversion between duplicates increases the frequency of the least-mutated class (Figure 3 andFigure S3), whether or not there is a gene conversion bias favoring functional over nonfunctional loci. The fre-quency of the least-loaded class represents a quantity of particular importance for adaptation on clonally trans-mitted chromosomes such as the Y (Charlesworth and Charlesworth 2000). Without recombination, the unit of selection is the chromosome rather than the locus. Beneficial mutations that are associated with mutation-free genetic backgrounds are relatively likely Figure3.—Gene conversion increases the frequency of Y

chromosome haplotypes that carry zero deleterious muta-tions (i.e., the ‘‘least-loaded’’ genotypic class). The cost of a mutation eliminating function of a copy of each duplicate pair is represented bysh(this cost increases from left to right on the x-axis). The relative proportion of mutation-free Y chromosomes in recombining vs. nonrecombining popula-tions is presented as a ratio of the two scenarios (gene conver-sion increases the proportion of mutation-free Y’s when this ratio is greater than one). The number of distinct, Y-linked genes is represented by n. Results are presented for c¼ 0, b¼ 0.5, andu¼ 5 3 104, per locus, per generation, and

(7)

to become fixed (Peck1994; Orrand Kim1998) and do not permit hitchhiking of deleterious mutations during a selective sweep (Rice1987). However, as the frequency of the least-loaded class becomes small, virtually all beneficial mutations will arise in inferior genetic back-grounds. This will limit the adaptive potential of the Y chromosome. Because it increases the fraction of mutant-free Y chromosomes, gene conversion is ex-pected to enhance the fixation probability for beneficial mutations and can reduce the deleterious consequen-ces of hitchhiking.

By shifting the mutational distribution toward rela-tively mutation-free genotypes, gene conversion also increases mean Y chromosome fitness. This effect does not depend on a gene conversion bias, but can become exacerbated when conversion events favor functional over nonfunctional variants (for models yielding similar conclusions about the genetic load, albeit by different approaches, see Bengtsson1986, 1990, and especially Ohta1989).

These long-term effects of gene conversion can be accounted for by a straightforward explanation. When the fitness cost of silencing both copies of a duplicate pair is much greater than the cost of silencing one of the copies (when duplicates partially or completely mask deleterious mutations:h,0.5), selection across Y chro-mosomes mimics truncation selection, which is par-ticularly efficient at removing deleterious alleles (e.g., Kondrashov 1988; Ohta 1989). Truncation selection arises because mutations on a relatively mutation-free Y will generally affect one copy of a pair, with the second, functional copy compensating for loss of the first. As the number of mutations on a Y increases, so does the probability of silencing the second copy of a pair. Con-sequently, the deleterious effect of each mutation in-creases faster than linearly with the number of mutations carried on a Y.

Without recombination, the accumulation of muta-tions is unidirectional, and the population will tend to evolve toward the edge of the truncation point (n mutations at distinct genes), particularly if masking by duplicates is strong (i.e., having two functional copies provides the same fitness as one copy). At the extreme of sh¼0 (complete masking), the population evolves to containnfunctional genes, each distinct. Gene conver-sion restores variability by permitting bidirectional transitions (e.g., k to k 1 and k 1 1 mutations). Y chromosomes that are closer to the truncation point have a higher probability of transitioning (by mutation or recombination) beyond the truncation point where they are removed by selection. Consequently, the population distribution shifts toward fewer mutations per Y. However, if selection in favor of functional duplicates is strong relative to the number Y-linked genes (sh.0;nlarge), most individuals will carry few mutations, the truncation point becomes irrelevant to Y chromosome evolution, selection shifts toward

multi-plicative epistasis, and gene conversion does not strongly influence mean fitness or the distribution of mutations among Y chromosomes. This explanation accounts for the decreased impact of gene conversion on mutation-free Y chromosomes, as the strength of selection (sh) increases (Figure 3 andFigure S3).

Muller’s ratchet and the accumulation of nonfunctional genes: The deterministic results (presented above) rep-resent an upper limit for Y chromosome fitness. In finite populations, where Muller’s ratchet operates, mean fitness can further decrease with each successive loss of ‘‘mutation-free’’ individuals. Once lost from the popula-tion, mutation-free genotypes are unlikely to be recov-ered by back mutation or positive selection because they must initially arise within the current least-loaded class and subsequently avoid stochastic loss (Peck1994; Orr and Kim1998; Gordoand Charlesworth2000).

To explore the influence of gene conversion on the rate and severity of Y chromosome degeneration via Muller’s ratchet, we conducted a series of stochastic simulations, varying the selection and recombinational parameters (u,h,n,d,c,b). We first use the recursions presented above to bring the frequencies of each genotypic class to deterministic equilibrium. Conver-gence to equilibrium is followed by 100,000 generations of simulation under a mutation–selection–drift model and constant male population size. For each generation, genotype frequencies were sampled from a pseudoran-dom multinomial distribution (pseudoranpseudoran-dom num-bers generated with R; R Development Core Team 2005), with genotypes randomly sampled after selec-tion, mutaselec-tion, and recombination.

When there is no gene conversion between dupli-cates, Muller’s ratchet can operate rapidly, causing Y-linked fitness decay and loss of functional genes. Representative simulation results are shown in Figures 4 and 5. In agreement with previous theory (Haigh1978; Gordoand Charlesworth2000; Bachtrog2008), the impact of the ratchet is strongest when the ancestral Y carries many functional gene duplicates and when mutations have small individual fitness effects. Relatively low rates of gene conversion can rescue Y-linked genes from stochastic loss via Muller’s ratchet and thereby increase mean fitness of the Y (Figures 4 and 5). Increasing the total mutation and gene conversion rates on the Y (UandD, respectively) amplifies the differences between recombining and nonrecombining chromo-somes, whereas a decrease in these compound parame-ters (U,D/0) eliminates these long-term evolutionary differences. This effect occurs both with and without biased gene conversion between duplicates.

(8)

and/or the chromosome-wide mutation rate (an in-creasing function of the mutation rate per locus and the number of loci) is high (Charlesworth and Charlesworth 2000; Bachtrog 2008). The similar consequences of gene conversion and crossing over are not surprising: both processes permit chromosomal transitions from more to fewer mutations and this, along with purifying selection, can counteract the steady accumulation of new deleterious mutations within a population.

DISCUSSION

Previous theory indicates that selection does not generally favor the invasion of a rare duplicate gene unless there is a direct benefit of carrying an additional gene copy (Clark 1994) or there is recombination between the paralogs (Yong 1998; Otto and Yong 2002; Tanakaand Takahasi2009). We have shown that gene conversion between duplicates can broaden the parameter conditions favoring the invasion of duplicate genes from low initial frequency. Biased gene conversion, with conversion favoring undamaged over damaged gene copies, can generate positive selection for rare duplicates that do not provide a direct fitness benefit (that is, individuals with two functional copies have fitness equal to those with one). However, the strength of positive selection acting on such duplicates is weak (on the order

of the mutation rate). This result is in agreement with a recent simulation study, which also found that gene conversion does not strongly promote the invasion of new Y-linked duplicates (Maraiset al.2010).

The invasion dynamics of rare duplicate genes bear some similarities to models of adaptation within gene families (Walsh 1985; Manoand Innan2008), which show that gene conversion can enhance the probability that a weakly beneficial allele becomes fixed. In our model, gene conversion alone is unlikely to overpower genetic drift unlessNu?1, yet this condition is rarely (if ever) expected to arise within animal populations, par-ticularly with respect to Y-linked loci that have reduced effective size relative to other nuclear genes. Further-more, there is no biological reason to suspect that gene conversion will necessarily be biased against mutant copies of a particular gene. We therefore expect that Y-linked duplicates will most likely become fixed by genetic drift, unless they directly increase the fitness of those who carry them (for additional discussion of duplicate gene fixation, see Innan and Kondrashov 2010). Likewise, deleterious Y-linked crossover events can generate selection against gene duplicates. This factor will have little impact on the probability of fixation or loss unless the crossover rate is relatively high and direct selection on the duplicate is weak or absent.

Y chromosome recombination can exert a profound influence on the retention of functional copies of genes Figure4.—Intrapalindrome gene conversion

prevents the erosion of Y chromosome gene con-tent and enhances adaptation on the Y.N repre-sents the Y-linked effective size,shis the fitness cost associated with mutations to one copy of each duplicate pair, t refers to the generation within the simulation, andnis the number of dis-tinct genes on the chromosome (including dupli-cates, each Y carries 2n genes). Results are presented forc¼0,b¼0.5, andu¼53104,

per locus, per generation. Each data point repre-sents the average of 10 simulation replicates. Since estimates of gene conversion from human– chimp comparisons suggest thatDmay be consid-erably higher than the mutation rate (Rozen

(9)

that have already become fixed within the population. Our simulations show that low rates of gene conversion are sufficient to maintain Y-linked genes and counteract degradation via Muller’s ratchet. These results are conservative, as higher rates enhance the preservation of functional gene copies. Thus, once gene conversion has evolved, it can potentially provide a degree of stability on an otherwise evolutionarily unstable Y chromosome. Interestingly, Marais et al. (2010) ob-served that the rate of invasion for gene conversion modifier alleles does not greatly exceed neutral expect-ations unless they greatly increase the gene conversion rate. This suggests that, while low rates of conversion may slow the rate of Muller’s ratchet, the evolution of the gene conversion rate itself may be much more restrictive.

The large number of genes within the ‘‘ampliconic’’ region of the human Y (Skaletskyet al.2003) should provide a large target for mutations, creating an opportunity for Muller’s ratchet to act. This role of gene conversion on the Y is therefore likely to explain patterns of gene retention on the human Y chromo-some. It is less clear whether similar patterns character-ize other animal species. Current (albeit incomplete) data suggest that gene family amplification and re-tention might be common Y chromosome attributes (Rozen et al.2003; Verkaaret al.2004; Murphyet al. 2006; Alfo¨ ldi2008; Wilkersonet al.2008; Krsticevic et al. 2009), although the prevalence of Y-linked gene conversion outside the human and chimp lineages is less clear (but see Geraldes et al. 2010). Future sequencing efforts, including evidence for gene conver-sion among Y-linked genes in nonhuman species, will help to determine the general relevance of the dupli-cation and gene conversion model presented here.

Within-chromosome crossovers can generate an ab-normal, sterility-inducing Y (Lange et al. 2009) and potentially represent a deleterious fitness consequence of Y-linked recombination. This cost also implies that

the number of Y-linked duplicate genes (or in humans the size of Y-linked palindromes) will have an upper limit. As the number of Y-linked loci that interact via recombination increases, so too should the rate of deleterious crossovers. This suggests an upper limit to Y chromosome gene content, where crossing over becomes unbearably costly. From this perspective, duplication and recombination represent a costly mechanism of Y chromosome preservation.

In addition to the Y chromosome, our findings have implications for asexually reproducing species. Recent reports suggest that the asexual bdelloid rotifers are tetraploid (Mark Welch et al. 2008) and that gene conversion occurs between gene copies (Huret al.2008; Mark Welch et al. 2008). Our model supports the verbal claim that gene conversion between homologous gene copies might aid in DNA damage repair and prevent the genomic degradation that is expected to accompany strict asexual reproduction. Unlike the Y chromosome scenario, crossovers between homolo-gous, tetraploid chromosomes will tend to avoid dele-terious chromosomal aberrations. The relative rate of nonhomologous crossovers is an empirical question that may be difficult to assess, given the likely association between chromosome abnormalities and embryonic death, which will lead to a pronounced bias toward ‘‘normal’’ chromosomes. On the other hand, crossing over between homologous chromatids is likely to generate copy number polymorphism, which adds a level of complexity to the evolutionary dynamics of autosomal gene duplicates or gene families. This may lead to different evolutionary consequences of crossing over and gene conversion in asexual lineages compared to the results that we report for the Y chromosome and represents an interesting avenue for future theoretical research.

We are grateful to Roman Arguello, Clement Chow, Margarida Cardoso-Moreira, Qixin He, Lacey Knowles, Amanda Larracuente, Rich Meisel, Nadia Singh, and two anonymous reviewers for discussion Figure5.—The proportion of loss-of-function

duplicates following 100,000 generations of mu-tation, selection, and genetic drift. Parameters are described in the Figure 4 legend and through-out the text. Results are presented forc¼0,b¼ 0.5,u¼53104, per locus, per generation, and

(10)

and comments that substantially improved the quality of the manu-script and to Sarah Otto for comments about the eigenvalue-selection-coefficient approximation and for sharing an unpublished manu-script. This work was supported by National Institutes of Health grant GM64590 to A.G.C. and A. B. Carvalho.

LITERATURE CITED

Alfo¨ ldi, J. E., 2008 Sequence of mouse Y chromosome. Ph.D. Dis-sertation, MIT, Cambridge, MA.

Bachtrog, D., 2006 A dynamic view of sex chromosome evolution. Curr. Opin. Genet. Dev.16:578–585.

Bachtrog, D., 2008 The temporal dynamics of processes underly-ing Y chromosome degeneration. Genetics179:1513–1525. Bengtsson, B. O., 1986 Biased conversion as the primary function

of recombination. Genet. Res.47:77–80.

Bengtsson, B. O., 1990 The effect of biased conversion on the mu-tation load. Genet. Res.55:183–187.

Carvalho, A. B., L. B. Koerichand A. G. Clark, 2009 Origin and evolution of Y chromosomes: Drosophila tales. Trends Genet.25:

270–277.

Charlesworth, B., 2003 The organization and evolution of the hu-man Y chromosome. Genome Biol.4:226.

Charlesworth, B., and D. Charlesworth, 2000 The degenera-tion of Y chromosomes. Philos. Trans. Biol. Sci.355:1563–1572. Clark, A. G., 1994 Invasion and maintenance of a gene duplication.

Proc. Natl. Acad. Sci. USA91:2950–2954.

Engelstadter, J., 2008 Muller’s ratchet and the degeneration of Y chromosomes: a simulation study. Genetics180:957–967. Geraldes, J., T. Rambo, R. Wing, N. Ferrandand M. W. Nachman,

2010 Extensive gene conversion drives the concerted evolution of paralogous copies of the SRY gene in European rabbits. Mol. Biol. Evol. (in press).

Gordo, I., and B. Charlesworth, 2000 The degeneration of asex-ual haploid populations and the speed of Muller’s ratchet. Genet-ics154:1379–1387.

Haigh, J., 1978 Accumulation of deleterious genes in a population—-Muller’s ratchet. Theor. Popul. Biol.14:251–267.

Haldane, J. B. S., 1937 The effect of variation on fitness. Am. Nat.

71:337–349.

Heinritz, W., D. Kotzot, S. Heinze, A. Kujat, W. J. Eleemannet al., 2005 Molecular and cytogenetic characterization of a non-mosaic isodicentric Y chromosome in a patient with Klinefelter syndrome. Am. J. Med. Genet. A132A:198–201.

Hughes, J. F., H. Skaletsky, T. Pyntikova, T. A. Graves, S. K. M. vanDaalenet al., 2010 Chimpanzee and human Y chromo-somes are remarkably divergent in structure and gene content. Nature463:536–539.

Hur, J. H., K. Van Doninck, M. L. Mandigoand M. Meselson, 2008 Degenerate tetraploidy was established before bdelloid rotifer families diverged. Mol. Biol. Evol.26:375–383.

Innan, H., and F. A. Kondrashov, 2010 The evolution of gene du-plications: classifying and distinguishing between models. Nat. Rev. Genet.11:97–108.

Kimura, M., 1957 Some problems of stochastic processes in genet-ics. Ann. Math. Stat.28:882–901.

Kimura, M., 1962 On the probability of fixation of mutant genes in a population. Genetics47:713–719.

Kimura, M., and T. Maruyama, 1966 Mutational load with epistatic gene interactions in fitness. Genetics54:1337–1351.

Koerich, L. B., X. Wang, A. G. Clark and A. B. Carvalho, 2008 Low conservation of gene content in the Drosophila Y chromosome. Nature456:949–951.

Kondrashov, A. S., 1988 Deleterious mutations and the evolution of sexual reproduction. Nature336:435–440.

Kondrashov, A. S., and J. F. Crow, 1988 King’s formula for the mu-tation load with epistasis. Genetics120:853–856.

Krsticevic, F. J., H. L. Santos, S. Januario, C. G. Schragoand A. B. Carvalho, 2009 Functional copies of theMst77Fgene on the Y chromosome ofDrosophila melanogaster.Genetics184:

295–307.

Lange, J., H. Skaletsky, S. K. M.vanDaalen, S. L. Embry, C. M. Korveret al., 2009 Isodicentric Y chromosomes and sex disor-ders as byproducts of homologous recombination that maintains palindromes. Cell138:855–869.

Mano, S., and H. Innan, 2008 The evolutionary rate of duplicate genes under concerted evolution. Genetics180:493–505. Marais, G., 2003 Biased gene conversion: implications for genome

and sex evolution. Trends Genet.19:330–338.

Marais, G., P. R. A. Camposand I. Gordo, 2010 Can intra-Y gene conversion oppose the degeneration of the human Y chromo-some?: A simulation study. Genome Biol. Evol.2:347–357. Mark Welch, D. B., J. L. Mark Welch and M. Meselson,

2008 Evidence for degenerate tetraploidy in bdelloid rotifers. Proc. Natl. Acad. Sci. USA105:5145–5149.

Murphy, W. J., A. J. P. Wilkerson, T. Raudsepp, R. Agarwala, A. A. Schafferet al., 2006 Novel gene acquisition on carnivore Y chromosomes. PLoS Genet.2:e43.

Noordam, M. J., and S. Repping, 2006 The human Y chromosome: a masculine chromosome. Curr. Opin. Genet. Dev.16:225–232. Ohta, T., 1989 The mutational load of a multigene family with

uni-form members. Genet. Res.53:141–145.

Orr, H. A., and Y. Kim, 1998 An adaptive hypothesis for the evolu-tion of the Y chromosome. Genetics150:1693–1698.

Otto, S. P., and D. Bourguet, 1999 Balanced polymorphisms and the evolution of dominance. Am. Nat.153:561–574.

Otto, S. P., and T. Day, 2007 A Biologist’s Guide to Mathematical Mod-eling in Ecology and Evolution.Princeton University Press, Prince-ton, NJ.

Otto, S. P., and D. B. Goldstein, 1992 Recombination and the evo-lution of diploidy. Genetics131:745–751.

Otto, S. P., and P. Yong, 2002 The evolution of gene duplicates. Homol. Eff.46:451–483.

Peck, J. R., 1994 A ruby in the rubbish: beneficial mutations, delete-rious mutations and the evolution of sex. Genetics137:597–606. R DevelopmentCoreTeam, 2005 R: A Language and Environment for Statistical Computing,reference index version 2.2.1. R Founda-tion for Statistical Computing, Vienna.

Repping, S., H. Skaletsky, J. Lange, S. Silber, F.van derVeenet al., 2002 Recombination between palindromes P5 and P1 on the human Y chromosome causes massive deletions and spermato-genic failure. Am. J. Hum. Genet.71:906–922.

Rice, W. R., 1987 Genetic hitchhiking and the evolution of reduced genetic activity on the Y sex chromosome. Genetics116:161–167. Rozen, S., H. Skaletsky, J. Lange, S. Silber, F.van derVeenet al., 2003 Abundant gene conversion between arms of palindromes in human and ape Y chromosomes. Nature423:873–876. Skaletsky, H., T. Kuroda-Kawaguchi, P. J. Minx, H. S. Cordum, L.

Hillieret al., 2003 The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature

423:825–837.

Tanaka, K. M., and K. R. Takahasi, 2009 Enhanced fixation and preservation of a newly arisen duplicate gene by masking delete-rious loss-of-function mutations. Genet. Res.91:267–280. Verkaar, E. L. C., C. Zijlstra, E. M.van’tVeld, K. Boutaga, D. C.

J. Boxtelet al., 2004 Organization and concerted evolution of the ampliconic Y-chromosomal TSPY genes from cattle. Genomics84:468–474.

Walsh, B., 1985 Interaction of selection and biased gene conversion in a multigene family. Proc. Natl. Acad. Sci. USA82:153–157. Wilkerson, A. J. P., F. Raudsepp, T. Graves, D. Albracht, W.

Warrenet al., 2008 Gene discovery and comparative analysis of X-degenerate genes from the domestic cat Y chromosome. Genomics92:329–338.

Yong, P., 1998 Theoretical population genetic model of the inva-sion of an initial duplication. Honours Thesis, Department of Zo-ology, University of British Columbia, Vancouver, BC, Canada. Yu, Y.-H., Y.-W. Lin, J.-F. Yu, W. Schempp and P. H. Yen,

2008 Evolution of the DAZ gene and AZFc region on primate Y chromosomes. BMC Evol. Biol.8:96.

(11)

GENETICS

Supporting Information

http://www.genetics.org/cgi/content/full/genetics.110.116756/DC1

Gene Duplication, Gene Conversion and the Evolution

of the Y Chromosome

Tim Connallon and Andrew G. Clark

Copyright

Ó

2010 by the Genetics Society of America

(12)

T. Connallon and A. G. Clark 2 SI

FILE S1

I. Invasion of gene duplicates on Y chromosomes that carry an arbitrary number of linked genes.

Y-linked duplicate genes evolve within the genetic background of the entire Y chromosome, which is likely to contain

multiple functional genes, particularly during early stages of sex chromosome evolution. To determine the generality of the single

gene duplication scenario in the main text, we developed a second model to examine the evolutionary dynamics of rare, Y-linked

duplicates on ancestral chromosomes carrying an arbitrary number (n) of single-copy genes.

Consider a rare, Y-linked duplicate on Y chromosome carrying n single-copy genes. By duplicating one of the n single-copy

genes, the individual has n – 1 single-copy genes and a single duplicated pair. Though expanding the number of loci greatly

increases the number of possible genotypes to follow within the population, subsequent calculations can be simplified by making

each gene essential. In other words, fitness drops to zero (s = 1) unless each of the n genes has at least one functional copy.

Given this simplification, there are four relevant genotypic classes within the population: (i) individuals with n functional

singletons and no duplicates, each at frequency xn and with fitness wn = 1 – sh; (ii) those with n + 1 functional genes (n – 1

singleton) at frequency xn1 and with fitness wn1 = 1; (iii) those with n + 1 genes (n – 1 singleton), of which n are functional, at

frequency xn0 and with fitness wn0 = 1 – sh; and (4) a class of sterile individuals, at frequency xs and with fitness ws = 1 – s = 0, that

either lack a functional copy of an essential gene, or carry an abnormal Y chromosome.

In an individual carrying n singletons, the Y chromosome deleterious mutation rate per gamete per generation is U = nu, and

the distribution of mutations across gametes is reasonably modeled as a Poisson variable with mean of nu. However, given that the

diploid, genomic deleterious mutation rate is unlikely to be much greater than one, and Y chromosomes typically represent a tiny

fraction of a genome, the number of new mutations should be close to the Bernoulli distribution: U = nu is probability of one

mutation, and 1 – U represents the probability of zero mutations, per generation. For an individual carrying n + 1 total genes, the

overall mutational target will be slightly increased, and the Y chromosome mutation rate becomes Udup = U(n + 1)/n, per

generation. The presence of gene duplicates introduces an opportunity for gene conversion, which as before, are governed by

recombination rate (d), crossover (c), and conversion bias (b) parameters.

Following the events order of (i) birth, (ii) selection, (iii) mutation, (iv) recombination, and (v) fertilization, the Y chromosome

recursions are:

x

n1

'=

x

n1

[2

Ud

(1

c

)

b

+

(

n

U

Un

)(1

dc

)]

[

x

n1

+

(

x

n0

+

x

n

)(1

h

)]

n

+

x

n0

(1

h

)(1

U

)

d

(1

c

)

b

x

n1

+

(

x

n0

+

x

n

)(1

h

)

x

n0

'

=

2x

n1

U(1

d)

[x

n1

+

(x

n0

+

x

n

)(1

h)]n

+

x

n0

(1

h)(1

U)(1

d)

x

n1

+

(x

n0

+

x

n

)(1

h)

x

n

'

=

x

n

(1

h

)(1

U

)

(13)

T. Connallon and A. G. Clark 3 SI

x

s

'

=

x

n1

'

+

x

n0

'

+

x

n

'

Stability of the equilibrium

x

n1 = xn0 = 0,

x

ˆ

n

=

1

U

=

1

x

ˆ

s, and

w

=

(1

U

)(1

h)

is governed by the eigenvalue:

=

2Ud(1

c)b

+

(n

U

Un)(1

dc)

+

(1

h)(1

U)(1

d)n

2(1

h)(1

U)n

+

2Ud(1

c)b

+

(n

U

Un)(1

dc)

+

(1

h)(1

U)(1

d)n

{

}

2

4(n

U

Un)(1

dc)(1

d)(1

h)(1

U)n

2(1

h)(1

U)n

When there is no recombination (d = 0), a rare gene duplicate is favored by selection when sh > U/(nnU). Substituting for

U = nu yields sh > u/(1 – nu). This result differs slightly from the previous model of a duplicate linked to a single essential gene (the

former model predicts that a duplicate invades when sh > u/(1 – u)). Multiple Y-linked genes will therefore decrease opportunities

for positive selection in favor of new duplicates.

When selection is weak (sh 0), recombination can promote selection in favor of the duplicate. For sh = c = 0, the Taylor

series approximation around d = 0 gives a leading eigenvalue of:

=

d=0

+

d

d=0

d

+

O

(

d

2

)

1

+

d

(2

b

1)

which is greater than one for b > 0.5, as in the previous model. Numerical simulations of the leading eigenvalue under a broad

range of parameter space show that, as before, the opportunity for positive selection for a new duplicate is greater with

(14)

T. Connallon and A. G. Clark 4 SI

II. Invasion Probability of Duplicate Genes with Gene Conversion

FIGURE S1.—The probability of fixation for Y-linked duplicate genes. The red line depicts the analytical approximation from

Eq. (2). To facilitate comparison between these results and those of Fig. 2 from the main text, we show the approximation for N =

1000, s = 1, d = 0, and u = 10-5, and present representative simulation results for d > 0 and various combinations of the remaining

parameters (c, b). Circles represent the proportion of duplicate genotypes (out of 100,000 replicate simulations for each data point)

(15)

T. Connallon and A. G. Clark 5 SI

III. Maintenance of Functional Gene Duplicates

FIGURE S2.—Gene conversion and the maintenance of functionally redundant paralogs. Results are presented for two extremes of selection: gene conversion between paralogs of an essential gene (s = 1) and between paralogs of a nonessential gene (s

= 0.001). In each case, gene conversion is unbiased (b = 0.5) and the mutation rate is u = 10-5. Under essentiality and

(16)

T. Connallon and A. G. Clark 6 SI

IV. Frequency of the ‘least loaded class’ under biased gene conversion.

Figure

TABLE 1
Figure 2.—The probability of fixation for Y-linked dupli-

References

Related documents

International Journal of Scientific Research in Computer Science, Engineering and Information Technology CSEIT1833399 | Received 01 April 2018 | Accepted 15 April 2018 | March April 2018

Cuando el malware tiene la forma de un archivo ejecutable en algún lugar del disco duro, normalmente se inicia sin la intervención del usuario, invocado de forma automática, como

The UP documents are the formal printed manuals which are widely distributed through the UNIVAC Customer Information Distribution Center.. These manuals, which

(1) Published data or estimates of the absolute number of annual snakebites in a country and the number of persons seeking treatment for snakebite each year, reported by

Face verification is a biometric person recognition technique used to verify (confirm or deny) a claimed identity based on a face image or a set of faces (or a video sequence)..