Molecular Evolution of the Avian CHD1 Genes on the Z and W Sex Chromosomes (original) (raw)
Journal Article
,
Department of Evolutionary Biology
, Evolutionary Biology Centre, Uppsala University, SE-752-36 Uppsala, Sweden
Search for other works by this author on:
Department of Evolutionary Biology
, Evolutionary Biology Centre, Uppsala University, SE-752-36 Uppsala, Sweden
Corresponding author: Hans Ellegren, Department of Evolutionary Biology, Evolutionary Biology Ctr., Uppsala University, Norbyvaägen 18D, SE-752-36 Uppsala, Sweden. E-mail: hans.ellegren@ebc.uu.se
Search for other works by this author on:
Received:
08 December 1999
Published:
01 August 2000
Navbar Search Filter Mobile Enter search term Search
Abstract
Genes shared between the nonrecombining parts of the two types of sex chromosomes offer a potential means to study the molecular evolution of the same gene exposed to different genomic environments. We have analyzed the molecular evolution of the coding sequence of the first pair of genes found to be shared by the avian Z (present in both sexes) and W (female-specific) sex chromosomes, CHD1Z and CHD1W. We show here that these two genes evolve independently but are highly conserved at nucleotide as well as amino acid levels, thus not indicating a female-specific role of the CHD1W gene. From comparisons of sequence data from three avian lineages, the frequency of nonsynonymous substitutions (_K_a) was found to be higher for CHD1W (1.55 per 100 sites) than for CHD1Z (0.81), while the opposite was found for synonymous substitutions (_K_s, 13.5 vs. 22.7). We argue that the lower effective population size and the absence of recombination on the W chromosome will generally imply that nonsynonymous substitutions accumulate faster on this chromosome than on the Z chromosome. The same should be true for the Y chromosome relative to the X chromosome in XY systems. Our data are compatible with a male-biased mutation rate, manifested by the faster rate of neutral evolution (synonymous substitutions) on the Z chromosome than on the female-specific W chromosome.
THE underlying factors affecting the molecular evolution of sex-linked genes differ in some important ways as compared to those governing the evolution of autosomal genes. First, the effective population size of sex-linked genes is always smaller than that of autosomal genes, implying different fixation probabilities of a given selection coefficient (Charlesworth et al. 1987; Li 1997). Second, while autosomal genes spend an equal amount of time in the male as in the female germline, sex-linked genes show a bias with respect to their transmission through the two sexes. The mammalian X chromosome, for example, is two-thirds of the time in the female germline. Moreover, genes from the nonrecombining part of one of the sex chromosomes are exclusively transmitted by a single sex. This means that if the patterns of mutation or selection differ between sexes, sex-linked genes will evolve in a “sex-biased” fashion (Miyata et al. 1987). For instance, several lines of evidence from several organisms indicate that the mutation rate of males is higher than that of females, a situation commonly attributed to the many more mitotic germline cell divisions in spermatogenesis than in oogenesis (Miyata et al. 1987; Shimmin et al. 1993; Ellegren and Fridolfsson 1997; Hurst and Ellegren 1998). Moreover, the degree of methylation of CpG sites, which increases the mutability of such sites, may differ between sexes (Driscoll and Migeon 1990). Third, while there as yet is no evidence for chromosome-specific mutation rates of vertebrate autosomes, a lowered mutation rate has been suggested for the mammalian X chromosome, which could be adaptive by reducing the effect of slightly deleterious mutations being exposed in hemizygote males (McVean and Hurst 1997). Fourth, possible dosage and dominance effects might act differentially on genes on sex chromosomes (Charlesworth et al. 1987) and their recombination rates may also differ. Thus, a number of sex- or chromosome-specific factors may be manifested in the molecular evolution of sex-linked genes.
Since the selection pressure on individual genes varies enormously, empirically addressing the effects of sex- and chromosome-specific factors in molecular evolutionary processes ideally requires analyses of the same gene exposed to different genomic environments. This is obviously not possible for single-copy genes and most multigene families are either autosomal or sex-linked, at least with respect to expressed gene copies. However, a very special class of genes offers a possibility to study these factors, namely, genes shared between the nonrecombining parts of the two types of sex chromosomes. In principle, sex chromosomes are thought to evolve from an ancestral pair of autosomes, where, following the arrest of recombination, one of the chromosomes gradually becomes degraded and devoid of most genes (Charlesworth 1996; Rice 1996). Since degradation will in most cases not be complete, a few genes will remain on the smaller sex chromosome (e.g., the mammalian Y chromosome) and will thus be present in a copy both on this and on the larger nondegraded sex chromosome (e.g., the X chromosome). As shown for mammals, some of these genes are associated with male-specific or male-enhancing functions and may actually become silenced or deleted from the X chromosome (Graves 1995). Others, however, will be expressed from both sex chromosomes. Only a limited number of such genes have yet been identified (Lahn and Page 1997).
In birds, the female is the heterogametic sex and she has one Z and one W chromosome, whereas the male has two Z chromosomes. Physically, the W chromosome resembles the mammalian Y chromosome in several respects; it is small, gene-poor, and mainly heterochromatic (Stefos and Arrighi 1971). Studies of genes shared by the Z and W chromosomes would be important for reasons discussed above, and the avian sex chromosome system makes it possible to distinguish between some sex- and chromosome-specific factors that confound analyses in mammals (cf. Crow 1997; Ellegren and Fridolfsson 1997; Lessels 1997). Moreover, such studies are also motivated by the fact that the role of the W chromosome in avian sex determination is still unclear (Ellegren 2000). The critical issue is whether it is the W chromosome that is required for female development or if it is the number of Z chromosomes that regulates male development, i.e., a dominance (as in mammals) or a balance (as in Drosophila and Caenorhabditis elegans) mode of genic sex determination. If it is the latter, and circumstantial evidence lends some support to this idea (Crew 1954; Halverson and Dvorak 1993; Raymond et al. 1999; Smith et al. 1999), the question is what selective constraints act on W-linked genes and why they have been retained on this chromosome.
Two avian genes have recently been shown to exist in a copy on both the Z and the W chromosome, the CHD1Z/CHD1W gene pair (Ellegren 1996; Griffiths et al. 1996; Griffiths and Korn 1997; Fridolfsson et al. 1998) and the ATP5A1Z/ATP5A1W pair (Dvorak et al. 1992; Fridolfsson et al. 1998; Carmichael et al. 2000). The avian CHD1 genes belong to a family of genes composed of a chromatin organization modifier (chromo) domain, a SNF2-related helicase/ATPase domain, and a DNA-binding domain and the protein has been named CHD to denote these domains. Functional studies in model organisms have indicated that CHD1 alters the chromatin structure and thereby facilitates gene expression (Stokes and Perry 1995; Stokes et al. 1996). It is not yet known if avian CHD1Z and CHD1W are functionally differentiated.
In this study, we present a detailed analysis of the molecular evolution of avian CHD1Z and CHD1W genes. Based on sequence data from three avian species, we show that the two genes are highly conserved both with respect to CHD1 genes in other organisms and to each other. However, the two genes appear to evolve independently, without signs of genetic exchange through recombination. CHD1Z has a lower frequency of nonsynonymous (_K_a) but a higher frequency of synonymous (_K_s) substitutions compared with CHD1W. We attribute these differences to the respective characteristics of effective population size, recombination, and sex-specific mutation rates associated with the two types of sex chromosomes.
MATERIALS AND METHODS
PCR and cloning: mRNA was prepared from 25 μl of fresh whole blood from one male and one female of Tengmalm's owl (Aegolius funerus) and of cockatiel (Lutino cockatiel), with a Quick Prep Micro mRNA purification kit (Pharmacia Biotech, Piscataway, NJ). The Access reverse transcriptase PCR (RT-PCR) system (Promega, Madison, WI) was used with 1/500 of each mRNA preparation, together with the primer combinations described below, to amplify overlapping fragments of the CHD1 genes. Obtaining CHD1Z was straightforward since amplification of male mRNA yields only this gene, even when using primer sequences conserved between CHD1Z and CHD1W. The following five pairs were used to amplify CHD1Z: 1090F (Ellegren 1996) and 2128R (Fridolfsson et al. 1998), 1628F (Fridolfsson et al. 1998) and 2469R (Ellegren 1996), 2421F (Ellegren 1996) and 3112R (Ellegren and Fridolfsson 1997), 2895F (5′-CGGCTAGTCACAAAAGGATC-3′) and 3681R (Ellegren and Fridolfsson 1997), and finally P3 (Griffiths and Tiwari 1995) and 4104R (Ellegren 1996).
Specific amplification of CHD1W in female birds is complicated by the fact that CHD1Z and CHD1W are very similar and are both expressed in females. We used a combined strategy of W-specific primers (underlined below), on the basis of sequence information from chicken CHD1W, and single-strand conformation polymorphism (SSCP) analysis to identify CHD1W products in amplifications of female DNA. The primers used were: 1275F (Ellegren 1996) and 1869R (5′-CATCCATTCATGAGTTCTTAT-3′), 1628F and 2469R, 2421F and 3112R, 2987F (Ellegren and Fridolfsson 1997) and 3829R (5′-GCCAACTCTTCTTCGTGAGAA-3′), and 3468F (Ellegren 1996) and 4105R. RT-PCR conditions were 48° for 45 min, then an initial denaturation step of 94° for 2 min followed by a 10-cycle touchdown profile consisting of 94° for 30 sec, 60–50° (lowering the temperature by 1° per cycle) for 30 sec, and 68° for 1 min. Then 30 cycles of the same profile were run at a constant annealing temperature of 50°, and a final extension step of 68° for 10 min was added after the last cycle.
Amplification products were separated by agarose gel electrophoresis (1.5% agarose, Sea Kem) and fragments of the expected size were excised and purified (Qiaex II gel extraction kit, QIAGEN, Hilden, Germany) and ligated into pGEM-T vector (pGEM-T easy vector systems, Promega). For CHD1W, 10 clones of each ligation were reamplified with the same primers and were analyzed with SSCP, together with clones known to contain the Z copy of the fragment (i.e., amplified from males). Clones containing CHD1W could thereby be identified on the basis of the contrasting SSCP patterns of CHD1Z and CHD1W sequences. Clones were sequenced with vector primers using BigDye terminator cycle sequencing chemistry (Perkin Elmer, Norwalk, CT), followed by analysis on an ABI377 automated sequencing instrument (Perkin Elmer, Foster City, CA). The fact that we used overlap-ping fragments allowed us to ensure that correct clones had always been identified.
Genes were named with a prefix denoting the Latin name of the species of origin (chicken, Gg; Tengmalm's owl, Af; cockatiel, Lc). For use in analyses we obtained from GenBank chicken CHD1Z (AF004397), mouse (Mm, L10410), human (Hs, AF006513), Drosophila melanogaster (Dm, X99021), Saccharomyces cerevisiae (Sc, L10718), and Arabidopsis thaliana (At, AC007209) CHD1 gene sequences. Sequences obtained in this study have been deposited in GenBank under accession nos. AF181824–AF181828.
Sequence analysis: Contigs of the coding sequence of CHD1Z and CHD1W from each species were constructed using Sequencher 3.0 (Gene Codes, Ann Arbor, MI). Avian sequences were aligned with Sequence Navigator (Applied Biosystems, Foster City, CA) and MEGA (Kumar et al. 1993) was used for translation and analyses of amino acid (aa) distances and base composition. Phylogenetic analyses were done by maximum parsimony (MP) and maximum likelihood (ML) as implemented in PAUP* 4.0b2A (Swofford 1998). PHYLIP version 3.5c (Felsenstein 1991) was used for UPGMA clustering of sequences based on synonymous substitutions. The frequency of synonymous and nonsynonymous substitutions and their standard errors were calculated by combining the information from twofold and fourfold degenerate sites and using the Kimura two-parameter model to correct for multiple hits (Li 1993; Pamilo and Bianchi 1993). Patterns of variation in the _K_a/_K_s ratio across genes were calculated by dividing the gene into 18 nonoverlapping sections, each containing 51 codons. Spearman rank correlation was used to test if patterns were significantly repeatable. To test for positive selection in individual CHD1W lineages or among sites, the program codeml included in PAML (Yang 1999) was used. Analysis of CpG sites followed the method described by Smith and Hurst (1999), where sites on both strands were included in the analysis. In several analyses, we present means of _K_a and _K_s in comparisons of different CHD1Z and CHD1W sequences. However, since only three avian species were studied, it should be noted that the three possible comparisons (chicken vs. Tengmalm's owl, chicken vs. cockatiel, and Tengmalm's owl vs. cockatiel) do not represent independent observations.
RESULTS
Independent evolution of CHD1W and CHD1Z: Based on overlapping fragments amplified by RT-PCR of mRNA prepared from blood, we sequenced 2754 bp of the coding region of the CHD1Z and CHD1W genes from two divergent bird species, Tengmalm's owl and cockatiel. This continuous region covers most of the three functional domains of the CHD protein, i.e., the chromo domain, the helicase domain, and the DNA-binding domain. The obtained sequences could be aligned with chicken CHD1Z (Griffiths and Korn 1997) and CHD1W (Ellegren 1996) without gaps.
As a starting point for further analysis, we first asked whether CHD1Z and CHD1W genes are evolving independently. Phylogenetic analysis with both MP and ML, using mouse and human CHD1 as outgroups, clustered the three CHD1Z and the three CHD1W genes separately (Figure 1). The ML tree has a stronger bootstrap support (84/100) than the MP tree (66/100), which is not unexpected given that maximum-likelihood analysis is
Figure 1.
Phylogenetic tree showing the relationship between six avian CHD1 gene sequences, using human and mouse CHD1 as outgroups. Numbers indicate bootstrap support for branches using maximum-parsimony and maximum-likelihood analysis (1000 replicates in each case).
less sensitive to long-branch attraction (Huelsenbeck 1997; the outgroups are only distantly related to birds and the branch leading to Gg_CHD1Z_ is the longest within the avian CHD1 tree; moreover, the rate of evolution varies among the branches; see below). The best alternative MP tree (two steps longer) places the root on the Gg_CHD1Z_ branch, and the best alternative ML tree (Δln L_= −5.34) places the root on the Ag_CHD1Z/Lc_CHD1Z_ branch; both alternatives are indeed unlikely. These results indicate that the CHD1Z and CHD1W genes of the three avian lineages under study have evolved without signs of genetic exchange (e.g., through recombination) between the Z and the W chromosomes. Importantly, the respective molecular evolution of CHD1Z and CHD1W should therefore reflect the intrinsic and different evolutionary forces operating on the two sex chromosomes.
High degree of amino acid conservation in CHD1 genes: The frequency of aa replacements between different gene copies was derived from alignments of avian CHD1Z and CHD1W sequences and of CHD1 from mouse, Drosophila, yeast, and Arabidopsis (Table 1). While avian sequences could be aligned to the mouse sequence without gaps, gaps had to be introduced relative to the more distantly related species. Overall levels of conservation were very high, with, for instance, about five replacements per 100 sites between avian and mouse genes.
Comparisons of avian CHD1Z and CHD1W aa sequences revealed that the two proteins are very similar to each other (mean = 3.2 ± 0.6 aa replacements per 100 sites, range 2.5–3.9; Figure 2), suggesting shared functional properties. Within the respective class of genes, CHD1Z proteins (mean = 1.2 ± 0.1, range 1.2–1.3) are more slowly evolving than CHD1W proteins (3.4 ± 0.5, range 2.8–3.7).
TABLE 1
Number of amino acid replacements per site between CHD1 genes
MmCHD1 | DmCHD1 | AtCHD1 | ScCHD1 | |
---|---|---|---|---|
All domains_a_ | ||||
GgCHDIW | 0.06 | 0.42 | 0.56 | 0.60 |
GgCHDIZ | 0.04 | 0.42 | 0.56 | 0.60 |
Chromo domain_b_ | ||||
GgCHDIW | 0.09 | 0.50 | 0.58 | 0.66 |
GgCHDIZ | 0.04 | 0.51 | 0.58 | 0.67 |
Helicase domain_c_ | ||||
GgCHDIW | 0.03 | 0.27 | 0.40 | 0.47 |
GgCHDIZ | 0.02 | 0.26 | 0.39 | 0.46 |
Intervening region between the H and D domains_d_ | ||||
GgCHDIW | 0.09 | 0.54 | 0.44 | 0.70 |
GgCHDIZ | 0.05 | 0.53 | 0.45 | 0.70 |
DNA-binding domain_e_ | ||||
GgCHDIW | 0.09 | 0.56 | 0.77 | 0.71 |
GgCHDIZ | 0.06 | 0.58 | 0.77 | 0.71 |
MmCHD1 | DmCHD1 | AtCHD1 | ScCHD1 | |
---|---|---|---|---|
All domains_a_ | ||||
GgCHDIW | 0.06 | 0.42 | 0.56 | 0.60 |
GgCHDIZ | 0.04 | 0.42 | 0.56 | 0.60 |
Chromo domain_b_ | ||||
GgCHDIW | 0.09 | 0.50 | 0.58 | 0.66 |
GgCHDIZ | 0.04 | 0.51 | 0.58 | 0.67 |
Helicase domain_c_ | ||||
GgCHDIW | 0.03 | 0.27 | 0.40 | 0.47 |
GgCHDIZ | 0.02 | 0.26 | 0.39 | 0.46 |
Intervening region between the H and D domains_d_ | ||||
GgCHDIW | 0.09 | 0.54 | 0.44 | 0.70 |
GgCHDIZ | 0.05 | 0.53 | 0.45 | 0.70 |
DNA-binding domain_e_ | ||||
GgCHDIW | 0.09 | 0.56 | 0.77 | 0.71 |
GgCHDIZ | 0.06 | 0.58 | 0.77 | 0.71 |
a
981 aa.
b
162 aa.
c
461 aa.
d
140 aa.
e
218 aa.
TABLE 1
Number of amino acid replacements per site between CHD1 genes
MmCHD1 | DmCHD1 | AtCHD1 | ScCHD1 | |
---|---|---|---|---|
All domains_a_ | ||||
GgCHDIW | 0.06 | 0.42 | 0.56 | 0.60 |
GgCHDIZ | 0.04 | 0.42 | 0.56 | 0.60 |
Chromo domain_b_ | ||||
GgCHDIW | 0.09 | 0.50 | 0.58 | 0.66 |
GgCHDIZ | 0.04 | 0.51 | 0.58 | 0.67 |
Helicase domain_c_ | ||||
GgCHDIW | 0.03 | 0.27 | 0.40 | 0.47 |
GgCHDIZ | 0.02 | 0.26 | 0.39 | 0.46 |
Intervening region between the H and D domains_d_ | ||||
GgCHDIW | 0.09 | 0.54 | 0.44 | 0.70 |
GgCHDIZ | 0.05 | 0.53 | 0.45 | 0.70 |
DNA-binding domain_e_ | ||||
GgCHDIW | 0.09 | 0.56 | 0.77 | 0.71 |
GgCHDIZ | 0.06 | 0.58 | 0.77 | 0.71 |
MmCHD1 | DmCHD1 | AtCHD1 | ScCHD1 | |
---|---|---|---|---|
All domains_a_ | ||||
GgCHDIW | 0.06 | 0.42 | 0.56 | 0.60 |
GgCHDIZ | 0.04 | 0.42 | 0.56 | 0.60 |
Chromo domain_b_ | ||||
GgCHDIW | 0.09 | 0.50 | 0.58 | 0.66 |
GgCHDIZ | 0.04 | 0.51 | 0.58 | 0.67 |
Helicase domain_c_ | ||||
GgCHDIW | 0.03 | 0.27 | 0.40 | 0.47 |
GgCHDIZ | 0.02 | 0.26 | 0.39 | 0.46 |
Intervening region between the H and D domains_d_ | ||||
GgCHDIW | 0.09 | 0.54 | 0.44 | 0.70 |
GgCHDIZ | 0.05 | 0.53 | 0.45 | 0.70 |
DNA-binding domain_e_ | ||||
GgCHDIW | 0.09 | 0.56 | 0.77 | 0.71 |
GgCHDIZ | 0.06 | 0.58 | 0.77 | 0.71 |
a
981 aa.
b
162 aa.
c
461 aa.
d
140 aa.
e
218 aa.
_K_a and _K_a/_K_s ratios of avian CHD1Z and CHD1W genes: In accordance with the aa data, _K_a was lower for CHD1Z (mean = 0.81 ± 0.08 nonsynonymous nucleotide substitutions per 100 sites) than for CHD1W (1.55 ± 0.30; Table 2), which in turn was only marginally less than that for CHD1Z vs. CHD1W (1.85 ± 0.31). However, since the overall mutation rate may differ between the Z and W chromosomes (Ellegren and Fridolfsson 1997), a more appropriate measure of the evolutionary forces operating on CHD1Z and CHD1W should be their
Figure 2.
Amino acid alignment of avian CHD1 genes, with mouse CHD1 as master sequence. Identical positions are denoted by dots and positions for which data are lacking are denoted by dashes. There are no gaps. Positions are numbered according to the complete aa sequence of mouse. Known functional domains or motifs are boxed.
_K_a/_K_s ratios. Mean _K_a/_K_s for CHD1Z (0.037 ± 0.01) was considerably lower than for CHD1W (0.11 ± 0.01).
Selective forces upon replacement substitutions can obviously be different for different parts of a gene, leading to variation in the pattern of _K_a/_K_s across genes (Alvarez-Valin et al. 1998). Repeatability of _K_a/_K_s patterns in comparisons of independent pairs of gene lineages is an indication of nonrandom substitution rates (Smith and Hurst 1998) and is suggestive of different gene copies sharing functional properties. The patterns of _K_a/_K_s variation across avian CHD1 genes were roughly similar in the three possible comparisons of CHD1Z and CHD1W genes (Figure 3). For instance, _K_a/_K_s was particularly low in the 3′ end of the helicase domain of both CHD1Z and CHD1W. Repeatability was statistically significant for Tengmalm's owl vs. cockatiel (_R_s = 0.60, P = 0.013), but not so in the two other comparisons.
Although the fact that _K_a/_K_s never exceeded 0.35 suggests an absence of positive selection, a higher _K_a/_K_s ratio in CHD1W than in CHD1Z genes might be indicative of adaptive changes in individual lineages or in parts of the CHD1W gene. To investigate this further, we used a likelihood-ratio test implemented in PAML (Yang and Nielsen 1998; Yang 1999). However, this failed to reject a null hypothesis of equal _K_a/K_s ratios in individual lineages, tested in all possible topologies of CHD1W trees [2Δ_l = 0.92, d.f. = 2, not significant (NS)]. Similarly, a likelihood-ratio test failed to reject a
TABLE 2
Frequency of nonsynonymous substitutions between avian CHD1 genes
AfCHDIZ | GgCHDIZ | LcCHDIZ | AfCHDIW | GgCHDIW | LcCHDIW |
---|---|---|---|---|---|
AfCHDIZ | 0.81 | 0.73 | 1.45 | 2.07 | 1.67 |
GgCHDIZ | 0.15 | 0.88 | 1.61 | 2.30 | 1.82 |
LcCHDIZ | 0.14 | 0.15 | 1.58 | 2.32 | 1.84 |
AfCHDIW | 0.19 | 0.20 | 0.20 | 1.68 | 1.21 |
GgCHDIW | 0.23 | 0.25 | 0.25 | 0.21 | 1.77 |
LcCHDIW | 0.21 | 0.22 | 0.22 | 0.17 | 0.22 |
AfCHDIZ | GgCHDIZ | LcCHDIZ | AfCHDIW | GgCHDIW | LcCHDIW |
---|---|---|---|---|---|
AfCHDIZ | 0.81 | 0.73 | 1.45 | 2.07 | 1.67 |
GgCHDIZ | 0.15 | 0.88 | 1.61 | 2.30 | 1.82 |
LcCHDIZ | 0.14 | 0.15 | 1.58 | 2.32 | 1.84 |
AfCHDIW | 0.19 | 0.20 | 0.20 | 1.68 | 1.21 |
GgCHDIW | 0.23 | 0.25 | 0.25 | 0.21 | 1.77 |
LcCHDIW | 0.21 | 0.22 | 0.22 | 0.17 | 0.22 |
Estimated number and standard error of nucleotide substitutions per 100 nonsynonymous (_K_a) sites between CHD1W and CHD1Z genes. _K_a values are above the diagonal; standard errors are below the diagonal.
TABLE 2
Frequency of nonsynonymous substitutions between avian CHD1 genes
AfCHDIZ | GgCHDIZ | LcCHDIZ | AfCHDIW | GgCHDIW | LcCHDIW |
---|---|---|---|---|---|
AfCHDIZ | 0.81 | 0.73 | 1.45 | 2.07 | 1.67 |
GgCHDIZ | 0.15 | 0.88 | 1.61 | 2.30 | 1.82 |
LcCHDIZ | 0.14 | 0.15 | 1.58 | 2.32 | 1.84 |
AfCHDIW | 0.19 | 0.20 | 0.20 | 1.68 | 1.21 |
GgCHDIW | 0.23 | 0.25 | 0.25 | 0.21 | 1.77 |
LcCHDIW | 0.21 | 0.22 | 0.22 | 0.17 | 0.22 |
AfCHDIZ | GgCHDIZ | LcCHDIZ | AfCHDIW | GgCHDIW | LcCHDIW |
---|---|---|---|---|---|
AfCHDIZ | 0.81 | 0.73 | 1.45 | 2.07 | 1.67 |
GgCHDIZ | 0.15 | 0.88 | 1.61 | 2.30 | 1.82 |
LcCHDIZ | 0.14 | 0.15 | 1.58 | 2.32 | 1.84 |
AfCHDIW | 0.19 | 0.20 | 0.20 | 1.68 | 1.21 |
GgCHDIW | 0.23 | 0.25 | 0.25 | 0.21 | 1.77 |
LcCHDIW | 0.21 | 0.22 | 0.22 | 0.17 | 0.22 |
Estimated number and standard error of nucleotide substitutions per 100 nonsynonymous (_K_a) sites between CHD1W and CHD1Z genes. _K_a values are above the diagonal; standard errors are below the diagonal.
Figure 3.
Variation in _K_a/_K_s across CHD1Z (solid squares) and CHD1W (open squares) genes. Each data point represents 102 codons, with an overlapping window of 51 codons. (a) Tengmalm's owl vs. chicken; (b) Tengmalm's owl vs. cockatiel; and (c) chicken vs. cockatiel.
null hypothesis of equal _K_a/K_s ratios among sites (2Δ_l = −27, d.f. = 2, NS).
A higher _K_s in CHD1Z than in CHD1W: The frequency of synonymous substitutions (_K_s) was higher for CHD1Z (mean = 22.70 ± 6.62) than for CHD1W (mean = 13.48 ± 2.06; Table 3), which contrasts to the situation for _K_a. This indicates an underlying sex difference in the mutation rate, assuming that synonymous substitutions in CHD1Z and CHD1W are selectively neutral or are at least under the same constraints. Pairwise comparisons of _K_s revealed estimates of the male-to-female mutation rate ratio (αm) of 2.1 ± 0.3 (Tengmalm's owl vs. chicken), 2.1 ± 0.3 (cockatiel vs. chicken), and 1.5 ± 0.2 (cockatiel vs. Tengmalm's owl). A mean value of αm ≈ 1.7 was estimated from the branch lengths of a dendrogram based on _K_s distances (cf. Shimmin et al. 1993). Since the phylogenetic relationship of the CHD1W genes was unresolved, this mean value is only an approximation.
Low influence of CpG sites on _K_s: The GC content of CHD1Z (mean = 40.5 ± 0.1) and CHD1W genes (39.1 ± 0.1) was lower than an average of 53.2% estimated from 399 chicken genes (Olivier and Marin 1996), but did not differ between the two types of genes (χ2 = 0.04, NS). The GC3 content was even lower (CHD1Z, 36.0 ± 0.5; CHD1W, 33.0 ± 0.4; chicken average, 69.4%; Bernardi et al. 1988), but again did not differ between CHD1Z and CHD1W (χ2 = 0.20, NS). The observed number of CpG sites was about five times lower than expected based on base composition in both CHD1Z (ratio of observed/expected = 0.17) and CHD1W (0.20). This underrepresentation is of the same magnitude as the average for genes in the human genome (Schorderet and Gartler 1992).
In separate analyses of CHD1Z and CHD1W, we counted the number of synonymous and nonsynonymous sites where all three sequences had a CpG dinucleotide. This number was compared to the number of sites where at least one sequence had a TpG dinucleotide while the other/s had a CpG dinucleotide, i.e., possible cases of C–T transitions at methylated CpG sites. Since both the total number of CpG sites (CHD1Z, 15; CHD1W, 15) and the number of sites with possible C–T transitions (CHD1Z, 4; CHD1W, 5) were low, and did not differ between genes, we conclude that methylated CpG sites seem not to affect the molecular evolution of CHD1Z and CHD1W evolution in a contrasting way.
DISCUSSION
Very few aa changes distinguish avian CHD1Z and CHD1W proteins (eight positions represent fixed differences,
TABLE 3
Frequency of synonymous substitutions between avian CHD1 genes
AfCHD1Z | GgCHD1Z | LcCHD1Z | AfCHD1W | GgCHD1W | LcCHD1W |
---|---|---|---|---|---|
AfCHDIZ | 26.45 | 15.06 | 21.36 | 25.75 | 22.82 |
GgCHDIZ | 1.70 | 26.59 | 29.30 | 31.90 | 29.76 |
LcCHDIZ | 1.20 | 1.74 | 23.53 | 27.31 | 25.46 |
AfCHDIW | 1.50 | 1.87 | 1.57 | 15.19 | 11.19 |
GgCHDIW | 1.65 | 1.93 | 1.75 | 1.23 | 14.07 |
LcCHDIW | 1.54 | 1.88 | 1.66 | 1.07 | 1.14 |
AfCHD1Z | GgCHD1Z | LcCHD1Z | AfCHD1W | GgCHD1W | LcCHD1W |
---|---|---|---|---|---|
AfCHDIZ | 26.45 | 15.06 | 21.36 | 25.75 | 22.82 |
GgCHDIZ | 1.70 | 26.59 | 29.30 | 31.90 | 29.76 |
LcCHDIZ | 1.20 | 1.74 | 23.53 | 27.31 | 25.46 |
AfCHDIW | 1.50 | 1.87 | 1.57 | 15.19 | 11.19 |
GgCHDIW | 1.65 | 1.93 | 1.75 | 1.23 | 14.07 |
LcCHDIW | 1.54 | 1.88 | 1.66 | 1.07 | 1.14 |
Estimated number and standard error of nucleotide substitutions per 100 synonymous (_K_s) sites between CHD1W and CHD1Z genes. _K_s values are above the diagonal; standard errors are below the diagonal.
TABLE 3
Frequency of synonymous substitutions between avian CHD1 genes
AfCHD1Z | GgCHD1Z | LcCHD1Z | AfCHD1W | GgCHD1W | LcCHD1W |
---|---|---|---|---|---|
AfCHDIZ | 26.45 | 15.06 | 21.36 | 25.75 | 22.82 |
GgCHDIZ | 1.70 | 26.59 | 29.30 | 31.90 | 29.76 |
LcCHDIZ | 1.20 | 1.74 | 23.53 | 27.31 | 25.46 |
AfCHDIW | 1.50 | 1.87 | 1.57 | 15.19 | 11.19 |
GgCHDIW | 1.65 | 1.93 | 1.75 | 1.23 | 14.07 |
LcCHDIW | 1.54 | 1.88 | 1.66 | 1.07 | 1.14 |
AfCHD1Z | GgCHD1Z | LcCHD1Z | AfCHD1W | GgCHD1W | LcCHD1W |
---|---|---|---|---|---|
AfCHDIZ | 26.45 | 15.06 | 21.36 | 25.75 | 22.82 |
GgCHDIZ | 1.70 | 26.59 | 29.30 | 31.90 | 29.76 |
LcCHDIZ | 1.20 | 1.74 | 23.53 | 27.31 | 25.46 |
AfCHDIW | 1.50 | 1.87 | 1.57 | 15.19 | 11.19 |
GgCHDIW | 1.65 | 1.93 | 1.75 | 1.23 | 14.07 |
LcCHDIW | 1.54 | 1.88 | 1.66 | 1.07 | 1.14 |
Estimated number and standard error of nucleotide substitutions per 100 synonymous (_K_s) sites between CHD1W and CHD1Z genes. _K_s values are above the diagonal; standard errors are below the diagonal.
six of which are conservative changes). Similarly, comparisons of eukaryotic CHD1 genes, including avian CHD1Z and CHD1W, reveal extensive conservation, particularly in the functional domains. For instance, only one fixed amino acid difference distinguishes birds from mammals over a region of 180 aa residues in the helicase domain (Figure 2). In fact, the helicase domain is highly conserved even between different members of the CHD gene family (Woodage et al. 1997), indicating strong functional constraints. The DNA-binding activity of the CHD1 protein has been located to a domain of 229 aa residues and within this region a sequence of 11 aa is essential for DNA binding by A · T minor-groove interactions (Stokes and Perry 1995). This sequence motif is identical between avian and mouse CHD1 genes. Overall, this suggests (i) that CHD1Z and CHD1W share similar functional properties and (ii) that this function should be more or less the same as in other organisms.
Comparative analyses of nonsynonymous substitution rates are preferably made using the _K_a/_K_s ratio to account for local variation in the mutation rate. In our study, we found _K_a/_K_s to be higher for CHD1W (0.11) than for CHD1Z (0.04), which in turn was higher than for CHD1 in mammals (0.025). Since _K_a/_K_s for CHD1W is ⪡1, which is the strict requirement for demonstration of positive selection, we found no overall suggestion that CHD1W would be rapidly diverging in an adaptive way. Likelihood-ratio tests similarly failed to detect signs of positive selection in terms of variation in _K_a/_K_s among CHD1W lineages or among CHD1W sites. Moreover, the patterns of _K_a/_K_s variation across the gene were similar between CHD1Z and CHD1W genes. This, together with the high degree of aa conservation seen between CHD1Z and CHD1W, strongly argues against a female-specific role of CHD1W. In fact, it might be argued that CHD1Z and CHD1W act in concert and in a sense should be seen as allelic variants of the same functional protein. It should be noted that positive selection has been recognized in male-specific and Y-linked sequences in mammals. For example, the mammalian SRY gene shows a _K_a/_K_s ratio of 1.3 (Tucker and Lundrigan 1993; Whitfield et al. 1993).
We argue that the difference in _K_a/_K_s between CHD1Z and CHD1W is associated with differences in effective population size and recombination characteristics of the two types of sex chromosomes. First, selection is more effective in removing slightly deleterious mutations in a population of larger size (Nei 1970; Li 1997). Other factors being equal, this should imply that such mutations are more easily removed from the Z chromosome since its effective population size is three times that of the W chromosome. Second, since most parts of the W chromosome do not recombine and are thus clonally transmitted, slightly deleterious mutations should be expected to accumulate faster than on the Z chromosome (Charlesworth 1996; Rice 1996). The expectation is in both cases a higher _K_a/_K_s ratio on the W chromosome than on the Z chromosome, as we observe. The same should be true for genes on the Y (analogous to W) and X (analogous to Z) chromosomes of mammals and is indeed supported by available data. The _K_a/_K_s ratio is higher for Ube1y (0.19) than for Ube1x (0.0; Chang and Li 1995), for ZFY (0.42) than for ZFX (0.13; Pamilo and Bianchi 1993), and for SMCY (0.17) than for SMCX (0.02; Agulnik et al. 1997).
In contrast to the situation for _K_a, _K_s was higher for CHD1Z than for CHD1W. From a similar observation based on partial sequence data, we recently interpreted this as evidence for a male-biased mutation rate, given that W is exclusively transmitted through the female germline (Ellegren and Fridolfsson 1997). Applying the formula of Miyata et al. (1987), present data suggest a male bias in the mutation rate of αm ≈ 1.7 in the lineages studied, which is lower than our previous estimate of αm ≈ 3.9 derived from the coding regions of CHD1 genes of two passerine bird species (Ellegren and Fridolfsson 1997). It is not clear if this suggests variation in αm between avian lineages, since the validity of statistical analyses is uncertain due to the difficulty in estimating confidence intervals of αm. Importantly, all presently available bird data indicate more mutations among males than females.
Does the excess of male mutations tie in quantitatively with the difference in the number of germline cell divisions between males and females? This question is difficult to address due to the lack of detailed cytological data, although it seems quite clear that spermatogenesis involves more cell generations than oogenesis in birds (Jones and Lin 1993). Moreover, if there is an intrinsic reduction in the Z chromosome mutation rate, as has been suggested for the mammalian X chromosome (McVean and Hurst 1997), comparisons of the rate of neutral evolution on Z and W chromosomes would tend to underestimate αm. On the other hand, αm could overestimate the difference in the number of cell divisions in male and female germlines if the per cell generation mutation rate differs between sexes. One such potential factor is the degree of germline methylation, which affects the mutability of CpG sites (Li 1997). For example, methylation has been invoked to explain the male-biased mutation rate at hemophilia A CpG sites, which are more strongly methylated in male than in female germline (Oldenburg et al. 1993; Sommer and Ketterling 1996). According to the present data, however, a potential role of methylated CpG sites in explaining the male-biased mutation rate of avian CHD1 genes could be excluded.
In summary, the genomic location of the CHD1Z and CHD1W genes on the avian sex chromosomes is likely to have affected the molecular evolution of these two genes in distinct ways. While the two proteins are highly conserved and do not seem functionally differentiated, they differ with respect to frequency of synonymous and nonsynonymous nucleotide substitutions. Since the respective factors contributing to these differences (effective population size, recombination, and sex-specific mutation rates) should be valid for sex chromosomes in general, we anticipate the observed patterns of molecular evolution to be general characteristics of sex-linked genes.
Acknowledgement
We thank Tim Hipkiss and Jan Högberg for providing avian blood samples. We also thank Nick Smith and Bengt-Olle Bengtsson for valuable comments on the manuscript. Financial support was obtained from the Swedish Natural Sciences Research Council and Lars Hiertas Minne.
Footnotes
Communicating editor: P. D. Keightley
LITERATURE CITED
Agulnik
A I
,
Bishop
C E
,
Lerner
J L
,
Agulnik
S I
,
Solovyev
V V
,
1997
Analysis of mutation rates in the SMCY/SMCX genes shows that mammalian evolution is male driven
.
Mamm. Genome
8
:
134
–
138
.
Alvarez-Valin
F
,
Jabbari
K
,
Bernardi
G
,
1998
Synonymous and nonsynonymous substitutions in mammalian genes: intragenic correlations
.
J. Mol. Evol.
46
:
37
–
44
.
Bernardi
G
,
Mouchiroud
D
,
Gautier
C
,
Bernardi
G
,
1988
Compositional patterns in vertebrate genomes: conservation and change in evolution
.
J. Mol. Evol.
28
:
7
–
18
.
Carmichael
A N
,
Fridolfsson
A K
,
Halverson
J
,
Ellegren
H
,
2000
Male-biased mutation rates revealed from Z- and W-chromosome linked ATP synthase α-subunit (ATP5A1) sequences in birds
.
J. Mol. Evol.
50
:
443
–
446
.
Chang
B H
,
Li
W-H
,
1995
Estimating the intensity of male-driven evolution in rodents by using X-linked and Y-linked Ube1 genes and pseudogenes
.
J. Mol. Evol.
40
:
70
–
77
.
Charlesworth
B
,
1996
The evolution of chromosomal sex determination and dosage compensation
.
Curr. Biol.
6
:
149
–
162
.
Charlesworth
B
,
Coyne
J A
,
Barton
N H
,
1987
The relative rates of evolution of sex chromosomes and autosomes
.
Am. Nat.
130
:
113
–
146
.
Crew
F A E
,
1954
Sex Determination
.
John Wiley & Sons
,
New York
.
Crow
J F
,
1997
Molecular evolution—who is in the driver's seat?
Nat. Genet.
17
:
129
–
130
.
Driscoll
D J
,
Migeon
B R
,
1990
Sex difference in methylation of single-copy genes in human meiotic germ cells: implications for X chromsome inactivation, parental imprinting and origin of CpG mutations
.
Somat. Cell Mol. Genet.
16
:
267
–
282
.
Dvorak
J
,
Halverson
J L
,
Gulick
P
,
Rauen
K A
,
Abbott
U K
et al. ,
1992
cDNA cloning of a Z- and W-linked gene in Gallinaceous birds
.
J. Hered.
83
:
22
–
25
.
Ellegren
H
,
1996
First gene on the avian W chromosome provides a tag for universal sexing of non-ratite birds
.
Proc. R. Soc. Lond. Ser. B
263
:
1635
–
1641
.
Ellegren
H
,
2000
Evolution of the avian sex chromosomes and their role in sex determination
.
Trends Ecol. Evol.
15
:
188
–
192
.
Ellegren
H
,
Fridolfsson
A-K
,
1997
Male-driven evolution of DNA sequences in birds
.
Nat. Genet.
17
:
182
–
184
.
Felsenstein
J
,
1991
PHYLIP: phylogeny inference package. Version 3.4
.
University of Washington
,
Seattle
.
Fridolfsson
A-K
,
Cheng
H
,
Copeland
N G
,
Jenkins
N A
,
Liu
H C
et al. ,
1998
Evolution of the avian sex chromosomes from an ancestral pair of autosomes
.
Proc. Natl. Acad. Sci. USA
95
:
8147
–
8152
.
Graves
J A M
,
1995
The evolution of mammalian sex chromosomes and the origin of sex determining genes
.
Philos. Trans. R. Soc. Lond. Ser. B
350
:
305
–
312
.
Griffiths
R
,
Korn
R M
,
1997
A CHD1 gene is Z chromosome linked in the chicken Gallus domesticus
.
Gene
197
:
225
–
229
.
Griffiths
R
,
Tiwari
B
,
1995
Sex of the last wild Spix's macaw
.
Nature
375
:
454
.
Griffiths
R
,
Daan
S
,
Dijkstra
C
,
1996
Sex identification in birds using two CHD genes
.
Proc. R. Soc. Lond. Ser. B
263
:
1251
–
1256
.
Halverson
J J
,
Dvorak
J
,
1993
Genetic control of sex determination in birds and the potential for its manipulation
.
Poult. Sci.
72
:
890
–
896
.
Huelsenbeck
J P
,
1997
Is the Felsenstein zone a fly trap?
Syst. Biol.
46
:
69
–
74
.
Hurst
L D
,
Ellegren
H
,
1998
Sex bias in the mutation rate
.
Trends Genet.
14
:
446
–
451
.
Jones
R C
,
Lin
M
,
1993
Spermatogenesis in birds
, pp.
233
–
264
in
Oxford Review of Reproductive Biology
, edited by
Milligan
S R
.
Oxford University Press
,
Oxford
.
Kumar
S
,
Tamura
K
,
Nei
M
,
1993
MEGA: Molecular Evolutionary Genetic Analysis, version 1.0
.
Pennsylvania State University
,
University Park, PA
.
Lahn
B T
,
Page
D C
,
1997
Functional coherence of the human Y chromosome
.
Science
278
:
675
–
680
.
Lessels
K
,
1997
More mutations in males
.
Nature
390
:
236
–
237
.
Li
W-H
,
1993
Unbiased estimation of the rates of synonymous and nonsynonymous substitution
.
J. Mol. Evol.
36
:
96
–
99
.
Li
W-H
,
1997
Molecular Evolution
.
Sinauer Associates
,
Sunderland, MA
.
McVean
G T
,
Hurst
L D
,
1997
Evidence for a selectively favourable reduction in the mutation rate of the X chromosome
.
Nature
386
:
388
–
395
.
Miyata
T
,
Hayashida
H
,
Kuma
K
,
Mitsuyasu
K
,
Yasunaga
T
,
1987
Male-driven molecular evolution: a model and nucleotide sequence analysis
.
Cold Spring Harbor Symp. Quant. Biol.
52
:
863
–
867
.
Nei
M
,
1970
Accumulation of nonfunctional genes on a sheltered chromosome
.
Am. Nat.
104
:
311
–
322
.
Oldenburg
J
,
Scwaab
R
,
Grimm
T
,
Zerres
K
,
Hakenberg
K
et al. ,
1993
Direct and indirect estimation of the sex ratio of mutation frequencies in hemophilia A
.
Am. J. Hum. Genet.
53
:
1229
–
1238
.
Olivier
J L
,
Marin
A
,
1996
A relationship between GC content and coding sequence length
.
J. Mol. Evol.
43
:
216
–
223
.
Pamilo
P
,
Bianchi
N O
,
1993
Evolution of the Zfx and Zfy genes: rates and interdependence between the genes
.
Mol. Biol. Evol.
10
:
271
–
281
.
Raymond
C S
,
Kettlewell
J R
,
Hirsch
B
,
Bardwell
V J
,
Zarkower
D
,
1999
Expression of Dmrt1 in the genital ridge of mouse and chicken embryos suggests a role in vertebrate sexual development
.
Dev. Biol.
215
:
208
–
220
.
Rice
W R
,
1996
Evolution of the Y sex chromosome in animals
.
BioScience
46
:
331
–
343
.
Schorderet
D F
,
Gartler
S M
,
1992
Analysis of CpG suppression in methylated and nonmethylated species
.
Proc. Natl. Acad. Sci. USA
89
:
957
–
961
.
Shimmin
L C
,
Chang
B H
,
Li
W-H
,
1993
Male-driven evolution of DNA sequences
.
Nature
362
:
745
–
747
.
Smith
N G C
,
Hurst
L D
,
1998
Molecular evolution of an imprinted gene: repeatability of patterns of evolution within the mammalian insulin-like growth factor type II receptor
.
Genetics
150
:
823
–
833
.
Smith
N G C
,
Hurst
L D
,
1999
The causes of synonymous rate variation in rodent genome: can substitution rates be used to estimate the sex bias in mutation rate?
Genetics
152
:
661
–
673
.
Smith
C A
,
McClive
P J
,
Western
P S
,
Reed
K J
,
Sinclair
A H
,
1999
Conservation of a sex-determining gene
.
Nature
402
:
601
–
602
.
Sommer
S S
,
Ketterling
R P
,
1996
The factor IX gene as a model for analysis of human germline mutations: an update
.
Hum. Mol. Genet.
5
:
1505
–
1514
.
Stefos
K
,
Arrighi
F E
,
1971
Heterochromatic nature of W chromosome in birds
.
Exp. Cell Res.
68
:
228
–
231
.
Stokes
D G
,
Perry
R P
,
1995
DNA-binding and chromatin localization properties of CHD1
.
Mol. Cell. Biol.
5
:
2745
–
2753
.
Stokes
D G
,
Tartof
K D
,
Perry
R P
,
1996
CHD1 is concentrated in interbands and puffed regions of Drosophila polytene chromosomes
.
Proc. Natl. Acad. Sci. USA
93
:
7137
–
7142
.
Swofford
D L
,
1998
PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods), version 4.0b2a
.
Sinauer Associates
,
Sunderland, MA
.
Tucker
P K
,
Lundrigan
B L
,
1993
Rapid evolution of the sex determining locus in Old World mice and rats
.
Nature
364
:
715
–
717
.
Whitfield
S L
,
Lovell-Badge
R
,
Goodfellow
P N
,
1993
Rapid sequence evolution of the mammalian sex-determining gene SRY
.
Nature
364
:
713
–
715
.
Woodage
T
,
Basra
M A
,
Baxevanis
A D
,
Hieter
P
,
Collins
F S
,
1997
Characterization of the CHD family of proteins
.
Proc. Natl. Acad. Sci. USA
94
:
11472
–
11477
.
Yang
Z
,
1999
Phylogenetic Analysis by Maximum Likelihood (PAML), version 2.0
.
University College
,
London
.
Yang
Z
,
Nielsen
R
,
1998
Synonymous and nonsynonymous rate variation in nuclear genes of mammals
.
J. Mol. Evol.
46
:
409
–
418
.
© Genetics 2000
Citations
Views
Altmetric
Metrics
Total Views 681
475 Pageviews
206 PDF Downloads
Since 2/1/2021
Month: | Total Views: |
---|---|
February 2021 | 6 |
March 2021 | 1 |
April 2021 | 10 |
May 2021 | 15 |
June 2021 | 6 |
July 2021 | 1 |
August 2021 | 9 |
September 2021 | 17 |
October 2021 | 30 |
November 2021 | 6 |
December 2021 | 1 |
January 2022 | 9 |
February 2022 | 11 |
March 2022 | 9 |
April 2022 | 6 |
May 2022 | 15 |
June 2022 | 3 |
July 2022 | 7 |
August 2022 | 28 |
September 2022 | 30 |
October 2022 | 5 |
November 2022 | 11 |
December 2022 | 22 |
January 2023 | 7 |
February 2023 | 6 |
March 2023 | 6 |
April 2023 | 7 |
May 2023 | 11 |
June 2023 | 10 |
July 2023 | 11 |
August 2023 | 19 |
September 2023 | 74 |
October 2023 | 64 |
November 2023 | 32 |
December 2023 | 13 |
January 2024 | 10 |
February 2024 | 16 |
March 2024 | 13 |
April 2024 | 18 |
May 2024 | 10 |
June 2024 | 12 |
July 2024 | 13 |
August 2024 | 19 |
September 2024 | 32 |
October 2024 | 20 |
×
Email alerts
Citing articles via
More from Oxford Academic