Extent of gene duplication in the genomes of Drosophila, nematode, and yeast - PubMed (original) (raw)
Extent of gene duplication in the genomes of Drosophila, nematode, and yeast
Zhenglong Gu et al. Mol Biol Evol. 2002 Mar.
Abstract
We conducted a detailed analysis of duplicate genes in three complete genomes: yeast, Drosophila, and Caenorhabditis elegans. For two proteins belonging to the same family we used the criteria: (1) their similarity is > or =I (I = 30% if L > or = 150 a.a. and I = 0.01n + 4.8L(-0.32(1 + exp(-L/1000))) if L < 150 a.a., where n = 6 and L is the length of the alignable region), and (2) the length of the alignable region between the two sequences is > or = 80% of the longer protein. We found it very important to delete isoforms (caused by alternative splicing), same genes with different names, and proteins derived from repetitive elements. We estimated that there were 530, 674, and 1,219 protein families in yeast, Drosophila, and C. elegans, respectively, so, as expected, yeast has the smallest number of duplicate genes. However, for the duplicate pairs with the number of substitutions per synonymous site (K(S)) < 0.01, Drosophila has only seven pairs, whereas yeast has 58 pairs and nematode has 153 pairs. After considering the possible effects of codon usage bias and gene conversion, these numbers became 6, 55, and 147, respectively. Thus, Drosophila appears to have much fewer young duplicate genes than do yeast and nematode. The larger numbers of duplicate pairs with K(S) < 0.01 in yeast and C. elegans were probably largely caused by block duplications. At any rate, it is clear that the genome of Drosophila melanogaster has undergone few gene duplications in the recent past and has much fewer gene families than C. elegans.
Similar articles
- Detection of gene duplications and block duplications in eukaryotic genomes.
Li WH, Gu Z, Cavalcanti AR, Nekrutenko A. Li WH, et al. J Struct Funct Genomics. 2003;3(1-4):27-34. J Struct Funct Genomics. 2003. PMID: 12836682 Review. - Patterns of gene duplication in Saccharomyces cerevisiae and Caenorhabditis elegans.
Cavalcanti AR, Ferreira R, Gu Z, Li WH. Cavalcanti AR, et al. J Mol Evol. 2003 Jan;56(1):28-37. doi: 10.1007/s00239-002-2377-2. J Mol Evol. 2003. PMID: 12569420 - GenomeHistory: a software tool and its application to fully sequenced genomes.
Conant GC, Wagner A. Conant GC, et al. Nucleic Acids Res. 2002 Aug 1;30(15):3378-86. doi: 10.1093/nar/gkf449. Nucleic Acids Res. 2002. PMID: 12140322 Free PMC article. - Pattern and timing of gene duplication in animal genomes.
Friedman R, Hughes AL. Friedman R, et al. Genome Res. 2001 Nov;11(11):1842-7. doi: 10.1101/gr.200601. Genome Res. 2001. PMID: 11691848 Free PMC article. - Birth and death of duplicated genes in completely sequenced eukaryotes.
Wagner A. Wagner A. Trends Genet. 2001 May;17(5):237-9. doi: 10.1016/s0168-9525(01)02243-0. Trends Genet. 2001. PMID: 11335019 Review.
Cited by
- Genome-Wide Analysis and Expression Profiling of Lectin Receptor-like Kinase Genes in Watermelon (Citrullus lanatus).
Lv D, Wang G, You J, Zhu L, Yang H, Cao B, Gu W, Li C. Lv D, et al. Int J Mol Sci. 2024 Jul 29;25(15):8257. doi: 10.3390/ijms25158257. Int J Mol Sci. 2024. PMID: 39125826 Free PMC article. - Genome-Wide Analyses of MADS-Box Genes Reveal Their Involvement in Seed Development and Oil Accumulation of Tea-Oil Tree (Camellia oleifera).
Zhang X, He W, Wang X, Duan Y, Li Y, Wang Y, Jiang Q, Liao B, Zhou S, Li Y. Zhang X, et al. Int J Genomics. 2024 Jul 29;2024:3375173. doi: 10.1155/2024/3375173. eCollection 2024. Int J Genomics. 2024. PMID: 39105136 Free PMC article. - Genome-wide identification and drought stress-induced expression analysis of the NHX gene family in potato.
Yihong J, Zhen L, Chang L, Ziying S, Ning Z, Meiqing S, Yuhui L, Lei W. Yihong J, et al. Front Genet. 2024 Jul 11;15:1396375. doi: 10.3389/fgene.2024.1396375. eCollection 2024. Front Genet. 2024. PMID: 39055260 Free PMC article. - Genome-wide identification of the HSP70 genes in Pacific oyster Magallana gigas and their response to heat stress.
Lu H, Liu C, Yang C, He Z, Wang L, Song L. Lu H, et al. Cell Stress Chaperones. 2024 Aug;29(4):589-602. doi: 10.1016/j.cstres.2024.06.002. Epub 2024 Jun 21. Cell Stress Chaperones. 2024. PMID: 38908469 Free PMC article. - Genome-wide identification and evolutionary analysis of the AP2/EREBP, COX and LTP genes in Zea mays L. under drought stress.
Maghraby A, Alzalaty M. Maghraby A, et al. Sci Rep. 2024 Mar 31;14(1):7610. doi: 10.1038/s41598-024-57376-5. Sci Rep. 2024. PMID: 38556556 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous