Evolutionary dynamics of duplicated genes in plants (original) (raw)

Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants

Genome Biology, 2019

Background The sharp increase of plant genome and transcriptome data provide valuable resources to investigate evolutionary consequences of gene duplication in a range of taxa, and unravel common principles underlying duplicate gene retention. Results We survey 141 sequenced plant genomes to elucidate consequences of gene and genome duplication, processes central to the evolution of biodiversity. We develop a pipeline named DupGen_finder to identify different modes of gene duplication in plants. Genes derived from whole-genome, tandem, proximal, transposed, or dispersed duplication differ in abundance, selection pressure, expression divergence, and gene conversion rate among genomes. The number of WGD-derived duplicate genes decreases exponentially with increasing age of duplication events—transposed duplication- and dispersed duplication-derived genes declined in parallel. In contrast, the frequency of tandem and proximal duplications showed no significant decrease over time, provi...

Prevalent Role of Gene Features in Determining Evolutionary Fates of Whole-Genome Duplication Duplicated Genes in Flowering Plants

PLANT PHYSIOLOGY, 2013

The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retenti...

Duplications and Turnover in Plant Genomes

Plant Genome Diversity Volume 1, 2012

Plant and other eukaryotic genomes contain an abundance of duplications with a variety of origins. The power of these duplications to influence evolution is attenuated by the rate and nature of gene birth and death. In most eukaryotes, including plants, plots of the age of all gene duplications in

The fate of duplicated genes in a polyploid plant genome

The Plant Journal, 2012

SummaryPolyploidy is generally not tolerated in animals, but is widespread in plant genomes and may result in extensive genetic redundancy. The fate of duplicated genes is poorly understood, both functionally and evolutionarily. Soybean (Glycine max L.) has undergone two separate polyploidy events (13 and 59 million years ago) that have resulted in 75% of its genes being present in multiple copies. It therefore constitutes a good model to study the impact of whole‐genome duplication on gene expression. Using RNA‐seq, we tested the functional fate of a set of approximately 18 000 duplicated genes. Across seven tissues tested, approximately 50% of paralogs were differentially expressed and thus had undergone expression sub‐functionalization. Based on gene ontology and expression data, our analysis also revealed that only a small proportion of the duplicated genes have been neo‐functionalized or non‐functionalized. In addition, duplicated genes were often found in collinear blocks, and...

Whole genome duplications in plants: an overview fromArabidopsis

Journal of Experimental Botany, 2015

Polyploidy is a common event in plants that involves the acquisition of more than two complete sets of chromosomes. Allopolyploidy originates from interspecies hybrids while autopolyploidy originates from intraspecies whole genome duplication (WGD) events. In spite of inconveniences derived from chromosomic rearrangement during polyploidization, natural plant polyploids species often exhibit improved growth vigour and adaptation to adverse environments, conferring evolutionary advantages. These advantages have also been incorporated into crop breeding programmes. Many tetraploid crops show increased stress tolerance, although the molecular mechanisms underlying these different adaptation abilities are poorly known. Understanding the physiological, cellular, and molecular mechanisms coupled to WGD, in both allo-and autopolyploidy, is a major challenge. Over the last few years, several studies, many of them in Arabidopsis, are shedding light on the basis of genetic, genomic, and epigenomic changes linked to WGD. In this review we summarize and discuss the latest advances made in Arabidopsis polyploidy, but also in other agronomic plant species.

Evolutionary Dynamics and Functional Specialization of Plant Paralogs Formed by Whole and Small-Scale Genome Duplications

Molecular Biology and Evolution, 2012

Gene duplicates are a major source of evolutionary novelties in the form of new or specialized functions and play a key role in speciation. Gene duplicates are generated through whole genome duplications (WGD) or small-scale genome duplications (SSD). Although WGD preserves the stoichiometric relationships between duplicates, those arising from SSD are usually unbalanced and are expected to follow different evolutionary dynamics than those formed by WGD. To dissect the role of the mechanism of duplication in these differential dynamics and determine whether this role was shared across species, we performed a genome wide evolutionary analysis of gene duplications arising from the most recent WGD events and contemporary episodes of SSD in four model species representing distinct plant evolutionary lineages. We found an excess of relaxed purifying selection after duplication in SSD paralogs compared with WGD, most of which may have been the result of functional divergence events between gene copies as estimated by measures of genetic distances. These differences were significant in three angiosperm genomes but not in the moss species Physcomitrella patens. Although the comparison of models of evolution does not attribute a relevant role to the mechanism of duplication in the evolution duplicates, distribution of retained genes among Gene Ontology functional categories support the conclusion that evolution of gene duplicates depends on its origin of duplication (WGD and SSD) but, most importantly, on the species. Similar lineage-specific biases were also observed in protein network connectivity, translational efficiency, and selective constraints acting on synonymous codon usage. Although the mechanism of duplication may determine gene retention, our results attribute a dominant role to the species in determining the ultimate pattern of duplicate gene retention and reveal an unanticipated complexity in the evolutionary dynamics and functional specialization of duplicated genes in plants. Downloaded from d LRT for differences in substitution rates (DSR): Model 3 (discrete) vs. clade Model D (K = 3); df = 1. e LRT for PS: Model A null (!2 = 1) vs. Model A (0 < !0 < 1); df = 1.

Locating Large-Scale Gene Duplication Events through Reconciled Trees: Implications for Identifying Ancient Polyploidy Events in Plants

Journal of Computational Biology, 2009

Recent analyses of plant genomic data have found extensive evidence of ancient whole genome duplication (or polyploidy) events, but there are many unresolved questions regarding the number and timing of such events in plant evolutionary history. We describe the first exact and efficient algorithm for the Episode Clustering problem, which, given a collection of rooted gene trees and a rooted species tree, seeks the minimum number of locations on the species tree of gene duplication events. Solving this problem allows one to place gene duplication events onto nodes of a given species tree and potentially detect large-scale gene duplication events. We examined the performance of an implementation of our algorithm using 85 plant gene trees that contain genes from a total of 136 plant taxa. We found evidence of large-scale gene duplication events in Populus, Gossypium, Poaceae, Asteraceae, Brassicaceae, Solanaceae, Fabaceae, and near the root of the eudicot clade that are consistent with previous genomic evidence. However, a lack of phylogenetic signal within the gene trees can produce erroneous evidence of large-scale duplication events, especially near the root of the species tree. Although the results of our algorithm should be interpreted cautiously, they provide hypotheses for precise locations of large-scale gene duplication events with data from relatively few gene trees and can complement other genomic approaches to provide a more comprehensive view of ancient large-scale gene duplication events.

Impact of recurrent gene duplication on adaptation of plant genomes

BMC Plant Biology, 2014

Background: Recurrent gene duplication and retention played an important role in angiosperm genome evolution. It has been hypothesized that these processes contribute significantly to plant adaptation but so far this hypothesis has not been tested at the genome scale. Results: We studied available sequenced angiosperm genomes to assess the frequency of positive selection footprints in lineage specific expanded (LSE) gene families compared to single-copy genes using a d N /d S -based test in a phylogenetic framework. We found 5.38% of alignments in LSE genes with codons under positive selection. In contrast, we found no evidence for codons under positive selection in the single-copy reference set. An analysis at the branch level shows that purifying selection acted more strongly on single-copy genes than on LSE gene clusters. Moreover we detect significantly more branches indicating evolution under positive selection and/or relaxed constraint in LSE genes than in single-copy genes.