Differential expansion of zinc-finger transcription factor loci in homologous human and mouse gene clusters (original) (raw)

Deep Vertebrate Roots for Mammalian Zinc Finger Transcription Factor Subfamilies

Genome Biology and Evolution, 2014

While many vertebrate transcription factor (TF) families are conserved, the C2H2 zinc finger (ZNF) family stands out as a notable exception. In particular, novel ZNF gene types have arisen, duplicated, and diverged independently throughout evolution to yield many lineage-specific TF genes. This evolutionary dynamic not only raises many intriguing questions but also severely complicates identification of those ZNF genes that remain functionally conserved. To address this problem, we searched for vertebrate "DNA binding orthologs" by mining ZNF loci from eight sequenced genomes and then aligning the patterns of DNA-binding amino acids, or "fingerprints," extracted from the encoded ZNF motifs. Using this approach, we found hundreds of lineage-specific genes in each species and also hundreds of orthologous groups. Most groups of orthologs displayed some degree of fingerprint divergence between species, but 174 groups showed fingerprint patterns that have been very rigidly conserved. Focusing on the dynamic KRAB-ZNF subfamily-including nearly 400 human genes thought to possess potent KRAB-mediated epigenetic silencing activitieswe found only three genes conserved between mammals and nonmammalian groups. These three genes, members of an ancient familial cluster, encode an unusual KRAB domain that functions as a transcriptional activator. Evolutionary analysis confirms the ancient provenance of this activating KRAB and reveals the independent expansion of KRAB-ZNFs in every vertebrate lineage. Most human ZNF genes, from the most deeply conserved to the primate-specific genes, are highly expressed in immune and reproductive tissues, indicating that they have been enlisted to regulate evolutionarily divergent biological traits.

Rapid Sequence and Expression Divergence Suggest Selection for Novel Function in Primate-Specific KRAB-ZNF Genes

Molecular Biology and Evolution, 2010

Recent segmental duplications (SDs), arising from duplication events that occurred within the past 35-40 My, have provided a major resource for the evolution of proteins with primate-specific functions. KRAB zinc finger (KRAB-ZNF) transcription factor genes are overrepresented among genes contained within these recent human SDs. Here, we examine the structural and functional diversity of the 70 human KRAB-ZNF genes involved in the most recent primate SD events including genes that arose in the hominid lineage. Despite their recent advent, many parent-daughter KRAB-ZNF gene pairs display significant differences in zinc finger structure and sequence, expression, and splicing patterns, each of which could significantly alter the regulatory functions of the paralogous genes. Paralogs that emerged on the lineage to humans and chimpanzees have undergone more evolutionary changes per unit of time than genes already present in the common ancestor of rhesus macaques and great apes. Taken together, these data indicate that a substantial fraction of the recently evolved primate-specific KRAB-ZNF gene duplicates have acquired novel functions that may possibly define novel regulatory pathways and suggest an active ongoing selection for regulatory diversity in primates.

The ancient mammalian KRAB zinc finger gene cluster on human chromosome 8q24.3 illustrates principles of C2H2 zinc finger evolution associated with unique expression profiles in human tissues

BMC Genomics, 2010

Background Expansion of multi-C2H2 domain zinc finger (ZNF) genes, including the Krüppel-associated box (KRAB) subfamily, paralleled the evolution of tetrapodes, particularly in mammalian lineages. Advances in their cataloging and characterization suggest that the functions of the KRAB-ZNF gene family contributed to mammalian speciation. Results Here, we characterized the human 8q24.3 ZNF cluster on the genomic, the phylogenetic, the structural and the transcriptome level. Six (ZNF7, ZNF34, ZNF250, ZNF251, ZNF252, ZNF517) of the seven locus members contain exons encoding KRAB domains, one (ZNF16) does not. They form a paralog group in which the encoded KRAB and ZNF protein domains generally share more similarities with each other than with other members of the human ZNF superfamily. The closest relatives with respect to their DNA-binding domain were ZNF7 and ZNF251. The analysis of orthologs in therian mammalian species revealed strong conservation and purifying selection of the KRA...

A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors

Genome Research, 2006

Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. ; fax (925) 422-2099. Article published online before print. Article and publication date are at http:// www.genome.org/cgi/

Analysis of HomologousXRCC1-Linked Zinc-Finger Gene Families in Human and Mouse: Evidence for Orthologous Genes

Genomics, 1998

tion factors and have been shown in a few instances to Genetic and physical mapping studies indicate that affect cell proliferation and/or development (El-Baradi hundreds of zinc-finger (ZNF)-containing genes popuand Pieler, 1991, and references therein). Many, but late the human genome and that many of these genes apparently not all, of the Kruppel-type ZNF proteins are arranged in familial clusters. However, the extent contain conserved modules in addition to their zinc to which these tandemly arrayed families are confingers that are hypothesized to be involved in proteinserved among mammalian species is largely unknown. protein interactions associated with transcription con-In a previous study, we identified a conserved cluster trol. A non-ZNF motif frequently found in combination of Kruppel-associated box (KRAB)-containing ZNF with zinc fingers is the Kruppel-associated box (KRAB), genes located near the XRCC1 gene in human chromowhich is located at the N-terminus of approximately some 19q13.2 and mouse chromosome 7 and analyzed one-third of the mammalian C2H2-type ZNF proteins two members of the murine gene family, Zfp93 and . This element may be composed Zfp94, in detail. Here we report the identification and of a single A domain or, alternatively, of A and B docharacterization of putative human orthologs of these mains (Constantinou-Deltas et al., 1992; Bellefroid et murine genes. The human genes ZFP93 and ZNF45 are al., 1993). Several studies have demonstrated that the substantially similar to their murine counterparts in KRAB motif acts as a repressor of both activated and overall structure, but two notable differences exist bebasal transcription (Margolin et al., 1994; Witzgall et tween the sets of genes. First, the human genes encode Pengue and Lania, 1996) and that repression more ZNF repeats than their murine counterparts.

Gain, Loss and Divergence in Primate Zinc-Finger Genes: A Rich Resource for Evolution of Gene Regulatory Differences between Species

PLoS ONE, 2011

The molecular changes underlying major phenotypic differences between humans and other primates are not well understood, but alterations in gene regulation are likely to play a major role. Here we performed a thorough evolutionary analysis of the largest family of primate transcription factors, the Krüppel-type zinc finger (KZNF) gene family. We identified and curated gene and pseudogene models for KZNFs in three primate species, chimpanzee, orangutan and rhesus macaque, to allow for a comparison with the curated set of human KZNFs. We show that the recent evolutionary history of primate KZNFs has been complex, including many lineage-specific duplications and deletions. We found 213 speciesspecific KZNFs, among them 7 human-specific and 23 chimpanzee-specific genes. Two human-specific genes were validated experimentally. Ten genes have been lost in humans and 13 in chimpanzees, either through deletion or pseudogenization. We also identified 30 KZNF orthologs with human-specific and 42 with chimpanzee-specific sequence changes that are predicted to affect DNA binding properties of the proteins. Eleven of these genes show signatures of accelerated evolution, suggesting positive selection between humans and chimpanzees. During primate evolution the most extensive re-shaping of the KZNF repertoire, including most gene additions, pseudogenizations, and structural changes occurred within the subfamily homininae. Using zinc finger (ZNF) binding predictions, we suggest potential impact these changes have had on human gene regulatory networks. The large species differences in this family of TFs stands in stark contrast to the overall high conservation of primate genomes and potentially represents a potent driver of primate evolution.

Comparative Analysis of a Conserved Zinc Finger Gene Cluster on Human Chromosome 19q and Mouse Chromosome 7

Genomics, 1996

Several lines of evidence now suggest that many of the zinc-finger-containing (ZNF) genes in the human genome are arranged in clusters. However, little is known about the structure or function of the clusters or about their conservation throughout evolution. Here, we report the analysis of a conserved ZNF gene cluster located in human chromosome 19q13.2 and mouse chromosome 7. Our results indicate that the human cluster consists of at least 10 related Kruppel-associated box (KRAB)-containing ZNF genes organized in tandem over a distance of 350-450 kb. Two cDNA clones representing genes in the murine cluster have been studied in detail. The KRAB A domains of these genes are nearly identical and are highly similar to human 19q13.2-derived KRAB sequences, but DNA-binding ZNF domains and other portions of the genes differ considerably. The two murine genes display distinct expression patterns, but are coexpressed in some adult tissues. These studies pave the way for a systematic analysis of the evolution of structure and function of genes within the numerous clustered ZNF families located on human chromosome 19 and elsewhere in the human and mouse genomes.

KRAB Zinc Finger Proteins: An Analysis of the Molecular Mechanisms Governing Their Increase in Numbers and Complexity During Evolution

Krüppel-related zinc finger proteins, with 564 members in the human genome, probably constitute the largest individual family of transcription factors in mammals. Approximately 30% of these proteins carry a potent repressor domain called the Krüppel associated box (KRAB). Depending on the structure of the KRAB domain, these proteins have been further divided into three subfamilies (A ϩ B, A ϩ b, and A only). In addition, some KRAB zinc finger proteins contain another conserved motif called SCAN. To study their molecular evolution, an extensive comparative analysis of a large panel of KRAB zinc finger genes was performed. The results show that both the KRAB A ϩ b and the KRAB A subfamilies have their origin in a single member or a few closely related members of the KRAB A ϩ B family. The KRAB A ϩ B family is also the most prevalent among the KRAB zinc finger genes. Furthermore, we show that internal duplications of individual zinc finger motifs or blocks of several zinc finger motifs have occurred quite frequently within this gene family. However, zinc finger motifs are also frequently lost from the open reading frame, either by functional inactivation by point mutations or by the introduction of a stop codon. The introduction of a stop codon causes the exclusion of part of the zinc finger region from the coding region and the formation of graveyards of degenerate zinc finger motifs in the 3Ј-untranslated region of these genes. Earlier reports have shown that duplications of zinc finger genes commonly occur throughout evolution. We show that there is a relatively low degree of sequence conservation of the zinc finger motifs after these duplications. In many cases this may cause altered binding specificities of the transcription factors encoded by these genes. The repetitive nature of the zinc finger region and the structural flexibility within the zinc finger motif make these proteins highly adaptable. These factors may have been of major importance for their massive expansion in both number and complexity during metazoan evolution.