TOX defines a conserved subfamily of HMG-box proteins - PubMed (original) (raw)

Comparative Study

TOX defines a conserved subfamily of HMG-box proteins

Emmett O'Flaherty et al. BMC Genomics. 2003.

Abstract

Background: HMG-box proteins are a large and diverse superfamily of architectural factors that share one or more copies of a sequence- and structurally-related DNA binding domain. These proteins can modify chromatin structure by bending and unwinding DNA. HMG-box proteins can be divided into two subfamilies based on whether they recognize DNA in a sequence-dependent or sequence-independent manner. We recently identified an HMG-box protein involved in T cell development, designated TOX, which is highly conserved in humans and mice.

Results: We show here that based on sequence alignment, TOX best fits into the sequence-independent HMG-box family. Three other human and murine predicted proteins are identified that share a common HMG-box domain with TOX, as well as other features. The gene encoding one of these additional family members has a distinct but overlapping pattern of tissue expression when compared to TOX. In addition, we identify genes encoding predicted TOX HMG-box subfamily members in pufferfish and mosquito.

Conclusions: We have identified a novel subfamily of HMG-box proteins that is related to the recently described TOX protein. The highly conserved nature of the TOX family of proteins in humans and mice and differences in the pattern of expression between family members suggest non-overlapping functions of individual proteins. In addition, our data suggest that the TOX subtype of HMG-box domain first appeared in invertebrates, was duplicated in early vertebrates and likely took on new functions in mammalian species.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Comparison of HMG-box sequences. The HMG-box region of TOX is aligned with the HMG-box motifs found in seven other proteins representing the two major subfamilies of HMG-box proteins. Similarities between the HMG-boxes of both groups of proteins are highlighted in gray and include matches to the consensus HMG-box domain (accession number pf00505) defined in the Pfam protein families' database. Residues in red and purple distinguish these two subgroups of HMG-box proteins and are discussed in the text. In addition, the consensus sequence GXXW (or more rarely AXXW, as also found in TOX) found commonly in HMG-boxes is shown in blue.

Figure 2

Figure 2

Murine TOX subfamily members and their human homologues. (A) The HMG-box and putative NLS regions within each predicted protein are shown in blue and green, respectively. The predicted size and molecular weight of each protein and the chromosomal location of the gene that encodes the protein are also indicated. *Note that the predicted protein LOC241768 as it currently exists in the NCBI database has been modified, and there is some question as to chromosomal location (see Methods). (B) Pair-wise comparison of HMG-box domains of murine and human TOX subfamily members. The upper predicted protein of each pair is mouse derived. Residues that differ from TOX are highlighted in red and the AXXW motif is highlighted in blue. The vertical line represents the position of the conserved exon boundary of the respective genes. (C) Comparison of predicted NLS of murine and human TOX subfamily members. The consensus motif is highlighted in green.

Figure 3

Figure 3

Comparison of TOX and LCP1. (A) Graphical representation of an unfiltered BLAST alignment comparing TOX and LCP1 proteins. Breaks in the proteins to allow maximum alignment are represented by a line. Residues in TOX that are identical to the aligned LCP1 protein are shown in black, while differences are shown in red. The NLS and HMG-box regions are indicated by green and blue bars, respectively. (B) Northern analysis of LCP1 gene expression in normalized poly-A+ RNA isolated from various tissues. The signal obtained from a probe to the housekeeping GAPDH gene is also shown as a loading control.

Figure 4

Figure 4

Identification of TOX subfamily predicted proteins in pufferfish and mosquito. Shown is an amino acid comparison of the TOX NLS and HMG-box regions with predicted proteins encoded in the Fugu rubripes or Anopheles gambiae genomes. Fugu rubripes predicted proteins are designated here simply by the scaffold (Scaf) location of the corresponding gene (see Methods). Amino acids that differ from the TOX HMG-box are highlighted in red, the putative NLS regions are highlighted in green, and the AXXW motif is highlighted in blue. The vertical line represents the position of the conserved exon boundary of the respective genes. Alternative amino acids found in other mammalian family members are also shown (family).

Similar articles

Cited by

References

    1. Goodwin GH, Johns EW. Isolation and characterisation of two calf-thymus chromatin non-histone proteins with high contents of acidic and basic amino acids. Eur J Biochem. 1973;40:215–219. - PubMed
    1. Bustin M. Regulation of DNA-dependent activities by the functional motifs of the high-mobility-group chromosomal proteins. Mol Cell Biol. 1999;19:5237–5246. - PMC - PubMed
    1. Thomas JO, Travers AA. HMG1 and 2, and related 'architectural' DNA-binding proteins. Trends Biochem Sci. 2001;26:167–174. doi: 10.1016/S0968-0004(01)01801-1. - DOI - PubMed
    1. Bustin M. Revised nomenclature for high mobility group (HMG) chromosomal proteins. Trends Biochem Sci. 2001;26:152–153. doi: 10.1016/S0968-0004(00)01777-1. - DOI - PubMed
    1. Baxevanis AD, Landsman D. The HMG-1 box protein family: classification and functional relationships. Nucleic Acids Res. 1995;23:1604–1613. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources