Tandem chimerism as a means to increase protein complexity in the human genome - PubMed (original) (raw)

Tandem chimerism as a means to increase protein complexity in the human genome

Genís Parra et al. Genome Res. 2006 Jan.

Abstract

The "one-gene, one-protein" rule, coined by Beadle and Tatum, has been fundamental to molecular biology. The rule implies that the genetic complexity of an organism depends essentially on its gene number. The discovery, however, that alternative gene splicing and transcription are widespread phenomena dramatically altered our understanding of the genetic complexity of higher eukaryotic organisms; in these, a limited number of genes may potentially encode a much larger number of proteins. Here we investigate yet another phenomenon that may contribute to generate additional protein diversity. Indeed, by relying on both computational and experimental analysis, we estimate that at least 4%-5% of the tandem gene pairs in the human genome can be eventually transcribed into a single RNA sequence encoding a putative chimeric protein. While the functional significance of most of these chimeric transcripts remains to be determined, we provide strong evidence that this phenomenon does not correspond to mere technical artifacts and that it is a common mechanism with the potential of generating hundreds of additional proteins in the human genome.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

Flowchart depicting the protocol used to infer EST-supported TICs.

Figure 2.

Figure 2.

The three computationally predicted chimeras in the ENCODE regions verified by RT-PCR. (A) chimera ENm013_GCL+006; (B) chimera ENr331_GCL+002; (C) chimera ENm005_GCL-002. Only one had known chimeric mRNAs before our experiments. One of the chimeras—ENm005_GCL-002—still lacks known mRNAs. The results of the RT-PCR validation in 12 of the 24 tissues tested are shown. Asterisks mark positive amplimers. The tissues are (1) brain, (2) heart, (3) kidney, (4) spleen, (5) liver, (6) colon, (7) small intestine, (8) muscle, (9) lung, (10) stomach, (11) testis, and (12) placenta. Chimeras were tested in 12 additional tissues (not shown here).

Similar articles

Cited by

References

    1. Akiva, P., Toporik, A., Edelheit, S., Peretz, Y., Diber, A., Shemesh, R., Novik, A., and Sorek, R. 2006. Transcription-mediated gene fusion in the human genome. Genome Res. (this issue). - PMC - PubMed
    1. Athanasiadis, A., Rich, A., and Maas, S. 2004. Widespread A-to-I RNA editing of Alu-containing mRNAs in the human transcriptome. PLoS Biol. 2 e391. - PMC - PubMed
    1. Beaudoing, E. and Gautheret, D. 2001. Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data. Genome Res. 11 1520-1526. - PMC - PubMed
    1. Blanco, E., Parra, G., and Guigó, R. 2003. Using geneid to identify genes. In Current protocols in bioinformatics (eds. A. Baxevanis and D. Davison), Vol. 1, Unit 4.3. John Wiley, New York. - PubMed
    1. Blow, M., Futreal, P.A., Wooster, R., and Stratton, M.R. 2004. A survey of RNA editing in human brain. Genome Res. 14 2379-2387. - PMC - PubMed

Web site references

    1. http://bind.ca/; BIND databases.
    1. http://genome.cse.ucsc.edu/; UCSC genome browser database.
    1. http://genome.imim.es/datasets/chimeras2005/; Supplemental materials and results.
    1. http://genome.ucsc.edu/ENCODE/regions.html; UCSC ENCODE repository.
    1. http://string.embl.de/; STRING server.

Publication types

MeSH terms

LinkOut - more resources