Complete sequencing and characterization of 21,243 full-length human cDNAs - PubMed (original) (raw)
Comparative Study
doi: 10.1038/ng1285. Epub 2003 Dec 21.
Yutaka Suzuki, Tetsuo Nishikawa, Tetsuji Otsuki, Tomoyasu Sugiyama, Ryotaro Irie, Ai Wakamatsu, Koji Hayashi, Hiroyuki Sato, Keiichi Nagai, Kouichi Kimura, Hiroshi Makita, Mitsuo Sekine, Masaya Obayashi, Tatsunari Nishi, Toshikazu Shibahara, Toshihiro Tanaka, Shizuko Ishii, Jun-ichi Yamamoto, Kaoru Saito, Yuri Kawai, Yuko Isono, Yoshitaka Nakamura, Kenji Nagahari, Katsuhiko Murakami, Tomohiro Yasuda, Takao Iwayanagi, Masako Wagatsuma, Akiko Shiratori, Hiroaki Sudo, Takehiko Hosoiri, Yoshiko Kaku, Hiroyo Kodaira, Hiroshi Kondo, Masanori Sugawara, Makiko Takahashi, Katsuhiro Kanda, Takahide Yokoi, Takako Furuya, Emiko Kikkawa, Yuhi Omura, Kumi Abe, Kumiko Kamihara, Naoko Katsuta, Kazuomi Sato, Machiko Tanikawa, Makoto Yamazaki, Ken Ninomiya, Tadashi Ishibashi, Hiromichi Yamashita, Katsuji Murakawa, Kiyoshi Fujimori, Hiroyuki Tanai, Manabu Kimata, Motoji Watanabe, Susumu Hiraoka, Yoshiyuki Chiba, Shinichi Ishida, Yukio Ono, Sumiyo Takiguchi, Susumu Watanabe, Makoto Yosida, Tomoko Hotuta, Junko Kusano, Keiichi Kanehori, Asako Takahashi-Fujii, Hiroto Hara, Tomo-o Tanase, Yoshiko Nomura, Sakae Togiya, Fukuyo Komai, Reiko Hara, Kazuha Takeuchi, Miho Arita, Nobuyuki Imose, Kaoru Musashino, Hisatsugu Yuuki, Atsushi Oshima, Naokazu Sasaki, Satoshi Aotsuka, Yoko Yoshikawa, Hiroshi Matsunawa, Tatsuo Ichihara, Namiko Shiohata, Sanae Sano, Shogo Moriya, Hiroko Momiyama, Noriko Satoh, Sachiko Takami, Yuko Terashima, Osamu Suzuki, Satoshi Nakagawa, Akihiro Senoh, Hiroshi Mizoguchi, Yoshihiro Goto, Fumio Shimizu, Hirokazu Wakebe, Haretsugu Hishigaki, Takeshi Watanabe, Akio Sugiyama, Makoto Takemoto, Bunsei Kawakami, Masaaki Yamazaki, Koji Watanabe, Ayako Kumagai, Shoko Itakura, Yasuhito Fukuzumi, Yoshifumi Fujimori, Megumi Komiyama, Hiroyuki Tashiro, Akira Tanigami, Tsutomu Fujiwara, Toshihide Ono, Katsue Yamada, Yuka Fujii, Kouichi Ozaki, Maasa Hirao, Yoshihiro Ohmori, Ayako Kawabata, Takeshi Hikiji, Naoko Kobatake, Hiromi Inagaki, Yasuko Ikema, Sachiko Okamoto, Rie Okitani, Takuma Kawakami, Saori Noguchi, Tomoko Itoh, Keiko Shigeta, Tadashi Senba, Kyoka Matsumura, Yoshie Nakajima, Takae Mizuno, Misato Morinaga, Masahide Sasaki, Takushi Togashi, Masaaki Oyama, Hiroko Hata, Manabu Watanabe, Takami Komatsu, Junko Mizushima-Sugano, Tadashi Satoh, Yuko Shirai, Yukiko Takahashi, Kiyomi Nakagawa, Koji Okumura, Takahiro Nagase, Nobuo Nomura, Hisashi Kikuchi, Yasuhiko Masuho, Riu Yamashita, Kenta Nakai, Tetsushi Yada, Yusuke Nakamura, Osamu Ohara, Takao Isogai, Sumio Sugano
Affiliations
- PMID: 14702039
- DOI: 10.1038/ng1285
Comparative Study
Complete sequencing and characterization of 21,243 full-length human cDNAs
Toshio Ota et al. Nat Genet. 2004 Jan.
Abstract
As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at approximately 58% compared with a peak at approximately 42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at approximately 42%, relatively low compared with that of protein-coding cDNAs.
Similar articles
- Characterization of 954 bovine full-CDS cDNA sequences.
Harhay GP, Sonstegard TS, Keele JW, Heaton MP, Clawson ML, Snelling WM, Wiedmann RT, Van Tassell CP, Smith TP. Harhay GP, et al. BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166. BMC Genomics. 2005. PMID: 16305752 Free PMC article. - A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis).
Ralph SG, Chun HJ, Kolosova N, Cooper D, Oddy C, Ritland CE, Kirkpatrick R, Moore R, Barber S, Holt RA, Jones SJ, Marra MA, Douglas CJ, Ritland K, Bohlmann J. Ralph SG, et al. BMC Genomics. 2008 Oct 14;9:484. doi: 10.1186/1471-2164-9-484. BMC Genomics. 2008. PMID: 18854048 Free PMC article. - [Transcriptome and non-coding RNAs: so many mRNA-like non-coding RNAs are really functional?].
Kanai A. Kanai A. Tanpakushitsu Kakusan Koso. 2004 Dec;49(16):2521-8. Tanpakushitsu Kakusan Koso. 2004. PMID: 15609715 Review. Japanese. No abstract available. - Genome annotation past, present, and future: how to define an ORF at each locus.
Brent MR. Brent MR. Genome Res. 2005 Dec;15(12):1777-86. doi: 10.1101/gr.3866105. Genome Res. 2005. PMID: 16339376 Review.
Cited by
- Development of luciferase-based highly sensitive reporters that detect ER-associated protein biogenesis abnormalities.
Kadokura H, Harada N, Yamaki S, Hirai N, Tsukuda R, Azuma K, Amagai Y, Nakamura D, Yanagitani K, Taguchi H, Kohno K, Inaba K. Kadokura H, et al. iScience. 2024 Oct 16;27(11):111189. doi: 10.1016/j.isci.2024.111189. eCollection 2024 Nov 15. iScience. 2024. PMID: 39555403 Free PMC article. - Overexpression of Glyoxalase 2 in Human Breast Cancer Cells: Implications for Cell Proliferation and Doxorubicin Resistance.
Romaldi B, Scirè A, Minnelli C, Frontini A, Casari G, Cianfruglia L, Mobbili G, de Bari L, Antognelli C, Pallardó FV, Armeni T. Romaldi B, et al. Int J Mol Sci. 2024 Oct 10;25(20):10888. doi: 10.3390/ijms252010888. Int J Mol Sci. 2024. PMID: 39456676 Free PMC article. - Host long noncoding RNAs in bacterial infections.
Cheng Y, Liang Y, Tan X, Liu L. Cheng Y, et al. Front Immunol. 2024 Sep 2;15:1419782. doi: 10.3389/fimmu.2024.1419782. eCollection 2024. Front Immunol. 2024. PMID: 39295861 Free PMC article. Review. - Investigation of TMEM41A's function in breast cancer prognosis and its connection to immune cell infiltration.
Fan F, Feng R, Zhang Y, Li X, Tang Y. Fan F, et al. Clin Transl Oncol. 2024 Sep 12. doi: 10.1007/s12094-024-03714-y. Online ahead of print. Clin Transl Oncol. 2024. PMID: 39264531 - Functional complementation of two splicing variants of Gustavus in Neocaridina denticulata sinensis during ovarian maturation.
Liang M, Feng D, Zhang J, Sun Y. Liang M, et al. Sci Rep. 2024 Sep 9;14(1):20939. doi: 10.1038/s41598-024-72080-0. Sci Rep. 2024. PMID: 39251721 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous