Skip to Content
Merck
CN
  • Complete sequencing and characterization of 21,243 full-length human cDNAs.

Complete sequencing and characterization of 21,243 full-length human cDNAs.

Nature genetics (2004-01-01)
Toshio Ota, Yutaka Suzuki, Tetsuo Nishikawa, Tetsuji Otsuki, Tomoyasu Sugiyama, Ryotaro Irie, Ai Wakamatsu, Koji Hayashi, Hiroyuki Sato, Keiichi Nagai, Kouichi Kimura, Hiroshi Makita, Mitsuo Sekine, Masaya Obayashi, Tatsunari Nishi, Toshikazu Shibahara, Toshihiro Tanaka, Shizuko Ishii, Jun-ichi Yamamoto, Kaoru Saito, Yuri Kawai, Yuko Isono, Yoshitaka Nakamura, Kenji Nagahari, Katsuhiko Murakami, Tomohiro Yasuda, Takao Iwayanagi, Masako Wagatsuma, Akiko Shiratori, Hiroaki Sudo, Takehiko Hosoiri, Yoshiko Kaku, Hiroyo Kodaira, Hiroshi Kondo, Masanori Sugawara, Makiko Takahashi, Katsuhiro Kanda, Takahide Yokoi, Takako Furuya, Emiko Kikkawa, Yuhi Omura, Kumi Abe, Kumiko Kamihara, Naoko Katsuta, Kazuomi Sato, Machiko Tanikawa, Makoto Yamazaki, Ken Ninomiya, Tadashi Ishibashi, Hiromichi Yamashita, Katsuji Murakawa, Kiyoshi Fujimori, Hiroyuki Tanai, Manabu Kimata, Motoji Watanabe, Susumu Hiraoka, Yoshiyuki Chiba, Shinichi Ishida, Yukio Ono, Sumiyo Takiguchi, Susumu Watanabe, Makoto Yosida, Tomoko Hotuta, Junko Kusano, Keiichi Kanehori, Asako Takahashi-Fujii, Hiroto Hara, Tomo-o Tanase, Yoshiko Nomura, Sakae Togiya, Fukuyo Komai, Reiko Hara, Kazuha Takeuchi, Miho Arita, Nobuyuki Imose, Kaoru Musashino, Hisatsugu Yuuki, Atsushi Oshima, Naokazu Sasaki, Satoshi Aotsuka, Yoko Yoshikawa, Hiroshi Matsunawa, Tatsuo Ichihara, Namiko Shiohata, Sanae Sano, Shogo Moriya, Hiroko Momiyama, Noriko Satoh, Sachiko Takami, Yuko Terashima, Osamu Suzuki, Satoshi Nakagawa, Akihiro Senoh, Hiroshi Mizoguchi, Yoshihiro Goto, Fumio Shimizu, Hirokazu Wakebe, Haretsugu Hishigaki, Takeshi Watanabe, Akio Sugiyama
ABSTRACT

As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at approximately 58% compared with a peak at approximately 42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at approximately 42%, relatively low compared with that of protein-coding cDNAs.

MATERIALS
Product Number
Brand
Product Description

Sigma-Aldrich
Lactic Dehydrogenase, recombinant from E. coli, ≥90 U/mg
Sigma-Aldrich
Malic Dehydrogenase from porcine heart, ≥600 units/mg protein (biuret), ammonium sulfate suspension
Sigma-Aldrich
L-Lactic Dehydrogenase from bovine heart, 1000 units/mL
Sigma-Aldrich
Galactokinase human, recombinant, expressed in E. coli
Sigma-Aldrich
CMP-Sialic Acid Synthetase from Neisseria meningitidis group B, recombinant, expressed in E. coli BL21, ≥10 units/mg protein
Sigma-Aldrich
Acyl-coenzyme A Synthetase from Pseudomonas sp., ≥2 units/mg protein
Sigma-Aldrich
Monoamine Oxidase B human, recombinant, expressed in baculovirus infected BTI insect cells
Sigma-Aldrich
Nucleoside Phosphorylase bacterial, recombinant, expressed in E. coli, ≥10 units/mg protein
Sigma-Aldrich
Lipase from porcine pancreas, Type VI-S, ≥20,000 units/mg protein, lyophilized powder
Sigma-Aldrich
Lipase from porcine pancreas, Type II, ≥125 units/mg protein (using olive oil (30 min incubation)), 30-90 units/mg protein (using triacetin)
Sigma-Aldrich
β-Galactosidase from Escherichia coli, Grade VI, lyophilized powder, ≥250 units/mg protein
Sigma-Aldrich
β-Galactosidase from Escherichia coli, Grade VIII, lyophilized powder, ≥500 units/mg protein
Sigma-Aldrich
Creatine Phosphokinase from rabbit muscle, Type I, salt-free, lyophilized powder, ≥150 units/mg protein
Sigma-Aldrich
Acetylcholinesterase human, recombinant, expressed in HEK 293 cells, lyophilized powder, ≥1,000 units/mg protein (Lowry)
Sigma-Aldrich
α-Glycerophosphate Dehydrogenase from rabbit muscle, Type X, lyophilized powder, ≥100 units/mg protein
Sigma-Aldrich
L-Lactic Dehydrogenase from bovine muscle, Type X, ammonium sulfate suspension, ≥600 units/mg protein
Sigma-Aldrich
Nucleoside 5′-Diphosphate Kinase from bovine liver, buffered aqueous glycerol solution, ≥1,000 units/mg protein (biuret)
Sigma-Aldrich
Sphingomyelinase from Staphylococcus aureus, buffered aqueous glycerol solution, 100-300 units/mg protein (Lowry)
Sigma-Aldrich
L-Lactic Dehydrogenase from porcine heart, ammonium sulfate suspension, ≥200 units/mg protein
Sigma-Aldrich
Lipase from Candida rugosa, lyophilized powder, ≥40,000 units/mg protein
Sigma-Aldrich
β-Galactosidase from Escherichia coli, lyophilized powder, ≥500 units/mg protein
Sigma-Aldrich
Lipase from Candida rugosa, Type VII, ≥700 unit/mg solid
Sigma-Aldrich
Glutamic-Pyruvic Transaminase from porcine heart, ammonium sulfate suspension, ≥75 units/mg protein
Sigma-Aldrich
Enolase from baker′s yeast (S. cerevisiae), lyophilized powder, ≥50 units/mg protein
Sigma-Aldrich
Galactose-1-phosphate Uridyltransferase from galactose-adapted yeast, Type IV, lyophilized powder, 20-60 units/mg protein (modified Warburg-Christian)
Sigma-Aldrich
L-Glutamine Synthetase from Escherichia coli, lyophilized powder, 400-2,000 units/mg protein
Sigma-Aldrich
β-Galactosidase from Escherichia coli, aqueous glycerol suspension, ≥500 units/mg protein (biuret)
Sigma-Aldrich
Phosphatase, Acid from potato, lyophilized powder, ≥3.0 units/mg solid
Sigma-Aldrich
β-Galactosidase from bovine testes, ammonium sulfate suspension, 1.0-3.0 units/mg protein (modified Warburg-Christian)
Sigma-Aldrich
Glutaminase from Escherichia coli, Grade V, lyophilized powder, ≥50 units/mg protein