Large-scale concatenation cDNA sequencing

Genome Res. 1997 Apr;7(4):353-8. doi: 10.1101/gr.7.4.353.

Abstract

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7-2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (> 20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (> or = 98% identity), and 16 clones generated nonexact matches (57%-97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching.

Publication types

  • Letter
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • DNA Transposable Elements
  • DNA, Complementary / chemistry
  • DNA, Complementary / genetics*
  • Databases, Factual
  • Gene Library
  • Humans
  • Molecular Sequence Data
  • Proteins / chemistry
  • Proteins / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Software

Substances

  • DNA Transposable Elements
  • DNA, Complementary
  • Proteins

Associated data

  • GENBANK/AF007128
  • GENBANK/AF007129
  • GENBANK/AF007130
  • GENBANK/AF007131
  • GENBANK/AF007132
  • GENBANK/AF007133
  • GENBANK/AF007134
  • GENBANK/AF007135
  • GENBANK/AF007136
  • GENBANK/AF007137
  • GENBANK/AF007138
  • GENBANK/AF007139
  • GENBANK/AF007140
  • GENBANK/AF007141
  • GENBANK/AF007142
  • GENBANK/AF007143
  • GENBANK/AF007144
  • GENBANK/AF007145
  • GENBANK/AF007146
  • GENBANK/AF007147
  • GENBANK/AF007148
  • GENBANK/AF007149
  • GENBANK/AF007150
  • GENBANK/AF007151
  • GENBANK/AF007152
  • GENBANK/AF007153
  • GENBANK/AF007154