Genomic exploration of the hemiascomycetous yeasts: 3. Methods and strategies used for sequence analysis and annotation

FEBS Lett. 2000 Dec 22;487(1):17-30. doi: 10.1016/s0014-5793(00)02274-2.

Abstract

The primary analysis of the sequences for our Hemiascomycete random sequence tag (RST) project was performed using a combination of classical methods for sequence comparison and contig assembly, and of specifically written scripts and computer visualization routines. Comparisons were performed first against DNA and protein sequences from Saccharomyces cerevisiae, then against protein sequences from other completely sequenced organisms and, finally, against protein sequences from all other organisms. Blast alignments were individually inspected to help recognize genes within our random genomic sequences despite the fact that only parts of them were available. For each yeast species, validated alignments were used to infer the proper genetic code, to determine codon usage preferences and to calculate their degree of sequence divergence with S. cerevisiae. The quality of each genomic library was monitored from contig analysis of the DNA sequences. Annotated sequences were submitted to the EMBL database, and the general annotation tables produced served as a basis for our comparative description of the evolution, redundancy and function of the Hemiascomycete genomes described in other articles of this issue.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Ascomycota / genetics*
  • Electronic Data Processing / methods
  • Gene Library
  • Genetic Code
  • Genome, Fungal
  • Genomics / methods*
  • Molecular Sequence Data
  • Reproducibility of Results
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Amino Acid