Objective: To annotate the human genome 3p24-p25 478 kb complete sequence.
Methods: The protein-coding genes in the genomic sequence were identified by using ab initio gene finding, homology-based similarity database searching and all or partial mRNA aligning with genomic sequence, and the content feature of the genomic sequence were analyzed by using EMBOSS package.
Results: Two known genes SLC6A1 and SLC6A11 were identified; as well as the GC content of this genomic sequence was 47% and 3 putative CpG islands were predicted in the genomic sequence, located in 130,685-131,516 bp, 307,090-307,870 bp and 415,585-416,308 bp, respectively.
Conclusions: The methods, as mentioned above, might be used for annotating the biological information in the genomic sequence, such as gene structure, GC content, CpG island.