High-throughput sequencing (HTS) is an effective tool for bacteriophage genome and its termini analysis. HTS technology parallelizes the sequencing process, producing thousands to millions of reads concurrently. Terminal information of a bacteriophage genome is important and basic knowledge for understanding the biology of the bacteriophage. We have created a high-occurrence reads as termini theory and developed practical methods to determine the bacteriophage genome termini, which is based on the large data of HTS. With this method, the termini of the bacteriophage genome can be efficiently and reliably identified as a by-product of bacteriophage genome sequencing, by solely analyzing the sequence statistics of the raw sequencing data (reads), without any further lab experiments.
Keywords: Bacteriophage; Genome termini; High Occurrence Read Termini theory; High frequency read sequence (HFS); High-throughput sequencing (HTS).