HomSI: a homozygous stretch identifier from next-generation sequencing data

Bioinformatics. 2014 Feb 1;30(3):445-7. doi: 10.1093/bioinformatics/btt686. Epub 2013 Dec 3.

Abstract

In consanguineous families, as a result of inheriting the same genomic segments through both parents, the individuals have stretches of their genomes that are homozygous. This situation leads to the prevalence of recessive diseases among the members of these families. Homozygosity mapping is based on this observation, and in consanguineous families, several recessive disease genes have been discovered with the help of this technique. The researchers typically use single nucleotide polymorphism arrays to determine the homozygous regions and then search for the disease gene by sequencing the genes within this candidate disease loci. Recently, the advent of next-generation sequencing enables the concurrent identification of homozygous regions and the detection of mutations relevant for diagnosis, using data from a single sequencing experiment. In this respect, we have developed a novel tool that identifies homozygous regions using deep sequence data. Using *.vcf (variant call format) files as an input file, our program identifies the majority of homozygous regions found by microarray single nucleotide polymorphism genotype data.

Availability and implementation: HomSI software is freely available at www.igbam.bilgem.tubitak.gov.tr/softwares/HomSI, with an online manual.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Consanguinity
  • Genes, Recessive
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Homozygote*
  • Humans
  • Internet
  • Mutation
  • Polymorphism, Single Nucleotide
  • Software*