Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle

Nat Commun. 2020 Apr 29;11(1):2071. doi: 10.1038/s41467-020-15848-y.

Abstract

Inbred animals were historically chosen for genome analysis to circumvent assembly issues caused by haplotype variation but this resulted in a composite of the two genomes. Here we report a haplotype-aware scaffolding and polishing pipeline which was used to create haplotype-resolved, chromosome-level genome assemblies of Angus (taurine) and Brahman (indicine) cattle subspecies from contigs generated by the trio binning method. These assemblies reveal structural and copy number variants that differentiate the subspecies and that variant detection is sensitive to the specific reference genome chosen. Six genes with immune related functions have additional copies in the indicine compared with taurine lineage and an indicus-specific extra copy of fatty acid desaturase is under positive selection. The haplotyped genomes also enable transcripts to be phased to detect allele-specific expression. This work exemplifies the value of haplotype-resolved genomes to better explore evolutionary and functional variations.

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alleles
  • Allelic Imbalance
  • Animals
  • Base Sequence
  • Cattle / genetics*
  • Chromosomes, Mammalian / genetics
  • Female
  • Genetic Loci
  • Genetic Variation*
  • Genome*
  • Haplotypes / genetics*
  • INDEL Mutation / genetics
  • Male
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Repetitive Sequences, Nucleic Acid / genetics

Substances

  • RNA, Messenger