Analysis of Compositional Bias in a Commercial Phage Display Peptide Library by Next-Generation Sequencing

Viruses. 2022 Oct 29;14(11):2402. doi: 10.3390/v14112402.

Abstract

The principal presumption of phage display biopanning is that the naïve library contains an unbiased repertoire of peptides, and thus, the enriched variants derive from the affinity selection of an entirely random peptide pool. In the current study, we utilized deep sequencing to characterize the widely used Ph.DTM-12 phage display peptide library (New England Biolabs). The next-generation sequencing (NGS) data indicated the presence of stop codons and a high abundance of wild-type clones in the naïve library, which collectively result in a reduced effective size of the library. The analysis of the DNA sequence logo and global and position-specific frequency of amino acids demonstrated significant bias in the nucleotide and amino acid composition of the library inserts. Principal component analysis (PCA) uncovered the existence of four distinct clusters in the naïve library and the investigation of peptide frequency distribution revealed a broad range of unequal abundances for peptides. Taken together, our data provide strong evidence for the notion that the naïve library represents substantial departures from randomness at the nucleotide, amino acid, and peptide levels, though not undergoing any selective pressure for target binding. This non-uniform sequence representation arises from both the M13 phage biology and technical errors of the library construction. Our findings highlight the paramount importance of the qualitative assessment of the naïve phage display libraries prior to biopanning.

Keywords: M13 phage; Ph.D.TM-12 peptide library; biopanning; compositional bias; deep sequencing; departure from randomness; next-generation sequencing; phage display; principal component analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / genetics
  • High-Throughput Nucleotide Sequencing*
  • Nucleotides
  • Peptide Library*
  • Peptides / chemistry

Substances

  • Peptide Library
  • Peptides
  • Amino Acids
  • Nucleotides

Grants and funding

This project received funding from the European Union’s Horizon 2020 research and innovation program under grant agreements no. 670261 (ERC Advanced Grant) and 668532 (Click-It), the Lundbeck Foundation, the Novo Nordisk Foundation, the Innovation Fund Denmark, the Neuroendocrine Tumor Research Foundation, the Danish Cancer Society, Arvid Nilsson Foundation, the Neye Foundation, the Research Foundation of Rigshospitalet, the Danish National Research Foundation (grant 126); PERSIMUNE, the Research Council of the Capital Region of Denmark, the Danish Health Authority, the John and Birthe Meyer Foundation and Research Council for Independent Research. Andreas Kjaer is a Lundbeck Foundation Professor.