tagFinder: A Novel Tag Analysis Methodology That Enables Detection of Molecules from DNA-Encoded Chemical Libraries

SLAS Discov. 2018 Jun;23(5):397-404. doi: 10.1177/2472555217753840. Epub 2018 Jan 23.

Abstract

Available tools to analyze sequencing data coming from DNA-encoded chemical libraries (DELs) are often limited to in-house methods, which usually rely on strictly looking for the particular DEL structure used. Current methods do not take into account technological errors, such as library codification and sequencing errors, when detecting the sequences. The vast amount of data produced by next-generation sequencing of DEL screens is usually enough to extract the minimum information needed for compound identification. Here, we report a methodology to deconvolute encoding oligonucleotides, thus optimizing the sequencing power regardless of the library size, design complexity, or sequencing technology chosen. tagFinder is a highly flexible tool for fast tag detection and thorough DEL results characterization, which requires minimal hardware resources, scales linearly, and does not introduce any analytical error. The methodology can even deal with sequencing errors and PCR duplicates on single- or double-stranded DNA, enhancing the analytical detection and quantification of molecules and the informativeness of the entire process. Source code is available at https://github.com/jamigo/tagFinder .

Keywords: DNA-encoded chemical libraries; affinity selection; algorithm; sequencing; tag.

MeSH terms

  • DNA / chemistry*
  • Drug Discovery / methods*
  • Gene Library
  • Oligonucleotides / chemistry
  • Polymerase Chain Reaction / methods
  • Small Molecule Libraries / chemistry*

Substances

  • Oligonucleotides
  • Small Molecule Libraries
  • DNA