A New Robust Epigenetic Model for Forensic Age Prediction

J Forensic Sci. 2020 Sep;65(5):1424-1431. doi: 10.1111/1556-4029.14460. Epub 2020 May 26.

Abstract

Forensic DNA phenotyping refers to an emerging field of forensic sciences aimed at the prediction of externally visible characteristics of unknown sample donors directly from biological materials. The aging process significantly affects most of the above characteristics making the development of a reliable method of age prediction very important. Today, the so-called "epigenetic clocks" represent the most accurate models for age prediction. Since they are technically not achievable in a typical forensic laboratory, forensic DNA technology has triggered efforts toward the simplification of these models. The present study aimed to build an epigenetic clock using a set of methylation markers of five different genes in a sample of the Italian population of different ages covering the whole span of adult life. In a sample of 330 subjects, 42 selected markers were analyzed with a machine learning approach for building a prediction model for age prediction. A ridge linear regression model including eight of the proposed markers was identified as the best performing model across a plethora of candidates. This model was tested on an independent sample of 83 subjects providing a median error of 4.5 years. In the present study, an epigenetic model for age prediction was validated in a sample of the Italian population. However, its applicability to advanced ages still represents the main limitation in forensic caseworks.

Keywords: ELOVL2; FDP; age prediction; automated machine learning; epigenetic clock; externally visible characteristics; methylation.

Publication types

  • Validation Study

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Aging / genetics*
  • CpG Islands
  • DNA Methylation
  • Epigenesis, Genetic*
  • Fatty Acid Elongases / genetics
  • Female
  • Forensic Genetics / methods*
  • Genetic Markers
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Intracellular Signaling Peptides and Proteins / genetics
  • LIM-Homeodomain Proteins / genetics
  • Linear Models
  • Machine Learning
  • Male
  • Middle Aged
  • Muscle Proteins / genetics
  • Polymerase Chain Reaction
  • Transcription Factors / genetics
  • Tripartite Motif Proteins / genetics
  • Young Adult

Substances

  • ELOVL2 protein, human
  • FHL2 protein, human
  • Genetic Markers
  • Intracellular Signaling Peptides and Proteins
  • LIM-Homeodomain Proteins
  • Muscle Proteins
  • TRIM59 protein, human
  • Transcription Factors
  • Tripartite Motif Proteins
  • Fatty Acid Elongases