Enhancing late postmortem interval prediction: a pilot study integrating proteomics and machine learning to distinguish human bone remains over 15 years

Biol Res. 2024 Oct 24;57(1):75. doi: 10.1186/s40659-024-00552-8.

Abstract

Background: Determining the postmortem interval (PMI) accurately remains a significant challenge in forensic sciences, especially for intervals greater than 5 years (late PMI). Traditional methods often fail due to the extensive degradation of soft tissues, necessitating reliance on bone material examinations. The precision in estimating PMIs diminishes with time, particularly for intervals between 1 and 5 years, dropping to about 50% accuracy. This study aims to address this issue by identifying key protein biomarkers through proteomics and machine learning, ultimately enhancing the accuracy of PMI estimation for intervals exceeding 15 years.

Methods: Proteomic analysis was conducted using LC-MS/MS on skeletal remains, specifically focusing on the tibia and ribs. Protein identification was performed using two strategies: a tryptic-specific search and a semitryptic search, the latter being particularly beneficial in cases of natural protein degradation. The Random Forest algorithm was used to model protein abundance data, enabling the prediction of PMI. A thorough screening process, combining importance scores and SHAP values, was employed to identify the most informative proteins for model's training and accuracy.

Results: A minimal set of three biomarkers-K1C13, PGS1, and CO3A1-was identified, significantly improving the prediction accuracy between PMIs of 15 and 20 years. The model, based on protein abundance data from semitryptic peptides in tibia samples, achieved sustained 100% accuracy across 100 iterations. In contrast, non-supervised methods like PCA and MCA did not yield comparable results. Additionally, the use of semitryptic peptides outperformed tryptic peptides, particularly in tibia proteomes, suggesting their potential reliability in late PMI prediction.

Conclusions: Despite limitations such as sample size and PMI range, this study demonstrates the feasibility of combining proteomics and machine learning for accurate late PMI predictions. Future research should focus on broader PMI ranges and various bone types to further refine and standardize forensic proteomic methodologies for PMI estimation.

Keywords: Biomarker discovery; Forensic science; Machine learning; Postmortem interval; Proteomics.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Biomarkers* / analysis
  • Body Remains / chemistry
  • Bone and Bones / chemistry
  • Bone and Bones / metabolism
  • Chromatography, Liquid
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Pilot Projects
  • Postmortem Changes*
  • Proteomics* / methods
  • Tandem Mass Spectrometry / methods
  • Tibia / chemistry
  • Time Factors

Substances

  • Biomarkers