Machine learning identifies abnormal Ca2+ transients in human induced pluripotent stem cell-derived cardiomyocytes

Sci Rep. 2020 Oct 12;10(1):16977. doi: 10.1038/s41598-020-73801-x.

Abstract

Human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) provide an excellent platform for potential clinical and research applications. Identifying abnormal Ca2+ transients is crucial for evaluating cardiomyocyte function that requires labor-intensive manual effort. Therefore, we develop an analytical pipeline for automatic assessment of Ca2+ transient abnormality, by employing advanced machine learning methods together with an Analytical Algorithm. First, we adapt an existing Analytical Algorithm to identify Ca2+ transient peaks and determine peak abnormality based on quantified peak characteristics. Second, we train a peak-level Support Vector Machine (SVM) classifier by using human-expert assessment of peak abnormality as outcome and profiled peak variables as predictive features. Third, we train another cell-level SVM classifier by using human-expert assessment of cell abnormality as outcome and quantified cell-level variables as predictive features. This cell-level SVM classifier can be used to assess additional Ca2+ transient signals. By applying this pipeline to our Ca2+ transient data, we trained a cell-level SVM classifier using 200 cells as training data, then tested its accuracy in an independent dataset of 54 cells. As a result, we obtained 88% training accuracy and 87% test accuracy. Further, we provide a free R package to implement our pipeline for high-throughput CM Ca2+ analysis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Calcium / metabolism*
  • Cells, Cultured
  • High-Throughput Screening Assays / methods*
  • Humans
  • Induced Pluripotent Stem Cells / metabolism*
  • Machine Learning*
  • Myocytes, Cardiac / metabolism*
  • Sensitivity and Specificity

Substances

  • Calcium