-
Smooth Kolmogorov Arnold networks enabling structural knowledge representation
Authors:
Moein E. Samadi,
Younes Müller,
Andreas Schuppert
Abstract:
Kolmogorov-Arnold Networks (KANs) offer an efficient and interpretable alternative to traditional multi-layer perceptron (MLP) architectures due to their finite network topology. However, according to the results of Kolmogorov and Vitushkin, the representation of generic smooth functions by KAN implementations using analytic functions constrained to a finite number of cutoff points cannot be exact…
▽ More
Kolmogorov-Arnold Networks (KANs) offer an efficient and interpretable alternative to traditional multi-layer perceptron (MLP) architectures due to their finite network topology. However, according to the results of Kolmogorov and Vitushkin, the representation of generic smooth functions by KAN implementations using analytic functions constrained to a finite number of cutoff points cannot be exact. Hence, the convergence of KAN throughout the training process may be limited. This paper explores the relevance of smoothness in KANs, proposing that smooth, structurally informed KANs can achieve equivalence to MLPs in specific function classes. By leveraging inherent structural knowledge, KANs may reduce the data required for training and mitigate the risk of generating hallucinated predictions, thereby enhancing model reliability and performance in computational biomedicine.
△ Less
Submitted 27 May, 2024; v1 submitted 18 May, 2024;
originally announced May 2024.
-
CMDA: a tool for Continuous Monitoring Data Analysis
Authors:
Pejman Farhadi Ghalati,
Andreas Schuppert
Abstract:
Over the last few years, with the growth of time-series collecting and storing, there has been a great demand for tools and software for temporal data engineering and modeling. This paper presents a generic workflow for time series data research, including temporal data importing, preprocessing, and feature extraction. This framework is developed and built as a robust and easy-to-use Python packag…
▽ More
Over the last few years, with the growth of time-series collecting and storing, there has been a great demand for tools and software for temporal data engineering and modeling. This paper presents a generic workflow for time series data research, including temporal data importing, preprocessing, and feature extraction. This framework is developed and built as a robust and easy-to-use Python package, called CMDA, with a modular structure that offers tools to prepare raw data, allowing both scientists and non-experts to analyze various temporal data structures.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Tree-Based Learning on Amperometric Time Series Data Demonstrates High Accuracy for Classification
Authors:
Jeyashree Krishnan,
Zeyu Lian,
Pieter E. Oomen,
Xiulan He,
Soodabeh Majdi,
Andreas Schuppert,
Andrew Ewing
Abstract:
Elucidating exocytosis processes provide insights into cellular neurotransmission mechanisms, and may have potential in neurodegenerative diseases research. Amperometry is an established electrochemical method for the detection of neurotransmitters released from and stored inside cells. An important aspect of the amperometry method is the sub-millisecond temporal resolution of the current recordin…
▽ More
Elucidating exocytosis processes provide insights into cellular neurotransmission mechanisms, and may have potential in neurodegenerative diseases research. Amperometry is an established electrochemical method for the detection of neurotransmitters released from and stored inside cells. An important aspect of the amperometry method is the sub-millisecond temporal resolution of the current recordings which leads to several hundreds of gigabytes of high-quality data. In this study, we present a universal method for the classification with respect to diverse amperometric datasets using data-driven approaches in computational science. We demonstrate a very high prediction accuracy (greater than or equal to 95%). This includes an end-to-end systematic machine learning workflow for amperometric time series datasets consisting of pre-processing; feature extraction; model identification; training and testing; followed by feature importance evaluation - all implemented. We tested the method on heterogeneous amperometric time series datasets generated using different experimental approaches, chemical stimulations, electrode types, and varying recording times. We identified a certain overarching set of common features across these datasets which enables accurate predictions. Further, we showed that information relevant for the classification of amperometric traces are neither in the spiky segments alone, nor can it be retrieved from just the temporal structure of spikes. In fact, the transients between spikes and the trace baselines carry essential information for a successful classification, thereby strongly demonstrating that an effective feature representation of amperometric time series requires the full time series. To our knowledge, this is one of the first studies that propose a scheme for machine learning, and in particular, supervised learning on full amperometry time series data.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Critical Transitions in Intensive Care Units: A Sepsis Case Study
Authors:
Pejman F. Ghalati,
Satya S. Samal,
Jayesh S. Bhat,
Robert Deisz,
Gernot Marx,
Andreas Schuppert
Abstract:
The progression of complex human diseases is associated with critical transitions across dynamical regimes. These transitions often spawn early-warning signals and provide insights into the underlying disease-driving mechanisms. In this paper, we propose a computational method based on surprise loss (SL) to discover data-driven indicators of such transitions in a multivariate time series dataset o…
▽ More
The progression of complex human diseases is associated with critical transitions across dynamical regimes. These transitions often spawn early-warning signals and provide insights into the underlying disease-driving mechanisms. In this paper, we propose a computational method based on surprise loss (SL) to discover data-driven indicators of such transitions in a multivariate time series dataset of septic shock and non-sepsis patient cohorts (MIMIC-III database). The core idea of SL is to train a mathematical model on time series in an unsupervised fashion and to quantify the deterioration of the model's forecast (out-of-sample) performance relative to its past (in-sample) performance. Considering the highest value of the moving average of SL as a critical transition, our retrospective analysis revealed that critical transitions occurred at a median of over 35 hours before the onset of septic shock, which suggests the applicability of our method as an early-warning indicator. Furthermore, we show that clinical variables at critical-transition regions are significantly different between septic shock and non-sepsis cohorts. Therefore, our paper contributes a critical-transition-based data-sampling strategy that can be utilized for further analysis, such as patient classification. Moreover, our method outperformed other indicators of critical transition in complex systems, such as temporal autocorrelation and variance.
△ Less
Submitted 23 October, 2019; v1 submitted 15 February, 2019;
originally announced February 2019.