Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Schmidt, M N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13584  [pdf, other

    cs.LG

    Explaining time series models using frequency masking

    Authors: Thea Brüsch, Kristoffer K. Wickstrøm, Mikkel N. Schmidt, Tommy S. Alstrøm, Robert Jenssen

    Abstract: Time series data is fundamentally important for describing many critical domains such as healthcare, finance, and climate, where explainable models are necessary for safe automated decision-making. To develop eXplainable AI (XAI) in these domains therefore implies explaining salient information in the time series. Current methods for obtaining saliency maps assumes localized information in the raw… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Submitted to the Next Generation of AI Safety workshop at ICML 2024

  2. arXiv:2312.04174  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Coherent energy and force uncertainty in deep learning force fields

    Authors: Peter Bjørn Jørgensen, Jonas Busk, Ole Winther, Mikkel N. Schmidt

    Abstract: In machine learning energy potentials for atomic systems, forces are commonly obtained as the negative derivative of the energy function with respect to atomic positions. To quantify aleatoric uncertainty in the predicted energies, a widely used modeling approach involves predicting both a mean and variance for each energy value. However, this model is not differentiable under the usual white nois… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Presented at Advancing Molecular Machine Learning - Overcoming Limitations [ML4Molecules], ELLIS workshop, VIRTUAL, December 8, 2023, unofficial NeurIPS 2023 side-event

  3. arXiv:2307.09614  [pdf, other

    stat.ML cs.LG eess.SP

    Multi-view self-supervised learning for multivariate variable-channel time series

    Authors: Thea Brüsch, Mikkel N. Schmidt, Tommy S. Alstrøm

    Abstract: Labeling of multivariate biomedical time series data is a laborious and expensive process. Self-supervised contrastive learning alleviates the need for large, labeled datasets through pretraining on unlabeled data. However, for multivariate time series data, the set of input channels often varies between applications, and most existing work does not allow for transfer between datasets with differe… ▽ More

    Submitted 20 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: To appear in proceedings of 2023 IEEE International workshop on Machine Learning for Signal Processing

  4. arXiv:2306.13263  [pdf, other

    cs.LG cs.CV cs.DC

    Synthetic data shuffling accelerates the convergence of federated learning under data heterogeneity

    Authors: Bo Li, Yasin Esfandiari, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich

    Abstract: In federated learning, data heterogeneity is a critical challenge. A straightforward solution is to shuffle the clients' data to homogenize the distribution. However, this may violate data access rights, and how and when shuffling can accelerate the convergence of a federated optimization algorithm is not theoretically well understood. In this paper, we establish a precise and quantifiable corresp… ▽ More

    Submitted 8 April, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Accepted at TMLR

  5. arXiv:2305.16325  [pdf, other

    physics.chem-ph cs.LG

    Graph Neural Network Interatomic Potential Ensembles with Calibrated Aleatoric and Epistemic Uncertainty on Energy and Forces

    Authors: Jonas Busk, Mikkel N. Schmidt, Ole Winther, Tejs Vegge, Peter Bjørn Jørgensen

    Abstract: Inexpensive machine learning potentials are increasingly being used to speed up structural optimization and molecular dynamics simulations of materials by iteratively predicting and applying interatomic forces. In these settings, it is crucial to detect when predictions are unreliable to avoid wrong or misleading results. Here, we present a complete framework for training and recalibrating graph n… ▽ More

    Submitted 11 September, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

  6. arXiv:2212.02191  [pdf, other

    cs.LG cs.DC

    On the effectiveness of partial variance reduction in federated learning with heterogeneous data

    Authors: Bo Li, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich

    Abstract: Data heterogeneity across clients is a key challenge in federated learning. Prior works address this by either aligning client and server models or using control variates to correct client model drift. Although these methods achieve fast convergence in convex or simple non-convex problems, the performance in over-parameterized models such as deep neural networks is lacking. In this paper, we first… ▽ More

    Submitted 9 June, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted to CVPR 2023

  7. End-to-End Learning for VCSEL-based Optical Interconnects: State-of-the-Art, Challenges, and Opportunities

    Authors: Muralikrishnan Srinivasan, Jinxiang Song, Alexander Grabowski, Krzysztof Szczerba, Holger K. Iversen, Mikkel N. Schmidt, Darko Zibar, Jochen Schröder, Anders Larsson, Christian Häger, Henk Wymeersch

    Abstract: Optical interconnects (OIs) based on vertical-cavity surface-emitting lasers (VCSELs) are the main workhorse within data centers, supercomputers, and even vehicles, providing low-cost, high-rate connectivity. VCSELs must operate under extremely harsh and time-varying conditions, thus requiring adaptive and flexible designs of the communication chain. Such designs can be built based on mathematical… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  8. Raman Spectrum Matching with Contrastive Representation Learning

    Authors: Bo Li, Mikkel N. Schmidt, Tommy S. Alstrøm

    Abstract: Raman spectroscopy is an effective, low-cost, non-intrusive technique often used for chemical identification. Typical approaches are based on matching observations to a reference database, which requires careful preprocessing, or supervised machine learning, which requires a fairly large number of training observations from each class. We propose a new machine learning technique for Raman spectrum… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: Under review at Analytical Chemistry

  9. arXiv:2201.06863  [pdf, other

    cs.AI

    Programmatic Policy Extraction by Iterative Local Search

    Authors: Rasmus Larsen, Mikkel Nørgaard Schmidt

    Abstract: Reinforcement learning policies are often represented by neural networks, but programmatic policies are preferred in some cases because they are more interpretable, amenable to formal verification, or generalize better. While efficient algorithms for learning neural policies exist, learning programmatic policies is challenging. Combining imitation-projection and dataset aggregation with a local se… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  10. arXiv:2107.06068  [pdf, ps, other

    cs.LG stat.ML

    Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

    Authors: Jonas Busk, Peter Bjørn Jørgensen, Arghya Bhowmik, Mikkel N. Schmidt, Ole Winther, Tejs Vegge

    Abstract: Data-driven methods based on machine learning have the potential to accelerate computational analysis of atomic structures. In this context, reliable uncertainty estimates are important for assessing confidence in predictions and enabling decision making. However, machine learning models can produce badly calibrated uncertainty estimates and it is therefore crucial to detect and handle uncertainty… ▽ More

    Submitted 3 November, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  11. arXiv:1806.08195  [pdf, other

    stat.ML cs.LG

    Probabilistic PARAFAC2

    Authors: Philip J. H. Jørgensen, Søren F. V. Nielsen, Jesper L. Hinrich, Mikkel N. Schmidt, Kristoffer H. Madsen, Morten Mørup

    Abstract: The PARAFAC2 is a multimodal factor analysis model suitable for analyzing multi-way data when one of the modes has incomparable observation units, for example because of differences in signal sampling or batch sizes. A fully probabilistic treatment of the PARAFAC2 is desirable in order to improve robustness to noise and provide a well founded principle for determining the number of factors, but ch… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: 16 pages (incl. 4 pages of supplemental material), 5 figures

  12. arXiv:1806.03146  [pdf, other

    stat.ML cs.LG

    Neural Message Passing with Edge Updates for Predicting Properties of Molecules and Materials

    Authors: Peter Bjørn Jørgensen, Karsten Wedel Jacobsen, Mikkel N. Schmidt

    Abstract: Neural message passing on molecular graphs is one of the most promising methods for predicting formation energy and other properties of molecules and materials. In this work we extend the neural message passing model with an edge update network which allows the information exchanged between atoms to depend on the hidden state of the receiving atom. We benchmark the proposed model on three publicly… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

  13. arXiv:1101.5097  [pdf, ps, other

    cs.SI cs.LG physics.soc-ph

    Infinite Multiple Membership Relational Modeling for Complex Networks

    Authors: Morten Mørup, Mikkel N. Schmidt, Lars Kai Hansen

    Abstract: Learning latent structure in complex networks has become an important problem fueled by many types of networked data originating from practically all fields of science. In this paper, we propose a new non-parametric Bayesian multiple-membership latent feature model for networks. Contrary to existing multiple-membership models that scale quadratically in the number of vertices the proposed model sc… ▽ More

    Submitted 26 January, 2011; originally announced January 2011.

    Comments: 8 pages, 4 figures