Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Labach, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18780  [pdf, other

    cs.LG

    MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for General Time Series Forecasting

    Authors: Linfeng Du, Ji Xin, Alex Labach, Saba Zuberi, Maksims Volkovs, Rahul G. Krishnan

    Abstract: Transformer-based models have greatly pushed the boundaries of time series forecasting recently. Existing methods typically encode time series data into $\textit{patches}$ using one or a fixed set of patch lengths. This, however, could result in a lack of ability to capture the variety of intricate temporal dependencies present in real-world multi-periodic time series. In this paper, we propose Mu… ▽ More

    Submitted 8 February, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  2. arXiv:2304.13017  [pdf, other

    cs.LG

    DuETT: Dual Event Time Transformer for Electronic Health Records

    Authors: Alex Labach, Aslesha Pokhrel, Xiao Shi Huang, Saba Zuberi, Seung Eun Yi, Maksims Volkovs, Tomi Poutanen, Rahul G. Krishnan

    Abstract: Electronic health records (EHRs) recorded in hospital settings typically contain a wide range of numeric time series data that is characterized by high sparsity and irregular observations. Effective modelling for such data must exploit its time series nature, the semantic relationship between different types of observations, and information in the sparsity structure of the data. Self-supervised Tr… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at MLHC 2023, camera-ready version

  3. A Framework for Neural Network Pruning Using Gibbs Distributions

    Authors: Alex Labach, Shahrokh Valaee

    Abstract: Modern deep neural networks are often too large to use in many practical scenarios. Neural network pruning is an important technique for reducing the size of such models and accelerating inference. Gibbs pruning is a novel framework for expressing and designing neural network pruning methods. Combining approaches from statistical physics and stochastic regularization methods, it can train and prun… ▽ More

    Submitted 28 December, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: v1 was presented at IEEE GLOBECOM 2020. v2 is a substantially expanded revision, also written in 2020

  4. arXiv:1911.09669  [pdf, other

    cs.LG stat.ML

    Regularizing Neural Networks by Stochastically Training Layer Ensembles

    Authors: Alex Labach, Shahrokh Valaee

    Abstract: Dropout and similar stochastic neural network regularization methods are often interpreted as implicitly averaging over a large ensemble of models. We propose STE (stochastically trained ensemble) layers, which enhance the averaging properties of such methods by training an ensemble of weight matrices with stochastic regularization while explicitly averaging outputs. This provides stronger regular… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  5. arXiv:1904.13310  [pdf, other

    cs.NE cs.AI cs.LG

    Survey of Dropout Methods for Deep Neural Networks

    Authors: Alex Labach, Hojjat Salehinejad, Shahrokh Valaee

    Abstract: Dropout methods are a family of stochastic techniques used in neural network training or inference that have generated significant research interest and are widely used in practice. They have been successfully applied in neural network regularization, model compression, and in measuring the uncertainty of neural network outputs. While original formulated for dense neural network layers, recent adv… ▽ More

    Submitted 25 October, 2019; v1 submitted 25 April, 2019; originally announced April 2019.