Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Kolbeinsson, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06483  [pdf, other

    cs.LG cs.CL

    Composable Interventions for Language Models

    Authors: Arinbjorn Kolbeinsson, Kyle O'Brien, Tianjin Huang, Shanghua Gao, Shiwei Liu, Jonathan Richard Schwarz, Anurag Vaidya, Faisal Mahmood, Marinka Zitnik, Tianlong Chen, Thomas Hartvigsen

    Abstract: Test-time interventions for language models can enhance factual accuracy, mitigate harmful outputs, and improve model efficiency without costly retraining. But despite a flood of new methods, different types of interventions are largely developing independently. In practice, multiple interventions must be applied sequentially to the same model, yet we lack standardized ways to study how interventi… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2307.16664  [pdf, other

    cs.LG eess.SP

    Generative models for wearables data

    Authors: Arinbjörn Kolbeinsson, Luca Foschini

    Abstract: Data scarcity is a common obstacle in medical research due to the high costs associated with data collection and the complexity of gaining access to and utilizing data. Synthesizing health data may provide an efficient and cost-effective solution to this shortage, enabling researchers to explore distributions and populations that are not represented in existing observations or difficult to access… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 14 pages, 4 figures

  3. arXiv:2112.13755  [pdf, other

    cs.LG

    Self-supervision of wearable sensors time-series data for influenza detection

    Authors: Arinbjörn Kolbeinsson, Piyusha Gade, Raghu Kainkaryam, Filip Jankovic, Luca Foschini

    Abstract: Self-supervision may boost model performance in downstream tasks. However, there is no principled way of selecting the self-supervised objectives that yield the most adaptable models. Here, we study this problem on daily time-series data generated from wearable sensors used to detect onset of influenza-like illness (ILI). We first show that using self-supervised learning to predict next-day time-s… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: The workshop on Self-Supervised Learning at NeurIPS (2021)

  4. arXiv:2011.07407  [pdf, other

    cs.LG cs.NE math.DG

    GENNI: Visualising the Geometry of Equivalences for Neural Network Identifiability

    Authors: Daniel Lengyel, Janith Petangoda, Isak Falk, Kate Highnam, Michalis Lazarou, Arinbjörn Kolbeinsson, Marc Peter Deisenroth, Nicholas R. Jennings

    Abstract: We propose an efficient algorithm to visualise symmetries in neural networks. Typically, models are defined with respect to a parameter space, where non-equal parameters can produce the same input-output map. Our proposed method, GENNI, allows us to efficiently identify parameters that are functionally equivalent and then visualise the subspace of the resulting equivalence class. By doing so, we a… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

  5. arXiv:2008.12965  [pdf, other

    cs.CV

    Patch-based Brain Age Estimation from MR Images

    Authors: Kyriaki-Margarita Bintsi, Vasileios Baltatzis, Arinbjörn Kolbeinsson, Alexander Hammers, Daniel Rueckert

    Abstract: Brain age estimation from Magnetic Resonance Images (MRI) derives the difference between a subject's biological brain age and their chronological age. This is a potential biomarker for neurodegeneration, e.g. as part of Alzheimer's disease. Early detection of neurodegeneration manifesting as a higher brain age can potentially facilitate better medical care and planning for affected individuals. Ma… ▽ More

    Submitted 1 October, 2020; v1 submitted 29 August, 2020; originally announced August 2020.

    Comments: Accepted (oral) at the MLCN workshop, MICCAI 2020

  6. arXiv:1911.11285  [pdf, other

    cs.LG cs.NE stat.ML

    Biologically inspired architectures for sample-efficient deep reinforcement learning

    Authors: Pierre H. Richemond, Arinbjörn Kolbeinsson, Yike Guo

    Abstract: Deep reinforcement learning requires a heavy price in terms of sample efficiency and overparameterization in the neural networks used for function approximation. In this work, we use tensor factorization in order to learn more compact representation for reinforcement learning policies. We show empirically that in the low-data regime, it is possible to learn online policies with 2 to 10 times less… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: Deep Reinforcement Learning Workshop, NeurIPS 2019, Vancouver, Canada

  7. arXiv:1909.10662  [pdf, other

    cs.LG

    How to Incorporate Monotonicity in Deep Networks While Preserving Flexibility?

    Authors: Akhil Gupta, Naman Shukla, Lavanya Marla, Arinbjörn Kolbeinsson, Kartik Yellepeddi

    Abstract: The importance of domain knowledge in enhancing model performance and making reliable predictions in the real-world is critical. This has led to an increased focus on specific model properties for interpretability. We focus on incorporating monotonic trends, and propose a novel gradient-based point-wise loss function for enforcing partial monotonicity with deep neural networks. While recent develo… ▽ More

    Submitted 2 December, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 8 pages, 5 figures. NeurIPS 2019 Workshop on Machine Learning with Guarantees

  8. arXiv:1905.08874  [pdf, other

    cs.LG stat.ML

    Adaptive Model Selection Framework: An Application to Airline Pricing

    Authors: Naman Shukla, Arinbjörn Kolbeinsson, Lavanya Marla, Kartik Yellepeddi

    Abstract: Multiple machine learning and prediction models are often used for the same prediction or recommendation task. In our recent work, where we develop and deploy airline ancillary pricing models in an online setting, we found that among multiple pricing models developed, no one model clearly dominates other models for all incoming customer requests. Thus, as algorithm designers, we face an exploratio… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  9. arXiv:1902.10758  [pdf, other

    cs.LG stat.ML

    Tensor Dropout for Robust Learning

    Authors: Arinbjörn Kolbeinsson, Jean Kossaifi, Yannis Panagakis, Adrian Bulat, Anima Anandkumar, Ioanna Tzoulaki, Paul Matthews

    Abstract: CNNs achieve remarkable performance by leveraging deep, over-parametrized architectures, trained on large datasets. However, they have limited generalization ability to data outside the training domain, and a lack of robustness to noise and adversarial attacks. By building better inductive biases, we can improve robustness and also obtain smaller networks that are more memory and computationally e… ▽ More

    Submitted 11 December, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

  10. arXiv:1902.02236  [pdf, other

    stat.ML cs.CY cs.LG

    Dynamic Pricing for Airline Ancillaries with Customer Context

    Authors: Naman Shukla, Arinbjörn Kolbeinsson, Ken Otwell, Lavanya Marla, Kartik Yellepeddi

    Abstract: Ancillaries have become a major source of revenue and profitability in the travel industry. Yet, conventional pricing strategies are based on business rules that are poorly optimized and do not respond to changing market conditions. This paper describes the dynamic pricing model developed by Deepair solutions, an AI technology provider for travel suppliers. We present a pricing model that provides… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  11. arXiv:1707.08308  [pdf, other

    cs.LG

    Tensor Regression Networks

    Authors: Jean Kossaifi, Zachary C. Lipton, Arinbjorn Kolbeinsson, Aran Khanna, Tommaso Furlanello, Anima Anandkumar

    Abstract: Convolutional neural networks typically consist of many convolutional layers followed by one or more fully connected layers. While convolutional layers map between high-order activation tensors, the fully connected layers operate on flattened activation vectors. Despite empirical success, this approach has notable drawbacks. Flattening followed by fully connected layers discards multilinear struct… ▽ More

    Submitted 20 July, 2020; v1 submitted 26 July, 2017; originally announced July 2017.