Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Padhy, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03349  [pdf, ps, other

    cs.PF cs.DC cs.LG

    Toward Smart Scheduling in Tapis

    Authors: Joe Stubbs, Smruti Padhy, Richard Cardone

    Abstract: The Tapis framework provides APIs for automating job execution on remote resources, including HPC clusters and servers running in the cloud. Tapis can simplify the interaction with remote cyberinfrastructure (CI), but the current services require users to specify the exact configuration of a job to run, including the system, queue, node count, and maximum run time, among other attributes. Moreover… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  2. arXiv:2406.01781  [pdf, other

    cs.LG

    DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$-transform

    Authors: Alexander Denker, Francisco Vargas, Shreyas Padhy, Kieran Didi, Simon Mathis, Vincent Dutordoir, Riccardo Barbano, Emile Mathieu, Urszula Julia Komorowska, Pietro Lio

    Abstract: Generative modelling paradigms based on denoising diffusion processes have emerged as a leading candidate for conditional sampling in inverse problems. In many real-world applications, we often have access to large, expensively trained unconditional diffusion models, which we aim to exploit for improving conditional sampling. Most recent approaches are motivated heuristically and lack a unifying f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.09236

  3. arXiv:2405.18457  [pdf, other

    cs.LG stat.ML

    Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

  4. arXiv:2405.18328  [pdf, other

    cs.LG stat.ML

    Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

    Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Advances in Approximate Bayesian Inference 2024

  5. arXiv:2403.01946  [pdf, other

    cs.LG

    A Generative Model of Symmetry Transformations

    Authors: James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Correctly capturing the symmetry transformations of data can lead to efficient models with strong generalization capabilities, though methods incorporating symmetries often require prior knowledge. While recent advancements have been made in learning those symmetries directly from the dataset, most of this work has focused on the discriminative setting. In this paper, we take inspiration from grou… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  7. arXiv:2307.05745  [pdf, other

    cs.CR cs.DC

    CloudSec: An Extensible Automated Reasoning Framework for Cloud Security Policies

    Authors: Joe Stubbs, Smruti Padhy, Richard Cardone, Steven Black

    Abstract: Users increasingly create, manage and share digital resources, including sensitive data, via cloud platforms and APIs. Platforms encode the rules governing access to these resources, referred to as \textit{security policies}, using different systems and semantics. As the number of resources and rules grows, the challenge of reasoning about them collectively increases. Formal methods tools, such as… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  8. arXiv:2307.01050  [pdf, other

    stat.ML cs.LG

    Transport meets Variational Inference: Controlled Monte Carlo Diffusions

    Authors: Francisco Vargas, Shreyas Padhy, Denis Blessing, Nikolas Nüsken

    Abstract: Connecting optimal transport and variational inference, we present a principled and systematic framework for sampling and generative modelling centred around divergences on path space. Our work culminates in the development of the \emph{Controlled Monte Carlo Diffusion} sampler (CMCD) for Bayesian computation, a score-based annealing technique that crucially adapts both forward and backward dynami… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Workshop on New Frontiers in Learning, Control, and Dynamical Systems at the International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA, 2023

  9. arXiv:2306.11589  [pdf, other

    cs.LG stat.ML

    Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

    Authors: Jihao Andreas Lin, Javier Antorán, Shreyas Padhy, David Janz, José Miguel Hernández-Lobato, Alexander Terenin

    Abstract: Gaussian processes are a powerful framework for quantifying uncertainty and for sequential decision-making but are limited by the requirement of solving linear systems. In general, this has a cubic cost in dataset size and is sensitive to conditioning. We explore stochastic gradient algorithms as a computationally efficient method of approximately solving these linear systems: we develop low-varia… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  10. arXiv:2304.08557  [pdf, other

    cs.CR cs.DC

    A Decentralized Authorization and Security Framework for Distributed Research Workflows

    Authors: Richard Cardone, Smruti Padhy, Steven Black, Sean Cleveland, Joe Stubbs

    Abstract: Research challenges such as climate change and the search for habitable planets increasingly use academic and commercial computing resources distributed across different institutions and physical sites. Furthermore, such analyses often require a level of automation that precludes direct human interaction, and securing these workflows involves adherence to security policies across institutions. In… ▽ More

    Submitted 13 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 10 pages. Short version of this paper to be published on COMPSAC 2023 proceedings

    ACM Class: H.4.0

  11. arXiv:2303.05420  [pdf, other

    stat.ML cs.CV cs.LG

    Kernel Regression with Infinite-Width Neural Networks on Millions of Examples

    Authors: Ben Adlam, Jaehoon Lee, Shreyas Padhy, Zachary Nado, Jasper Snoek

    Abstract: Neural kernels have drastically increased performance on diverse and nonstandard data modalities but require significantly more compute, which previously limited their application to smaller datasets. In this work, we address this by massively parallelizing their computation across many GPUs. We combine this with a distributed, preconditioned conjugate gradients algorithm to enable kernel regressi… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  12. arXiv:2212.02346  [pdf, other

    cs.LG

    Accu-Help: A Machine Learning based Smart Healthcare Framework for Accurate Detection of Obsessive Compulsive Disorder

    Authors: Kabita Patel, Ajaya Kumar Tripathy, Laxmi Narayan Padhy, Sujita Kumar Kar, Susanta Kumar Padhy, Saraju Prasad Mohanty

    Abstract: In recent years the importance of Smart Healthcare cannot be overstated. The current work proposed to expand the state-of-art of smart healthcare in integrating solutions for Obsessive Compulsive Disorder (OCD). Identification of OCD from oxidative stress biomarkers (OSBs) using machine learning is an important development in the study of OCD. However, this process involves the collection of OCD c… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  13. arXiv:2210.04994  [pdf, other

    stat.ML cs.AI cs.LG

    Sampling-based inference for large linear models, with application to linearised Laplace

    Authors: Javier Antorán, Shreyas Padhy, Riccardo Barbano, Eric Nalisnick, David Janz, José Miguel Hernández-Lobato

    Abstract: Large-scale linear models are ubiquitous throughout machine learning, with contemporary application as surrogate models for neural network uncertainty quantification; that is, the linearised Laplace method. Alas, the computational cost associated with Bayesian linear models constrains this method's application to small networks, small output spaces and small datasets. We address this limitation by… ▽ More

    Submitted 16 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2023. This latest Arxiv version is extended with a demonstration of the proposed methods on the Imagenet dataset

  14. arXiv:2205.00403  [pdf, other

    cs.LG stat.ML

    A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness

    Authors: Jeremiah Zhe Liu, Shreyas Padhy, Jie Ren, Zi Lin, Yeming Wen, Ghassen Jerfel, Zack Nado, Jasper Snoek, Dustin Tran, Balaji Lakshminarayanan

    Abstract: Accurate uncertainty quantification is a major challenge in deep learning, as neural networks can make overconfident errors and assign high confidence predictions to out-of-distribution (OOD) inputs. The most popular approaches to estimate predictive uncertainty in deep learning are methods that combine predictions from multiple neural networks, such as Bayesian neural networks (BNNs) and deep ens… ▽ More

    Submitted 30 December, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2006.10108

  15. arXiv:2106.09022  [pdf, other

    cs.LG

    A Simple Fix to Mahalanobis Distance for Improving Near-OOD Detection

    Authors: Jie Ren, Stanislav Fort, Jeremiah Liu, Abhijit Guha Roy, Shreyas Padhy, Balaji Lakshminarayanan

    Abstract: Mahalanobis distance (MD) is a simple and popular post-processing method for detecting out-of-distribution (OOD) inputs in neural networks. We analyze its failure modes for near-OOD detection and propose a simple fix called relative Mahalanobis distance (RMD) which improves performance and is more robust to hyperparameter choice. On a wide selection of challenging vision, language, and biology OOD… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  16. arXiv:2106.04015  [pdf, other

    cs.LG

    Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning

    Authors: Zachary Nado, Neil Band, Mark Collier, Josip Djolonga, Michael W. Dusenberry, Sebastian Farquhar, Qixuan Feng, Angelos Filos, Marton Havasi, Rodolphe Jenatton, Ghassen Jerfel, Jeremiah Liu, Zelda Mariet, Jeremy Nixon, Shreyas Padhy, Jie Ren, Tim G. J. Rudner, Faris Sbahi, Yeming Wen, Florian Wenzel, Kevin Murphy, D. Sculley, Balaji Lakshminarayanan, Jasper Snoek, Yarin Gal , et al. (1 additional authors not shown)

    Abstract: High-quality estimates of uncertainty and robustness are crucial for numerous real-world applications, especially for deep learning which underlies many deployed ML systems. The ability to compare techniques for improving these estimates is therefore very important for research and practice alike. Yet, competitive comparisons of methods are often lacking due to a range of reasons, including: compu… ▽ More

    Submitted 5 January, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

  17. arXiv:2105.14602  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    On the geometry of generalization and memorization in deep neural networks

    Authors: Cory Stephenson, Suchismita Padhy, Abhinav Ganesh, Yue Hui, Hanlin Tang, SueYeon Chung

    Abstract: Understanding how large neural networks avoid memorizing training data is key to explaining their high generalization performance. To examine the structure of when and where memorization occurs in a deep network, we use a recently developed replica-based mean field theoretic geometric analysis method. We find that all layers preferentially learn from examples which share features, and link this be… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: ICLR 2021

  18. arXiv:2007.05134  [pdf, other

    cs.LG stat.ML

    Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks

    Authors: Shreyas Padhy, Zachary Nado, Jie Ren, Jeremiah Liu, Jasper Snoek, Balaji Lakshminarayanan

    Abstract: Accurate estimation of predictive uncertainty in modern neural networks is critical to achieve well calibrated predictions and detect out-of-distribution (OOD) inputs. The most promising approaches have been predominantly focused on improving model uncertainty (e.g. deep ensembles and Bayesian neural networks) and post-processing techniques for OOD detection (e.g. ODIN and Mahalanobis distance). H… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  19. arXiv:2006.10963  [pdf, other

    cs.LG stat.ML

    Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift

    Authors: Zachary Nado, Shreyas Padhy, D. Sculley, Alexander D'Amour, Balaji Lakshminarayanan, Jasper Snoek

    Abstract: Covariate shift has been shown to sharply degrade both predictive accuracy and the calibration of uncertainty estimates for deep learning models. This is worrying, because covariate shift is prevalent in a wide range of real world deployment settings. However, in this paper, we note that frequently there exists the potential to access small unlabeled batches of the shifted data just before predict… ▽ More

    Submitted 14 January, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

  20. arXiv:2006.10108  [pdf, other

    cs.LG stat.ML

    Simple and Principled Uncertainty Estimation with Deterministic Deep Learning via Distance Awareness

    Authors: Jeremiah Zhe Liu, Zi Lin, Shreyas Padhy, Dustin Tran, Tania Bedrax-Weiss, Balaji Lakshminarayanan

    Abstract: Bayesian neural networks (BNN) and deep ensembles are principled approaches to estimate the predictive uncertainty of a deep learning model. However their practicality in real-time, industrial-scale applications are limited due to their heavy memory and inference cost. This motivates us to study principled approaches to high-quality uncertainty estimation that require only a single deep neural net… ▽ More

    Submitted 25 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  21. arXiv:2003.01787  [pdf, other

    cs.LG cond-mat.dis-nn cs.CL cs.SD eess.AS

    Untangling in Invariant Speech Recognition

    Authors: Cory Stephenson, Jenelle Feather, Suchismita Padhy, Oguz Elibol, Hanlin Tang, Josh McDermott, SueYeon Chung

    Abstract: Encouraged by the success of deep neural networks on a variety of visual tasks, much theoretical and experimental work has been aimed at understanding and interpreting how vision networks operate. Meanwhile, deep neural networks have also achieved impressive performance in audio processing applications, both as sub-components of larger systems and as complete end-to-end systems by themselves. Desp… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Advances in Neural Information Processing Systems. 2019

  22. arXiv:1910.12587  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Label-efficient audio classification through multitask learning and self-supervision

    Authors: Tyler Lee, Ting Gong, Suchismita Padhy, Andrew Rouditchenko, Anthony Ndirango

    Abstract: While deep learning has been incredibly successful in modeling tasks with large, carefully curated labeled datasets, its application to problems with limited labeled data remains a challenge. The aim of the present work is to improve the label efficiency of large neural networks operating on audio data through a combination of multitask learning and self-supervised learning on unlabeled data. We t… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: Presented at ICLR 2019 Limited Labeled Data (LLD) Workshop

  23. arXiv:1703.01641  [pdf, other

    cs.NI

    End-to-End Network Delay Guarantees for Real-Time Systems using SDN

    Authors: Rakesh Kumar, Monowar Hasan, Smruti Padhy, Konstantin Evchenko, Lavanya Piramanayagam, Sibin Mohan, Rakesh B. Bobba

    Abstract: We propose a novel framework that reduces the management and integration overheads for real-time network flows by leveraging the capabilities (especially global visibility and management) of software-defined networking (SDN) architectures. Given the specifications of flows that must meet hard real-time requirements, our framework synthesizes paths through the network and associated switch configur… ▽ More

    Submitted 3 May, 2017; v1 submitted 5 March, 2017; originally announced March 2017.