Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Deligiannidis, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08399  [pdf, other

    stat.ML cs.LG

    Differentiable Cost-Parameterized Monge Map Estimators

    Authors: Samuel Howard, George Deligiannidis, Patrick Rebeschini, James Thornton

    Abstract: Within the field of optimal transport (OT), the choice of ground cost is crucial to ensuring that the optimality of a transport map corresponds to usefulness in real-world applications. It is therefore desirable to use known information to tailor cost functions and hence learn OT maps which are adapted to the problem at hand. By considering a class of neural ground costs whose Monge maps have a kn… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2404.17442  [pdf, ps, other

    stat.ML cs.LG

    Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets

    Authors: Benjamin Dupuis, Paul Viallard, George Deligiannidis, Umut Simsekli

    Abstract: We propose data-dependent uniform generalization bounds by approaching the problem from a PAC-Bayesian perspective. We first apply the PAC-Bayesian framework on `random sets' in a rigorous way, where the training algorithm is assumed to output a data-dependent hypothesis set after observing the training data. This approach allows us to prove data-dependent bounds, which can be applicable in numero… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2402.06320  [pdf, other

    stat.ML cs.LG stat.CO

    Particle Denoising Diffusion Sampler

    Authors: Angus Phillips, Hai-Dang Dau, Michael John Hutchinson, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusion models have become ubiquitous for generative modeling. The core idea is to transport the data distribution to a Gaussian by using a diffusion. Approximate samples from the data distribution are then obtained by estimating the time-reversal of this diffusion using score matching ideas. We follow here a similar strategy to sample from unnormalized probability densities and comput… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: To be published in ICML 2024. 37 pages, 20 figures, 3 tables, 5 algorithms

  4. arXiv:2308.03686  [pdf, other

    stat.ML cs.LG

    Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

    Authors: Joe Benton, Valentin De Bortoli, Arnaud Doucet, George Deligiannidis

    Abstract: Denoising diffusions are a powerful method to generate approximate samples from high-dimensional data distributions. Recent results provide polynomial bounds on their convergence rate, assuming $L^2$-accurate scores. Until now, the tightest bounds were either superlinear in the data dimension or required strong smoothness assumptions. We provide the first convergence bounds which are linear in the… ▽ More

    Submitted 5 March, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  5. arXiv:2306.07254  [pdf, other

    stat.ML cs.LG

    On the Expected Size of Conformal Prediction Sets

    Authors: Guneet S. Dhillon, George Deligiannidis, Tom Rainforth

    Abstract: While conformal predictors reap the benefits of rigorous statistical guarantees on their error frequency, the size of their corresponding prediction sets is critical to their practical utility. Unfortunately, there is currently a lack of finite-sample analysis and guarantees for their prediction set sizes. To address this shortfall, we theoretically quantify the expected size of the prediction set… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: International Conference on Artificial Intelligence and Statistics (AISTATS), 2024

  6. arXiv:2305.19638  [pdf, other

    stat.ML cs.CV cs.LG eess.IV

    A Unified Framework for U-Net Design and Analysis

    Authors: Christopher Williams, Fabian Falck, George Deligiannidis, Chris Holmes, Arnaud Doucet, Saifuddin Syed

    Abstract: U-Nets are a go-to, state-of-the-art neural architecture across numerous tasks for continuous signals on a square such as images and Partial Differential Equations (PDE), however their design and architecture is understudied. In this paper, we provide a framework for designing and analysing general U-Net architectures. We present theoretical results which characterise the role of the encoder and d… ▽ More

    Submitted 10 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  7. arXiv:2305.16860  [pdf, ps, other

    stat.ML cs.LG

    Error Bounds for Flow Matching Methods

    Authors: Joe Benton, George Deligiannidis, Arnaud Doucet

    Abstract: Score-based generative models are a popular class of generative modelling techniques relying on stochastic differential equations (SDE). From their inception, it was realized that it was also possible to perform generation using ordinary differential equations (ODE) rather than SDE. This led to the introduction of the probability flow ODE approach and denoising diffusion implicit models. Flow matc… ▽ More

    Submitted 11 February, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2302.02766  [pdf, other

    stat.ML cs.LG

    Generalization Bounds with Data-dependent Fractal Dimensions

    Authors: Benjamin Dupuis, George Deligiannidis, Umut Şimşekli

    Abstract: Providing generalization guarantees for modern neural networks has been a crucial task in statistical learning. Recently, several studies have attempted to analyze the generalization error in such settings by using tools from fractal geometry. While these works have successfully introduced new mathematical tools to apprehend generalization, they heavily rely on a Lipschitz continuity assumption, w… ▽ More

    Submitted 10 July, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Machine Learning (ICML 2023)

  9. arXiv:2301.08187  [pdf, other

    stat.ML cs.CV cs.LG eess.SP

    A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs

    Authors: Fabian Falck, Christopher Williams, Dominic Danks, George Deligiannidis, Christopher Yau, Chris Holmes, Arnaud Doucet, Matthew Willetts

    Abstract: U-Net architectures are ubiquitous in state-of-the-art deep learning, however their regularisation properties and relationship to wavelets are understudied. In this paper, we formulate a multi-resolution framework which identifies U-Nets as finite-dimensional truncations of models on an infinite-dimensional function space. We provide theoretical results which prove that average pooling corresponds… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022 (selected as oral)

  10. arXiv:2211.03595  [pdf, other

    stat.ML cs.LG

    From Denoising Diffusions to Denoising Markov Models

    Authors: Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusions are state-of-the-art generative models exhibiting remarkable empirical performance. They work by diffusing the data distribution into a Gaussian distribution and then learning to reverse this noising process to obtain synthetic datapoints. The denoising diffusion relies on approximations of the logarithmic derivatives of the noised data densities using score matching. Such mod… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  11. arXiv:2209.02525  [pdf, other

    stat.ML cs.LG

    Generalisation under gradient descent via deterministic PAC-Bayes

    Authors: Eugenio Clerico, Tyler Farghly, George Deligiannidis, Benjamin Guedj, Arnaud Doucet

    Abstract: We establish disintegrated PAC-Bayesian generalisation bounds for models trained with gradient descent methods or continuous gradient flows. Contrary to standard practice in the PAC-Bayesian setting, our result applies to optimisation algorithms that are deterministic, without requiring any de-randomisation step. Our bounds are fully computable, depending on the density of the initial distribution… ▽ More

    Submitted 4 April, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

  12. arXiv:2207.00109  [pdf, other

    stat.ML cs.IR cs.LG math.OC

    Ranking In Generalized Linear Bandits

    Authors: Amitis Shidani, George Deligiannidis, Arnaud Doucet

    Abstract: We study the ranking problem in generalized linear bandits. At each time, the learning agent selects an ordered list of items and observes stochastic outcomes. In recommendation systems, displaying an ordered list of the most attractive items is not always optimal as both position and item dependencies result in a complex reward function. A very naive example is the lack of diversity when all the… ▽ More

    Submitted 1 January, 2024; v1 submitted 30 June, 2022; originally announced July 2022.

    Journal ref: AAAI 2024 Workshop on Recommendation Ecosystems: Modeling, Optimization and Incentive Design

  13. arXiv:2205.14987  [pdf, other

    stat.ML cs.LG

    A Continuous Time Framework for Discrete Denoising Models

    Authors: Andrew Campbell, Joe Benton, Valentin De Bortoli, Tom Rainforth, George Deligiannidis, Arnaud Doucet

    Abstract: We provide the first complete continuous time framework for denoising diffusion models of discrete data. This is achieved by formulating the forward noising process and corresponding reverse time generative process as Continuous Time Markov Chains (CTMCs). The model can be efficiently trained using a continuous time version of the ELBO. We simulate the high dimensional CTMC using techniques develo… ▽ More

    Submitted 14 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 44 pages, 15 figures; NeurIPS 2022

  14. arXiv:2203.00977  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Chained Generalisation Bounds

    Authors: Eugenio Clerico, Amitis Shidani, George Deligiannidis, Arnaud Doucet

    Abstract: This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By developing a general theoretical framework, we establish a duality between generalisation bounds based on the regularity of the loss function, and their chained counterparts, which can be obtained by lifting the regularity assumption from the… ▽ More

    Submitted 30 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the 35th Conference on Learning Theory, PMLR 178:4212-4257, 2022

  15. arXiv:2203.00554  [pdf, other

    stat.ML cs.LG

    Neural Score Matching for High-Dimensional Causal Inference

    Authors: Oscar Clivio, Fabian Falck, Brieuc Lehmann, George Deligiannidis, Chris Holmes

    Abstract: Traditional methods for matching in causal inference are impractical for high-dimensional datasets. They suffer from the curse of dimensionality: exact matching and coarsened exact matching find exponentially fewer matches as the input dimension grows, and propensity score matching may match highly unrelated units together. To overcome this problem, we develop theoretical results which motivate th… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: To appear in AISTATS 2022

  16. arXiv:2202.13460  [pdf, other

    stat.ML cs.LG

    Conditional Simulation Using Diffusion Schrödinger Bridges

    Authors: Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusion models have recently emerged as a powerful class of generative models. They provide state-of-the-art results, not only for unconditional simulation, but also when used to solve conditional simulation problems arising in a wide range of inverse problems. A limitation of these models is that they are computationally intensive at generation time as they require simulating a diffus… ▽ More

    Submitted 26 June, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 29 pages, 15 figures. UAI 2022 camera-ready version

  17. arXiv:2112.00565  [pdf, other

    stat.ML cs.LG stat.CO

    On Mixing Times of Metropolized Algorithm With Optimization Step (MAO) : A New Framework

    Authors: EL Mahdi Khribch, George Deligiannidis, Daniel Paulin

    Abstract: In this paper, we consider sampling from a class of distributions with thin tails supported on $\mathbb{R}^d$ and make two primary contributions. First, we propose a new Metropolized Algorithm With Optimization Step (MAO), which is well suited for such targets. Our algorithm is capable of sampling from distributions where the Metropolis-adjusted Langevin algorithm (MALA) is not converging or lacki… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 24 pages, 27 Figures, 4 Tables

    MSC Class: 65C05; 65C40 ACM Class: G.3

  18. arXiv:2110.11886  [pdf, other

    cs.LG stat.ML

    Conditionally Gaussian PAC-Bayes

    Authors: Eugenio Clerico, George Deligiannidis, Arnaud Doucet

    Abstract: Recent studies have empirically investigated different methods to train stochastic neural networks on a classification task by optimising a PAC-Bayesian bound via stochastic gradient descent. Most of these procedures need to replace the misclassification error with a surrogate loss, leading to a mismatch between the optimisation objective and the actual generalisation bound. The present paper prop… ▽ More

    Submitted 24 February, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:2311-2329, 2022

  19. arXiv:2108.08129  [pdf, ps, other

    stat.ML cs.LG math.OC math.PR

    Quantitative Uniform Stability of the Iterative Proportional Fitting Procedure

    Authors: George Deligiannidis, Valentin De Bortoli, Arnaud Doucet

    Abstract: We establish the uniform in time stability, w.r.t. the marginals, of the Iterative Proportional Fitting Procedure, also known as Sinkhorn algorithm, used to solve entropy-regularised Optimal Transport problems. Our result is quantitative and stated in terms of the 1-Wasserstein metric. As a corollary we establish a quantitative stability result for Schrödinger bridges.

    Submitted 22 October, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 14 pages

  20. arXiv:2106.09798  [pdf, other

    stat.ML cs.LG

    Wide stochastic networks: Gaussian limit and PAC-Bayesian training

    Authors: Eugenio Clerico, George Deligiannidis, Arnaud Doucet

    Abstract: The limit of infinite width allows for substantial simplifications in the analytical study of over-parameterised neural networks. With a suitable random initialisation, an extremely large network exhibits an approximately Gaussian behaviour. In the present work, we establish a similar result for a simple stochastic architecture whose parameters are random variables, holding both before and during… ▽ More

    Submitted 13 February, 2023; v1 submitted 17 June, 2021; originally announced June 2021.

    Journal ref: The 34th International Conference on Algorithmic Learning Theory (ALT 2023)

  21. arXiv:2106.04881  [pdf, other

    stat.ML cs.LG

    Fractal Structure and Generalization Properties of Stochastic Optimization Algorithms

    Authors: Alexander Camuto, George Deligiannidis, Murat A. Erdogdu, Mert Gürbüzbalaban, Umut Şimşekli, Lingjiong Zhu

    Abstract: Understanding generalization in deep learning has been one of the major challenges in statistical learning theory over the last decade. While recent work has illustrated that the dataset and the training algorithm must be taken into account in order to obtain meaningful generalization bounds, it is still theoretically not clear which properties of the data and the algorithm determine the generaliz… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 34 pages including Supplement, 4 Figures

  22. arXiv:2102.07850  [pdf, other

    stat.ML cs.LG stat.CO

    Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

    Authors: Adrien Corenflos, James Thornton, George Deligiannidis, Arnaud Doucet

    Abstract: Particle Filtering (PF) methods are an established class of procedures for performing inference in non-linear state-space models. Resampling is a key ingredient of PF, necessary to obtain low variance likelihood and states estimates. However, traditional resampling methods result in PF-based loss functions being non-differentiable with respect to model and PF parameters. In a variational inference… ▽ More

    Submitted 30 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 9 pages of content + 11 pages supplementary, accepted for oral at ICML 2021

  23. arXiv:2010.12859  [pdf, other

    cs.LG stat.ML

    Stable ResNet

    Authors: Soufiane Hayou, Eugenio Clerico, Bobby He, George Deligiannidis, Arnaud Doucet, Judith Rousseau

    Abstract: Deep ResNet architectures have achieved state of the art performance on many tasks. While they solve the problem of gradient vanishing, they might suffer from gradient exploding as the depth becomes large (Yang et al. 2017). Moreover, recent results have shown that ResNet might lose expressivity as the depth goes to infinity (Yang et al. 2017, Hayou et al. 2019). To resolve these issues, we introd… ▽ More

    Submitted 18 March, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 43 pages, 4 figures

  24. Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks

    Authors: Umut Şimşekli, Ozan Sener, George Deligiannidis, Murat A. Erdogdu

    Abstract: Despite its success in a wide range of applications, characterizing the generalization properties of stochastic gradient descent (SGD) in non-convex deep learning problems is still an important challenge. While modeling the trajectories of SGD via stochastic differential equations (SDE) under heavy-tailed gradient noise has recently shed light over several peculiar characteristics of SGD, a rigoro… ▽ More

    Submitted 22 May, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published at NeurIPS 2020 (Spotlight) -- an imprecision in Definition 2 and a mistake in the statement and the proof of Theorem 2 are fixed

  25. arXiv:1909.13833  [pdf, other

    stat.ML cs.LG

    Relaxing Bijectivity Constraints with Continuously Indexed Normalising Flows

    Authors: Rob Cornish, Anthony L. Caterini, George Deligiannidis, Arnaud Doucet

    Abstract: We show that normalising flows become pathological when used to model targets whose supports have complicated topologies. In this scenario, we prove that a flow must become arbitrarily numerically noninvertible in order to approximate the target closely. This result has implications for all flow-based models, and especially Residual Flows (ResFlows), which explicitly control the Lipschitz constant… ▽ More

    Submitted 23 April, 2021; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: Minor revision

  26. arXiv:1901.09881  [pdf, other

    stat.ML cs.LG

    Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets

    Authors: Robert Cornish, Paul Vanetti, Alexandre Bouchard-Côté, George Deligiannidis, Arnaud Doucet

    Abstract: Bayesian inference via standard Markov Chain Monte Carlo (MCMC) methods is too computationally intensive to handle large datasets, since the cost per step usually scales like $Θ(n)$ in the number of data points $n$. We propose the Scalable Metropolis-Hastings (SMH) kernel that exploits Gaussian concentration of the posterior to require processing on average only $O(1)$ or even $O(1/\sqrt{n})$ data… ▽ More

    Submitted 10 June, 2019; v1 submitted 28 January, 2019; originally announced January 2019.