Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Godichon-Baggioni, A

.
  1. arXiv:2405.14459  [pdf, other

    stat.ML

    Semi-Discrete Optimal Transport: Nearly Minimax Estimation With Stochastic Gradient Descent and Adaptive Entropic Regularization

    Authors: Ferdinand Genans, Antoine Godichon-Baggioni, François-Xavier Vialard, Olivier Wintenberger

    Abstract: Optimal Transport (OT) based distances are powerful tools for machine learning to compare probability measures and manipulate them using OT maps. In this field, a setting of interest is semi-discrete OT, where the source measure $μ$ is continuous, while the target $ν$ is discrete. Recent works have shown that the minimax rate for the OT map is $\mathcal{O}(t^{-1/2})$ when using $t$ i.i.d. subsampl… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2405.01908  [pdf, other

    math.ST stat.ML

    A Full Adagrad algorithm with O(Nd) operations

    Authors: Antoine Godichon-Baggioni, Wei Lu, Bruno Portier

    Abstract: A novel approach is given to overcome the computational challenges of the full-matrix Adaptive Gradient algorithm (Full AdaGrad) in stochastic optimization. By developing a recursive method that estimates the inverse of the square root of the covariance of the gradient, alongside a streaming variant for parameter updates, the study offers efficient and practical algorithms for large-scale applicat… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2404.19496  [pdf, other

    math.ST stat.ML

    Online and Offline Robust Multivariate Linear Regression

    Authors: Antoine Godichon-Baggioni, Stephane S. Robin, Laure Sansonnet

    Abstract: We consider the robust estimation of the parameters of multivariate Gaussian linear regression models. To this aim we consider robust version of the usual (Mahalanobis) least-square criterion, with or without Ridge regularization. We introduce two methods each considered contrast: (i) online stochastic gradient descent algorithms and their averaged versions and (ii) offline fix-point algorithms… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  4. arXiv:2402.02857  [pdf, other

    stat.ML cs.LG

    Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

    Authors: Sobihan Surendran, Antoine Godichon-Baggioni, Adeline Fermanian, Sylvain Le Corff

    Abstract: Stochastic Gradient Descent (SGD) with adaptive steps is now widely used for training deep neural networks. Most theoretical results assume access to unbiased gradient estimators, which is not the case in several recent deep learning and reinforcement learning applications that use Monte Carlo methods. This paper provides a comprehensive non-asymptotic analysis of SGD with biased gradients and ada… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2401.10923  [pdf, other

    math.OC stat.ML

    Online estimation of the inverse of the Hessian for stochastic optimization with application to universal stochastic Newton algorithms

    Authors: Antoine Godichon-Baggioni, Wei Lu, Bruno Portier

    Abstract: This paper addresses second-order stochastic optimization for estimating the minimizer of a convex function written as an expectation. A direct recursive estimation technique for the inverse Hessian matrix using a Robbins-Monro procedure is introduced. This approach enables to drastically reduces computational complexity. Above all, it allows to develop universal stochastic Newton methods and inve… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2312.09633  [pdf, other

    stat.ME

    Natural Gradient Variational Bayes without Fisher Matrix Analytic Calculation and Its Inversion

    Authors: A. Godichon-Baggioni, D. Nguyen, M-N Tran

    Abstract: This paper introduces a method for efficiently approximating the inverse of the Fisher information matrix, a crucial step in achieving effective variational Bayes inference. A notable aspect of our approach is the avoidance of analytically computing the Fisher information matrix and its explicit inversion. Instead, we introduce an iterative procedure for generating a sequence of matrices that conv… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 43 pages

  7. arXiv:2311.17753  [pdf, other

    math.ST

    On Adaptive Stochastic Optimization for Streaming Data: A Newton's Method with O(dN) Operations

    Authors: Antoine Godichon-Baggioni, Nicklas Werge

    Abstract: Stochastic optimization methods encounter new challenges in the realm of streaming, characterized by a continuous flow of large, high-dimensional data. While first-order methods, like stochastic gradient descent, are the natural choice, they often struggle with ill-conditioned problems. In contrast, second-order methods, such as Newton's methods, offer a potential solution, but their computational… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  8. arXiv:2309.11916  [pdf, other

    stat.ME math.ST

    A mixture of ellipsoidal densities for 3D data modelling

    Authors: Denis Brazey, Antoine Godichon-Baggioni, Bruno Portier

    Abstract: In this paper, we propose a new ellipsoidal mixture model. This model is based a new probability density function belonging to the family of elliptical distributions and designed to model points spread around an ellipsoidal surface. Then, we consider a mixture model based on this density, whose parameters are estimated with the help of an EM algorithm. The properties of the estimates are studied t… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  9. arXiv:2304.00770  [pdf, other

    stat.ML

    Online stochastic Newton methods for estimating the geometric median and applications

    Authors: Antoine Godichon-Baggioni, Wei Lu

    Abstract: In the context of large samples, a small number of individuals might spoil basic statistical indicators like the mean. It is difficult to detect automatically these atypical individuals, and an alternative strategy is using robust approaches. This paper focuses on estimating the geometric median of a random variable, which is a robust indicator of central tendency. In order to deal with large samp… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  10. arXiv:2303.01370  [pdf, ps, other

    math.OC math.PR math.ST stat.ML

    Non asymptotic analysis of Adaptive stochastic gradient algorithms and applications

    Authors: Antoine Godichon-Baggioni, Pierre Tarrago

    Abstract: In stochastic optimization, a common tool to deal sequentially with large sample is to consider the well-known stochastic gradient algorithm. Nevertheless, since the stepsequence is the same for each direction, this can lead to bad results in practice in case of ill-conditionned problem. To overcome this, adaptive gradient algorithms such that Adagrad or Stochastic Newton algorithms should be pref… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  11. arXiv:2211.08131  [pdf, other

    stat.ME math.ST

    A robust model-based clustering based on the geometric median and the Median Covariation Matrix

    Authors: Antoine Godichon-Baggioni, Stéphane Robin

    Abstract: Grouping observations into homogeneous groups is a recurrent task in statistical data analysis. We consider Gaussian Mixture Models, which are the most famous parametric model-based clustering method. We propose a new robust approach for model-based clustering, which consists in a modification of the EM algorithm (more specifically, the M-step) by replacing the estimates of the mean and the varian… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  12. arXiv:2209.03597  [pdf, other

    math.ST

    A penalized criterion for selecting the number of clusters for K-medians

    Authors: Antoine Godichon-Baggioni, Sobihan Surendran

    Abstract: Clustering is a usual unsupervised machine learning technique for grouping the data points into groups based upon similar features. We focus here on unsupervised clustering for contaminated data, i.e in the case where K-medians should be preferred to K-means because of its robustness. More precisely, we concentrate on a common question in clustering: how to chose the number of clusters? The answer… ▽ More

    Submitted 27 February, 2024; v1 submitted 8 September, 2022; originally announced September 2022.

  13. arXiv:2205.12549  [pdf, other

    cs.LG math.OC stat.ML

    Learning from time-dependent streaming data with online stochastic algorithms

    Authors: Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger

    Abstract: This paper addresses stochastic optimization in a streaming setting with time-dependent and biased gradient estimates. We analyze several first-order methods, including Stochastic Gradient Descent (SGD), mini-batch SGD, and time-varying mini-batch SGD, along with their Polyak-Ruppert averages. Our non-asymptotic analysis establishes novel heuristics that link dependence, biases, and convexity leve… ▽ More

    Submitted 18 July, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  14. arXiv:2109.07117  [pdf, other

    cs.LG math.OC stat.ML

    Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

    Authors: Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger

    Abstract: We introduce a streaming framework for analyzing stochastic approximation/optimization problems. This streaming framework is analogous to solving optimization problems using time-varying mini-batches that arrive sequentially. We provide non-asymptotic convergence rates of various gradient-based algorithms; this includes the famous Stochastic Gradient (SG) descent (a.k.a. Robbins-Monro algorithm),… ▽ More

    Submitted 24 April, 2023; v1 submitted 15 September, 2021; originally announced September 2021.

  15. arXiv:2107.12058  [pdf, ps, other

    math.ST

    Convergence in quadratic mean of averaged stochastic gradient algorithms without strong convexity nor bounded gradient

    Authors: Antoine Godichon-Baggioni

    Abstract: Online averaged stochastic gradient algorithms are more and more studied since (i) they can deal quickly with large sample taking values in high dimensional spaces, (ii) they enable to treat data sequentially, (iii) they are known to be asymptotically efficient. In this paper, we focus on giving explicit bounds of the quadratic mean error of the estimates, and this, with very weak assumptions, i.e… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  16. On the asymptotic rate of convergence of Stochastic Newton algorithms and their Weighted Averaged versions

    Authors: Claire Boyer, Antoine Godichon-Baggioni

    Abstract: The majority of machine learning methods can be regarded as the minimization of an unavailable risk function. To optimize the latter, given samples provided in a streaming fashion, we define a general stochastic Newton algorithm and its weighted average version. In several use cases, both implementations will be shown not to require the inversion of a Hessian estimate at each iteration, but a dire… ▽ More

    Submitted 29 June, 2023; v1 submitted 19 November, 2020; originally announced November 2020.

    Comments: Computational Optimization and Applications, 2022

  17. arXiv:2006.12920  [pdf, other

    math.ST

    An efficient Averaged Stochastic Gauss-Newton algorithm for estimating parameters of non linear regressions models

    Authors: Peggy Cénac, Antoine Godichon-Baggioni, Bruno Portier

    Abstract: Non linear regression models are a standard tool for modeling real phenomena, with several applications in machine learning, ecology, econometry... Estimating the parameters of the model has garnered a lot of attention during many years. We focus here on a recursive method for estimating parameters of non linear regressions. Indeed, these kinds of methods, whose most famous are probably the stocha… ▽ More

    Submitted 16 September, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

  18. arXiv:1904.07908  [pdf, other

    math.ST

    An efficient stochastic Newton algorithm for parameter estimation in logistic regressions

    Authors: Bernard Bercu, Antoine Godichon-Baggioni, Bruno Portier

    Abstract: Logistic regression is a well-known statistical model which is commonly used in the situation where the output is a binary random variable. It has a wide range of applications including machine learning, public health, social sciences, ecology and econometry. In order to estimate the unknown parameters of logistic regression with data streams arriving sequentially and at high speed, we focus our a… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  19. arXiv:1710.07926  [pdf, other

    math.ST

    On the rates of convergence of Parallelized Averaged Stochastic Gradient Algorithms

    Authors: Antoine Godichon-Baggioni, Sofiane Saadane

    Abstract: The growing interest for high dimensional and functional data analysis led in the last decade to an important research developing a consequent amount of techniques. Parallelized algorithms, which consist in distributing and treat the data into different machines, for example, are a good answer to deal with large samples taking values in high dimensional spaces. We introduce here a parallelized ave… ▽ More

    Submitted 22 October, 2017; originally announced October 2017.

  20. Clustering transformed compositional data using K-means, with applications in gene expression and bicycle sharing system data

    Authors: Antoine Godichon-Baggioni, Cathy Maugis-Rabusseau, Andrea Rau

    Abstract: Although there is no shortage of clustering algorithms proposed in the literature, the question of the most relevant strategy for clustering compositional data (i.e., data made up of profiles, whose rows belong to the simplex) remains largely unexplored in cases where the observed value of an observation is equal or close to zero for one or more samples. This work is motivated by the analysis of t… ▽ More

    Submitted 20 April, 2017; originally announced April 2017.

    MSC Class: 62H30; 62P10

  21. arXiv:1702.00931  [pdf, other

    math.ST

    Online estimation of the asymptotic variance for averaged stochastic gradient algorithms

    Authors: Antoine Godichon-Baggioni

    Abstract: Stochastic gradient algorithms are more and more studied since they can deal efficiently and online with large samples in high dimensional spaces. In this paper, we first establish a Central Limit Theorem for these estimates as well as for their averaged version in general Hilbert spaces. Moreover, since having the asymptotic normality of estimates is often unusable without an estimation of the as… ▽ More

    Submitted 16 October, 2017; v1 submitted 3 February, 2017; originally announced February 2017.

  22. arXiv:1609.05479  [pdf, other

    math.ST

    Lp and almost sure rates of convergence of averaged stochastic gradient algorithms: locally strongly convex objective

    Authors: Antoine Godichon-Baggioni

    Abstract: An usual problem in statistics consists in estimating the minimizer of a convex function. When we have to deal with large samples taking values in high dimensional spaces, stochastic gradient algorithms and their averaged versions are efficient candidates. Indeed, (1) they do not need too much computational efforts, (2) they do not need to store all the data, which is crucial when we deal with big… ▽ More

    Submitted 11 January, 2022; v1 submitted 18 September, 2016; originally announced September 2016.

  23. arXiv:1606.04276  [pdf, other

    math.ST

    An averaged projected Robbins-Monro algorithm for estimating the parameters of a truncated spherical distribution

    Authors: Antoine Godichon-Baggioni, Bruno Portier

    Abstract: The objective of this work is to propose a new algorithm to fit a sphere on a noisy 3D point cloud distributed around a complete or a truncated sphere. More precisely, we introduce a projected Robbins-Monro algorithm and its averaged version for estimating the center and the radius of the sphere. We give asymptotic results such as the almost sure convergence of these algorithms as well as the asym… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

  24. arXiv:1504.02852  [pdf, other

    math.ST

    Fast Estimation of the Median Covariation Matrix with Application to Online Robust Principal Components Analysis

    Authors: Hervé Cardot, Antoine Godichon-Baggioni

    Abstract: The geometric median covariation matrix is a robust multivariate indicator of dispersion which can be extended without any difficulty to functional data. We define estimators, based on recursive algorithms, that can be simply updated at each new observation and are able to deal rapidly with large samples of high dimensional data without being obliged to store all the data in memory. Asymptotic con… ▽ More

    Submitted 9 July, 2016; v1 submitted 11 April, 2015; originally announced April 2015.