Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Kavis, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02016  [pdf, other

    math.OC cs.LG stat.ML

    Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization

    Authors: Ruichen Jiang, Ali Kavis, Qiujiang Jin, Sujay Sanghavi, Aryan Mokhtari

    Abstract: We propose adaptive, line search-free second-order methods with optimal rate of convergence for solving convex-concave min-max problems. By means of an adaptive step size, our algorithms feature a simple update rule that requires solving only one linear system per iteration, eliminating the need for line search or backtracking mechanisms. Specifically, we base our algorithms on the optimistic meth… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 33 pages, 2 figures

  2. arXiv:2211.01851  [pdf, other

    math.OC cs.LG

    Adaptive Stochastic Variance Reduction for Non-convex Finite-Sum Minimization

    Authors: Ali Kavis, Stratis Skoulakis, Kimon Antonakopoulos, Leello Tadesse Dadi, Volkan Cevher

    Abstract: We propose an adaptive variance-reduction method, called AdaSpider, for minimization of $L$-smooth, non-convex functions with a finite-sum structure. In essence, AdaSpider combines an AdaGrad-inspired [Duchi et al., 2011, McMahan & Streeter, 2010], but a fairly distinct, adaptive step-size schedule with the recursive stochastic path integrated estimator proposed in [Fang et al., 2018]. To our know… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 23 pages, 2 figures, accepted at NeurIPS 2022

  3. arXiv:2211.01832  [pdf, other

    math.OC cs.LG stat.ML

    Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

    Authors: Kimon Antonakopoulos, Ali Kavis, Volkan Cevher

    Abstract: This work proposes a universal and adaptive second-order method for minimizing second-order smooth, convex functions. Our algorithm achieves $O(σ/ \sqrt{T})$ convergence when the oracle feedback is stochastic with variance $σ^2$, and improves its convergence to $O( 1 / T^3)$ with deterministic oracles, where $T$ is the number of iterations. Our method also interpolates these rates without knowing… ▽ More

    Submitted 12 December, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 32 pages, 4 figures, accepted at NeurIPS 2022

  4. arXiv:2204.02833  [pdf, ps, other

    math.OC cs.LG

    High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad Stepsize

    Authors: Ali Kavis, Kfir Yehuda Levy, Volkan Cevher

    Abstract: In this paper, we propose a new, simplified high probability analysis of AdaGrad for smooth, non-convex problems. More specifically, we focus on a particular accelerated gradient (AGD) template (Lan, 2020), through which we recover the original AdaGrad and its variant with averaging, and prove a convergence rate of $\mathcal O (1/ \sqrt{T})$ with high probability without the knowledge of smoothnes… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 27 pages, acccepted to ICLR 2022

  5. arXiv:2111.01040  [pdf, other

    math.OC cs.LG

    STORM+: Fully Adaptive SGD with Momentum for Nonconvex Optimization

    Authors: Kfir Y. Levy, Ali Kavis, Volkan Cevher

    Abstract: In this work we investigate stochastic non-convex optimization problems where the objective is an expectation over smooth loss functions, and the goal is to find an approximate stationary point. The most popular approach to handling such problems is variance reduction techniques, which are also known to obtain tight convergence rates, matching the lower bounds in this case. Nevertheless, these tec… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 25 pages, 1 figure, accepted to NeurIPS 2021

  6. arXiv:2007.01147  [pdf, ps, other

    math.ST cs.CC

    Double-Loop Unadjusted Langevin Algorithm

    Authors: Paul Rolland, Armin Eftekhari, Ali Kavis, Volkan Cevher

    Abstract: A well-known first-order method for sampling from log-concave probability distributions is the Unadjusted Langevin Algorithm (ULA). This work proposes a new annealing step-size schedule for ULA, which allows to prove new convergence guarantees for sampling from a smooth log-concave distribution, which are not covered by existing state-of-the-art convergence guarantees. To establish this result, we… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  7. arXiv:2006.11144  [pdf, other

    math.OC cs.LG math.PR stat.ML

    On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

    Authors: Panayotis Mertikopoulos, Nadav Hallak, Ali Kavis, Volkan Cevher

    Abstract: This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand the algorithm's convergence properties in non-convex problems. We first show that the sequence of iterates generated by SGD remains bounded and converges with probability $1$ under a very broad range of step-size schedules. Subsequently, going beyond existing positive probability guarantees, we show that S… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 32 pages, 8 figures

    MSC Class: Primary 90C26; 62L20; secondary 90C30; 90C15; 37N40

  8. arXiv:1910.13857  [pdf, other

    math.OC cs.LG

    UniXGrad: A Universal, Adaptive Algorithm with Optimal Guarantees for Constrained Optimization

    Authors: Ali Kavis, Kfir Y. Levy, Francis Bach, Volkan Cevher

    Abstract: We propose a novel adaptive, accelerated algorithm for the stochastic constrained convex optimization setting. Our method, which is inspired by the Mirror-Prox method, \emph{simultaneously} achieves the optimal rates for smooth/non-smooth problems with either deterministic/stochastic first-order oracles. This is done without any prior knowledge of the smoothness nor the noise properties of the pro… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  9. arXiv:1812.04428  [pdf, other

    cs.LG stat.ML

    Efficient learning of smooth probability functions from Bernoulli tests with guarantees

    Authors: Paul Rolland, Ali Kavis, Alex Immer, Adish Singla, Volkan Cevher

    Abstract: We study the fundamental problem of learning an unknown, smooth probability function via pointwise Bernoulli tests. We provide a scalable algorithm for efficiently solving this problem with rigorous guarantees. In particular, we prove the convergence rate of our posterior update rule to the true probability function in L2-norm. Moreover, we allow the Bernoulli tests to depend on contextual feature… ▽ More

    Submitted 23 August, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

  10. arXiv:1802.10174  [pdf, other

    cs.LG math.OC

    Mirrored Langevin Dynamics

    Authors: Ya-Ping Hsieh, Ali Kavis, Paul Rolland, Volkan Cevher

    Abstract: We consider the problem of sampling from constrained distributions, which has posed significant challenges to both non-asymptotic analysis and algorithmic design. We propose a unified framework, which is inspired by the classical mirror descent, to derive novel first-order sampling schemes. We prove that, for a general target distribution with strongly convex potential, our framework implies the e… ▽ More

    Submitted 30 December, 2020; v1 submitted 27 February, 2018; originally announced February 2018.