Zum Hauptinhalt springen

Showing 1–37 of 37 results for author: Andoni, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.05315  [pdf, ps, other

    cs.CG cs.DS

    Sub-quadratic (1+\eps)-approximate Euclidean Spanners, with Applications

    Authors: Alexandr Andoni, Hengjie Zhang

    Abstract: We study graph spanners for point-set in the high-dimensional Euclidean space. On the one hand, we prove that spanners with stretch <\sqrt{2} and subquadratic size are not possible, even if we add Steiner points. On the other hand, if we add extra nodes to the graph (non-metric Steiner points), then we can obtain (1+\eps)-approximate spanners of subquadratic size. We show how to construct a spanne… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 27 pages

  2. arXiv:2306.11312  [pdf, other

    cs.DS cs.LG stat.ML

    Data Structures for Density Estimation

    Authors: Anders Aamand, Alexandr Andoni, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal

    Abstract: We study statistical/computational tradeoffs for the following density estimation problem: given $k$ distributions $v_1, \ldots, v_k$ over a discrete domain of size $n$, and sampling access to a distribution $p$, identify $v_i$ that is "close" to $p$. Our main result is the first data structure that, given a sublinear (in $n$) number of samples from $p$, identifies $v_i$ in time sublinear in $k$.… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: To appear at ICML'23

  3. arXiv:2211.13473  [pdf, ps, other

    cs.CC

    Communication Complexity of Inner Product in Symmetric Normed Spaces

    Authors: Alexandr Andoni, Jarosław Błasiok, Arnold Filtser

    Abstract: We introduce and study the communication complexity of computing the inner product of two vectors, where the input is restricted w.r.t. a norm $N$ on the space $\mathbb{R}^n$. Here, Alice and Bob hold two vectors $v,u$ such that $\|v\|_N\le 1$ and $\|u\|_{N^*}\le 1$, where $N^*$ is the dual norm. They want to compute their inner product $\langle v,u \rangle$ up to an $\varepsilon$ additive term. T… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to ITCS 2023

  4. arXiv:2112.05106  [pdf, other

    cs.DS

    Estimating the Longest Increasing Subsequence in Nearly Optimal Time

    Authors: Alexandr Andoni, Negev Shekel Nosatzki, Sandip Sinha, Clifford Stein

    Abstract: Longest Increasing Subsequence (LIS) is a fundamental statistic of a sequence, and has been studied for decades. While the LIS of a sequence of length $n$ can be computed exactly in time $O(n\log n)$, the complexity of estimating the (length of the) LIS in sublinear time, especially when LIS $\ll n$, is still open. We show that for any integer $n$ and any $λ= o(1)$, there exists a (randomized) n… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Full version of FOCS 2022 paper

    ACM Class: F.2.0

  5. arXiv:2108.05433  [pdf, other

    cs.DS cs.LG

    Learning to Hash Robustly, Guaranteed

    Authors: Alexandr Andoni, Daniel Beaglehole

    Abstract: The indexing algorithms for the high-dimensional nearest neighbor search (NNS) with the best worst-case guarantees are based on the randomized Locality Sensitive Hashing (LSH), and its derivatives. In practice, many heuristic approaches exist to "learn" the best indexing method in order to speed-up NNS, crucially adapting to the structure of the given dataset. Oftentimes, these heuristics outper… ▽ More

    Submitted 7 July, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

  6. arXiv:2105.05761  [pdf, ps, other

    cs.DS

    From Average Embeddings To Nearest Neighbor Search

    Authors: Alexandr Andoni, David Cheikhi

    Abstract: In this note, we show that one can use average embeddings, introduced recently in [Naor'20, arXiv:1905.01280], to obtain efficient algorithms for approximate nearest neighbor search. In particular, a metric $X$ embeds into $\ell_2$ on average, with distortion $D$, if, for any distribution $μ$ on $X$, the embedding is $D$ Lipschitz and the (square of) distance does not decrease on average (wrt $μ$)… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  7. arXiv:2007.03633  [pdf, ps, other

    cs.DS cs.CC cs.CG cs.LG

    Streaming Complexity of SVMs

    Authors: Alexandr Andoni, Collin Burns, Yi Li, Sepideh Mahabadi, David P. Woodruff

    Abstract: We study the space complexity of solving the bias-regularized SVM problem in the streaming model. This is a classic supervised learning problem that has drawn lots of attention, including for developing fast algorithms for solving the problem approximately. One of the most widely used algorithms for approximately optimizing the SVM objective is Stochastic Gradient Descent (SGD), which requires onl… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: APPROX 2020

  8. arXiv:2005.07678  [pdf, other

    cs.DS

    Edit Distance in Near-Linear Time: it's a Constant Factor

    Authors: Alexandr Andoni, Negev Shekel Nosatzki

    Abstract: We present an algorithm for approximating the edit distance between two strings of length $n$ in time $n^{1+\varepsilon}$ up to a constant factor, for any $\varepsilon>0$. Our result completes a research direction set forth in the recent breakthrough paper [Chakraborty-Das-Goldenberg-Koucky-Saks, FOCS'18], which showed the first constant-factor approximation algorithm with a (strongly) sub-quadrat… ▽ More

    Submitted 14 July, 2022; v1 submitted 15 May, 2020; originally announced May 2020.

  9. arXiv:1911.01956  [pdf, other

    cs.DS cs.DC

    Parallel Approximate Undirected Shortest Paths Via Low Hop Emulators

    Authors: Alexandr Andoni, Clifford Stein, Peilin Zhong

    Abstract: We present a $(1+\varepsilon)$-approximate parallel algorithm for computing shortest paths in undirected graphs, achieving $\mathrm{poly}(\log n)$ depth and $m\mathrm{poly}(\log n)$ work for $n$-nodes $m$-edges graphs. Although sequential algorithms with (nearly) optimal running time have been known for several decades, near-optimal parallel algorithms have turned out to be a much tougher challeng… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

  10. arXiv:1905.00850  [pdf, ps, other

    cs.DS cs.DC

    Log Diameter Rounds Algorithms for $2$-Vertex and $2$-Edge Connectivity

    Authors: Alexandr Andoni, Clifford Stein, Peilin Zhong

    Abstract: Many modern parallel systems, such as MapReduce, Hadoop and Spark, can be modeled well by the MPC model. The MPC model captures well coarse-grained computation on large data --- data is distributed to processors, each of which has a sublinear (in the input data) amount of memory and we alternate between rounds of computation and rounds of communication, where each machine can communicate an amount… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: ICALP 2019

  11. arXiv:1811.04065  [pdf, other

    cs.DS

    Two Party Distribution Testing: Communication and Security

    Authors: Alexandr Andoni, Tal Malkin, Negev Shekel Nosatzki

    Abstract: We study the problem of discrete distribution testing in the two-party setting. For example, in the standard closeness testing problem, Alice and Bob each have $t$ samples from, respectively, distributions $a$ and $b$ over $[n]$, and they need to test whether $a=b$ or $a,b$ are $ε$-far for some fixed $ε>0$. This is in contrast to the well-studied one-party case, where the tester has unrestricted a… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

  12. arXiv:1809.02995  [pdf, ps, other

    cs.DS

    On Solving Linear Systems in Sublinear Time

    Authors: Alexandr Andoni, Robert Krauthgamer, Yosef Pogrow

    Abstract: We study \emph{sublinear} algorithms that solve linear systems locally. In the classical version of this problem the input is a matrix $S\in \mathbb{R}^{n\times n}$ and a vector $b\in\mathbb{R}^n$ in the range of $S$, and the goal is to output $x\in \mathbb{R}^n$ satisfying $Sx=b$. For the case when the matrix $S$ is symmetric diagonally dominant (SDD), the breakthrough algorithm of Spielman and T… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

  13. arXiv:1807.08478  [pdf, ps, other

    cs.DS

    Batch Sparse Recovery, or How to Leverage the Average Sparsity

    Authors: Alexandr Andoni, Lior Kamma, Robert Krauthgamer, Eric Price

    Abstract: We introduce a \emph{batch} version of sparse recovery, where the goal is to report a sequence of vectors $A_1',\ldots,A_m' \in \mathbb{R}^n$ that estimate unknown signals $A_1,\ldots,A_m \in \mathbb{R}^n$ using a few linear measurements, each involving exactly one signal vector, under an assumption of \emph{average sparsity}. More precisely, we want to have \newline… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

  14. arXiv:1806.09823  [pdf, ps, other

    cs.DS cs.CG cs.DB stat.ML

    Approximate Nearest Neighbor Search in High Dimensions

    Authors: Alexandr Andoni, Piotr Indyk, Ilya Razenshteyn

    Abstract: The nearest neighbor problem is defined as follows: Given a set $P$ of $n$ points in some metric space $(X,D)$, build a data structure that, given any point $q$, returns a point in $P$ that is closest to $q$ (its "nearest neighbor" in $P$). The data structure stores additional information about the set $P$, which is then used to find the nearest neighbor without computing all distances between… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: 27 pages, no figures; to appear in the proceedings of ICM 2018 (accompanying the talk by P. Indyk)

  15. arXiv:1806.06430  [pdf, ps, other

    cs.DS cs.LG

    Subspace Embedding and Linear Regression with Orlicz Norm

    Authors: Alexandr Andoni, Chengyu Lin, Ying Sheng, Peilin Zhong, Ruiqi Zhong

    Abstract: We consider a generalization of the classic linear regression problem to the case when the loss is an Orlicz norm. An Orlicz norm is parameterized by a non-negative convex function $G:\mathbb{R}_+\rightarrow\mathbb{R}_+$ with $G(0)=0$: the Orlicz norm of a vector $x\in\mathbb{R}^n$ is defined as $ \|x\|_G=\inf\left\{α>0\large\mid\sum_{i=1}^n G(|x_i|/α)\leq 1\right\}. $ We consider the cases where… ▽ More

    Submitted 17 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  16. arXiv:1805.03055  [pdf, other

    cs.DS cs.DC

    Parallel Graph Connectivity in Log Diameter Rounds

    Authors: Alexandr Andoni, Clifford Stein, Zhao Song, Zhengyu Wang, Peilin Zhong

    Abstract: We study graph connectivity problem in MPC model. On an undirected graph with $n$ nodes and $m$ edges, $O(\log n)$ round connectivity algorithms have been known for over 35 years. However, no algorithms with better complexity bounds were known. In this work, we give fully scalable, faster algorithms for the connectivity problem, by parameterizing the time complexity as a function of the diameter o… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  17. arXiv:1707.04875  [pdf, ps, other

    cs.DS cs.IT cs.NI

    Coding sets with asymmetric information

    Authors: Alexandr Andoni, Javad Ghaderi, Daniel Hsu, Dan Rubenstein, Omri Weinstein

    Abstract: We study the following one-way asymmetric transmission problem, also a variant of model-based compressed sensing: a resource-limited encoder has to report a small set $S$ from a universe of $N$ items to a more powerful decoder (server). The distinguishing feature is asymmetric information: the subset $S$ is comprised of i.i.d. samples from a prior distribution $μ$, and $μ$ is only known to the dec… ▽ More

    Submitted 26 July, 2018; v1 submitted 16 July, 2017; originally announced July 2017.

  18. arXiv:1611.06222  [pdf, other

    cs.DS cs.CG cs.LG math.MG

    Approximate Near Neighbors for General Symmetric Norms

    Authors: Alexandr Andoni, Huy L. Nguyen, Aleksandar Nikolov, Ilya Razenshteyn, Erik Waingarten

    Abstract: We show that every symmetric normed space admits an efficient nearest neighbor search data structure with doubly-logarithmic approximation. Specifically, for every $n$, $d = n^{o(1)}$, and every $d$-dimensional symmetric norm $\|\cdot\|$, there exists a data structure for $\mathrm{poly}(\log \log n)$-approximate nearest neighbor search over $\|\cdot\|$ for $n$-point datasets achieving $n^{o(1)}$ q… ▽ More

    Submitted 24 July, 2017; v1 submitted 18 November, 2016; originally announced November 2016.

    Comments: 27 pages, 1 figure

  19. arXiv:1608.03580  [pdf, other

    cs.DS cs.CC cs.CG cs.IR

    Optimal Hashing-based Time-Space Trade-offs for Approximate Near Neighbors

    Authors: Alexandr Andoni, Thijs Laarhoven, Ilya Razenshteyn, Erik Waingarten

    Abstract: [See the paper for the full abstract.] We show tight upper and lower bounds for time-space trade-offs for the $c$-Approximate Near Neighbor Search problem. For the $d$-dimensional Euclidean space and $n$-point datasets, we develop a data structure with space $n^{1 + ρ_u + o(1)} + O(dn)$ and query time $n^{ρ_q + o(1)} + d n^{o(1)}$ for every $ρ_u, ρ_q \geq 0$ such that: \begin{equation} c^2 \sqrt… ▽ More

    Submitted 21 May, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Comments: 62 pages, 5 figures; a merger of arXiv:1511.07527 [cs.DS] and arXiv:1605.02701 [cs.DS], which subsumes both of the preprints. New version contains more elaborated proofs and fixed some typos

  20. arXiv:1605.02701  [pdf, other

    cs.DS cs.CC cs.CG cs.IT

    Lower Bounds on Time-Space Trade-Offs for Approximate Near Neighbors

    Authors: Alexandr Andoni, Thijs Laarhoven, Ilya Razenshteyn, Erik Waingarten

    Abstract: We show tight lower bounds for the entire trade-off between space and query time for the Approximate Near Neighbor search problem. Our lower bounds hold in a restricted model of computation, which captures all hashing-based approaches. In articular, our lower bound matches the upper bound recently shown in [Laarhoven 2015] for the random instance on a Euclidean sphere (which we show in fact extend… ▽ More

    Submitted 18 August, 2016; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: 47 pages, 2 figures; v2: substantially revised introduction, lots of small corrections; subsumed by arXiv:1608.03580 [cs.DS] (along with arXiv:1511.07527 [cs.DS])

  21. arXiv:1511.06099  [pdf, ps, other

    cs.DS

    On Sketching Quadratic Forms

    Authors: Alexandr Andoni, Jiecao Chen, Robert Krauthgamer, Bo Qin, David P. Woodruff, Qin Zhang

    Abstract: We undertake a systematic study of sketching a quadratic form: given an $n \times n$ matrix $A$, create a succinct sketch $\textbf{sk}(A)$ which can produce (without further access to $A$) a multiplicative $(1+ε)$-approximation to $x^T A x$ for any desired query $x \in \mathbb{R}^n$. While a general matrix does not admit non-trivial sketches, positive semi-definite (PSD) matrices admit sketches of… ▽ More

    Submitted 19 November, 2015; originally announced November 2015.

    Comments: 46 pages; merging of arXiv:1403.7058 and arXiv:1412.8225

  22. arXiv:1509.02897  [pdf, other

    cs.DS cs.CG cs.IR

    Practical and Optimal LSH for Angular Distance

    Authors: Alexandr Andoni, Piotr Indyk, Thijs Laarhoven, Ilya Razenshteyn, Ludwig Schmidt

    Abstract: We show the existence of a Locality-Sensitive Hashing (LSH) family for the angular distance that yields an approximate Near Neighbor Search algorithm with the asymptotically optimal running time exponent. Unlike earlier algorithms with this property (e.g., Spherical LSH [Andoni, Indyk, Nguyen, Razenshteyn 2014], [Andoni, Razenshteyn 2015]), our algorithm is also practical, improving upon the well-… ▽ More

    Submitted 9 September, 2015; originally announced September 2015.

    Comments: 22 pages, an extended abstract is to appear in the proceedings of the 29th Annual Conference on Neural Information Processing Systems (NIPS 2015)

  23. arXiv:1507.04299  [pdf, ps, other

    cs.DS cs.CC cs.CG

    Tight Lower Bounds for Data-Dependent Locality-Sensitive Hashing

    Authors: Alexandr Andoni, Ilya Razenshteyn

    Abstract: We prove a tight lower bound for the exponent $ρ$ for data-dependent Locality-Sensitive Hashing schemes, recently used to design efficient solutions for the $c$-approximate nearest neighbor search. In particular, our lower bound matches the bound of $ρ\le \frac{1}{2c-1}+o(1)$ for the $\ell_1$ space, obtained via the recent algorithm from [Andoni-Razenshteyn, STOC'15]. In recent years it emerged… ▽ More

    Submitted 15 July, 2015; originally announced July 2015.

    Comments: 16 pages, no figures

  24. arXiv:1501.01062  [pdf, other

    cs.DS

    Optimal Data-Dependent Hashing for Approximate Near Neighbors

    Authors: Alexandr Andoni, Ilya Razenshteyn

    Abstract: We show an optimal data-dependent hashing scheme for the approximate near neighbor problem. For an $n$-point data set in a $d$-dimensional space our data structure achieves query time $O(d n^{ρ+o(1)})$ and space $O(n^{1+ρ+o(1)} + dn)$, where $ρ=\tfrac{1}{2c^2-1}$ for the Euclidean space and approximation $c>1$. For the Hamming space, we obtain an exponent of $ρ=\tfrac{1}{2c-1}$. Our result compl… ▽ More

    Submitted 15 July, 2015; v1 submitted 5 January, 2015; originally announced January 2015.

    Comments: 36 pages, 5 figures, an extended abstract appeared in the proceedings of the 47th ACM Symposium on Theory of Computing (STOC 2015)

  25. arXiv:1411.2577  [pdf, ps, other

    cs.DS cs.CC math.FA

    Sketching and Embedding are Equivalent for Norms

    Authors: Alexandr Andoni, Robert Krauthgamer, Ilya Razenshteyn

    Abstract: An outstanding open question posed by Guha and Indyk in 2006 asks to characterize metric spaces in which distances can be estimated using efficient sketches. Specifically, we say that a sketching algorithm is efficient if it achieves constant approximation using constant sketch size. A well-known result of Indyk (J. ACM, 2006) implies that a metric that admits a constant-distortion embedding into… ▽ More

    Submitted 15 February, 2017; v1 submitted 10 November, 2014; originally announced November 2014.

    Comments: 33 pages, an extended abstract appeared in the proceedings of the 47th ACM Symposium on Theory of Computing (STOC 2015); changes in v2: added quantitative bounds for the main results, preliminaries section with necessary definitions and facts has been added; v3: several clarifications, including a section on the basics of communication complexity

  26. arXiv:1408.0751  [pdf, other

    cs.DS

    Spectral Approaches to Nearest Neighbor Search

    Authors: Amirali Abdullah, Alexandr Andoni, Ravindran Kannan, Robert Krauthgamer

    Abstract: We study spectral algorithms for the high-dimensional Nearest Neighbor Search problem (NNS). In particular, we consider a semi-random setting where a dataset $P$ in $\mathbb{R}^d$ is chosen arbitrarily from an unknown subspace of low dimension $k\ll d$, and then perturbed by fully $d$-dimensional Gaussian noise. We design spectral NNS algorithms whose query time depends polynomially on $d$ and… ▽ More

    Submitted 4 August, 2014; originally announced August 2014.

    Comments: Accepted in the proceedings of FOCS 2014. 30 pages and 4 figures

  27. arXiv:1403.7058  [pdf, ps, other

    cs.DS

    The Sketching Complexity of Graph Cuts

    Authors: Alexandr Andoni, Robert Krauthgamer, David P. Woodruff

    Abstract: We study the problem of sketching an input graph, so that given the sketch, one can estimate the weight of any cut in the graph within factor $1+ε$. We present lower and upper bounds on the size of a randomized sketch, focusing on the dependence on the accuracy parameter $ε>0$. First, we prove that for every $ε> 1/\sqrt n$, every sketch that succeeds (with constant probability) in estimating the… ▽ More

    Submitted 10 November, 2014; v1 submitted 27 March, 2014; originally announced March 2014.

    Comments: The current version differs slightly from an earlier one (arXiv:1403.7058v1). First, the lower bound for the number of edges in $(1+ε)$-cut sparsifiers is improved, and now our bound is tight. Second, we retract our earlier claim that the sparsification algorithm can be performed in two passes of streaming over the graph edges

  28. arXiv:1401.0042  [pdf, other

    cs.DS cs.DC

    Parallel Algorithms for Geometric Graph Problems

    Authors: Alexandr Andoni, Aleksandar Nikolov, Krzysztof Onak, Grigory Yaroslavtsev

    Abstract: We give algorithms for geometric graph problems in the modern parallel models inspired by MapReduce. For example, for the Minimum Spanning Tree (MST) problem over a set of points in the two-dimensional space, our algorithm computes a $(1+ε)$-approximate MST. Our algorithms work in a constant number of rounds of communication, while using total space and communication proportional to the size of th… ▽ More

    Submitted 4 January, 2014; v1 submitted 30 December, 2013; originally announced January 2014.

  29. arXiv:1310.3252  [pdf, other

    cs.DS math.CO

    Towards (1+ε)-Approximate Flow Sparsifiers

    Authors: Alexandr Andoni, Anupam Gupta, Robert Krauthgamer

    Abstract: A useful approach to "compress" a large network $G$ is to represent it with a {\em flow-sparsifier}, i.e., a small network $H$ that supports the same flows as $G$, up to a factor $q \geq 1$ called the quality of sparsifier. Specifically, we assume the network $G$ contains a set of $k$ terminals $T$, shared with the network $H$, i.e., $T\subseteq V(G)\cap V(H)$, and we want $H$ to preserve all mult… ▽ More

    Submitted 11 October, 2013; originally announced October 2013.

    Comments: Full version of a paper accepted to SODA 2014

  30. arXiv:1306.6295  [pdf, ps, other

    cs.DS cs.IT math.ST

    Tight Lower Bound for Linear Sketches of Moments

    Authors: Alexandr Andoni, Huy L. Nguyen, Yury Polyanskiy, Yihong Wu

    Abstract: The problem of estimating frequency moments of a data stream has attracted a lot of attention since the onset of streaming algorithms [AMS99]. While the space complexity for approximately computing the $p^{\rm th}$ moment, for $p\in(0,2]$ has been settled [KNW10], for $p>2$ the exact complexity remains open. For $p>2$ the current best algorithm uses $O(n^{1-2/p}\log n)$ words of space [AKO11,BO10]… ▽ More

    Submitted 26 June, 2013; originally announced June 2013.

    Comments: In Proceedings of the 40th International Colloquium on Automata, Languages and Programming (ICALP), Riga, Latvia, July 2013

  31. arXiv:1306.1547  [pdf, ps, other

    cs.DS cs.CG

    Beyond Locality-Sensitive Hashing

    Authors: Alexandr Andoni, Piotr Indyk, Huy L. Nguyen, Ilya Razenshteyn

    Abstract: We present a new data structure for the c-approximate near neighbor problem (ANN) in the Euclidean space. For n points in R^d, our algorithm achieves O(n^ρ + d log n) query time and O(n^{1 + ρ} + d log n) space, where ρ<= 7/(8c^2) + O(1 / c^3) + o(1). This is the first improvement over the result by Andoni and Indyk (FOCS 2006) and the first data structure that bypasses a locality-sensitive hashin… ▽ More

    Submitted 8 October, 2013; v1 submitted 6 June, 2013; originally announced June 2013.

    Comments: 17 pages, many corrections, added some intuition for the main ingredients, Section 4 has been rewritten completely; to appear at ACM-SIAM Symposium on Discrete Algorithms (SODA 2014)

  32. arXiv:1305.1359  [pdf, other

    cs.LG

    A Differential Equations Approach to Optimizing Regret Trade-offs

    Authors: Alexandr Andoni, Rina Panigrahy

    Abstract: We consider the classical question of predicting binary sequences and study the {\em optimal} algorithms for obtaining the best possible regret and payoff functions for this problem. The question turns out to be also equivalent to the problem of optimal trade-offs between the regrets of two experts in an "experts problem", studied before by \cite{kearns-regret}. While, say, a regret of… ▽ More

    Submitted 6 May, 2013; originally announced May 2013.

  33. arXiv:1109.5635  [pdf, ps, other

    cs.DS

    Approximating Edit Distance in Near-Linear Time

    Authors: Alexandr Andoni, Krzysztof Onak

    Abstract: We show how to compute the edit distance between two strings of length n up to a factor of 2^{Õ(sqrt(log n))} in n^(1+o(1)) time. This is the first sub-polynomial approximation algorithm for this problem that runs in near-linear time, improving on the state-of-the-art n^(1/3+o(1)) approximation. Previously, approximation of 2^{Õ(sqrt(log n))} was known only for embedding edit distance into l_1, an… ▽ More

    Submitted 26 September, 2011; originally announced September 2011.

    Comments: Preliminary version appeared in STOC 2009

  34. arXiv:1011.1263  [pdf, ps, other

    cs.DS cs.CG

    Streaming Algorithms from Precision Sampling

    Authors: Alexandr Andoni, Robert Krauthgamer, Krzysztof Onak

    Abstract: A technique introduced by Indyk and Woodruff [STOC 2005] has inspired several recent advances in data-stream algorithms. We show that a number of these results follow easily from the application of a single probabilistic method called Precision Sampling. Using this method, we obtain simple data-stream algorithms that maintain a randomized sketch of an input vector $x=(x_1,...x_n)$, which is useful… ▽ More

    Submitted 22 April, 2011; v1 submitted 4 November, 2010; originally announced November 2010.

  35. arXiv:1005.4033  [pdf, ps, other

    cs.DS

    Polylogarithmic Approximation for Edit Distance and the Asymmetric Query Complexity

    Authors: Alexandr Andoni, Robert Krauthgamer, Krzysztof Onak

    Abstract: We present a near-linear time algorithm that approximates the edit distance between two strings within a polylogarithmic factor; specifically, for strings of length n and every fixed epsilon>0, it can compute a (log n)^O(1/epsilon) approximation in n^(1+epsilon) time. This is an exponential improvement over the previously known factor, 2^(O (sqrt(log n))), with a comparable running time (Ostrovsky… ▽ More

    Submitted 21 May, 2010; originally announced May 2010.

  36. arXiv:0912.2577  [pdf, ps, other

    math.PR cs.DS math.ST q-bio.PE q-bio.QM

    Global Alignment of Molecular Sequences via Ancestral State Reconstruction

    Authors: Alexandr Andoni, Constantinos Daskalakis, Avinatan Hassidim, Sebastien Roch

    Abstract: Molecular phylogenetic techniques do not generally account for such common evolutionary events as site insertions and deletions (known as indels). Instead tree building algorithms and ancestral state inference procedures typically rely on substitution-only models of sequence evolution. In practice these methods are extended beyond this simplified setting with the use of heuristics that produce g… ▽ More

    Submitted 14 December, 2009; originally announced December 2009.

  37. arXiv:0808.2222  [pdf, ps, other

    cs.DS

    Better Bounds for Frequency Moments in Random-Order Streams

    Authors: Alexandr Andoni, Andrew McGregor, Krzysztof Onak, Rina Panigrahy

    Abstract: Estimating frequency moments of data streams is a very well studied problem and tight bounds are known on the amount of space that is necessary and sufficient when the stream is adversarially ordered. Recently, motivated by various practical considerations and applications in learning and statistics, there has been growing interest into studying streams that are randomly ordered. In the paper we… ▽ More

    Submitted 15 August, 2008; originally announced August 2008.

    Comments: 4 pages