Zum Hauptinhalt springen

Showing 1–50 of 82 results for author: Krauthgamer, R

Searching in archive cs. Search in all archives.
.
  1. Moderate Dimension Reduction for $k$-Center Clustering

    Authors: Shaofeng H. -C. Jiang, Robert Krauthgamer, Shay Sapir

    Abstract: The Johnson-Lindenstrauss (JL) Lemma introduced the concept of dimension reduction via a random linear map, which has become a fundamental technique in many computational settings. For a set of $n$ points in $\mathbb{R}^d$ and any fixed $ε>0$, it reduces the dimension $d$ to $O(\log n)$ while preserving, with high probability, all the pairwise Euclidean distances within factor $1+ε$. Perhaps surpr… ▽ More

    Submitted 6 August, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: 24 pages, appeared in SoCG 2024. v3: minor corrections in page 8 and in section 5. v4: added references on robust coresets in section 3

  2. arXiv:2307.09110  [pdf, ps, other

    cs.DS

    Cut Sparsification and Succinct Representation of Submodular Hypergraphs

    Authors: Yotam Kenneth, Robert Krauthgamer

    Abstract: In cut sparsification, all cuts of a hypergraph $H=(V,E,w)$ are approximated within $1\pmε$ factor by a small hypergraph $H'$. This widely applied method was generalized recently to a setting where the cost of cutting each hyperedge $e$ is provided by a splitting function $g_e: 2^e\to\mathbb{R}_+$. This generalization is called a submodular hypergraph when the functions $\{g_e\}_{e\in E}$ are subm… ▽ More

    Submitted 18 February, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

  3. arXiv:2307.07848  [pdf, ps, other

    cs.DS cs.DC

    Fully Scalable MPC Algorithms for Clustering in High Dimension

    Authors: Artur Czumaj, Guichen Gao, Shaofeng H. -C. Jiang, Robert Krauthgamer, Pavel Veselý

    Abstract: We design new parallel algorithms for clustering in high-dimensional Euclidean spaces. These algorithms run in the Massively Parallel Computation (MPC) model, and are fully scalable, meaning that the local memory in each machine may be $n^σ$ for arbitrarily small fixed $σ>0$. Importantly, the local memory may be substantially smaller than the number of clusters $k$, yet all our algorithms are fast… ▽ More

    Submitted 6 July, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

  4. arXiv:2303.16287  [pdf, ps, other

    cs.DS

    Lower Bounds for Pseudo-Deterministic Counting in a Stream

    Authors: Vladimir Braverman, Robert Krauthgamer, Aditya Krishnan, Shay Sapir

    Abstract: Many streaming algorithms provide only a high-probability relative approximation. These two relaxations, of allowing approximation and randomization, seem necessary -- for many streaming problems, both relaxations must be employed simultaneously, to avoid an exponentially larger (and often trivial) space complexity. A common drawback of these randomized approximate algorithms is that independent e… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 14 pages, ICALP2023

  5. arXiv:2212.01821  [pdf, ps, other

    cs.DS

    Clustering Permutations: New Techniques with Streaming Applications

    Authors: Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

    Abstract: We study the classical metric $k$-median clustering problem over a set of input rankings (i.e., permutations), which has myriad applications, from social-choice theory to web search and databases. A folklore algorithm provides a $2$-approximate solution in polynomial time for all $k=O(1)$, and works irrespective of the underlying distance measure, so long it is a metric; however, going below the… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    ACM Class: F.2.0

  6. arXiv:2211.12496  [pdf, other

    cs.DS

    An Algorithmic Bridge Between Hamming and Levenshtein Distances

    Authors: Elazar Goldenberg, Tomasz Kociumaka, Robert Krauthgamer, Barna Saha

    Abstract: The edit distance between strings classically assigns unit cost to every character insertion, deletion, and substitution, whereas the Hamming distance only allows substitutions. In many real-life scenarios, insertions and deletions (abbreviated indels) appear frequently but significantly less so than substitutions. To model this, we consider substitutions being cheaper than indels, with cost… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: The full version of a paper accepted to ITCS 2023; abstract shortened to meet arXiv requirements

    ACM Class: F.2.2

  7. arXiv:2211.05293  [pdf, ps, other

    cs.DS

    Streaming Euclidean Max-Cut: Dimension vs Data Reduction

    Authors: Xiaoyu Chen, Shaofeng H. -C. Jiang, Robert Krauthgamer

    Abstract: Max-Cut is a fundamental problem that has been studied extensively in various settings. We design an algorithm for Euclidean Max-Cut, where the input is a set of points in $\mathbb{R}^d$, in the model of dynamic geometric streams, where the input $X\subseteq [Δ]^d$ is presented as a sequence of point insertions and deletions. Previously, Frahling and Sohler [STOC 2005] designed a $(1+ε)$-approxima… ▽ More

    Submitted 29 March, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  8. arXiv:2209.07230  [pdf, ps, other

    stat.ML cs.LG

    Recovery Guarantees for Distributed-OMP

    Authors: Chen Amiraz, Robert Krauthgamer, Boaz Nadler

    Abstract: We study distributed schemes for high-dimensional sparse linear regression, based on orthogonal matching pursuit (OMP). Such schemes are particularly suited for settings where a central fusion center is connected to end machines, that have both computation and communication limitations. We prove that under suitable assumptions, distributed-OMP schemes recover the support of the regression vector w… ▽ More

    Submitted 31 October, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 47 pages, 4 figures

  9. arXiv:2209.01901  [pdf, ps, other

    cs.DS

    The Power of Uniform Sampling for Coresets

    Authors: Vladimir Braverman, Vincent Cohen-Addad, Shaofeng H. -C. Jiang, Robert Krauthgamer, Chris Schwiegelshohn, Mads Bech Toftrup, Xuan Wu

    Abstract: Motivated by practical generalizations of the classic $k$-median and $k$-means objectives, such as clustering with size constraints, fair clustering, and Wasserstein barycenter, we introduce a meta-theorem for designing coresets for constrained-clustering problems. The meta-theorem reduces the task of coreset construction to one on a bounded number of ring instances with a much-relaxed additive er… ▽ More

    Submitted 17 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  10. arXiv:2207.07363  [pdf, other

    cs.DS

    Exact Flow Sparsification Requires Unbounded Size

    Authors: Robert Krauthgamer, Ron Mosenzon

    Abstract: Given a large edge-capacitated network $G$ and a subset of $k$ vertices called terminals, an (exact) flow sparsifier is a small network $G'$ that preserves (exactly) all multicommodity flows that can be routed between the terminals. Flow sparsifiers were introduced by Leighton and Moitra [STOC 2010], and have been studied and used in many algorithmic contexts. A fundamental question that remaine… ▽ More

    Submitted 3 August, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: 17 pages, 5 figures. The first version of this paper proved a weaker statement of Theorem 1.7 with 4 commodities. The current statement has only 3 commodities, and now fully refutes Seymour's conjectures

  11. arXiv:2207.01459  [pdf, ps, other

    cs.DS

    Optimal Vertex-Cut Sparsification of Quasi-Bipartite Graphs

    Authors: Itai Boneh, Robert Krauthgamer

    Abstract: In vertex-cut sparsification, given a graph $G=(V,E)$ with a terminal set $T\subseteq V$, we wish to construct a graph $G'=(V',E')$ with $T\subseteq V'$, such that for every two sets of terminals $A,B\subseteq T$, the size of a minimum $(A,B)$-vertex-cut in $G'$ is the same as in $G$. In the most basic setting, $G$ is unweighted and undirected, and we wish to bound the size of $G'$ by a function o… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 12 pages, 3 figures

  12. arXiv:2206.10681  [pdf, other

    cs.DS cs.CG cs.DM

    Near-Linear $\varepsilon$-Emulators for Planar Graphs

    Authors: Hsien-Chih Chang, Robert Krauthgamer, Zihan Tan

    Abstract: We study vertex sparsification for distances, in the setting of planar graphs with distortion: Given a planar graph $G$ (with edge weights) and a subset of $k$ terminal vertices, the goal is to construct an $\varepsilon$-emulator, which is a small planar graph $G'$ that contains the terminals and preserves the distances between the terminals up to factor $1+\varepsilon$. We construct the first… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: Conference version appeared in STOC 2022

  13. arXiv:2204.02095  [pdf, other

    cs.DS

    Streaming Facility Location in High Dimension via Geometric Hashing

    Authors: Artur Czumaj, Arnold Filtser, Shaofeng H. -C. Jiang, Robert Krauthgamer, Pavel Veselý, Mingwei Yang

    Abstract: In Euclidean Uniform Facility Location (UFL), the input is a set of clients in $\mathbb{R}^d$ and the goal is to place facilities to serve them, so as to minimize the total cost of opening facilities plus connecting the clients. We study the setting of dynamic geometric streams, where the clients are presented as a sequence of insertions and deletions of points in the grid $\{1,\ldots,Δ\}^d$, and… ▽ More

    Submitted 28 January, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: The abstract is shortened to meet the length constraint of arXiv

  14. arXiv:2201.12874  [pdf, ps, other

    math.NA cs.DS

    Comparison of Matrix Norm Sparsification

    Authors: Robert Krauthgamer, Shay Sapir

    Abstract: A well-known approach in the design of efficient algorithms, called matrix sparsification, approximates a matrix $A$ with a sparse matrix $A'$. Achlioptas and McSherry [2007] initiated a long line of work on spectral-norm sparsification, which aims to guarantee that $\|A'-A\|\leq ε\|A\|$ for error parameter $ε>0$. Various forms of matrix approximation motivate considering this problem with a guara… ▽ More

    Submitted 9 September, 2023; v1 submitted 30 January, 2022; originally announced January 2022.

  15. arXiv:2112.06916  [pdf, ps, other

    cs.DS

    Flow Metrics on Graphs

    Authors: Lior Kalman, Robert Krauthgamer

    Abstract: Given a graph with non-negative edge weights, there are various ways to interpret the edge weights and induce a metric on the vertices of the graph. A few examples are shortest-path, when interpreting the weights as lengths; resistance distance, when thinking of the graph as an electrical network and the weights are resistances; and the inverse of minimum $st$-cut, when thinking of the weights as… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: MSc thesis of Lior Kalman at the Weizmann Institute

  16. arXiv:2111.12706  [pdf, ps, other

    cs.DS

    Gap Edit Distance via Non-Adaptive Queries: Simple and Optimal

    Authors: Elazar Goldenberg, Tomasz Kociumaka, Robert Krauthgamer, Barna Saha

    Abstract: We study the problem of approximating edit distance in sublinear time. This is formalized as the $(k,k^c)$-Gap Edit Distance problem, where the input is a pair of strings $X,Y$ and parameters $k,c>1$, and the goal is to return YES if $ED(X,Y)\leq k$, NO if $ED(X,Y)> k^c$, and an arbitrary answer when $k < ED(X,Y) \le k^c$. Recent years have witnessed significant interest in designing sublinear-tim… ▽ More

    Submitted 2 October, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted to FOCS 2022

  17. arXiv:2111.04958  [pdf, other

    cs.DS

    Breaking the Cubic Barrier for All-Pairs Max-Flow: Gomory-Hu Tree in Nearly Quadratic Time

    Authors: Amir Abboud, Robert Krauthgamer, Jason Li, Debmalya Panigrahi, Thatchaphol Saranurak, Ohad Trabelsi

    Abstract: In 1961, Gomory and Hu showed that the All-Pairs Max-Flow problem of computing the max-flow between all $n\choose 2$ pairs of vertices in an undirected graph can be solved using only $n-1$ calls to any (single-pair) max-flow algorithm. Even assuming a linear-time max-flow algorithm, this yields a running time of $O(mn)$, which is $O(n^3)$ when $m = Θ(n^2)$. While subsequent work has improved this… ▽ More

    Submitted 3 August, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

  18. arXiv:2110.15891  [pdf, ps, other

    cs.DS

    Friendly Cut Sparsifiers and Faster Gomory-Hu Trees

    Authors: Amir Abboud, Robert Krauthgamer, Ohad Trabelsi

    Abstract: We devise new cut sparsifiers that are related to the classical sparsification of Nagamochi and Ibaraki [Algorithmica, 1992], which is an algorithm that, given an unweighted graph $G$ on $n$ nodes and a parameter $k$, computes a subgraph with $O(nk)$ edges that preserves all cuts of value up to $k$. We put forward the notion of a friendly cut sparsifier, which is a minor of $G$ that preserves all… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  19. arXiv:2110.02898  [pdf, other

    cs.DS

    Coresets for Kernel Clustering

    Authors: Shaofeng H. -C. Jiang, Robert Krauthgamer, Jianing Lou, Yubo Zhang

    Abstract: We devise coresets for kernel $k$-Means with a general kernel, and use them to obtain new, more efficient, algorithms. Kernel $k$-Means has superior clustering capability compared to classical $k$-Means, particularly when clusters are non-linearly separable, but it also introduces significant computational challenges. We address this computational issue by constructing a coreset, which is a reduce… ▽ More

    Submitted 6 April, 2024; v1 submitted 6 October, 2021; originally announced October 2021.

  20. arXiv:2107.09497  [pdf, other

    cs.DS

    Approximate Trace Reconstruction via Median String (in Average-Case)

    Authors: Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

    Abstract: We consider an \emph{approximate} version of the trace reconstruction problem, where the goal is to recover an unknown string $s\in\{0,1\}^n$ from $m$ traces (each trace is generated independently by passing $s$ through a probabilistic insertion-deletion channel with rate $p$). We present a deterministic near-linear time algorithm for the average-case model, where $s$ is random, that uses only \em… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  21. arXiv:2106.16112  [pdf, other

    cs.DS

    Coresets for Clustering with Missing Values

    Authors: Vladimir Braverman, Shaofeng H. -C. Jiang, Robert Krauthgamer, Xuan Wu

    Abstract: We provide the first coreset for clustering points in $\mathbb{R}^d$ that have multiple missing values (coordinates). Previous coreset constructions only allow one missing coordinate. The challenge in this setting is that objective functions, like $k$-Means, are evaluated only on the set of available (non-missing) coordinates, which varies across points. Recall that an $ε$-coreset of a large datas… ▽ More

    Submitted 11 November, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

  22. arXiv:2106.02981  [pdf, other

    cs.DS

    APMF < APSP? Gomory-Hu Tree for Unweighted Graphs in Almost-Quadratic Time

    Authors: Amir Abboud, Robert Krauthgamer, Ohad Trabelsi

    Abstract: We design an $n^{2+o(1)}$-time algorithm that constructs a cut-equivalent (Gomory-Hu) tree of a simple graph on $n$ nodes. This bound is almost-optimal in terms of $n$, and it improves on the recent $\tilde{O}(n^{2.5})$ bound by the authors (STOC 2021), which was the first to break the cubic barrier. Consequently, the All-Pairs Maximum-Flow (APMF) problem has time complexity $n^{2+o(1)}$, and for… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

  23. arXiv:2106.02353  [pdf, ps, other

    cs.DS

    Spectral Hypergraph Sparsifiers of Nearly Linear Size

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Graph sparsification has been studied extensively over the past two decades, culminating in spectral sparsifiers of optimal size (up to constant factors). Spectral hypergraph sparsification is a natural analogue of this problem, for which optimal bounds on the sparsifier size are not known, mainly because the hypergraph Laplacian is non-linear, and thus lacks the linear-algebraic structure and too… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  24. arXiv:2103.08211  [pdf, ps, other

    cs.DS

    Smoothness of Schatten Norms and Sliding-Window Matrix Streams

    Authors: Robert Krauthgamer, Shay Sapir

    Abstract: Large matrices are often accessed as a row-order stream. We consider the setting where rows are time-sensitive (i.e. they expire), which can be described by the sliding-window row-order model, and provide the first $(1+ε)$-approximation of Schatten $p$-norms in this setting. Our main technical contribution is a proof that Schatten $p$-norms in row-order streams are smooth, and thus fit the smooth-… ▽ More

    Submitted 23 November, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: 10 pages, 1 table

  25. arXiv:2102.03060  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Distributed Sparse Normal Means Estimation with Sublinear Communication

    Authors: Chen Amiraz, Robert Krauthgamer, Boaz Nadler

    Abstract: We consider the problem of sparse normal means estimation in a distributed setting with communication constraints. We assume there are $M$ machines, each holding $d$-dimensional observations of a $K$-sparse vector $μ$ corrupted by additive Gaussian noise. The $M$ machines are connected in a star topology to a fusion center, whose goal is to estimate the vector $μ$ with a low communication budget.… ▽ More

    Submitted 14 February, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 36 pages, 2 figures

  26. arXiv:2012.10281  [pdf, other

    cs.DS

    Subcubic Algorithms for Gomory-Hu Tree in Unweighted Graphs

    Authors: Amir Abboud, Robert Krauthgamer, Ohad Trabelsi

    Abstract: Every undirected graph $G$ has a (weighted) cut-equivalent tree $T$, commonly named after Gomory and Hu who discovered it in 1961. Both $T$ and $G$ have the same node set, and for every node pair $s,t$, the minimum $(s,t)$-cut in $T$ is also an exact minimum $(s,t)$-cut in $G$. We give the first subcubic-time algorithm that constructs such a tree for a simple graph $G$ (unweighted with no parall… ▽ More

    Submitted 14 April, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

  27. arXiv:2011.06530  [pdf, ps, other

    cs.DS

    Towards Tight Bounds for Spectral Sparsification of Hypergraphs

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Cut and spectral sparsification of graphs have numerous applications, including e.g. speeding up algorithms for cuts and Laplacian solvers. These powerful notions have recently been extended to hypergraphs, which are much richer and may offer new applications. However, the current bounds on the size of hypergraph sparsifiers are not as tight as the corresponding bounds for graphs. Our first resu… ▽ More

    Submitted 12 April, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  28. arXiv:2011.04324  [pdf, other

    cs.DS

    Streaming Algorithms for Geometric Steiner Forest

    Authors: Artur Czumaj, Shaofeng H. -C. Jiang, Robert Krauthgamer, Pavel Veselý

    Abstract: We consider an important generalization of the Steiner tree problem, the \emph{Steiner forest problem}, in the Euclidean plane: the input is a multiset $X \subseteq \mathbb{R}^2$, partitioned into $k$ color classes $C_1, C_2, \ldots, C_k \subseteq X$. The goal is to find a minimum-cost Euclidean graph $G$ such that every color class $C_i$ is connected in $G$. We study this Steiner forest problem i… ▽ More

    Submitted 10 May, 2024; v1 submitted 9 November, 2020; originally announced November 2020.

  29. arXiv:2011.01777  [pdf, ps, other

    cs.DS

    Near-Optimal Entrywise Sampling of Numerically Sparse Matrices

    Authors: Vladimir Braverman, Robert Krauthgamer, Aditya Krishnan, Shay Sapir

    Abstract: Many real-world data sets are sparse or almost sparse. One method to measure this for a matrix $A\in \mathbb{R}^{n\times n}$ is the \emph{numerical sparsity}, denoted $\mathsf{ns}(A)$, defined as the minimum $k\geq 1$ such that $\|a\|_1/\|a\|_2 \leq \sqrt{k}$ for every row and every column $a$ of $A$. This measure of $a$ is smooth and is clearly only smaller than the number of non-zeros in the row… ▽ More

    Submitted 5 July, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: 20 pages. To appear in COLT 2021

  30. arXiv:2011.00868  [pdf, ps, other

    cs.DS

    Approximating the Median under the Ulam Metric

    Authors: Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

    Abstract: We study approximation algorithms for variants of the \emph{median string} problem, which asks for a string that minimizes the sum of edit distances from a given set of $m$ strings of length $n$. Only the straightforward $2$-approximation is known for this NP-hard problem. This problem is motivated e.g.~by computational biology, and belongs to the class of median problems (over different metric sp… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    MSC Class: 68Q25; 68W25; 68W32; 68W40 ACM Class: F.2.2

  31. arXiv:2009.06090  [pdf, other

    cs.DS

    Cut-Equivalent Trees are Optimal for Min-Cut Queries

    Authors: Amir Abboud, Robert Krauthgamer, Ohad Trabelsi

    Abstract: Min-Cut queries are fundamental: Preprocess an undirected edge-weighted graph, to quickly report a minimum-weight cut that separates a query pair of nodes $s,t$. The best data structure known for this problem simply builds a cut-equivalent tree, discovered 60 years ago by Gomory and Hu, who also showed how to construct it using $n-1$ minimum $st$-cut computations. Using state-of-the-art algorithms… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

  32. arXiv:2004.07718  [pdf, ps, other

    cs.DS

    Coresets for Clustering in Excluded-minor Graphs and Beyond

    Authors: Vladimir Braverman, Shaofeng H. -C. Jiang, Robert Krauthgamer, Xuan Wu

    Abstract: Coresets are modern data-reduction tools that are widely used in data analysis to improve efficiency in terms of running time, space and communication complexity. Our main result is a fast algorithm to construct a small coreset for k-Median in (the shortest-path metric of) an excluded-minor graph. Specifically, we give the first coreset of size that depends only on $k$, $ε$ and the excluded-minor… ▽ More

    Submitted 15 July, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

  33. arXiv:2002.07727  [pdf, other

    cs.DS cs.CG

    Faster Algorithms for Orienteering and $k$-TSP

    Authors: Lee-Ad Gottlieb, Robert Krauthgamer, Havana Rika

    Abstract: We consider the rooted orienteering problem in Euclidean space: Given $n$ points $P$ in $\mathbb R^d$, a root point $s\in P$ and a budget $\mathcal B>0$, find a path that starts from $s$, has total length at most $\mathcal B$, and visits as many points of $P$ as possible. This problem is known to be NP-hard, hence we study $(1-δ)$-approximation algorithms. The previous Polynomial-Time Approximatio… ▽ More

    Submitted 21 April, 2022; v1 submitted 18 February, 2020; originally announced February 2020.

  34. arXiv:1910.00901  [pdf, ps, other

    cs.CC cs.DS

    Sublinear Algorithms for Gap Edit Distance

    Authors: Elazar Goldenberg, Robert Krauthgamer, Barna Saha

    Abstract: The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. A simple dynamic programming computes the edit distance between two strings of length $n$ in $O(n^2)$ time, and a more sophisticated algorithm runs in time $O(n+t^2)$ when the edit… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

  35. arXiv:1907.06857  [pdf, other

    cs.DS cs.CG

    Labelings vs. Embeddings: On Distributed Representations of Distances

    Authors: Arnold Filtser, Lee-Ad Gottlieb, Robert Krauthgamer

    Abstract: We investigate for which metric spaces the performance of distance labeling and of $\ell_\infty$-embeddings differ, and how significant can this difference be. Recall that a distance labeling is a distributed representation of distances in a metric space $(X,d)$, where each point $x\in X$ is assigned a succinct label, such that the distance between any two points $x,y \in X$ can be approximated gi… ▽ More

    Submitted 20 September, 2023; v1 submitted 16 July, 2019; originally announced July 2019.

  36. arXiv:1907.05457  [pdf, ps, other

    cs.DS

    Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension

    Authors: Vladimir Braverman, Robert Krauthgamer, Aditya Krishnan, Roi Sinoff

    Abstract: Spectral functions of large matrices contains important structural information about the underlying data, and is thus becoming increasingly important. Many times, large matrices representing real-world data are \emph{sparse} or \emph{doubly sparse} (i.e., sparse in both rows and columns), and are accessed as a \emph{stream} of updates, typically organized in \emph{row-order}. In this setting, wher… ▽ More

    Submitted 27 February, 2020; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: 39 pages

  37. arXiv:1907.04733  [pdf, other

    cs.DS

    Coresets for Clustering in Graphs of Bounded Treewidth

    Authors: Daniel Baker, Vladimir Braverman, Lingxiao Huang, Shaofeng H. -C. Jiang, Robert Krauthgamer, Xuan Wu

    Abstract: We initiate the study of coresets for clustering in graph metrics, i.e., the shortest-path metric of edge-weighted graphs. Such clustering problems are essential to data analysis and used for example in road networks and data visualization. A coreset is a compact summary of the data that approximately preserves the clustering objective for every possible center set, and it offers significant effic… ▽ More

    Submitted 12 December, 2022; v1 submitted 10 July, 2019; originally announced July 2019.

  38. arXiv:1904.07957  [pdf, ps, other

    cs.DS

    Almost-Smooth Histograms and Sliding-Window Graph Algorithms

    Authors: Robert Krauthgamer, David Reitblat

    Abstract: We study algorithms for the sliding-window model, an important variant of the data-stream model, in which the goal is to compute some function of a fixed-length suffix of the stream. We extend the smooth-histogram framework of Braverman and Ostrovsky (FOCS 2007) to almost-smooth functions, which includes all subadditive functions. Specifically, we show that if a subadditive function can be… ▽ More

    Submitted 24 May, 2022; v1 submitted 16 April, 2019; originally announced April 2019.

  39. arXiv:1903.04351  [pdf, other

    cs.DS

    Coresets for Ordered Weighted Clustering

    Authors: Vladimir Braverman, Shaofeng H. -C. Jiang, Robert Krauthgamer, Xuan Wu

    Abstract: We design coresets for Ordered k-Median, a generalization of classical clustering problems such as k-Median and k-Center, that offers a more flexible data analysis, like easily combining multiple objectives (e.g., to increase fairness or for Pareto optimization). Its objective function is defined via the Ordered Weighted Averaging (OWA) paradigm of Yager (1988), where data points are weighted acco… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: 23 pages, 3 figures, 2 tables

  40. arXiv:1901.01412  [pdf, other

    cs.DS

    New Algorithms and Lower Bounds for All-Pairs Max-Flow in Undirected Graphs

    Authors: Amir Abboud, Robert Krauthgamer, Ohad Trabelsi

    Abstract: We investigate the time-complexity of the All-Pairs Max-Flow problem: Given a graph with $n$ nodes and $m$ edges, compute for all pairs of nodes the maximum-flow value between them. If Max-Flow (the version with a given source-sink pair $s,t$) can be solved in time $T(m)$, then an $O(n^2) \cdot T(m)$ is a trivial upper bound. But can we do better? For directed graphs, recent results in fine-grai… ▽ More

    Submitted 9 July, 2019; v1 submitted 5 January, 2019; originally announced January 2019.

  41. arXiv:1812.00241  [pdf, ps, other

    cs.DS

    Universal Streaming of Subset Norms

    Authors: Vladimir Braverman, Robert Krauthgamer, Lin F. Yang

    Abstract: Most known algorithms in the streaming model of computation aim to approximate a single function such as an $\ell_p$-norm. In 2009, Nelson [\url{https://sublinear.info}, Open Problem 30] asked if it possible to design \emph{universal algorithms}, that simultaneously approximate multiple functions of the stream. In this paper we answer the question of Nelson for the class of \emph{subset $\ell_0$-n… ▽ More

    Submitted 4 April, 2020; v1 submitted 1 December, 2018; originally announced December 2018.

  42. arXiv:1811.02685  [pdf, other

    cs.DS math.MG

    Flow-Cut Gaps and Face Covers in Planar Graphs

    Authors: Robert Krauthgamer, James R. Lee, Havana Rika

    Abstract: The relationship between the sparsest cut and the maximum concurrent multi-flow in graphs has been studied extensively. For general graphs with $k$ terminal pairs, the flow-cut gap is $O(\log k)$, and this is tight. But when topological restrictions are placed on the flow network, the situation is far less clear. In particular, it has been conjectured that the flow-cut gap in planar networks is… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  43. arXiv:1809.02995  [pdf, ps, other

    cs.DS

    On Solving Linear Systems in Sublinear Time

    Authors: Alexandr Andoni, Robert Krauthgamer, Yosef Pogrow

    Abstract: We study \emph{sublinear} algorithms that solve linear systems locally. In the classical version of this problem the input is a matrix $S\in \mathbb{R}^{n\times n}$ and a vector $b\in\mathbb{R}^n$ in the range of $S$, and the goal is to output $x\in \mathbb{R}^n$ satisfying $Sx=b$. For the case when the matrix $S$ is symmetric diagonally dominant (SDD), the breakthrough algorithm of Spielman and T… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

  44. arXiv:1809.00942  [pdf, other

    cs.DS

    Relaxed Voronoi: a Simple Framework for Terminal-Clustering Problems

    Authors: Arnold Filtser, Robert Krauthgamer, Ohad Trabelsi

    Abstract: We reprove three known algorithmic bounds for terminal-clustering problems, using a single framework that leads to simpler proofs. In this genre of problems, the input is a metric space $(X,d)$ (possibly arising from a graph) and a subset of terminals $K\subset X$, and the goal is to partition the points $X$ such that each part, called a cluster, contains exactly one terminal (possibly with connec… ▽ More

    Submitted 6 November, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

  45. arXiv:1807.08478  [pdf, ps, other

    cs.DS

    Batch Sparse Recovery, or How to Leverage the Average Sparsity

    Authors: Alexandr Andoni, Lior Kamma, Robert Krauthgamer, Eric Price

    Abstract: We introduce a \emph{batch} version of sparse recovery, where the goal is to report a sequence of vectors $A_1',\ldots,A_m' \in \mathbb{R}^n$ that estimate unknown signals $A_1,\ldots,A_m \in \mathbb{R}^n$ using a few linear measurements, each involving exactly one signal vector, under an assumption of \emph{average sparsity}. More precisely, we want to have \newline… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

  46. arXiv:1807.05803  [pdf, other

    cs.DS

    Faster Algorithms for All-Pairs Bounded Min-Cuts

    Authors: Amir Abboud, Loukas Georgiadis, Giuseppe F. Italiano, Robert Krauthgamer, Nikos Parotsidis, Ohad Trabelsi, Przemysław Uznański, Daniel Wolleb-Graf

    Abstract: The All-Pairs Min-Cut problem (aka All-Pairs Max-Flow) asks to compute a minimum $s$-$t$ cut (or just its value) for all pairs of vertices $s,t$. We study this problem in directed graphs with unit edge/vertex capacities (corresponding to edge/vertex connectivity). Our focus is on the $k$-bounded case, where the algorithm has to find all pairs with min-cut value less than $k$, and report only those… ▽ More

    Submitted 21 February, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

  47. arXiv:1711.08041  [pdf, other

    cs.DS

    The Set Cover Conjecture and Subgraph Isomorphism with a Tree Pattern

    Authors: Robert Krauthgamer, Ohad Trabelsi

    Abstract: In the Set Cover problem, the input is a ground set of $n$ elements and a collection of $m$ sets, and the goal is to find the smallest sub-collection of sets whose union is the entire ground set. The fastest algorithm known runs in time $O(mn2^n)$ [Fomin et al., WG 2004], and the Set Cover Conjecture (SeCoCo) [Cygan et al., TALG 2016] asserts that for every fixed $\varepsilon>0$, no algorithm can… ▽ More

    Submitted 16 January, 2019; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: Merged works

  48. arXiv:1708.07591  [pdf, other

    cs.DS

    Conditional Lower Bound for Subgraph Isomorphism with a Tree Pattern

    Authors: Robert Krauthgamer, Ohad Trabelsi

    Abstract: The kTree problem is a special case of Subgraph Isomorphism where the pattern graph is a tree, that is, the input is an $n$-node graph $G$ and a $k$-node tree $T$, and the goal is to determine whether $G$ has a subgraph isomorphic to $T$. We provide evidence that this problem cannot be computed significantly faster than $2^{k} \textsf{poly}(n)$, which matches the fastest algorithm known for this p… ▽ More

    Submitted 9 April, 2018; v1 submitted 24 August, 2017; originally announced August 2017.

    Comments: A merged work containing the results in this paper is available at arXiv:1711.08041

  49. arXiv:1702.05951  [pdf, other

    cs.DS

    Refined Vertex Sparsifiers of Planar Graphs

    Authors: Robert Krauthgamer, Havana, Rika

    Abstract: We study the following version of cut sparsification. Given a large edge-weighted network $G$ with $k$ terminal vertices, compress it into a smaller network $H$ with the same terminals, such that every minimum terminal cut in $H$ approximates the corresponding one in $G$, up to a factor $q\geq 1$ that is called the quality. (The case $q=1$ is known also as a mimicking network). We provide new insi… ▽ More

    Submitted 4 October, 2019; v1 submitted 20 February, 2017; originally announced February 2017.

  50. arXiv:1702.05805  [pdf, other

    cs.DS

    Conditional Lower Bounds for All-Pairs Max-Flow

    Authors: Robert Krauthgamer, Ohad Trabelsi

    Abstract: We provide evidence that computing the maximum flow value between every pair of nodes in a directed graph on $n$ nodes, $m$ edges,and capacities in the range $[1..n]$, which we call the All-Pairs Max-Flow problem, cannot be solved in time that is significantly faster (i.e., by a polynomial factor) than $O(n^3)$ even for sparse graphs. Since a single maximum $st$-flow can be solved in time… ▽ More

    Submitted 21 November, 2022; v1 submitted 19 February, 2017; originally announced February 2017.