Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Dahlgaard, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1711.08797  [pdf, other

    stat.ML cs.DS cs.LG

    Practical Hash Functions for Similarity Estimation and Dimensionality Reduction

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: Hashing is a basic tool for dimensionality reduction employed in several aspects of machine learning. However, the perfomance analysis is often carried out under the abstract assumption that a truly random unit cost hash function is used, without concern for which concrete hash function is employed. The concrete hash function may work fine on sufficiently random input. The question is if it can be… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: Preliminary version of this paper will appear at NIPS 2017

  2. arXiv:1709.01960  [pdf, other

    cs.DS

    Constructing Light Spanners Deterministically in Near-Linear Time

    Authors: Stephen Alstrup, Søren Dahlgaard, Arnold Filtser, Morten Stöckel, Christian Wulff-Nilsen

    Abstract: Graph spanners are well-studied and widely used both in theory and practice. In a recent breakthrough, Chechik and Wulff-Nilsen [CW18] improved the state-of-the-art for light spanners by constructing a $(2k-1)(1+ε)$-spanner with $O(n^{1+1/k})$ edges and $O_ε(n^{1/k})$ lightness. Soon after, Filtser and Solomon [FS19] showed that the classic greedy spanner construction achieves the same bounds The… ▽ More

    Submitted 19 January, 2022; v1 submitted 6 September, 2017; originally announced September 2017.

  3. arXiv:1704.04370  [pdf, other

    cs.DS

    Fast Similarity Sketching

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Langhede, Jakob Bæk Tejs Houen, Mikkel Thorup

    Abstract: We consider the $\textit{Similarity Sketching}$ problem: Given a universe $[u] = \{0,\ldots, u-1\}$ we want a random function $S$ mapping subsets $A\subseteq [u]$ into vectors $S(A)$ of size $t$, such that the Jaccard similarity $J(A,B) = |A\cap B|/|A\cup B|$ between sets $A$ and $B$ is preserved. More precisely, define $X_i = [S(A)[i] = S(B)[i]]$ and $X = \sum_{i\in [t]} X_i$. We want… ▽ More

    Submitted 5 May, 2024; v1 submitted 14 April, 2017; originally announced April 2017.

    Comments: The original version was directly based on a conference paper of the same title from FOCS'17. This new version is substantially revised with some cleaner and stronger theorems, particularly concerning the high probability domain. Moreover, there is one more author, Jakob Houen. In addition, one of the old authors, Mathias, has changed surname from Knudsen to Langhede

  4. arXiv:1704.02178  [pdf, other

    cs.DS

    New Subquadratic Approximation Algorithms for the Girth

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: We consider the problem of approximating the girth, $g$, of an unweighted and undirected graph $G=(V,E)$ with $n$ nodes and $m$ edges. A seminal result of Itai and Rodeh [SICOMP'78] gave an additive $1$-approximation in $O(n^2)$ time, and the main open question is thus how well we can do in subquadratic time. In this paper we present two main results. The first is a $(1+\varepsilon,O(1))$-approx… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

  5. arXiv:1703.10380  [pdf, other

    cs.DS

    Finding Even Cycles Faster via Capped k-Walks

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: In this paper, we consider the problem of finding a cycle of length $2k$ (a $C_{2k}$) in an undirected graph $G$ with $n$ nodes and $m$ edges for constant $k\ge2$. A classic result by Bondy and Simonovits [J.Comb.Th.'74] implies that if $m \ge100k n^{1+1/k}$, then $G$ contains a $C_{2k}$, further implying that one needs to consider only graphs with $m = O(n^{1+1/k})$. Previously the best known a… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

    Comments: To appear at STOC'17

  6. arXiv:1702.03259  [pdf, other

    cs.DS

    Fast and Compact Exact Distance Oracle for Planar Graphs

    Authors: Vincent Cohen-Addad, Søren Dahlgaard, Christian Wulff-Nilsen

    Abstract: For a given a graph, a distance oracle is a data structure that answers distance queries between pairs of vertices. We introduce an $O(n^{5/3})$-space distance oracle which answers exact distance queries in $O(\log n)$ time for $n$-vertex planar edge-weighted digraphs. All previous distance oracles for planar graphs with truly subquadratic space i.e., space $O(n^{2 - ε})$ for some constant $ε> 0$)… ▽ More

    Submitted 2 May, 2017; v1 submitted 10 February, 2017; originally announced February 2017.

  7. arXiv:1609.08403  [pdf, other

    cs.DS cs.CC

    Tight Hardness Results for Distance and Centrality Problems in Constant Degree Graphs

    Authors: Søren Dahlgaard, Jacob Evald

    Abstract: Finding important nodes in a graph and measuring their importance is a fundamental problem in the analysis of social networks, transportation networks, biological systems, etc. Among popular such metrics are graph centrality, betweenness centrality (BC), and reach centrality (RC). These measures are also very related to classic notions like diameter and radius. Roditty and Vassilevska Williams~[ST… ▽ More

    Submitted 20 April, 2017; v1 submitted 27 September, 2016; originally announced September 2016.

    Comments: 14 pages, 4 figures, 2 tables

  8. arXiv:1605.03797  [pdf, other

    cs.DS cs.CC

    Popular Conjectures as a Barrier for Dynamic Planar Graph Algorithms

    Authors: Amir Abboud, Søren Dahlgaard

    Abstract: The dynamic shortest paths problem on planar graphs asks us to preprocess a planar graph $G$ such that we may support insertions and deletions of edges in $G$ as well as distance queries between any two nodes $u,v$ subject to the constraint that the graph remains planar at all times. This problem has been extensively studied in both the theory and experimental communities over the past decades and… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: 20 pages, 4 figures. Abstract has been truncated to fit arXiv limits

  9. arXiv:1602.06705  [pdf, other

    cs.DS

    On the Hardness of Partially Dynamic Graph Problems and Connections to Diameter

    Authors: Søren Dahlgaard

    Abstract: Conditional lower bounds for dynamic graph problems has received a great deal of attention in recent years. While many results are now known for the fully-dynamic case and such bounds often imply worst-case bounds for the partially dynamic setting, it seems much more difficult to prove amortized bounds for incremental and decremental algorithms. In this paper we consider partially dynamic versions… ▽ More

    Submitted 3 May, 2016; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: To appear at ICALP'16. Abstract truncated to fit arXiv limits

  10. arXiv:1507.02618  [pdf, other

    cs.DS

    Sublinear Distance Labeling

    Authors: Stephen Alstrup, Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Ely Porat

    Abstract: A distance labeling scheme labels the $n$ nodes of a graph with binary strings such that, given the labels of any two nodes, one can determine the distance in the graph between the two nodes by looking only at the labels. A $D$-preserving distance labeling scheme only returns precise distances between pairs of nodes that are at distance at least $D$ from each other. In this paper we consider dista… ▽ More

    Submitted 8 September, 2016; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: A preliminary version of this paper appeared at ESA'16

  11. arXiv:1504.02306  [pdf, other

    cs.DS

    Optimal induced universal graphs and adjacency labeling for trees

    Authors: Stephen Alstrup, Søren Dahlgaard, Mathias Bæk Tejs Knudsen

    Abstract: We show that there exists a graph $G$ with $O(n)$ nodes, where any forest of $n$ nodes is a node-induced subgraph of $G$. Furthermore, for constant arboricity $k$, the result implies the existence of a graph with $O(n^k)$ nodes that contains all $n$-node graphs as node-induced subgraphs, matching a $Ω(n^k)$ lower bound. The lower bound and previously best upper bounds were presented in Alstrup and… ▽ More

    Submitted 15 February, 2016; v1 submitted 9 April, 2015; originally announced April 2015.

    Comments: A preliminary version of this paper appeared at FOCS'15

  12. arXiv:1411.7191  [pdf, ps, other

    cs.DS

    Hashing for statistics over k-partitions

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Eva Rotenberg, Mikkel Thorup

    Abstract: In this paper we analyze a hash function for $k$-partitioning a set into bins, obtaining strong concentration bounds for standard algorithms combining statistics from each bin. This generic method was originally introduced by Flajolet and Martin~[FOCS'83] in order to save a factor $Ω(k)$ of time per element over $k$ independent samples when estimating the number of distinct elements in a data st… ▽ More

    Submitted 15 February, 2016; v1 submitted 26 November, 2014; originally announced November 2014.

    Comments: Appear at FOCS'15

  13. arXiv:1407.6846  [pdf, other

    cs.DS

    The Power of Two Choices with Simple Tabulation

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Eva Rotenberg, Mikkel Thorup

    Abstract: The power of two choices is a classic paradigm for load balancing when assigning $m$ balls to $n$ bins. When placing a ball, we pick two bins according to two hash functions $h_0$ and $h_1$, and place the ball in the least loaded bin. Assuming fully random hash functions, when $m=O(n)$, Azar et al.~[STOC'94] proved that the maximum load is $\lg \lg n + O(1)$ with high probability. In this paper,… ▽ More

    Submitted 25 January, 2016; v1 submitted 25 July, 2014; originally announced July 2014.

    Comments: SODA'16

  14. arXiv:1407.5011  [pdf, other

    cs.DS

    A simple and optimal ancestry labeling scheme for trees

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Noy Rotbart

    Abstract: We present a $\lg n + 2 \lg \lg n+3$ ancestry labeling scheme for trees. The problem was first presented by Kannan et al. [STOC 88'] along with a simple $2 \lg n$ solution. Motivated by applications to XML files, the label size was improved incrementally over the course of more than 20 years by a series of papers. The last, due to Fraigniaud and Korman [STOC 10'], presented an asymptotically optim… ▽ More

    Submitted 26 April, 2015; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: 12 pages, 1 figure. To appear at ICALP'15

  15. arXiv:1404.6724  [pdf, other

    cs.DS

    Approximately Minwise Independence with Twisted Tabulation

    Authors: Søren Dahlgaard, Mikkel Thorup

    Abstract: A random hash function $h$ is $\varepsilon$-minwise if for any set $S$, $|S|=n$, and element $x\in S$, $\Pr[h(x)=\min h(S)]=(1\pm\varepsilon)/n$. Minwise hash functions with low bias $\varepsilon$ have widespread applications within similarity estimation. Hashing from a universe $[u]$, the twisted tabulation hashing of Pǎtraşcu and Thorup [SODA'13] makes $c=O(1)$ lookups in tables of size… ▽ More

    Submitted 1 May, 2014; v1 submitted 27 April, 2014; originally announced April 2014.

    Comments: To appear in Proceedings of SWAT 2014

  16. arXiv:1404.4982  [pdf, other

    cs.DS cs.DC

    Dynamic and Multi-functional Labeling Schemes

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Noy Rotbart

    Abstract: We investigate labeling schemes supporting adjacency, ancestry, sibling, and connectivity queries in forests. In the course of more than 20 years, the existence of $\log n + O(\log \log)$ labeling schemes supporting each of these functions was proven, with the most recent being ancestry [Fraigniaud and Korman, STOC '10]. Several multi-functional labeling schemes also enjoy lower or upper bounds of… ▽ More

    Submitted 19 April, 2014; originally announced April 2014.

    Comments: 17 pages, 5 figures