Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Fahrbach, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18754  [pdf, other

    cs.DS cs.LG

    GIST: Greedy Independent Set Thresholding for Diverse Data Summarization

    Authors: Matthew Fahrbach, Srikumar Ramalingam, Morteza Zadimoghaddam, Sara Ahmadian, Gui Citovsky, Giulia DeSalvo

    Abstract: We propose a novel subset selection task called min-distance diverse data summarization ($\textsf{MDDS}$), which has a wide variety of applications in machine learning, e.g., data sampling and feature selection. Given a set of points in a metric space, the goal is to maximize an objective that combines the total utility of the points and a diversity term that captures the minimum distance between… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 15 pages, 1 figure

  2. arXiv:2402.04987  [pdf, other

    cs.LG cs.DS

    PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses

    Authors: Adel Javanmard, Matthew Fahrbach, Vahab Mirrokni

    Abstract: This work studies algorithms for learning from aggregate responses. We focus on the construction of aggregation sets (called bags in the literature) for event-level loss functions. We prove for linear regression and generalized linear models (GLMs) that the optimal bagging problem reduces to one-dimensional size-constrained $k$-means clustering. Further, we theoretically quantify the advantage of… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 29 pages, 4 figures

  3. arXiv:2311.06192  [pdf, other

    cs.LG cs.AI stat.ML

    Greedy PIG: Adaptive Integrated Gradients

    Authors: Kyriakos Axiotis, Sami Abu-al-haija, Lin Chen, Matthew Fahrbach, Gang Fu

    Abstract: Deep learning has become the standard approach for most machine learning tasks. While its impact is undeniable, interpreting the predictions of deep learning models from a human perspective remains a challenge. In contrast to model training, model interpretability is harder to quantify and pose as an explicit optimization problem. Inspired by the AUC softmax information curve (AUC SIC) metric for… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  4. arXiv:2311.03703  [pdf, other

    cs.LG cs.DC

    Practical Performance Guarantees for Pipelined DNN Inference

    Authors: Aaron Archer, Matthew Fahrbach, Kuikui Liu, Prakash Prabhu

    Abstract: We optimize pipeline parallelism for deep neural network (DNN) inference by partitioning model graphs into $k$ stages and minimizing the running time of the bottleneck stage, including communication. We give practical and effective algorithms for this NP-hard problem, but our emphasis is on tackling the practitioner's dilemma of deciding when a solution is good enough. To this end, we design novel… ▽ More

    Submitted 4 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 figures

  5. arXiv:2305.12102  [pdf, other

    cs.LG cs.IR

    Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

    Authors: Benjamin Coleman, Wang-Cheng Kang, Matthew Fahrbach, Ruoxi Wang, Lichan Hong, Ed H. Chi, Derek Zhiyuan Cheng

    Abstract: Learning high-quality feature embeddings efficiently and effectively is critical for the performance of web-scale machine learning systems. A typical model ingests hundreds of features with vocabularies on the order of millions to billions of tokens. The standard approach is to represent each feature value as a d-dimensional embedding, introducing hundreds of billions of parameters for extremely h… ▽ More

    Submitted 14 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: NeurIPS'23 Spotlight

    Journal ref: Proceedings of the 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023) 56234-56255

  6. arXiv:2303.15634  [pdf, other

    cs.LG math.OC stat.ML

    Learning Rate Schedules in the Presence of Distribution Shift

    Authors: Matthew Fahrbach, Adel Javanmard, Vahab Mirrokni, Pratik Worah

    Abstract: We design learning rate schedules that minimize regret for SGD-based online learning in the presence of a changing data distribution. We fully characterize the optimal learning rate schedule for online linear regression via a novel analysis with stochastic differential equations. For general convex loss functions, we propose new learning rate schedules that are robust to distribution shift and we… ▽ More

    Submitted 20 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 33 pages, 6 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML 2023) 9523-9546

  7. arXiv:2302.03886  [pdf, other

    cs.DS cs.LG math.CO

    Approximately Optimal Core Shapes for Tensor Decompositions

    Authors: Mehrdad Ghadiri, Matthew Fahrbach, Gang Fu, Vahab Mirrokni

    Abstract: This work studies the combinatorial optimization problem of finding an optimal core tensor shape, also called multilinear rank, for a size-constrained Tucker decomposition. We give an algorithm with provable approximation guarantees for its reconstruction error via connections to higher-order singular values. Specifically, we introduce a novel Tucker packing problem, which we prove is NP-hard, and… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 18 pages, 4 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML 2023) 11237-11254

  8. arXiv:2209.14881  [pdf, other

    cs.LG stat.ML

    Sequential Attention for Feature Selection

    Authors: Taisuke Yasuda, MohammadHossein Bateni, Lin Chen, Matthew Fahrbach, Gang Fu, Vahab Mirrokni

    Abstract: Feature selection is the problem of selecting a subset of features for a machine learning model that maximizes model quality subject to a budget constraint. For neural networks, prior methods, including those based on $\ell_1$ regularization, attention, and other techniques, typically select the entire feature subset in one evaluation round, ignoring the residual value of features during selection… ▽ More

    Submitted 25 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to ICLR 2023

    Journal ref: Proceedings of the 11th International Conference on Learning Representations (ICLR 2023)

  9. arXiv:2209.04876  [pdf, other

    cs.DS cs.LG math.ST

    Subquadratic Kronecker Regression with Applications to Tensor Decomposition

    Authors: Matthew Fahrbach, Thomas Fu, Mehrdad Ghadiri

    Abstract: Kronecker regression is a highly-structured least squares problem $\min_{\mathbf{x}} \lVert \mathbf{K}\mathbf{x} - \mathbf{b} \rVert_{2}^2$, where the design matrix $\mathbf{K} = \mathbf{A}^{(1)} \otimes \cdots \otimes \mathbf{A}^{(N)}$ is a Kronecker product of factor matrices. This regression problem arises in each step of the widely-used alternating least squares (ALS) algorithm for computing t… ▽ More

    Submitted 12 May, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: 36 pages, 1 figure, 12 tables. arXiv admin note: text overlap with arXiv:2107.10654

    MSC Class: 62J05; 62J07; 65F10 ACM Class: F.2.1; G.1.3; G.1.6

    Journal ref: Advances in Neural Information Processing Systems 35 (2022): 28776-28789

  10. arXiv:2107.10654  [pdf, other

    cs.DS cs.LG math.ST

    Fast Low-Rank Tensor Decomposition by Ridge Leverage Score Sampling

    Authors: Matthew Fahrbach, Mehrdad Ghadiri, Thomas Fu

    Abstract: Low-rank tensor decomposition generalizes low-rank matrix approximation and is a powerful technique for discovering low-dimensional structure in high-dimensional data. In this paper, we study Tucker decompositions and use tools from randomized numerical linear algebra called ridge leverage scores to accelerate the core tensor update step in the widely-used alternating least squares (ALS) algorithm… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 29 pages, 1 figure

  11. arXiv:2007.02817  [pdf, other

    cs.LG cs.DS stat.ML

    Faster Graph Embeddings via Coarsening

    Authors: Matthew Fahrbach, Gramoz Goranci, Richard Peng, Sushant Sachdeva, Chi Wang

    Abstract: Graph embeddings are a ubiquitous tool for machine learning tasks, such as node classification and link prediction, on graph-structured data. However, computing the embeddings for large-scale graphs is prohibitively inefficient even if we are interested only in a small subset of relevant vertices. To address this, we present an efficient graph coarsening approach, based on Schur complements, for c… ▽ More

    Submitted 22 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 18 pages, 2 figures, to appear in the Proceedings of the 37th International Conference on Machine Learning (ICML 2020)

    Journal ref: Proceedings of the 37th International Conference on Machine Learning (ICML 2020) 2953-2963

  12. arXiv:2005.01929  [pdf, other

    cs.DS cs.GT

    Edge-Weighted Online Bipartite Matching

    Authors: Matthew Fahrbach, Zhiyi Huang, Runzhou Tao, Morteza Zadimoghaddam

    Abstract: Online bipartite matching and its variants are among the most fundamental problems in the online algorithms literature. Karp, Vazirani, and Vazirani (STOC 1990) introduced an elegant algorithm for the unweighted problem that achieves an optimal competitive ratio of $1-1/e$. Later, Aggarwal et al. (SODA 2011) generalized their algorithm and analysis to the vertex-weighted case. Little is known, how… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 36 pages, 5 figures. This work merges and refines the results in arXiv:1704.05384, arXiv:1910.02569, and arXiv:1910.03287. In particular, we fix a bug in arXiv:1910.03287 and have a smaller competitive ratio as a result

    Journal ref: Journal of the ACM 69(6): 45:1-45:35 (2022)

  13. A Fast Minimum Degree Algorithm and Matching Lower Bound

    Authors: Robert Cummings, Matthew Fahrbach, Animesh Fatehpuria

    Abstract: The minimum degree algorithm is one of the most widely-used heuristics for reducing the cost of solving large sparse systems of linear equations. It has been studied for nearly half a century and has a rich history of bridging techniques from data structures, graph algorithms, and scientific computing. In this paper, we present a simple but novel combinatorial algorithm for computing an exact mini… ▽ More

    Submitted 22 July, 2020; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: 17 pages

    Journal ref: Proceedings of the 32nd Annual ACM-SIAM Symposium on Discrete Algorithms (2021) 724-734

  14. arXiv:1904.01495  [pdf, other

    cs.DS math-ph math.PR

    Slow Mixing of Glauber Dynamics for the Six-Vertex Model in the Ordered Phases

    Authors: Matthew Fahrbach, Dana Randall

    Abstract: The six-vertex model in statistical physics is a weighted generalization of the ice model on $\mathbb{Z}^2$ (i.e., Eulerian orientations) and the zero-temperature three-state Potts model (i.e., proper three-colorings). The phase diagram of the model depicts its physical properties and suggests where local Markov chains will be efficient. In this paper, we analyze the mixing time of Glauber dynamic… ▽ More

    Submitted 22 December, 2020; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: 28 pages, 6 figures, Proceedings of the 23rd International Conference on Randomization and Computation (RANDOM 2019)

  15. arXiv:1808.06932  [pdf, other

    cs.DS

    Non-monotone Submodular Maximization with Nearly Optimal Adaptivity and Query Complexity

    Authors: Matthew Fahrbach, Vahab Mirrokni, Morteza Zadimoghaddam

    Abstract: Submodular maximization is a general optimization problem with a wide range of applications in machine learning (e.g., active learning, clustering, and feature selection). In large-scale optimization, the parallel running time of an algorithm is governed by its adaptivity, which measures the number of sequential rounds needed if the algorithm can execute polynomially-many independent oracle querie… ▽ More

    Submitted 7 April, 2023; v1 submitted 19 August, 2018; originally announced August 2018.

    Comments: 19 pages, 8 figures. This version fixes a bug in the threshold sampling algorithm that implicitly assumed monotonicity. All original results hold

    Journal ref: Proceedings of the 36th International Conference on Machine Learning (ICML 2019) 1833-1842

  16. Submodular Maximization with Nearly Optimal Approximation, Adaptivity and Query Complexity

    Authors: Matthew Fahrbach, Vahab Mirrokni, Morteza Zadimoghaddam

    Abstract: Submodular optimization generalizes many classic problems in combinatorial optimization and has recently found a wide range of applications in machine learning (e.g., feature engineering and active learning). For many large-scale optimization problems, we are often concerned with the adaptivity complexity of an algorithm, which quantifies the number of sequential rounds where polynomially-many ind… ▽ More

    Submitted 7 April, 2023; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: 30 pages. This version fixes minor bugs with the definition of $I_t$ and the termination condition of Algorithm 5. We also update all theorem statements to explicitly assume monotone submodular functions

    Journal ref: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2019) 255-273

  17. Graph Sketching Against Adaptive Adversaries Applied to the Minimum Degree Algorithm

    Authors: Matthew Fahrbach, Gary L. Miller, Richard Peng, Saurabh Sawlani, Junxing Wang, Shen Chen Xu

    Abstract: Motivated by the study of matrix elimination orderings in combinatorial scientific computing, we utilize graph sketching and local sampling to give a data structure that provides access to approximate fill degrees of a matrix undergoing elimination in $O(\text{polylog}(n))$ time per elimination and query. We then study the problem of using this data structure in the minimum degree algorithm, which… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: 58 pages, 3 figures. This is a substantially revised version of arXiv:1711.08446 with an emphasis on the underlying theoretical problems

    Journal ref: Proceedings of the 59th Annual IEEE Symposium on Foundations of Computer Science (2018) 101-112

  18. arXiv:1711.08446  [pdf, ps, other

    cs.DS

    On Computing Min-Degree Elimination Orderings

    Authors: Matthew Fahrbach, Gary L. Miller, Richard Peng, Saurabh Sawlani, Junxing Wang, Shen Chen Xu

    Abstract: We study faster algorithms for producing the minimum degree ordering used to speed up Gaussian elimination. This ordering is based on viewing the non-zero elements of a symmetric positive definite matrix as edges of an undirected graph, and aims at reducing the additional non-zeros (fill) in the matrix by repeatedly removing the vertex of minimum degree. It is one of the most widely used primitive… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: 57 pages

  19. Analyzing Boltzmann Samplers for Bose-Einstein Condensates with Dirichlet Generating Functions

    Authors: Megan Bernstein, Matthew Fahrbach, Dana Randall

    Abstract: Boltzmann sampling is commonly used to uniformly sample objects of a particular size from large combinatorial sets. For this technique to be effective, one needs to prove that (1) the sampling procedure is efficient and (2) objects of the desired size are generated with sufficiently high probability. We use this approach to give a provably efficient sampling algorithm for a class of weighted integ… ▽ More

    Submitted 13 November, 2017; v1 submitted 7 August, 2017; originally announced August 2017.

    Comments: 20 pages, 1 figure

    Journal ref: Proceedings of the 15th Workshop on Analytic Algorithmics and Combinatorics (ANALCO 2018) 107-117

  20. arXiv:1704.05384  [pdf, other

    cs.DS

    Online Weighted Matching: Breaking the $\frac{1}{2}$ Barrier

    Authors: Matthew Fahrbach, Morteza Zadimoghaddam

    Abstract: Online matching and its variants are some of the most fundamental problems in the online algorithms literature. In this paper, we study the online weighted bipartite matching problem. Karp et al. (STOC 1990) gave an elegant algorithm in the unweighted case that achieves a tight competitive ratio of $1-1/e$. In the weighted case, however, we can easily show that no competitive ratio is obtainable w… ▽ More

    Submitted 21 November, 2019; v1 submitted 18 April, 2017; originally announced April 2017.

    Comments: 28 pages, 1 figure. This is substantially revised version that simplifies the presentation and fixes some minor problems

  21. arXiv:1704.04830  [pdf, other

    cs.DS cs.DM

    Nearly Tight Bounds for Sandpile Transience on the Grid

    Authors: David Durfee, Matthew Fahrbach, Yu Gao, Tao Xiao

    Abstract: We use techniques from the theory of electrical networks to give nearly tight bounds for the transience class of the Abelian sandpile model on the two-dimensional grid up to polylogarithmic factors. The Abelian sandpile model is a discrete process on graphs that is intimately related to the phenomenon of self-organized criticality. In this process, vertices receive grains of sand, and once the num… ▽ More

    Submitted 14 November, 2017; v1 submitted 16 April, 2017; originally announced April 2017.

    Comments: 36 pages, 4 figures

    Journal ref: Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2018) 605-624

  22. Approximately Sampling Elements with Fixed Rank in Graded Posets

    Authors: Prateek Bhakta, Ben Cousins, Matthew Fahrbach, Dana Randall

    Abstract: Graded posets frequently arise throughout combinatorics, where it is natural to try to count the number of elements of a fixed rank. These counting problems are often $\#\textbf{P}$-complete, so we consider approximation algorithms for counting and uniform sampling. We show that for certain classes of posets, biased Markov chains that walk along edges of their Hasse diagrams allow us to approximat… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: 23 pages, 12 figures

    Journal ref: Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2017) 1823-1838