Zum Hauptinhalt springen

Showing 1–50 of 59 results for author: Moseley, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01473  [pdf

    cs.PF

    gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

    Authors: Erik D. Huckvale, Hunter N. B. Moseley

    Abstract: Determining the maximum usage of random-access memory (RAM) on both the motherboard and on a graphical processing unit (GPU) over the lifetime of a computing task can be extremely useful for troubleshooting points of failure as well as optimizing memory utilization, especially within a high-performance computing (HPC) setting. While there are tools for tracking compute time and RAM, including by j… ▽ More

    Submitted 24 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2404.00346  [pdf, other

    cs.PF cs.DC

    Asymptotically Optimal Scheduling of Multiple Parallelizable Job Classes

    Authors: Benjamin Berg, Benjamin Moseley, Weina Wang, Mor Harchol-Balter

    Abstract: Many modern computing workloads are composed of parallelizable jobs. A single parallelizable job can be completed more quickly if it is run on additional servers, however each job is typically limited in the number of servers it can run on (its parallelizability level). A job's parallelizability level is determined by the type of computation the job performs and how it was implemented. As a result… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  3. arXiv:2402.13530  [pdf, other

    math.OC cs.LG

    Best of Many in Both Worlds: Online Resource Allocation with Predictions under Unknown Arrival Model

    Authors: Lin An, Andrew A. Li, Benjamin Moseley, Gabriel Visotsky

    Abstract: Online decision-makers often obtain predictions on future variables, such as arrivals, demands, inventories, and so on. These predictions can be generated from simple forecasting algorithms for univariate time-series, all the way to state-of-the-art machine learning models that leverage multiple time-series and additional feature information. However, the prediction accuracy is unknown to decision… ▽ More

    Submitted 22 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2402.11028  [pdf, other

    cs.DS

    Incremental Topological Ordering and Cycle Detection with Predictions

    Authors: Samuel McCauley, Benjamin Moseley, Aidin Niaparast, Shikha Singh

    Abstract: This paper leverages the framework of algorithms-with-predictions to design data structures for two fundamental dynamic graph problems: incremental topological ordering and cycle detection. In these problems, the input is a directed graph on $n$ nodes, and the $m$ edges arrive one by one. The data structure must maintain a topological ordering of the vertices at all times and detect if the newly i… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2312.14063  [pdf, other

    cs.DB cs.DS

    Polynomial Time Convergence of the Iterative Evaluation of Datalogo Programs

    Authors: Sungjin Im, Benjamin Moseley, Hung Q. Ngo, Kirk Pruhs

    Abstract: Datalogo is an extension of Datalog that allows for aggregation and recursion over an arbitrary commutative semiring. Like Datalog, Datalogo programs can be evaluated via the natural iterative algorithm until a fixed point is reached. However unlike Datalog, the natural iterative evaluation of some Datalogo programs over some semirings may not converge. It is known that the commutative semirings f… ▽ More

    Submitted 21 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  6. arXiv:2311.17664  [pdf, other

    cs.DB

    On the Convergence Rate of Linear Datalogo over Stable Semirings

    Authors: Sungjin Im, Benjamin Moseley, Hung Ngo, Kirk Pruhs

    Abstract: Datalogo is an extension of Datalog, where instead of a program being a collection of union of conjunctive queries over the standard Boolean semiring, a program may now be a collection of sum-sum-product queries over an arbitrary commutative partially ordered pre-semiring. Datalogo is more powerful than Datalog in that its additional algebraic structure alows for supporting recursion with aggregat… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  7. arXiv:2308.05067  [pdf, other

    cs.DS

    Controlling Tail Risk in Online Ski-Rental

    Authors: Michael Dinitz, Sungjin Im, Thomas Lavastida, Benjamin Moseley, Sergei Vassilvitskii

    Abstract: The classical ski-rental problem admits a textbook 2-competitive deterministic algorithm, and a simple randomized algorithm that is $\frac{e}{e-1}$-competitive in expectation. The randomized algorithm, while optimal in expectation, has a large variance in its performance: it has more than a 37% chance of competitive ratio exceeding 2, and a $Θ(1/n)$ chance of the competitive ratio exceeding $n$!… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 28 pages, 2 figures

  8. arXiv:2308.01534  [pdf, other

    cs.DS cs.DM

    Simultaneously Approximating All $\ell_p$-norms in Correlation Clustering

    Authors: Sami Davies, Benjamin Moseley, Heather Newman

    Abstract: This paper considers correlation clustering on unweighted complete graphs. We give a combinatorial algorithm that returns a single clustering solution that is simultaneously $O(1)$-approximate for all $\ell_p$-norms of the disagreement vector; in other words, a combinatorial $O(1)$-approximation of the all-norms objective for correlation clustering. This is the first proof that minimal sacrifice i… ▽ More

    Submitted 9 March, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 27 pages, 2 figures

  9. arXiv:2305.18227  [pdf, other

    cs.DS cs.LG

    Online Dynamic Acknowledgement with Learned Predictions

    Authors: Sungjin Im, Benjamin Moseley, Chenyang Xu, Ruilong Zhang

    Abstract: We revisit the online dynamic acknowledgment problem. In the problem, a sequence of requests arrive over time to be acknowledged, and all outstanding requests can be satisfied simultaneously by one acknowledgement. The goal of the problem is to minimize the total request delay plus acknowledgement cost. This elegant model studies the trade-off between acknowledgement cost and waiting experienced b… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: To appear in INFOCOM 2023

  10. arXiv:2305.10536  [pdf, other

    cs.DS cs.LG

    Online List Labeling with Predictions

    Authors: Samuel McCauley, Benjamin Moseley, Aidin Niaparast, Shikha Singh

    Abstract: A growing line of work shows how learned predictions can be used to break through worst-case barriers to improve the running time of an algorithm. However, incorporating predictions into data structures with strong theoretical guarantees remains underdeveloped. This paper takes a step in this direction by showing that predictions can be leveraged in the fundamental online list labeling problem. In… ▽ More

    Submitted 20 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  11. arXiv:2303.15379  [pdf, other

    cs.DS

    Online $k$-Median with Consistent Clusters

    Authors: Benjamin Moseley, Heather Newman, Kirk Pruhs

    Abstract: We consider the online $k$-median clustering problem in which $n$ points arrive online and must be irrevocably assigned to a cluster on arrival. As there are lower bound instances that show that an online algorithm cannot achieve a competitive ratio that is a function of $n$ and $k$, we consider a beyond worst-case analysis model in which the algorithm is provided a priori with a predicted budget… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 28 pages, 7 figures

  12. arXiv:2303.00837  [pdf, other

    cs.DS

    Predictive Flows for Faster Ford-Fulkerson

    Authors: Sami Davies, Benjamin Moseley, Sergei Vassilvitskii, Yuyan Wang

    Abstract: Recent work has shown that leveraging learned predictions can improve the running time of algorithms for bipartite matching and similar combinatorial problems. In this work, we build on this idea to improve the performance of the widely used Ford-Fulkerson algorithm for computing maximum flows by seeding Ford-Fulkerson with predicted flows. Our proposed method offers strong theoretical performance… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  13. arXiv:2301.13079  [pdf, other

    cs.DS cs.DM

    Fast Combinatorial Algorithms for Min Max Correlation Clustering

    Authors: Sami Davies, Benjamin Moseley, Heather Newman

    Abstract: We introduce fast algorithms for correlation clustering with respect to the Min Max objective that provide constant factor approximations on complete graphs. Our algorithms are the first purely combinatorial approximation algorithms for this problem. We construct a novel semi-metric on the set of vertices, which we call the correlation metric, that indicates to our clustering algorithms whether pa… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  14. arXiv:2212.07682  [pdf, other

    cs.DS

    Min-max Submodular Ranking for Multiple Agents

    Authors: Qingyun Chen, Sungjin Im, Benjamin Moseley, Chenyang Xu, Ruilong Zhang

    Abstract: In the submodular ranking (SR) problem, the input consists of a set of submodular functions defined on a ground set of elements. The goal is to order elements for all the functions to have value above a certain threshold as soon on average as possible, assuming we choose one element per time. The problem is flexible enough to capture various applications in machine learning, including decision tre… ▽ More

    Submitted 27 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: To appear in AAAI 2023

  15. arXiv:2211.08381  [pdf, ps, other

    cs.DS

    Optimizing Polymatroid Functions

    Authors: Sungjin Im, Benjamin Moseley, Hung Q. Ngo, Kirk Pruhs, Alireza Samadian

    Abstract: We consider a class of optimization problems that involve determining the maximum value that a function in a particular class can attain subject to a collection of difference constraints. We show that a particular linear programming technique, based on duality and projections, can be used to rederive some structural results that were previously established using more ad hoc methods. We then show t… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  16. arXiv:2210.12438  [pdf, ps, other

    cs.LG cs.DS

    Algorithms with Prediction Portfolios

    Authors: Michael Dinitz, Sungjin Im, Thomas Lavastida, Benjamin Moseley, Sergei Vassilvitskii

    Abstract: The research area of algorithms with predictions has seen recent success showing how to incorporate machine learning into algorithm design to improve performance when the predictions are correct, while retaining worst-case guarantees when they are not. Most previous work has assumed that the algorithm has access to a single predictor. However, in practice, there are many machine learning methods a… ▽ More

    Submitted 2 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: 24 pages. Appears at NeurIPS 2022

  17. arXiv:2208.13702  [pdf, ps, other

    cs.DS

    Configuration Balancing for Stochastic Requests

    Authors: Franziska Eberle, Anupam Gupta, Nicole Megow, Benjamin Moseley, Rudy Zhou

    Abstract: The configuration balancing problem with stochastic requests generalizes many well-studied resource allocation problems such as load balancing and virtual circuit routing. In it, we have $m$ resources and $n$ requests. Each request has multiple possible configurations, each of which increases the load of each resource by some amount. The goal is to select one configuration for each request to mini… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  18. arXiv:2208.13696  [pdf, other

    cs.DS

    Minimizing Completion Times for Stochastic Jobs via Batched Free Times

    Authors: Anupam Gupta, Benjamin Moseley, Rudy Zhou

    Abstract: We study the classic problem of minimizing the expected total completion time of jobs on $m$ identical machines in the setting where the sizes of the jobs are stochastic. Specifically, the size of each job is a random variable whose distribution is known to the algorithm, but whose realization is revealed only after the job is scheduled. While minimizing the total completion time is easy in the de… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  19. arXiv:2206.12896  [pdf, ps, other

    cs.DS math.CO

    On the Impossibility of Decomposing Binary Matroids

    Authors: Marilena Leichter, Benjamin Moseley, Kirk Pruhs

    Abstract: We show that there exist $k$-colorable matroids that are not $(b,c)$-decomposable when $b$ and $c$ are constants. A matroid is $(b,c)$-decomposable, if its ground set of elements can be partitioned into sets $X_1, X_2, \ldots, X_l$ with the following two properties. Each set $X_i$ has size at most $ck$. Moreover, for all sets $Y$ such that $|Y \cap X_i| \leq 1$ it is the case that $Y$ is $b$-color… ▽ More

    Submitted 29 June, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

  20. arXiv:2112.05353  [pdf, other

    cs.LG

    Learning-Augmented Algorithms for Online Steiner Tree

    Authors: Chenyang Xu, Benjamin Moseley

    Abstract: This paper considers the recently popular beyond-worst-case algorithm analysis model which integrates machine-learned predictions with online algorithm design. We consider the online Steiner tree problem in this model for both directed and undirected graphs. Steiner tree is known to have strong lower bounds in the online setting and any algorithm's worst-case guarantee is far from desirable. This… ▽ More

    Submitted 18 March, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: To appear in AAAI 2022

  21. arXiv:2111.06564  [pdf, ps, other

    cs.DS

    A Competitive Algorithm for Throughput Maximization on Identical Machines

    Authors: Benjamin Moseley, Kirk Pruhs, Clifford Stein, Rudy Zhou

    Abstract: This paper considers the basic problem of scheduling jobs online with preemption to maximize the number of jobs completed by their deadline on $m$ identical machines. The main result is an $O(1)$ competitive deterministic algorithm for any number of machines $m >1$.

    Submitted 23 August, 2024; v1 submitted 11 November, 2021; originally announced November 2021.

  22. arXiv:2107.09770  [pdf, other

    cs.LG cs.DS

    Faster Matchings via Learned Duals

    Authors: Michael Dinitz, Sungjin Im, Thomas Lavastida, Benjamin Moseley, Sergei Vassilvitskii

    Abstract: A recent line of research investigates how algorithms can be augmented with machine-learned predictions to overcome worst case lower bounds. This area has revealed interesting algorithmic insights into problems, with particular success in the design of competitive online algorithms. However, the question of improving algorithm running times with predictions has largely been unexplored. We take a… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 27 pages, 7 figures

  23. arXiv:2107.07871  [pdf, other

    physics.comp-ph cs.LG

    Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations

    Authors: Ben Moseley, Andrew Markham, Tarje Nissen-Meyer

    Abstract: Recently, physics-informed neural networks (PINNs) have offered a powerful new paradigm for solving problems relating to differential equations. Compared to classical numerical methods PINNs have several advantages, for example their ability to provide mesh-free solutions of differential equations and their ability to carry out forward and inverse modelling within the same optimisation problem. Wh… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 27 pages, 13 figures

  24. arXiv:2107.03795  [pdf, other

    cs.DS math.CO

    An Efficient Reduction of a Gammoid to a Partition Matroid

    Authors: Marilena Leichter, Benjamin Moseley, Kirk Pruhs

    Abstract: Our main contribution is a polynomial-time algorithm to reduce a $k$-colorable gammoid to a $(2k-2)$-colorable partition matroid. It is known that there are gammoids that can not be reduced to any $(2k-3)$-colorable partition matroid, so this result is tight. We then discuss how such a reduction can be used to obtain polynomial-time algorithms with better approximation ratios for various natural p… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Full version of a paper accepted at ESA 2021

  25. arXiv:2106.01079  [pdf, other

    cs.DS

    Using Predicted Weights for Ad Delivery

    Authors: Thomas Lavastida, Benjamin Moseley, R. Ravi, Chenyang Xu

    Abstract: We study the performance of a proportional weights algorithm for online capacitated bipartite matching modeling the delivery of impression ads. The algorithm uses predictions on the advertiser nodes to match arriving impression nodes fractionally in proportion to the weights of its neighbors. This paper gives a thorough empirical study of the performance of the algorithm on a data-set of ad impres… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 15 pages, 10 figures. To appear in ACDA 2021

  26. arXiv:2012.08083  [pdf, other

    cs.DS cs.DB

    Instance Optimal Join Size Estimation

    Authors: Mahmoud Abo-Khamis, Sungjin Im, Benjamin Moseley, Kirk Pruhs, Alireza Samadian

    Abstract: We consider the problem of efficiently estimating the size of the inner join of a collection of preprocessed relational tables from the perspective of instance optimality analysis. The run time of instance optimal algorithms is comparable to the minimum time needed to verify the correctness of a solution. Previously instance optimal algorithms were only known when the size of the join was small (a… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  27. arXiv:2011.11743  [pdf, other

    cs.LG cs.DS

    Learnable and Instance-Robust Predictions for Online Matching, Flows and Load Balancing

    Authors: Thomas Lavastida, Benjamin Moseley, R. Ravi, Chenyang Xu

    Abstract: We propose a new model for augmenting algorithms with predictions by requiring that they are formally learnable and instance robust. Learnability ensures that predictions can be efficiently constructed from a reasonable amount of past data. Instance robustness ensures that the prediction is robust to modest changes in the problem input, where the measure of the change may be problem specific. Inst… ▽ More

    Submitted 1 July, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: To appear in ESA 2021

  28. arXiv:2009.00808  [pdf, other

    cs.DS

    Structural Iterative Rounding for Generalized $k$-Median Problems

    Authors: Anupam Gupta, Benjamin Moseley, Rudy Zhou

    Abstract: This paper considers approximation algorithms for generalized $k$-median problems. This class of problems can be informally described as $k$-median with a constant number of extra constraints, and includes $k$-median with outliers, and knapsack median. Our first contribution is a pseudo-approximation algorithm for generalized $k$-median that outputs a $6.387$-approximate solution, with a constant… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

  29. arXiv:2008.13235  [pdf, other

    cs.LG stat.ML

    An Objective for Hierarchical Clustering in Euclidean Space and its Connection to Bisecting K-means

    Authors: Benjamin Moseley, Yuyan Wang

    Abstract: This paper explores hierarchical clustering in the case where pairs of points have dissimilarity scores (e.g. distances) as a part of the input. The recently introduced objective for points with dissimilarity scores results in every tree being a 1/2 approximation if the distances form a metric. This shows the objective does not make a significant distinction between a good and poor hierarchical cl… ▽ More

    Submitted 30 August, 2020; originally announced August 2020.

  30. arXiv:2008.00358  [pdf, other

    cs.DS cs.DB cs.LG

    Relational Algorithms for k-means Clustering

    Authors: Benjamin Moseley, Kirk Pruhs, Alireza Samadian, Yuyan Wang

    Abstract: This paper gives a k-means approximation algorithm that is efficient in the relational algorithms model. This is an algorithm that operates directly on a relational database without performing a join to convert it to a matrix whose rows represent the data points. The running time is potentially exponentially smaller than $N$, the number of data points to be clustered that the relational database r… ▽ More

    Submitted 20 May, 2021; v1 submitted 1 August, 2020; originally announced August 2020.

  31. arXiv:2006.10221  [pdf, other

    cs.DS cs.LG stat.ML

    Fair Hierarchical Clustering

    Authors: Sara Ahmadian, Alessandro Epasto, Marina Knittel, Ravi Kumar, Mohammad Mahdian, Benjamin Moseley, Philip Pham, Sergei Vassilvitskii, Yuyan Wang

    Abstract: As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates over-representation in traditional clustering. In this paper we extend this notion to hierarchical clustering, where the goal is to recursively partition the data to opt… ▽ More

    Submitted 18 June, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  32. arXiv:2005.11912  [pdf, ps, other

    cs.DS cs.DM

    Symmetric Linear Programming Formulations for Minimum Cut with Applications to TSP

    Authors: Robert D. Carr, Jennifer Iglesias, Giuseppe Lanciac, Benjamin Moseley

    Abstract: We introduce multiple symmetric LP relaxations for minimum cut problems. The relaxations give optimal and approximate solutions when the input is a Hamiltonian cycle. We show that this leads to one of two interesting results. In one case, these LPs always give optimal and near optimal solutions, and then they would be the smallest known symmetric LPs for the problems considered. Otherwise, these L… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

    Comments: Submitted to a journal

    MSC Class: 68R10 ACM Class: G.2.2

  33. arXiv:2005.09745  [pdf, ps, other

    cs.PF

    Optimal Resource Allocation for Elastic and Inelastic Jobs

    Authors: Benjamin Berg, Mor Harchol-Balter, Benjamin Moseley, Weina Wang, Justin Whitehouse

    Abstract: Modern data centers are tasked with processing heterogeneous workloads consisting of various classes of jobs. These classes differ in their arrival rates, size distributions, and job parallelizability. With respect to paralellizability, some jobs are elastic, meaning they can parallelize linearly across many servers. Other jobs are inelastic, meaning they can only run on a single server. Although… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  34. arXiv:2005.05325  [pdf, ps, other

    cs.DS cs.LG

    A Relational Gradient Descent Algorithm For Support Vector Machine Training

    Authors: Mahmoud Abo-Khamis, Sungjin Im, Benjamin Moseley, Kirk Pruhs, Alireza Samadian

    Abstract: We consider gradient descent like algorithms for Support Vector Machine (SVM) training when the data is in relational form. The gradient of the SVM objective can not be efficiently computed by known techniques as it suffers from the ``subtraction problem''. We first show that the subtraction problem can not be surmounted by showing that computing any constant approximation of the gradient of the S… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

  35. arXiv:2003.10588  [pdf, ps, other

    cs.DS cs.DB cs.LG

    Approximate Aggregate Queries Under Additive Inequalities

    Authors: Mahmoud Abo-Khamis, Sungjin Im, Benjamin Moseley, Kirk Pruhs, Alireza Samadian

    Abstract: We consider the problem of evaluating certain types of functional aggregation queries on relational data subject to additive inequalities. Such aggregation queries, with a smallish number of additive inequalities, arise naturally/commonly in many applications, particularly in learning applications. We give a relatively complete categorization of the computational complexity of such problems. We fi… ▽ More

    Submitted 30 April, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

  36. arXiv:2003.02433  [pdf, ps, other

    cs.DS

    Fast Noise Removal for $k$-Means Clustering

    Authors: Sungjin Im, Mahshid Montazer Qaem, Benjamin Moseley, Xiaorui Sun, Rudy Zhou

    Abstract: This paper considers $k$-means clustering in the presence of noise. It is known that $k$-means clustering is highly sensitive to noise, and thus noise should be removed to obtain a quality solution. A popular formulation of this problem is called $k$-means clustering with outliers. The goal of $k$-means clustering with outliers is to discard up to a specified number $z$ of points as noise/outliers… ▽ More

    Submitted 13 April, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Published in AISTATS 2020

  37. arXiv:2001.07784  [pdf, ps, other

    cs.DS cs.NI

    Scheduling for Weighted Flow and Completion Times in Reconfigurable Networks

    Authors: Michael Dinitz, Benjamin Moseley

    Abstract: New optical technologies offer the ability to reconfigure network topologies dynamically, rather than setting them once and for all. This is true in both optical wide area networks (optical WANs) and in datacenters, despite the many differences between these two settings. Because of these new technologies, there has been a surge of both practical and theoretical research on algorithms to take adva… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 10 pages. Appears in INFOCOM 2020

  38. arXiv:2001.06841  [pdf, other

    cs.DS cs.PF

    Dynamic Weighted Fairness with Minimal Disruptions

    Authors: Sungjin Im, Benjamin Moseley, Kamesh Munagala, Kirk Pruhs

    Abstract: In this paper, we consider the following dynamic fair allocation problem: Given a sequence of job arrivals and departures, the goal is to maintain an approximately fair allocation of the resource against a target fair allocation policy, while minimizing the total number of disruptions, which is the number of times the allocation of any job is changed. We consider a rich class of fair allocation po… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS) 2020 (SIGMETRICS)

  39. arXiv:1910.04939  [pdf, ps, other

    cs.LG cs.DB stat.ML

    Rk-means: Fast Clustering for Relational Data

    Authors: Ryan Curtin, Ben Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: Conventional machine learning algorithms cannot be applied until a data matrix is available to process. When the data matrix needs to be obtained from a relational database via a feature extraction query, the computation cost can be prohibitive, as the data matrix may be (much) larger than the total input relation size. This paper introduces Rk-means, or relational k -means algorithm, for clusteri… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  40. arXiv:1905.10845  [pdf, ps, other

    cs.LG stat.ML

    On Coresets for Regularized Loss Minimization

    Authors: Ryan R. Curtin, Sungjin Im, Ben Moseley, Kirk Pruhs, Alireza Samadian

    Abstract: We design and mathematically analyze sampling-based algorithms for regularized loss minimization problems that are implementable in popular computational models for large data, in which the access to the data is restricted in some way. Our main result is that if the regularizer's effect does not become negligible as the norm of the hypothesis scales, and as the data scales, then a uniform sample o… ▽ More

    Submitted 31 May, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

  41. arXiv:1904.09667  [pdf, other

    cs.DS

    Scheduling to Approximate Minimization Objectives on Identical Machines

    Authors: Benjamin Moseley

    Abstract: This paper considers scheduling on identical machines. The scheduling objective considered in this paper generalizes most scheduling minimization problems. In the problem, there are $n$ jobs and each job $j$ is associated with a monotonically increasing function $g_j$. The goal is to design a schedule that minimizes $\sum_{j \in [n]} g_{j}(C_j)$ where $C_j$ is the completion time of job $j$ in the… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

  42. arXiv:1904.08954  [pdf, other

    cs.CC cs.DS

    A Conditional Lower Bound on Graph Connectivity in MapReduce

    Authors: Sungjin Im, Benjamin Moseley

    Abstract: MapReduce (and its open source implementation Hadoop) has become the de facto platform for processing large data sets. MapReduce offers a streamlined computational framework by interleaving sequential and parallel computation while hiding underlying system issues from the programmer. Due to the popularity of MapReduce, there have been attempts in the theoretical computer science community to under… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: Preprint from 2015

  43. arXiv:1901.07988  [pdf, other

    cs.LG stat.ML

    Backprop with Approximate Activations for Memory-efficient Network Training

    Authors: Ayan Chakrabarti, Benjamin Moseley

    Abstract: Training convolutional neural network models is memory intensive since back-propagation requires storing activations of all intermediate layers. This presents a practical concern when seeking to deploy very deep architectures in production, especially when models need to be frequently re-trained on updated datasets. In this paper, we propose a new implementation for back-propagation that significa… ▽ More

    Submitted 28 October, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: Project page at http://projects.ayanc.org/blpa/

  44. arXiv:1812.09526  [pdf, ps, other

    cs.DB cs.DS cs.IT cs.LG

    Functional Aggregate Queries with Additive Inequalities

    Authors: Mahmoud Abo Khamis, Ryan R. Curtin, Benjamin Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: Motivated by fundamental applications in databases and relational machine learning, we formulate and study the problem of answering functional aggregate queries (FAQ) in which some of the input factors are defined by a collection of additive inequalities between variables. We refer to these queries as FAQ-AI for short. To answer FAQ-AI in the Boolean semiring, we define relaxed tree decompositio… ▽ More

    Submitted 15 September, 2020; v1 submitted 22 December, 2018; originally announced December 2018.

  45. arXiv:1811.08871  [pdf, other

    cs.LG stat.ML

    Efficient nonmyopic active search with applications in drug and materials discovery

    Authors: Shali Jiang, Gustavo Malkomes, Benjamin Moseley, Roman Garnett

    Abstract: Active search is a learning paradigm for actively identifying as many members of a given class as possible. A critical target scenario is high-throughput screening for scientific discovery, such as drug or materials discovery. In this paper, we approach this problem in Bayesian decision framework. We first derive the Bayesian optimal policy under a natural utility, and establish a theoretical hard… ▽ More

    Submitted 23 November, 2018; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Molecules and Materials (NeurIPS 2018 Workshop)

  46. arXiv:1810.03199  [pdf, other

    cs.NE q-bio.NC

    Pre-Synaptic Pool Modification (PSPM): A Supervised Learning Procedure for Spiking Neural Networks

    Authors: Bryce Bagley, Blake Bordelon, Benjamin Moseley, Ralf Wessel

    Abstract: Learning synaptic weights of spiking neural network (SNN) models that can reproduce target spike trains from provided neural firing data is a central problem in computational neuroscience and spike-based computing. The discovery of the optimal weight values can be posed as a supervised learning task wherein the weights of the model network are chosen to maximize the similarity between the target s… ▽ More

    Submitted 8 October, 2019; v1 submitted 7 October, 2018; originally announced October 2018.

    Comments: 24 pages, 8 figures, Code and data can be found at https://github.com/blakebordelon/Spiking-Neural-Network-Optimization

  47. arXiv:1804.08317  [pdf, ps, other

    cs.DS

    Online Non-Preemptive Scheduling to Minimize Weighted Flow-time on Unrelated Machines

    Authors: Giorgio Lucarelli, Benjamin Moseley, Nguyen Kim Thang, Abhinav Srivastav, Denis Trystram

    Abstract: In this paper, we consider the online problem of scheduling independent jobs \emph{non-preemptively} so as to minimize the weighted flow-time on a set of unrelated machines. There has been a considerable amount of work on this problem in the preemptive setting where several competitive algorithms are known in the classical competitive model. %Using the speed augmentation model, Anand et al. showed… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

  48. arXiv:1802.10309  [pdf, ps, other

    cs.DS

    Online Non-preemptive Scheduling on Unrelated Machines with Rejections

    Authors: Giorgio Lucarelli, Benjamin Moseley, Nguyen Kim Thang, Abhinav Srivastav, Denis Trystram

    Abstract: When a computer system schedules jobs there is typically a significant cost associated with preempting a job during execution. This cost can be from the expensive task of saving the memory's state and loading data into and out of memory. It is desirable to schedule jobs non-preemptively to avoid the costs of preemption. There is a need for non-preemptive system schedulers on desktops, servers and… ▽ More

    Submitted 28 February, 2018; originally announced February 2018.

  49. arXiv:1708.09046  [pdf, ps, other

    cs.DS

    An O(log log m)-competitive Algorithm for Online Machine Minimization

    Authors: Sungjin Im, Benjamin Moseley, Kirk Pruhs, Clifford Stein

    Abstract: This paper considers the online machine minimization problem, a basic real time scheduling problem. The setting for this problem consists of n jobs that arrive over time, where each job has a deadline by which it must be completed. The goal is to design an online scheduler that feasibly schedules the jobs on a nearly minimal number of machines. An algorithm is c-machine optimal if the algorithm wi… ▽ More

    Submitted 29 January, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

  50. Greed Works -- Online Algorithms For Unrelated Machine Stochastic Scheduling

    Authors: Varun Gupta, Benjamin Moseley, Marc Uetz, Qiaomin Xie

    Abstract: This paper establishes performance guarantees for online algorithms that schedule stochastic, nonpreemptive jobs on unrelated machines to minimize the expected total weighted completion time. Prior work on unrelated machine scheduling with stochastic jobs was restricted to the offline case, and required linear or convex programming relaxations for the assignment of jobs to machines. The algorithms… ▽ More

    Submitted 13 May, 2020; v1 submitted 5 March, 2017; originally announced March 2017.

    Comments: Preliminary version appeared in IPCO 2017

    MSC Class: 90B36 ACM Class: F.2.2

    Journal ref: Mathematics of Operations Research 44(2), 2020, 497-516