Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Williamson, S A

.
  1. arXiv:2211.00080  [pdf, other

    cs.LG eess.SP stat.AP

    Denoising neural networks for magnetic resonance spectroscopy

    Authors: Natalie Klein, Amber J. Day, Harris Mason, Michael W. Malone, Sinead A. Williamson

    Abstract: In many scientific applications, measured time series are corrupted by noise or distortions. Traditional denoising techniques often fail to recover the signal of interest, particularly when the signal-to-noise ratio is low or when certain assumptions on the signal and noise are violated. In this work, we demonstrate that deep learning-based denoising methods can outperform traditional techniques w… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 5 pages with appendix

  2. arXiv:2006.14621  [pdf, other

    stat.ME cs.LG stat.ML

    Understanding collections of related datasets using dependent MMD coresets

    Authors: Sinead A. Williamson, Jette Henderson

    Abstract: Understanding how two datasets differ can help us determine whether one dataset under-represents certain sub-populations, and provides insights into how well models will generalize across datasets. Representative points selected by a maximum mean discrepency (MMD) coreset can provide interpretable summaries of a single dataset, but are not easily compared across datasets. In this paper we introduc… ▽ More

    Submitted 4 August, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  3. arXiv:2006.08795  [pdf, other

    cs.LG stat.ML

    Balance is key: Private median splits yield high-utility random trees

    Authors: Shorya Consul, Sinead A. Williamson

    Abstract: Random forests are a popular method for classification and regression due to their versatility. However, this flexibility can come at the cost of user privacy, since training random forests requires multiple data queries, often on small, identifiable subsets of the training data. Privatizing these queries typically comes at a high utility cost, in large part because we are privatizing queries on s… ▽ More

    Submitted 19 February, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 17 pages

  4. arXiv:2001.05591  [pdf, other

    stat.ML cs.LG

    Distributed, partially collapsed MCMC for Bayesian Nonparametrics

    Authors: Avinava Dubey, Michael Minyi Zhang, Eric P. Xing, Sinead A. Williamson

    Abstract: Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to… ▽ More

    Submitted 4 March, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: To appear in the 23rd International Conference on Artificial Intelligence and Statistics

    Journal ref: Artificial Intelligence and Statistics, 108:3685-3695, 2020

  5. arXiv:1910.05098  [pdf, other

    cs.LG stat.ME stat.ML

    A Nonparametric Bayesian Model for Sparse Dynamic Multigraphs

    Authors: Elahe Ghalebi, Hamidreza Mahyar, Radu Grosu, Graham W. Taylor, Sinead A. Williamson

    Abstract: As the availability and importance of temporal interaction data--such as email communication--increases, it becomes increasingly important to understand the underlying structure that underpins these interactions. Often these interactions form a multigraph, where we might have multiple interactions between two entities. Such multigraphs tend to be sparse yet structured, and their distribution often… ▽ More

    Submitted 14 June, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

  6. arXiv:1909.01251  [pdf, other

    stat.ML cs.AI cs.LG

    Avoiding Resentment Via Monotonic Fairness

    Authors: Guy W. Cole, Sinead A. Williamson

    Abstract: Classifiers that achieve demographic balance by explicitly using protected attributes such as race or gender are often politically or culturally controversial due to their lack of individual fairness, i.e. individuals with similar qualifications will receive different outcomes. Individually and group fair decision criteria can produce counter-intuitive results, e.g. that the optimal constrained bo… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  7. arXiv:1905.11724  [pdf, other

    cs.LG stat.ML

    Sequential Edge Clustering in Temporal Multigraphs

    Authors: Elahe Ghalebi, Hamidreza Mahyar, Radu Grosu, Graham W. Taylor, Sinead A. Williamson

    Abstract: Interaction graphs, such as those recording emails between individuals or transactions between institutions, tend to be sparse yet structured, and often grow in an unbounded manner. Such behavior can be well-captured by structured, nonparametric edge-exchangeable graphs. However, such exchangeable models necessarily ignore temporal dynamics in the network. We propose a dynamic nonparametric model… ▽ More

    Submitted 13 October, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  8. Sequential Gaussian Processes for Online Learning of Nonstationary Functions

    Authors: Michael Minyi Zhang, Bianca Dumitrascu, Sinead A. Williamson, Barbara E. Engelhardt

    Abstract: Many machine learning problems can be framed in the context of estimating functions, and often these are time-dependent functions that are estimated in real-time as observations arrive. Gaussian processes (GPs) are an attractive choice for modeling real-valued nonlinear functions due to their flexibility and uncertainty quantification. However, the typical GP regression model suffers from several… ▽ More

    Submitted 6 May, 2023; v1 submitted 23 May, 2019; originally announced May 2019.

    Journal ref: IEEE Transactions on Signal Processing, vol. 71, pp. 1539-1550, 2023

  9. arXiv:1904.08548  [pdf, other

    stat.ML cs.LG

    A New Class of Time Dependent Latent Factor Models with Applications

    Authors: Sinead A. Williamson, Michael Minyi Zhang, Paul Damien

    Abstract: In many applications, observed data are influenced by some combination of latent causes. For example, suppose sensors are placed inside a building to record responses such as temperature, humidity, power consumption and noise levels. These random, observed responses are typically affected by many unobserved, latent factors (or features) within the building such as the number of individuals, the tu… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: Journal of Machine Learning Research 21(27):1-24, 2020

  10. arXiv:1904.02016  [pdf, other

    cs.SI cs.LG stat.ML

    Stochastic Blockmodels with Edge Information

    Authors: Guy W. Cole, Sinead A. Williamson

    Abstract: Stochastic blockmodels allow us to represent networks in terms of a latent community structure, often yielding intuitions about the underlying social structure. Typically, this structure is inferred based only on a binary network representing the presence or absence of interactions between nodes, which limits the amount of information that can be extracted from the data. In practice, many interact… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  11. arXiv:1901.04321  [pdf, other

    cs.IR cs.LG cs.NE stat.ML

    Large-scale Collaborative Filtering with Product Embeddings

    Authors: Thom Lake, Sinead A. Williamson, Alexander T. Hawk, Christopher C. Johnson, Benjamin P. Wing

    Abstract: The application of machine learning techniques to large-scale personalized recommendation problems is a challenging task. Such systems must make sense of enormous amounts of implicit feedback in order to understand user preferences across numerous product categories. This paper presents a deep learning based solution to this problem within the collaborative filtering with implicit feedback framewo… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 15 pages, 5 figures

  12. arXiv:1810.06738  [pdf, other

    stat.ME

    Random clique covers for graphs with local density and global sparsity

    Authors: Sinead A. Williamson, Mauricio Tec

    Abstract: Large real-world graphs tend to be sparse, but they often contain many densely connected subgraphs and exhibit high clustering coefficients. While recent random graph models can capture this sparsity, they ignore the local density, or vice versa. We develop a Bayesian nonparametric graph model based on random edge clique covers, and show that this model can capture power law degree distribution, g… ▽ More

    Submitted 17 July, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Appears in UAI 2019. This version includes appendices

  13. arXiv:1806.02512  [pdf, other

    stat.ML cs.LG

    Importance Weighted Generative Networks

    Authors: Maurice Diesendruck, Ethan R. Elenberg, Rajat Sen, Guy W. Cole, Sanjay Shakkottai, Sinead A. Williamson

    Abstract: Deep generative networks can simulate from a complex target distribution, by minimizing a loss with respect to samples from that distribution. However, often we do not have direct access to our target distribution - our data may be subject to sample selection bias, or may be from a different but related distribution. We present methods based on importance weighting that can estimate the loss with… ▽ More

    Submitted 6 September, 2020; v1 submitted 7 June, 2018; originally announced June 2018.

  14. Accelerated Parallel Non-conjugate Sampling for Bayesian Non-parametric Models

    Authors: Michael Minyi Zhang, Sinead A. Williamson, Fernando Perez-Cruz

    Abstract: Inference of latent feature models in the Bayesian nonparametric setting is generally difficult, especially in high dimensional settings, because it usually requires proposing features from some prior distribution. In special cases, where the integration is tractable, we can sample new feature assignments according to a predictive likelihood. We present a novel method to accelerate the mixing of l… ▽ More

    Submitted 29 April, 2022; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: To appear in Statistics & Computing

    Journal ref: Statistics and Computing, Vol. 32, Num. 50, 2022

  15. arXiv:1703.03457  [pdf, other

    stat.ML

    Parallel Markov Chain Monte Carlo for the Indian Buffet Process

    Authors: Michael M. Zhang, Avinava Dubey, Sinead A. Williamson

    Abstract: Indian Buffet Process based models are an elegant way for discovering underlying features within a data set, but inference in such models can be slow. Inferring underlying features using Markov chain Monte Carlo either relies on an uncollapsed representation, which leads to poor mixing, or on a collapsed representation, which leads to a quadratic increase in computational complexity. Existing atte… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: Workshop paper in Bayesian Nonparametrics: The Next Generation, NIPS 2015

  16. arXiv:1702.08420  [pdf, other

    stat.ML

    Embarrassingly Parallel Inference for Gaussian Processes

    Authors: Michael Minyi Zhang, Sinead A. Williamson

    Abstract: Training Gaussian process-based models typically involves an $ O(N^3)$ computational bottleneck due to inverting the covariance matrix. Popular methods for overcoming this matrix inversion problem cannot adequately model all types of latent functions, and are often not parallelizable. However, judicious choice of model structure can ameliorate this problem. A mixture-of-experts model that uses a m… ▽ More

    Submitted 3 March, 2020; v1 submitted 27 February, 2017; originally announced February 2017.

    Journal ref: Journal of Machine Learning Research 20, no. 169 (2019): 1-26

  17. arXiv:1508.06303  [pdf, other

    stat.ME

    Restricted Indian Buffet Processes

    Authors: Finale Doshi-Velez, Sinead A. Williamson

    Abstract: Latent feature models are a powerful tool for modeling data with globally-shared features. Nonparametric exchangeable models such as the Indian Buffet Process offer modeling flexibility by letting the number of latent features be unbounded. However, current models impose implicit distributions over the number of latent features per data point, and these implicit distributions may not match our kno… ▽ More

    Submitted 25 August, 2015; originally announced August 2015.

  18. arXiv:1211.7120  [pdf, other

    stat.ML

    Exact and Efficient Parallel Inference for Nonparametric Mixture Models

    Authors: Sinead A. Williamson, Avinava Dubey, Eric P. Xing

    Abstract: Nonparametric mixture models based on the Dirichlet process are an elegant alternative to finite models when the number of underlying components is unknown, but inference in such models can be slow. Existing attempts to parallelize inference in such models have relied on introducing approximations, which can lead to inaccuracies in the posterior estimate. In this paper, we describe auxiliary varia… ▽ More

    Submitted 29 November, 2012; originally announced November 2012.

  19. arXiv:1202.3705  [pdf

    cs.GT cs.AI

    Filtered Fictitious Play for Perturbed Observation Potential Games and Decentralised POMDPs

    Authors: Archie C. Chapman, Simon A. Williamson, Nicholas R. Jennings

    Abstract: Potential games and decentralised partially observable MDPs (Dec-POMDPs) are two commonly used models of multi-agent interaction, for static optimisation and sequential decisionmaking settings, respectively. In this paper we introduce filtered fictitious play for solving repeated potential games in which each player's observations of others' actions are perturbed by random noise, and use this algo… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-77-85