Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Flaherty, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.17514  [pdf, other

    cs.LG stat.ME

    Cost-aware Generalized $α$-investing for Multiple Hypothesis Testing

    Authors: Thomas Cook, Harsh Vardhan Dubey, Ji Ah Lee, Guangyu Zhu, Tingting Zhao, Patrick Flaherty

    Abstract: We consider the problem of sequential multiple hypothesis testing with nontrivial data collection costs. This problem appears, for example, when conducting biological experiments to identify differentially expressed genes of a disease process. This work builds on the generalized $α$-investing framework which enables control of the false discovery rate in a sequential testing setting. We make a the… ▽ More

    Submitted 3 November, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 26 pages, 5 figures, 8 tables

    MSC Class: 62L05; 62C10

  2. arXiv:2106.06691  [pdf, other

    stat.ML cs.LG q-bio.GN stat.AP

    Doubly Non-Central Beta Matrix Factorization for DNA Methylation Data

    Authors: Aaron Schein, Anjali Nagulpally, Hanna Wallach, Patrick Flaherty

    Abstract: We present a new non-negative matrix factorization model for $(0,1)$ bounded-support data based on the doubly non-central beta (DNCB) distribution, a generalization of the beta distribution. The expressiveness of the DNCB distribution is particularly useful for modeling DNA methylation datasets, which are typically highly dispersed and multi-modal; however, the model structure is sufficiently gene… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: To appear in the Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI) 2021

  3. arXiv:2104.07061  [pdf, other

    cs.LG cs.DS physics.data-an stat.ML

    Exact and Approximate Hierarchical Clustering Using A*

    Authors: Craig S. Greenberg, Sebastian Macaluso, Nicholas Monath, Avinava Dubey, Patrick Flaherty, Manzil Zaheer, Amr Ahmed, Kyle Cranmer, Andrew McCallum

    Abstract: Hierarchical clustering is a critical task in numerous domains. Many approaches are based on heuristics and the properties of the resulting clusterings are studied post hoc. However, in several applications, there is a natural cost function that can be used to characterize the quality of the clustering. In those cases, hierarchical clustering can be seen as a combinatorial optimization problem. To… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 30 pages, 9 figures

  4. arXiv:2002.11661  [pdf, other

    cs.DS cs.LG physics.data-an stat.ML

    Data Structures & Algorithms for Exact Inference in Hierarchical Clustering

    Authors: Craig S. Greenberg, Sebastian Macaluso, Nicholas Monath, Ji-Ah Lee, Patrick Flaherty, Kyle Cranmer, Andrew McGregor, Andrew McCallum

    Abstract: Hierarchical clustering is a fundamental task often used to discover meaningful structures in data, such as phylogenetic trees, taxonomies of concepts, subtypes of cancer, and cascades of particle decays in particle physics. Typically approximate algorithms are used for inference due to the combinatorial number of possible hierarchical clusterings. In contrast to existing methods, we present novel… ▽ More

    Submitted 22 October, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: 27 pages, 12 figures

  5. arXiv:1911.04285  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    MAP Clustering under the Gaussian Mixture Model via Mixed Integer Nonlinear Optimization

    Authors: Patrick Flaherty, Pitchaya Wiratchotisatian, Ji Ah Lee, Zhou Tang, Andrew C. Trapp

    Abstract: We present a global optimization approach for solving the maximum a-posteriori (MAP) clustering problem under the Gaussian mixture model.Our approach can accommodate side constraints and it preserves the combinatorial structure of the MAP clustering problem by formulating it asa mixed-integer nonlinear optimization problem (MINLP). We approximate the MINLP through a mixed-integer quadratic program… ▽ More

    Submitted 16 March, 2020; v1 submitted 8 November, 2019; originally announced November 2019.