Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Papamarkou, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06642  [pdf, other

    cs.LG

    TopoBenchmarkX: A Framework for Benchmarking Topological Deep Learning

    Authors: Lev Telyatnikov, Guillermo Bernardez, Marco Montagna, Pavlo Vasylenko, Ghada Zamzmi, Mustafa Hajij, Michael T Schaub, Nina Miolane, Simone Scardapane, Theodore Papamarkou

    Abstract: This work introduces TopoBenchmarkX, a modular open-source library designed to standardize benchmarking and accelerate research in Topological Deep Learning (TDL). TopoBenchmarkX maps the TDL pipeline into a sequence of independent and modular components for data loading and processing, as well as model training, optimization, and evaluation. This modular organization provides flexibility for modi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 6 August, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  3. arXiv:2402.02441  [pdf, other

    cs.LG cs.AI cs.MS stat.CO

    TopoX: A Suite of Python Packages for Machine Learning on Topological Domains

    Authors: Mustafa Hajij, Mathilde Papillon, Florian Frantzen, Jens Agerberg, Ibrahem AlJabea, Ruben Ballester, Claudio Battiloro, Guillermo Bernárdez, Tolga Birdal, Aiden Brent, Peter Chin, Sergio Escalera, Simone Fiorellino, Odin Hoff Gardaa, Gurusankar Gopalakrishnan, Devendra Govil, Josef Hoppe, Maneel Reddy Karri, Jude Khouja, Manuel Lecha, Neal Livesay, Jan Meißner, Soham Mukherjee, Alexander Nikitin, Theodore Papamarkou , et al. (18 additional authors not shown)

    Abstract: We introduce TopoX, a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains that extend graphs: hypergraphs, simplicial, cellular, path and combinatorial complexes. TopoX consists of three packages: TopoNetX facilitates constructing and computing on these domains, including working with nodes, edges and higher-order… ▽ More

    Submitted 17 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  4. arXiv:2402.01484  [pdf, other

    cs.LG stat.CO stat.ML

    Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?

    Authors: Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer

    Abstract: A major challenge in sample-based inference (SBI) for Bayesian neural networks is the size and structure of the networks' parameter space. Our work shows that successful SBI is possible by embracing the characteristic relationship between weight and function space, uncovering a systematic link between overparameterization and the difficulty of the sampling problem. Through extensive experiments, w… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  5. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  6. arXiv:2312.09504  [pdf, other

    cs.LG cs.SI math.AT math.CO stat.ML

    Combinatorial Complexes: Bridging the Gap Between Cell Complexes and Hypergraphs

    Authors: Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Aldo Guzmán-Sáenz, Tolga Birdal, Michael T. Schaub

    Abstract: Graph-based signal processing techniques have become essential for handling data in non-Euclidean spaces. However, there is a growing awareness that these graph models might need to be expanded into `higher-order' domains to effectively represent the complex relations found in high-dimensional data. Such higher-order domains are typically modeled either as hypergraphs, or as simplicial, cubical or… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: 57th Asilomar Conference on Signals, Systems, and Computers, 2023

  7. arXiv:2310.12842  [pdf, other

    stat.ML cs.LG

    Model-agnostic variable importance for predictive uncertainty: an entropy-based approach

    Authors: Danny Wood, Theodore Papamarkou, Matt Benatan, Richard Allmendinger

    Abstract: In order to trust the predictions of a machine learning algorithm, it is necessary to understand the factors that contribute to those predictions. In the case of probabilistic and uncertainty-aware models, it is necessary to understand not only the reasons for the predictions themselves, but also the reasons for the model's level of confidence in those predictions. In this paper, we show how exist… ▽ More

    Submitted 16 August, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Data Mining and Knowledge Discovery. Springer

  8. ICML 2023 Topological Deep Learning Challenge : Design and Results

    Authors: Mathilde Papillon, Mustafa Hajij, Helen Jenne, Johan Mathe, Audun Myers, Theodore Papamarkou, Tolga Birdal, Tamal Dey, Tim Doster, Tegan Emerson, Gurusankar Gopalakrishnan, Devendra Govil, Aldo Guzmán-Sáenz, Henry Kvinge, Neal Livesay, Soham Mukherjee, Shreyas N. Samaga, Karthikeyan Natesan Ramamurthy, Maneel Reddy Karri, Paul Rosen, Sophia Sanborn, Robin Walters, Jens Agerberg, Sadrodin Barikbin, Claudio Battiloro , et al. (31 additional authors not shown)

    Abstract: This paper presents the computational challenge on topological deep learning that was hosted within the ICML 2023 Workshop on Topology and Geometry in Machine Learning. The competition asked participants to provide open-source implementations of topological neural networks from the literature by contributing to the python packages TopoNetX (data processing) and TopoModelX (deep learning). The chal… ▽ More

    Submitted 18 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  9. arXiv:2304.02902  [pdf, other

    stat.ML cs.LG

    Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry

    Authors: Jonas Gregor Wiese, Lisa Wimmer, Theodore Papamarkou, Bernd Bischl, Stephan Günnemann, David Rügamer

    Abstract: Bayesian inference in deep neural networks is challenging due to the high-dimensional, strongly multi-modal parameter posterior density landscape. Markov chain Monte Carlo approaches asymptotically recover the true posterior but are considered prohibitively expensive for large modern architectures. Local methods, which have emerged as a popular alternative, focus on specific parameter regions that… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  10. arXiv:2208.11389  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Approximate blocked Gibbs sampling for Bayesian neural networks

    Authors: Theodore Papamarkou

    Abstract: In this work, minibatch MCMC sampling for feedforward neural networks is made more feasible. To this end, it is proposed to sample subgroups of parameters via a blocked Gibbs sampling scheme. By partitioning the parameter space, sampling is possible irrespective of layer width. It is also possible to alleviate vanishing acceptance rates for increasing depth by reducing the proposal variance in dee… ▽ More

    Submitted 24 July, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

  11. arXiv:2206.00606  [pdf, other

    cs.LG cs.CV cs.SI math.AT stat.ML

    Topological Deep Learning: Going Beyond Graph Data

    Authors: Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Nina Miolane, Aldo Guzmán-Sáenz, Karthikeyan Natesan Ramamurthy, Tolga Birdal, Tamal K. Dey, Soham Mukherjee, Shreyas N. Samaga, Neal Livesay, Robin Walters, Paul Rosen, Michael T. Schaub

    Abstract: Topological deep learning is a rapidly growing field that pertains to the development of deep learning models for data supported on topological domains such as simplicial complexes, cell complexes, and hypergraphs, which generalize many domains encountered in scientific computations. In this paper, we present a unifying deep learning framework built upon a richer data structure that includes widel… ▽ More

    Submitted 19 May, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  12. arXiv:2112.00365  [pdf, other

    stat.ML cs.LG

    Probability-Generating Function Kernels for Spherical Data

    Authors: Theodore Papamarkou, Alexey Lindo

    Abstract: Probability-generating function (PGF) kernels are introduced, which constitute a class of kernels supported on the unit hypersphere, for the purposes of spherical data analysis. PGF kernels generalize RBF kernels in the context of spherical data. The properties of PGF kernels are studied. A semi-parametric learning algorithm is introduced to enable the use of PGF kernels with spherical data.

    Submitted 1 February, 2024; v1 submitted 1 December, 2021; originally announced December 2021.

  13. arXiv:2104.07737  [pdf, other

    stat.ML cs.LG math.AT

    A Random Persistence Diagram Generator

    Authors: Theodore Papamarkou, Farzana Nasrin, Austin Lawson, Na Gong, Orlando Rios, Vasileios Maroulas

    Abstract: Topological data analysis (TDA) studies the shape patterns of data. Persistent homology is a widely used method in TDA that summarizes homological features of data at multiple scales and stores them in persistence diagrams (PDs). In this paper, we propose a random persistence diagram generator (RPDG) method that generates a sequence of random PDs from the ones produced by the data. RPDG is underpi… ▽ More

    Submitted 14 September, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 17 pages, 6 figures and 3 tables

    MSC Class: 62R40; 55N31; 60G55

  14. arXiv:2103.04046  [pdf, other

    cs.LG cs.CG cs.CV math.AT stat.ML

    Simplicial Complex Representation Learning

    Authors: Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Vasileios Maroulas, Xuanting Cai

    Abstract: Simplicial complexes form an important class of topological spaces that are frequently used in many application areas such as computer-aided design, computer graphics, and simulation. Representation learning on graphs, which are just 1-d simplicial complexes, has witnessed a great attention in recent years. However, there has not been enough effort to extend representation learning to higher dimen… ▽ More

    Submitted 1 February, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: MACHINE LEARNING ON GRAPHS, MLoG Workshop at WSDM'22

  15. arXiv:2008.08044  [pdf, other

    stat.ML cs.LG stat.CO

    Bayesian neural networks and dimensionality reduction

    Authors: Deborshee Sen, Theodore Papamarkou, David Dunson

    Abstract: In conducting non-linear dimensionality reduction and feature learning, it is common to suppose that the data lie near a lower-dimensional manifold. A class of model-based approaches for such problems includes latent variables in an unknown non-linear regression function; this includes Gaussian process latent variable models and variational auto-encoders (VAEs) as special cases. VAEs are artificia… ▽ More

    Submitted 19 August, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: 29 pages, 13 figures

  16. arXiv:2006.03151  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Hidden Markov models as recurrent neural networks: an application to Alzheimer's disease

    Authors: Matt Baucum, Anahita Khojandi, Theodore Papamarkou

    Abstract: Hidden Markov models (HMMs) are commonly used for disease progression modeling when the true patient health state is not fully known. Since HMMs typically have multiple local optima, incorporating additional patient covariates can improve parameter estimation and predictive performance. To allow for this, we develop hidden Markov recurrent neural networks (HMRNNs), a special case of recurrent neur… ▽ More

    Submitted 1 October, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

  17. arXiv:2005.01699  [pdf, other

    cs.LG cs.IT stat.ML

    Depth-2 Neural Networks Under a Data-Poisoning Attack

    Authors: Sayar Karmakar, Anirbit Mukherjee, Theodore Papamarkou

    Abstract: In this work, we study the possibility of defending against data-poisoning attacks while training a shallow neural network in a regression setup. We focus on doing supervised learning for a class of depth-2 finite-width neural networks, which includes single-filter convolutional networks. In this class of networks, we attempt to learn the network weights in the presence of a malicious oracle doing… ▽ More

    Submitted 29 June, 2022; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 32 page, 7 figures

    MSC Class: 90C15 68W40 68T05

  18. arXiv:2003.03241  [pdf, other

    cs.CV stat.AP stat.ML

    Automated detection of corrosion in used nuclear fuel dry storage canisters using residual neural networks

    Authors: Theodore Papamarkou, Hayley Guy, Bryce Kroencke, Jordan Miller, Preston Robinette, Daniel Schultz, Jacob Hinkle, Laura Pullum, Catherine Schuman, Jeremy Renshaw, Stylianos Chatzidakis

    Abstract: Nondestructive evaluation methods play an important role in ensuring component integrity and safety in many industries. Operator fatigue can play a critical role in the reliability of such methods. This is important for inspecting high value assets or assets with a high consequence of failure, such as aerospace and nuclear components. Recent advances in convolution neural networks can support and… ▽ More

    Submitted 13 July, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

  19. arXiv:2001.00921  [pdf, other

    stat.ML cs.LG

    Wide Neural Networks with Bottlenecks are Deep Gaussian Processes

    Authors: Devanshu Agrawal, Theodore Papamarkou, Jacob Hinkle

    Abstract: There has recently been much work on the "wide limit" of neural networks, where Bayesian neural networks (BNNs) are shown to converge to a Gaussian process (GP) as all hidden layers are sent to infinite width. However, these results do not apply to architectures that require one or more of the hidden layers to remain narrow. In this paper, we consider the wide limit of BNNs where some hidden layer… ▽ More

    Submitted 6 July, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

  20. arXiv:1910.06539  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Challenges in Markov chain Monte Carlo for Bayesian neural networks

    Authors: Theodore Papamarkou, Jacob Hinkle, M. Todd Young, David Womble

    Abstract: Markov chain Monte Carlo (MCMC) methods have not been broadly adopted in Bayesian neural networks (BNNs). This paper initially reviews the main challenges in sampling from the parameter posterior of a neural network via MCMC. Such challenges culminate to lack of convergence to the parameter posterior. Nevertheless, this paper shows that a non-converged Markov chain, generated via MCMC sampling fro… ▽ More

    Submitted 1 October, 2021; v1 submitted 15 October, 2019; originally announced October 2019.

  21. Distributions.jl: Definition and Modeling of Probability Distributions in the JuliaStats Ecosystem

    Authors: Mathieu Besançon, Theodore Papamarkou, David Anthoff, Alex Arslan, Simon Byrne, Dahua Lin, John Pearson

    Abstract: Random variables and their distributions are a central part in many areas of statistical methods. The Distributions.jl package provides Julia users and developers tools for working with probability distributions, leveraging Julia features for their intuitive and flexible manipulation, while remaining highly efficient through zero-cost abstractions.

    Submitted 12 July, 2021; v1 submitted 19 July, 2019; originally announced July 2019.

  22. arXiv:1607.07892  [pdf, ps, other

    cs.MS

    Forward-Mode Automatic Differentiation in Julia

    Authors: Jarrett Revels, Miles Lubin, Theodore Papamarkou

    Abstract: We present ForwardDiff, a Julia package for forward-mode automatic differentiation (AD) featuring performance competitive with low-level languages like C++. Unlike recently developed AD tools in other popular high-level languages such as Python and MATLAB, ForwardDiff takes advantage of just-in-time (JIT) compilation to transparently recompile AD-unaware user code, enabling efficient support for h… ▽ More

    Submitted 26 July, 2016; originally announced July 2016.

    Comments: 4 pages