Zum Hauptinhalt springen

Showing 1–37 of 37 results for author: Vergari, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11778  [pdf, other

    cs.LG cs.AI cs.CC math.AG

    Sum of Squares Circuits

    Authors: Lorenzo Loconte, Stefan Mengel, Antonio Vergari

    Abstract: Designing expressive generative models that support exact and efficient inference is a core question in probabilistic ML. Probabilistic circuits (PCs) offer a framework where this tractability-vs-expressiveness trade-off can be analyzed theoretically. Recently, squared PCs encoding subtractive mixtures via negative parameters have emerged as tractable models that can be exponentially more expressi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  2. arXiv:2408.11081  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    What can Large Language Models Capture about Code Functional Equivalence?

    Authors: Nickil Maveli, Antonio Vergari, Shay B. Cohen

    Abstract: Code-LLMs, LLMs pre-trained on large code corpora, have shown great progress in learning rich representations of the structure and syntax of code, successfully using it to generate or classify code fragments. At the same time, understanding if they are able to do so because they capture code semantics, and how well, is still an open question. In this paper, we tackle this problem by introducing Se… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 37 pages

  3. arXiv:2406.10368  [pdf, other

    cs.LG cs.AI

    A Benchmark Suite for Systematically Evaluating Reasoning Shortcuts

    Authors: Samuele Bortolotti, Emanuele Marconato, Tommaso Carraro, Paolo Morettin, Emile van Krieken, Antonio Vergari, Stefano Teso, Andrea Passerini

    Abstract: The advent of powerful neural classifiers has increased interest in problems that require both learning and reasoning. These problems are critical for understanding important properties of models, such as trustworthiness, generalization, interpretability, and compliance to safety and structural constraints. However, recent research observed that tasks requiring both learning and reasoning on backg… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.06494  [pdf, other

    cs.LG cs.AI

    Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits

    Authors: Gennaro Gala, Cassio de Campos, Antonio Vergari, Erik Quaeghebeur

    Abstract: Probabilistic integral circuits (PICs) have been recently introduced as probabilistic models enjoying the key ingredient behind expressive generative models: continuous latent variables (LVs). PICs are symbolic computational graphs defining continuous LV models as hierarchies of functions that are summed and multiplied together, or integrated over some LVs. They are tractable if LVs can be analyti… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  5. arXiv:2404.12843  [pdf, other

    cs.LG cs.CL

    Towards Logically Consistent Language Models via Probabilistic Reasoning

    Authors: Diego Calanzone, Stefano Teso, Antonio Vergari

    Abstract: Large language models (LLMs) are a promising venue for natural language understanding and generation tasks. However, current LLMs are far from reliable: they are prone to generate non-factual information and, more crucially, to contradict themselves when prompted to reason about beliefs of the world. These problems are currently addressed with large scale fine-tuning or by delegating consistent re… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  6. arXiv:2404.08458  [pdf, other

    stat.ML cs.AI cs.LG

    On the Independence Assumption in Neurosymbolic Learning

    Authors: Emile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari

    Abstract: State-of-the-art neurosymbolic learning systems use probabilistic reasoning to guide neural networks towards predictions that conform to logical constraints over symbols. Many such systems assume that the probabilities of the considered symbols are conditionally independent given the input to simplify learning and reasoning. We study and criticise this assumption, highlighting how it can hinder op… ▽ More

    Submitted 7 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at ICML 2024

  7. arXiv:2402.12240  [pdf, other

    cs.LG cs.AI

    BEARS Make Neuro-Symbolic Models Aware of their Reasoning Shortcuts

    Authors: Emanuele Marconato, Samuele Bortolotti, Emile van Krieken, Antonio Vergari, Andrea Passerini, Stefano Teso

    Abstract: Neuro-Symbolic (NeSy) predictors that conform to symbolic knowledge - encoding, e.g., safety constraints - can be affected by Reasoning Shortcuts (RSs): They learn concepts consistent with the symbolic knowledge by exploiting unintended semantics. RSs compromise reliability and generalization and, as we show in this paper, they are linked to NeSy models being overconfident about the predicted conc… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  8. arXiv:2401.03321  [pdf, other

    cs.CL

    PIXAR: Auto-Regressive Language Modeling in Pixel Space

    Authors: Yintao Tai, Xiyang Liao, Alessandro Suglia, Antonio Vergari

    Abstract: Recent work showed the possibility of building open-vocabulary large language models (LLMs) that directly operate on pixel representations. These models are implemented as autoencoders that reconstruct masked patches of rendered text. However, these pixel-based LLMs are limited to discriminative tasks (e.g., classification) and, similar to BERT, cannot be used to generate text. Therefore, they can… ▽ More

    Submitted 23 February, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

  9. arXiv:2311.04215  [pdf, other

    cs.LG cs.AI eess.SP

    Wearable data from subjects playing Super Mario, sitting university exams, or performing physical exercise help detect acute mood episodes via self-supervised learning

    Authors: Filippo Corponi, Bryan M. Li, Gerard Anmella, Clàudia Valenzuela-Pascual, Ariadna Mas, Isabella Pacchiarotti, Marc Valentí, Iria Grande, Antonio Benabarre, Marina Garriga, Eduard Vieta, Allan H Young, Stephen M. Lawrie, Heather C. Whalley, Diego Hidalgo-Mazzei, Antonio Vergari

    Abstract: Personal sensing, leveraging data passively and near-continuously collected with wearables from patients in their ecological environment, is a promising paradigm to monitor mood disorders (MDs), a major determinant of worldwide disease burden. However, collecting and annotating wearable data is very resource-intensive. Studies of this kind can thus typically afford to recruit only a couple dozens… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  10. arXiv:2310.16986  [pdf, other

    cs.LG

    Probabilistic Integral Circuits

    Authors: Gennaro Gala, Cassio de Campos, Robert Peharz, Antonio Vergari, Erik Quaeghebeur

    Abstract: Continuous latent variables (LVs) are a key ingredient of many generative models, as they allow modelling expressive mixtures with an uncountable number of components. In contrast, probabilistic circuits (PCs) are hierarchical discrete mixtures represented as computational graphs composed of input, sum and product units. Unlike continuous LV models, PCs provide tractable inference but are limited… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  11. arXiv:2310.10443  [pdf, other

    cs.LG

    Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification

    Authors: Andreas Grivas, Antonio Vergari, Adam Lopez

    Abstract: Sigmoid output layers are widely used in multi-label classification (MLC) tasks, in which multiple labels can be assigned to any input. In many practical MLC tasks, the number of possible labels is in the thousands, often exceeding the number of input features and resulting in a low-rank output layer. In multi-class classification, it is known that such a low-rank output layer is a bottleneck that… ▽ More

    Submitted 29 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Published at AAAI24

  12. arXiv:2310.00724  [pdf, other

    cs.LG cs.AI

    Subtractive Mixture Models via Squaring: Representation and Learning

    Authors: Lorenzo Loconte, Aleksanteri M. Sladek, Stefan Mengel, Martin Trapp, Arno Solin, Nicolas Gillis, Antonio Vergari

    Abstract: Mixture models are traditionally represented and learned by adding several distributions as components. Allowing mixtures to subtract probability mass or density can drastically reduce the number of components needed to model complex distributions. However, learning such subtractive mixtures while ensuring they still encode a non-negative function is challenging. We investigate how to learn and pe… ▽ More

    Submitted 26 April, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  13. arXiv:2305.19979  [pdf, other

    cs.LG cs.AI

    Knowledge Graph Embeddings in the Biomedical Domain: Are They Useful? A Look at Link Prediction, Rule Learning, and Downstream Polypharmacy Tasks

    Authors: Aryo Pradipta Gema, Dominik Grabarczyk, Wolf De Wulf, Piyush Borole, Javier Antonio Alfaro, Pasquale Minervini, Antonio Vergari, Ajitha Rajan

    Abstract: Knowledge graphs are powerful tools for representing and organising complex biomedical data. Several knowledge graph embedding algorithms have been proposed to learn from and complete knowledge graphs. However, a recent study demonstrates the limited efficacy of these embedding algorithms when applied to biomedical knowledge graphs, raising the question of whether knowledge graph embeddings have l… ▽ More

    Submitted 31 August, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  14. arXiv:2305.19951  [pdf, other

    cs.LG stat.ML

    Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts

    Authors: Emanuele Marconato, Stefano Teso, Antonio Vergari, Andrea Passerini

    Abstract: Neuro-Symbolic (NeSy) predictive models hold the promise of improved compliance with given constraints, systematic generalization, and interpretability, as they allow to infer labels that are consistent with some prior knowledge by reasoning over high-level concepts extracted from sub-symbolic inputs. It was recently shown that NeSy predictors are affected by reasoning shortcuts: they can attain h… ▽ More

    Submitted 18 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  15. arXiv:2305.15944  [pdf, other

    cs.LG cs.AI

    How to Turn Your Knowledge Graph Embeddings into Generative Models

    Authors: Lorenzo Loconte, Nicola Di Mauro, Robert Peharz, Antonio Vergari

    Abstract: Some of the most successful knowledge graph embedding (KGE) models for link prediction -- CP, RESCAL, TuckER, ComplEx -- can be interpreted as energy-based models. Under this perspective they are not amenable for exact maximum-likelihood estimation (MLE), sampling and struggle to integrate logical constraints. This work re-interprets the score functions of these KGEs as circuits -- constrained com… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  16. arXiv:2303.11076  [pdf, other

    cs.LG cs.AI

    From MNIST to ImageNet and Back: Benchmarking Continual Curriculum Learning

    Authors: Kamil Faber, Dominik Zurek, Marcin Pietron, Nathalie Japkowicz, Antonio Vergari, Roberto Corizzo

    Abstract: Continual learning (CL) is one of the most promising trends in recent machine learning research. Its goal is to go beyond classical assumptions in machine learning and develop models and learning strategies that present high robustness in dynamic environments. The landscape of CL research is fragmented into several learning evaluation protocols, comprising different learning tasks, datasets, and e… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  17. arXiv:2210.02095  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    ChemAlgebra: Algebraic Reasoning on Chemical Reactions

    Authors: Andrea Valenti, Davide Bacciu, Antonio Vergari

    Abstract: While showing impressive performance on various kinds of learning tasks, it is yet unclear whether deep learning models have the ability to robustly tackle reasoning tasks. than by learning the underlying reasoning process that is actually required to solve the tasks. Measuring the robustness of reasoning in machine learning models is challenging as one needs to provide a task that cannot be easil… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  18. arXiv:2206.00426  [pdf, other

    cs.LG cs.AI

    Semantic Probabilistic Layers for Neuro-Symbolic Learning

    Authors: Kareem Ahmed, Stefano Teso, Kai-Wei Chang, Guy Van den Broeck, Antonio Vergari

    Abstract: We design a predictive layer for structured-output prediction (SOP) that can be plugged into any neural network guaranteeing its predictions are consistent with a set of predefined symbolic constraints. Our Semantic Probabilistic Layer (SPL) can model intricate correlations, and hard constraints, over a structured output space all while being amenable to end-to-end learning via maximum likelihood.… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  19. arXiv:2202.08566  [pdf, ps, other

    cs.LG stat.ML

    Efficient and Reliable Probabilistic Interactive Learning with Structured Outputs

    Authors: Stefano Teso, Antonio Vergari

    Abstract: In this position paper, we study interactive learning for structured output spaces, with a focus on active learning, in which labels are unknown and must be acquired, and on skeptical learning, in which the labels are noisy and may need relabeling. These scenarios require expressive models that guarantee reliable and efficient computation of probabilistic quantities to measure uncertainty. We iden… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning

  20. arXiv:2102.10562  [pdf, other

    cs.LG cs.AI

    Tractable Computation of Expected Kernels

    Authors: Wenzhe Li, Zhe Zeng, Antonio Vergari, Guy Van den Broeck

    Abstract: Computing the expectation of kernel functions is a ubiquitous task in machine learning, with applications from classical support vector machines to exploiting kernel embeddings of distributions in probabilistic modeling, statistical inference, causal discovery, and deep learning. In all these scenarios, we tend to resort to Monte Carlo estimates as expectations of kernels are intractable in genera… ▽ More

    Submitted 22 July, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

  21. arXiv:2102.06137  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    A Compositional Atlas of Tractable Circuit Operations: From Simple Transformations to Complex Information-Theoretic Queries

    Authors: Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van den Broeck

    Abstract: Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models. In this paper, we show how complex inference scenarios for these models that commonly arise in machine learning -- from computing the expectations of decision tree ensembles to information-theoretic divergences of deep mixture models -- can be represented in terms of t… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    ACM Class: G.3; I.2.4; I.2.6

  22. arXiv:2102.00424  [pdf, other

    cs.CL cs.CV cs.LG

    An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games

    Authors: Alessandro Suglia, Yonatan Bisk, Ioannis Konstas, Antonio Vergari, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

    Abstract: Guessing games are a prototypical instance of the "learning by interacting" paradigm. This work investigates how well an artificial agent can benefit from playing guessing games when later asked to perform on novel NLP downstream tasks such as Visual Question Answering (VQA). We propose two ways to exploit playing guessing games: 1) a supervised learning scenario in which the agent learns to mimic… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: Accepted paper for the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)

  23. arXiv:2011.02917  [pdf, other

    cs.CL cs.CV cs.LG

    Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games

    Authors: Alessandro Suglia, Antonio Vergari, Ioannis Konstas, Yonatan Bisk, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

    Abstract: In visual guessing games, a Guesser has to identify a target object in a scene by asking questions to an Oracle. An effective strategy for the players is to learn conceptual representations of objects that are both discriminative and expressive enough to ask questions and guess correctly. However, as shown by Suglia et al. (2020), existing models fail to learn truly multi-modal representations, re… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted to the International Conference on Computational Linguistics (COLING) 2020

  24. arXiv:2007.09331  [pdf, other

    cs.LG cs.AI

    Strudel: Learning Structured-Decomposable Probabilistic Circuits

    Authors: Meihua Dang, Antonio Vergari, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) represent a probability distribution as a computational graph. Enforcing structural properties on these graphs guarantees that several inference scenarios become tractable. Among these properties, structured decomposability is a particularly appealing one: it enables the efficient and exact computations of the probability of complex logical formulas, and can be used to… ▽ More

    Submitted 2 September, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: 12 pages, 3 figures, to be published on PGM2020 (The 10th International Conference on Probabilistic Graphical Models)

    ACM Class: I.2.6

  25. arXiv:2006.16341  [pdf, other

    cs.LG cs.AI stat.ML

    Handling Missing Data in Decision Trees: A Probabilistic Approach

    Authors: Pasha Khosravi, Antonio Vergari, YooJung Choi, Yitao Liang, Guy Van den Broeck

    Abstract: Decision trees are a popular family of models due to their attractive properties such as interpretability and ability to handle heterogeneous data. Concurrently, missing data is a prevalent occurrence that hinders performance of machine learning models. As such, handling missing data in decision trees is a well studied problem. In this paper, we tackle this problem by taking a probabilistic approa… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  26. arXiv:2004.06231  [pdf, other

    cs.LG stat.ML

    Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

    Authors: Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

    Abstract: Probabilistic circuits (PCs) are a promising avenue for probabilistic modeling, as they permit a wide range of exact and efficient inference routines. Recent ``deep-learning-style'' implementations of PCs strive for a better scalability, but are still difficult to train on real-world data, due to their sparsely connected computational graphs. In this paper, we propose Einsum Networks (EiNets), a n… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  27. arXiv:2003.00126  [pdf, other

    cs.AI

    Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing

    Authors: Zhe Zeng, Paolo Morettin, Fanqi Yan, Antonio Vergari, Guy Van den Broeck

    Abstract: Weighted model integration (WMI) is a very appealing framework for probabilistic inference: it allows to express the complex dependencies of real-world problems where variables are both continuous and discrete, via the language of Satisfiability Modulo Theories (SMT), as well as to compute probabilistic queries with complex logical and arithmetic constraints. Yet, existing WMI solvers are not read… ▽ More

    Submitted 19 August, 2020; v1 submitted 28 February, 2020; originally announced March 2020.

  28. arXiv:1910.02182  [pdf, other

    cs.LG cs.AI stat.ML

    On Tractable Computation of Expected Predictions

    Authors: Pasha Khosravi, YooJung Choi, Yitao Liang, Antonio Vergari, Guy Van den Broeck

    Abstract: Computing expected predictions of discriminative models is a fundamental task in machine learning that appears in many interesting applications such as fairness, handling missing values, and data analysis. Unfortunately, computing expectations of a discriminative model with respect to a probability distribution defined by an arbitrary generative model has been proven to be hard in general. In fact… ▽ More

    Submitted 31 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

  29. arXiv:1909.09362  [pdf, other

    cs.AI

    Hybrid Probabilistic Inference with Logical Constraints: Tractability and Message Passing

    Authors: Zhe Zeng, Fanqi Yan, Paolo Morettin, Antonio Vergari, Guy Van den Broeck

    Abstract: Weighted model integration (WMI) is a very appealing framework for probabilistic inference: it allows to express the complex dependencies of real-world hybrid scenarios where variables are heterogeneous in nature (both continuous and discrete) via the language of Satisfiability Modulo Theories (SMT); as well as computing probabilistic queries with arbitrarily complex logical constraints. Recent wo… ▽ More

    Submitted 30 September, 2019; v1 submitted 20 September, 2019; originally announced September 2019.

  30. arXiv:1905.08550  [pdf, other

    cs.LG stat.ML

    Conditional Sum-Product Networks: Imposing Structure on Deep Probabilistic Architectures

    Authors: Xiaoting Shao, Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Thomas Liebig, Kristian Kersting

    Abstract: Probabilistic graphical models are a central tool in AI; however, they are generally not as expressive as deep neural models, and inference is notoriously hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs) capture joint distributions in a tractable fashion, but still lack the expressive power of intractable models based on deep neural networks. Therefore, we… ▽ More

    Submitted 29 September, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 13 pages, 6 figures

  31. arXiv:1903.12436  [pdf, other

    cs.LG stat.ML

    From Variational to Deterministic Autoencoders

    Authors: Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari, Michael Black, Bernhard Schölkopf

    Abstract: Variational Autoencoders (VAEs) provide a theoretically-backed and popular framework for deep generative models. However, learning a VAE from data poses still unanswered theoretical questions and considerable practical challenges. In this work, we propose an alternative framework for generative modeling that is simpler, easier to train, and deterministic, yet has many of the advantages of VAEs. We… ▽ More

    Submitted 29 May, 2020; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: Partha Ghosh and Mehdi S. M. Sajjadi contributed equally to this work

  32. arXiv:1901.03704  [pdf, other

    cs.LG stat.ML

    SPFlow: An Easy and Extensible Library for Deep Probabilistic Learning using Sum-Product Networks

    Authors: Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Pranav Subramani, Nicola Di Mauro, Pascal Poupart, Kristian Kersting

    Abstract: We introduce SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs). The library allows one to quickly create SPNs both from data and through a domain specific language (DSL). It efficiently implements several probabilistic inference routines like computing ma… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 4 pages, 1 figure, code

  33. arXiv:1807.09306  [pdf, other

    stat.ML cs.LG

    Automatic Bayesian Density Analysis

    Authors: Antonio Vergari, Alejandro Molina, Robert Peharz, Zoubin Ghahramani, Kristian Kersting, Isabel Valera

    Abstract: Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for {exploratory data analysis} are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous… ▽ More

    Submitted 10 February, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: In proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

  34. arXiv:1806.01910  [pdf, other

    cs.LG cs.AI stat.ML

    Probabilistic Deep Learning using Random Sum-Product Networks

    Authors: Robert Peharz, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Kristian Kersting, Zoubin Ghahramani

    Abstract: The need for consistent treatment of uncertainty has recently triggered increased interest in probabilistic deep learning methods. However, most current approaches have severe limitations when it comes to inference, since many of these models do not even permit to evaluate exact data likelihoods. Sum-product networks (SPNs), on the other hand, are an excellent architecture in that regard, as they… ▽ More

    Submitted 22 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  35. arXiv:1710.03297  [pdf, other

    cs.LG stat.ML

    Sum-Product Networks for Hybrid Domains

    Authors: Alejandro Molina, Antonio Vergari, Nicola Di Mauro, Sriraam Natarajan, Floriana Esposito, Kristian Kersting

    Abstract: While all kinds of mixed data -from personal data, over panel and scientific data, to public and commercial data- are collected and stored, building probabilistic graphical models for these hybrid domains becomes more difficult. Users spend significant amounts of time in identifying the parametric form of the random variables (Gaussian, Poisson, Logit, etc.) involved and learning the mixed models.… ▽ More

    Submitted 6 November, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 16 Pages, 5 Figures

  36. Visualizing and Understanding Sum-Product Networks

    Authors: Antonio Vergari, Nicola Di Mauro, Floriana Esposito

    Abstract: Sum-Product Networks (SPNs) are recently introduced deep tractable probabilistic models by which several kinds of inference queries can be answered exactly and in a tractable time. Up to now, they have been largely used as black box density estimators, assessed only by comparing their likelihood scores only. In this paper we explore and exploit the inner representations learned by SPNs. We do this… ▽ More

    Submitted 24 August, 2018; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: Machine Learning Journal paper (First Online), 24 pages

  37. arXiv:1608.02341  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Representation Learning with Tractable Probabilistic Models

    Authors: Antonio Vergari, Nicola Di Mauro, Floriana Esposito

    Abstract: Probabilistic models learned as density estimators can be exploited in representation learning beside being toolboxes used to answer inference queries only. However, how to extract useful representations highly depends on the particular model involved. We argue that tractable inference, i.e. inference that can be computed in polynomial time, can enable general schemes to extract features from blac… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

    Comments: 10 pages, submitted to ECML-PKDD 2016 Doctoral Consortium