Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Filippone, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01494  [pdf, other

    cs.CV cs.LG stat.ML

    Robust Classification by Coupling Data Mollification with Label Smoothing

    Authors: Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone

    Abstract: Introducing training-time augmentations is a key technique to enhance generalization and prepare deep neural networks against test-time corruptions. Inspired by the success of generative diffusion models, we propose a novel approach coupling data augmentation, in the form of image noising and blurring, with label smoothing to align predicted label confidences with image degradation. The method is… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2402.03146  [pdf, other

    cs.LG stat.ML

    A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as the length of the trajectory grows. In this paper we tackle this issue by using a multi-step objective to train one-step models. Our objective is a weighted sum of the m… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2402.02644  [pdf, other

    cs.LG stat.ML

    Variational DAG Estimation via State Augmentation With Stochastic Permutations

    Authors: Edwin V. Bonilla, Pantelis Elinas, He Zhao, Maurizio Filippone, Vassili Kitsios, Terry O'Kane

    Abstract: Estimating the structure of a Bayesian network, in the form of a directed acyclic graph (DAG), from observational data is a statistically and computationally hard problem with essential applications in areas such as causal discovery. Bayesian approaches are a promising direction for solving this task, as they allow for uncertainty quantification and deal with well-known identifiability issues. Fro… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  4. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  5. arXiv:2311.09491  [pdf, other

    stat.ML cs.LG

    Spatial Bayesian Neural Networks

    Authors: Andrew Zammit-Mangion, Michael D. Kaminski, Ba-Hien Tran, Maurizio Filippone, Noel Cressie

    Abstract: interpretable, and well understood models that are routinely employed even though, as is revealed through prior and posterior predictive checks, these can poorly characterise the spatial heterogeneity in the underlying process of interest. Here, we propose a new, flexible class of spatial-process models, which we refer to as spatial Bayesian neural networks (SBNNs). An SBNN leverages the represent… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 35 pages, 21 figures

  6. arXiv:2310.05672  [pdf, other

    cs.LG stat.ML

    Multi-timestep models for Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as length of the trajectory grows. In this paper we tackle this issue by using a multi-timestep objective to train one-step models. Our objective is a weighted sum of a los… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  7. arXiv:2305.18900  [pdf, other

    cs.LG

    One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models

    Authors: Ba-Hien Tran, Giulio Franzese, Pietro Michiardi, Maurizio Filippone

    Abstract: Generative Models (GMs) have attracted considerable attention due to their tremendous success in various domains, such as computer vision where they are capable to generate impressive realistic-looking images. Likelihood-based GMs are attractive due to the possibility to generate new data by a single model evaluation. However, they typically achieve lower sample quality compared to state-of-the-ar… ▽ More

    Submitted 21 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  8. arXiv:2303.04020  [pdf, other

    stat.ML cs.LG

    When is Importance Weighting Correction Needed for Covariate Shift Adaptation?

    Authors: Davit Gogolashvili, Matteo Zecchin, Motonobu Kanagawa, Marios Kountouris, Maurizio Filippone

    Abstract: This paper investigates when the importance weighting (IW) correction is needed to address covariate shift, a common situation in supervised learning where the input distributions of training and test data differ. Classic results show that the IW correction is needed when the model is parametric and misspecified. In contrast, recent results indicate that the IW correction may not be necessary when… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  9. arXiv:2303.00800  [pdf, other

    cs.LG stat.ML

    Continuous-Time Functional Diffusion Processes

    Authors: Giulio Franzese, Giulio Corallo, Simone Rossi, Markus Heinonen, Maurizio Filippone, Pietro Michiardi

    Abstract: We introduce Functional Diffusion Processes (FDPs), which generalize score-based diffusion models to infinite-dimensional function spaces. FDPs require a new mathematical framework to describe the forward and backward dynamics, and several extensions to derive practical training objectives. These include infinite-dimensional versions of Girsanov theorem, in order to be able to compute an ELBO, and… ▽ More

    Submitted 18 December, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Under review

  10. arXiv:2302.04534  [pdf, other

    cs.LG stat.ML

    Fully Bayesian Autoencoders with Latent Sparse Gaussian Processes

    Authors: Ba-Hien Tran, Babak Shahbaba, Stephan Mandt, Maurizio Filippone

    Abstract: Autoencoders and their variants are among the most widely used models in representation learning and generative modeling. However, autoencoder-based models usually assume that the learned representations are i.i.d. and fail to capture the correlations between the data samples. To address this issue, we propose a novel Sparse Gaussian Process Bayesian Autoencoder (SGPBAE) model in which we impose f… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  11. arXiv:2210.09998  [pdf, other

    stat.ML cs.LG

    Locally Smoothed Gaussian Process Regression

    Authors: Davit Gogolashvili, Bogdan Kozyrskiy, Maurizio Filippone

    Abstract: We develop a novel framework to accelerate Gaussian process regression (GPR). In particular, we consider localization kernels at each data point to down-weigh the contributions from other data points that are far away, and we derive the GPR model stemming from the application of such localization operation. Through a set of experiments, we demonstrate the competitive performance of the proposed ap… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  12. arXiv:2206.05173  [pdf, other

    stat.ML cs.LG

    How Much is Enough? A Study on Diffusion Times in Score-based Generative Models

    Authors: Giulio Franzese, Simone Rossi, Lixuan Yang, Alessandro Finamore, Dario Rossi, Maurizio Filippone, Pietro Michiardi

    Abstract: Score-based diffusion models are a class of generative models whose dynamics is described by stochastic differential equations that map noise into data. While recent works have started to lay down a theoretical foundation for these models, an analytical understanding of the role of the diffusion time T is still lacking. Current best practice advocates for a large T to ensure that the forward dynam… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  13. arXiv:2204.05667  [pdf, other

    stat.ML cs.LG stat.CO

    Local Random Feature Approximations of the Gaussian Kernel

    Authors: Jonas Wacker, Maurizio Filippone

    Abstract: A fundamental drawback of kernel-based statistical models is their limited scalability to large data sets, which requires resorting to approximations. In this work, we focus on the popular Gaussian kernel and on techniques to linearize kernel-based models by means of random feature approximations. In particular, we do so by studying a less explored random feature approximation based on Maclaurin e… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 11 pages

  14. arXiv:2202.02031  [pdf, other

    stat.ML cs.LG stat.CO

    Complex-to-Real Sketches for Tensor Products with Applications to the Polynomial Kernel

    Authors: Jonas Wacker, Ruben Ohana, Maurizio Filippone

    Abstract: Randomized sketches of a tensor product of $p$ vectors follow a tradeoff between statistical efficiency and computational acceleration. Commonly used approaches avoid computing the high-dimensional tensor product explicitly, resulting in a suboptimal dependence of $\mathcal{O}(3^p)$ in the embedding dimension. We propose a simple Complex-to-Real (CtR) modification of well-known sketches that repla… ▽ More

    Submitted 30 April, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 32 pages

  15. arXiv:2201.08712  [pdf, other

    stat.ML cs.LG stat.CO

    Improved Random Features for Dot Product Kernels

    Authors: Jonas Wacker, Motonobu Kanagawa, Maurizio Filippone

    Abstract: Dot product kernels, such as polynomial and exponential (softmax) kernels, are among the most widely used kernels in machine learning, as they enable modeling the interactions between input features, which is crucial in applications like computer vision, natural language processing, and recommender systems. We make several novel contributions for improving the efficiency of random feature approxim… ▽ More

    Submitted 13 August, 2024; v1 submitted 21 January, 2022; originally announced January 2022.

    Comments: To appear in Journal of Machine Learning Research (JMLR)

  16. arXiv:2106.16200  [pdf, other

    cs.LG stat.CO

    Revisiting the Effects of Stochasticity for Hamiltonian Samplers

    Authors: Giulio Franzese, Dimitrios Milios, Maurizio Filippone, Pietro Michiardi

    Abstract: We revisit the theoretical properties of Hamiltonian stochastic differential equations (SDES) for Bayesian posterior sampling, and we study the two types of errors that arise from numerical SDE simulation: the discretization error and the error due to noisy gradient estimates in the context of data subsampling. Our main result is a novel analysis for the effect of mini-batches through the lens of… ▽ More

    Submitted 4 November, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

  17. arXiv:2106.06245  [pdf, other

    stat.ML cs.LG

    Model Selection for Bayesian Autoencoders

    Authors: Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Pietro Michiardi, Edwin V. Bonilla, Maurizio Filippone

    Abstract: We develop a novel method for carrying out model selection for Bayesian autoencoders (BAEs) by means of prior hyper-parameter optimization. Inspired by the common practice of type-II maximum likelihood optimization and its equivalence to Kullback-Leibler divergence minimization, we propose to optimize the distributional sliced-Wasserstein distance (DSWD) between the output of the autoencoder and t… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  18. arXiv:2011.12829  [pdf, other

    stat.ML cs.LG

    All You Need is a Good Functional Prior for Bayesian Deep Learning

    Authors: Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Maurizio Filippone

    Abstract: The Bayesian treatment of neural networks dictates that a prior distribution is specified over their weight and bias parameters. This poses a challenge because modern neural networks are characterized by a large number of parameters, and the choice of these priors has an uncontrolled effect on the induced functional prior, which is the distribution of the functions obtained by sampling the paramet… ▽ More

    Submitted 25 April, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

  19. arXiv:2011.05041  [pdf, other

    stat.ML cs.LG

    Sparse within Sparse Gaussian Processes using Neighbor Information

    Authors: Gia-Lac Tran, Dimitrios Milios, Pietro Michiardi, Maurizio Filippone

    Abstract: Approximations to Gaussian processes based on inducing variables, combined with variational inference techniques, enable state-of-the-art sparse approaches to infer GPs at scale through mini batch-based learning. In this work, we address one limitation of sparse GPs, which is due to the challenge in dealing with a large number of inducing variables without imposing a special structure on the induc… ▽ More

    Submitted 20 July, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 10 pages

  20. arXiv:2010.09360  [pdf, other

    cs.LG stat.ML

    An Identifiable Double VAE For Disentangled Representations

    Authors: Graziano Mita, Maurizio Filippone, Pietro Michiardi

    Abstract: A large part of the literature on learning disentangled representations focuses on variational autoencoders (VAE). Recent developments demonstrate that disentanglement cannot be obtained in a fully unsupervised setting without inductive biases on models and data. However, Khemakhem et al., AISTATS, 2020 suggest that employing a particular form of factorized prior, conditionally dependent on auxili… ▽ More

    Submitted 10 February, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

  21. arXiv:2006.05087  [pdf, other

    cs.LG stat.ML

    Isotropic SGD: a Practical Approach to Bayesian Posterior Sampling

    Authors: Giulio Franzese, Rosa Candela, Dimitrios Milios, Maurizio Filippone, Pietro Michiardi

    Abstract: In this work we define a unified mathematical framework to deepen our understanding of the role of stochastic gradient (SG) noise on the behavior of Markov chain Monte Carlo sampling (SGMCMC) algorithms. Our formulation unlocks the design of a novel, practical approach to posterior sampling, which makes the SG noise isotropic using a fixed learning rate that we determine analytically, and that r… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    MSC Class: 65C05 ACM Class: G.3

  22. arXiv:2006.04548  [pdf, other

    cs.LG stat.ML

    A Variational View on Bootstrap Ensembles as Bayesian Inference

    Authors: Dimitrios Milios, Pietro Michiardi, Maurizio Filippone

    Abstract: In this paper, we employ variational arguments to establish a connection between ensemble methods for Neural Networks and Bayesian inference. We consider an ensemble-based scheme where each model/particle corresponds to a perturbation of the data by means of parametric bootstrap and a perturbation of the prior. We derive conditions under which any optimization steps of the particles makes the asso… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  23. arXiv:2003.03080  [pdf, other

    stat.ML cs.LG

    Sparse Gaussian Processes Revisited: Bayesian Approaches to Inducing-Variable Approximations

    Authors: Simone Rossi, Markus Heinonen, Edwin V. Bonilla, Zheyang Shen, Maurizio Filippone

    Abstract: Variational inference techniques based on inducing variables provide an elegant framework for scalable posterior estimation in Gaussian process (GP) models. Besides enabling scalability, one of their main advantages over sparse approximations using direct marginal likelihood maximization is that they provide a robust alternative for point estimation of the inducing inputs, i.e. the location of the… ▽ More

    Submitted 23 February, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

  24. arXiv:1912.00015  [pdf, other

    stat.ML cs.LG

    Efficient Approximate Inference with Walsh-Hadamard Variational Inference

    Authors: Simone Rossi, Sebastien Marmin, Maurizio Filippone

    Abstract: Variational inference offers scalable and flexible tools to tackle intractable Bayesian inference of modern statistical models like Bayesian neural networks and Gaussian processes. For largely over-parameterized models, however, the over-regularization property of the variational objective makes the application of variational inference challenging. Inspired by the literature on kernel methods, and… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: Paper accepted at the 4th Workshop on Bayesian Deep Learning (NeurIPS 2019), Vancouver, Canada. arXiv admin note: substantial text overlap with arXiv:1905.11248

  25. arXiv:1911.06537  [pdf, other

    cs.LG cs.AI stat.ML

    LIBRE: Learning Interpretable Boolean Rule Ensembles

    Authors: Graziano Mita, Paolo Papotti, Maurizio Filippone, Pietro Michiardi

    Abstract: We present a novel method - LIBRE - to learn an interpretable classifier, which materializes as a set of Boolean rules. LIBRE uses an ensemble of bottom-up weak learners operating on a random subset of features, which allows for the learning of rules that generalize well on unseen data even in imbalanced settings. Weak learners are combined with a simple union so that the final ensemble is also in… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  26. Kernel computations from large-scale random features obtained by Optical Processing Units

    Authors: Ruben Ohana, Jonas Wacker, Jonathan Dong, Sébastien Marmin, Florent Krzakala, Maurizio Filippone, Laurent Daudet

    Abstract: Approximating kernel functions with random features (RFs)has been a successful application of random projections for nonparametric estimation. However, performing random projections presents computational challenges for large-scale problems. Recently, a new optical hardware called Optical Processing Unit (OPU) has been developed for fast and energy-efficient computation of large-scale RFs in the a… ▽ More

    Submitted 2 December, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 5 pages, 3 figures, submitted to ICASSP 2020

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  27. arXiv:1910.09466  [pdf, ps, other

    cs.LG stat.ML

    Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD

    Authors: Rosa Candela, Giulio Franzese, Maurizio Filippone, Pietro Michiardi

    Abstract: Large scale machine learning is increasingly relying on distributed optimization, whereby several machines contribute to the training process of a statistical model. In this work we study the performance of asynchronous, distributed settings, when applying sparsification, a technique used to reduce communication overheads. In particular, for the first time in an asynchronous, non-convex setting, w… ▽ More

    Submitted 18 January, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

  28. arXiv:1905.11248  [pdf, other

    stat.ML cs.LG

    Walsh-Hadamard Variational Inference for Bayesian Deep Learning

    Authors: Simone Rossi, Sebastien Marmin, Maurizio Filippone

    Abstract: Over-parameterized models, such as DeepNets and ConvNets, form a class of models that are routinely adopted in a wide variety of applications, and for which Bayesian inference is desirable but extremely challenging. Variational inference offers the tools to tackle this challenge in a scalable way and with some degree of flexibility on the approximation, but for over-parameterized models this is ch… ▽ More

    Submitted 23 November, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

  29. A comparative evaluation of novelty detection algorithms for discrete sequences

    Authors: Rémi Domingues, Pietro Michiardi, Jérémie Barlet, Maurizio Filippone

    Abstract: The identification of anomalies in temporal data is a core component of numerous research areas such as intrusion detection, fault prevention, genomics and fraud detection. This article provides an experimental comparison of the novelty detection problem applied to discrete sequences. The objective of this study is to identify which state-of-the-art methods are efficient and appropriate candidates… ▽ More

    Submitted 29 November, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: Submitted to Artificial Intelligence Review journal; 24 pages, 4 tables, 11 figures

    MSC Class: I.2.6 ACM Class: I.2.6

    Journal ref: Artificial Intelligence Review (2019)

  30. arXiv:1810.12177  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Variational Calibration of Computer Models

    Authors: Sébastien Marmin, Maurizio Filippone

    Abstract: Bayesian calibration of black-box computer models offers an established framework to obtain a posterior distribution over model parameters. Traditional Bayesian calibration involves the emulation of the computer model and an additive model discrepancy term using Gaussian processes; inference is then carried out using MCMC. These choices pose computational and statistical challenges and limitations… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

  31. arXiv:1810.08083  [pdf, other

    stat.ML cs.LG

    Good Initializations of Variational Bayes for Deep Models

    Authors: Simone Rossi, Pietro Michiardi, Maurizio Filippone

    Abstract: Stochastic variational inference is an established way to carry out approximate Bayesian inference for deep models. While there have been effective proposals for good initializations for loss minimization in deep learning, far less attention has been devoted to the issue of initialization of stochastic variational inference. We address this by proposing a novel layer-wise initialization strategy b… ▽ More

    Submitted 25 January, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: 8 pages of main paper (+3 for references and +6 of supplement material)

  32. arXiv:1805.10915  [pdf, other

    cs.LG stat.ML

    Dirichlet-based Gaussian Processes for Large-scale Calibrated Classification

    Authors: Dimitrios Milios, Raffaello Camoriano, Pietro Michiardi, Lorenzo Rosasco, Maurizio Filippone

    Abstract: In this paper, we study the problem of deriving fast and accurate classification algorithms with uncertainty quantification. Gaussian process classification provides a principled approach, but the corresponding computational burden is hardly sustainable in large-scale problems and devising efficient alternatives is a challenge. In this work, we investigate if and how Gaussian process regression di… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

  33. arXiv:1805.10522  [pdf, other

    stat.ML cs.LG

    Calibrating Deep Convolutional Gaussian Processes

    Authors: Gia-Lac Tran, Edwin V. Bonilla, John P. Cunningham, Pietro Michiardi, Maurizio Filippone

    Abstract: The wide adoption of Convolutional Neural Networks (CNNs) in applications where decision-making under uncertainty is fundamental, has brought a great deal of attention to the ability of these models to accurately quantify the uncertainty in their predictions. Previous work on combining CNNs with Gaussian processes (GPs) has been developed under the assumption that the predictive probabilities of t… ▽ More

    Submitted 26 May, 2018; originally announced May 2018.

    Comments: 12 pages

  34. arXiv:1711.00625  [pdf, other

    cs.IT

    Decentralized Deep Scheduling for Interference Channels

    Authors: Paul de Kerret, David Gesbert, Maurizio Filippone

    Abstract: In this paper, we study the problem of decentralized scheduling in Interference Channels (IC). In this setting, each Transmitter (TX) receives an arbitrary amount of feedback regarding the global multi-user channel state based on which it decides whether to transmit or to stay silent without any form of communication with the other TXs. While many methods have been proposed to tackle the problem o… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: Submitted to the 2018 IEEE International Conference on Communications (ICC)

  35. arXiv:1704.07223  [pdf, other

    math.NA cs.IT stat.CO stat.ML

    Entropic Trace Estimates for Log Determinants

    Authors: Jack Fitzsimons, Diego Granziol, Kurt Cutajar, Michael Osborne, Maurizio Filippone, Stephen Roberts

    Abstract: The scalable calculation of matrix determinants has been a bottleneck to the widespread application of many machine learning methods such as determinantal point processes, Gaussian processes, generalised Markov random fields, graph models and many others. In this work, we estimate log determinants under the framework of maximum entropy, given information in the form of moment constraints from stoc… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: 16 pages, 4 figures, 2 tables, 2 algorithms

  36. arXiv:1607.02024  [pdf, other

    stat.ML cs.LG

    Mini-Batch Spectral Clustering

    Authors: Yufei Han, Maurizio Filippone

    Abstract: The cost of computing the spectrum of Laplacian matrices hinders the application of spectral clustering to large data sets. While approximations recover computational tractability, they can potentially affect clustering performance. This paper proposes a practical approach to learn spectral clustering based on adaptive stochastic gradient optimization. Crucially, the proposed approach recovers the… ▽ More

    Submitted 12 August, 2016; v1 submitted 7 July, 2016; originally announced July 2016.

  37. On User Availability Prediction and Network Applications

    Authors: Matteo Dell'Amico, Maurizio Filippone, Pietro Michiardi, Yves Roudier

    Abstract: User connectivity patterns in network applications are known to be heterogeneous, and to follow periodic (daily and weekly) patterns. In many cases, the regularity and the correlation of those patterns is problematic: for network applications, many connected users create peaks of demand; in contrast, in peer-to-peer scenarios, having few users online results in a scarcity of available resources. O… ▽ More

    Submitted 30 April, 2014; originally announced April 2014.

    Comments: Accepted for publication in IEEE/ACM Transactions on Networking

  38. arXiv:1310.0740  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Pseudo-Marginal Bayesian Inference for Gaussian Processes

    Authors: Maurizio Filippone, Mark Girolami

    Abstract: The main challenges that arise when adopting Gaussian Process priors in probabilistic modeling are how to carry out exact Bayesian inference and how to account for uncertainty on model parameters when making model-based predictions on out-of-sample data. Using probit regression as an illustrative working example, this paper presents a general and effective methodology based on the pseudo-marginal… ▽ More

    Submitted 7 April, 2014; v1 submitted 2 October, 2013; originally announced October 2013.

    Comments: 14 pages double column