Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Colombo, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.17377  [pdf, other

    cs.LG

    Entropy Reweighted Conformal Classification

    Authors: Rui Luo, Nicolo Colombo

    Abstract: Conformal Prediction (CP) is a powerful framework for constructing prediction sets with guaranteed coverage. However, recent studies have shown that integrating confidence calibration with CP can lead to a degradation in efficiency. In this paper, We propose an adaptive approach that considers the classifier's uncertainty and employs entropy-based reweighting to enhance the efficiency of predictio… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  2. arXiv:2406.08281  [pdf, other

    cs.LG stat.ML

    Conformal Load Prediction with Transductive Graph Autoencoders

    Authors: Rui Luo, Nicolo Colombo

    Abstract: Predicting edge weights on graphs has various applications, from transportation systems to social networks. This paper describes a Graph Neural Network (GNN) approach for edge weight prediction with guaranteed coverage. We leverage conformal prediction to calibrate the GNN outputs and produce valid prediction intervals. We handle data heteroscedasticity through error reweighting and Conformalized… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.05745  [pdf, other

    stat.ML cs.AI cs.LG

    Structured Learning of Compositional Sequential Interventions

    Authors: Jialin Yu, Andreas Koukorinis, Nicolò Colombo, Yuchen Zhu, Ricardo Silva

    Abstract: We consider sequential treatment regimes where each unit is exposed to combinations of interventions over time. When interventions are described by qualitative labels, such as ``close schools for a month due to a pandemic'' or ``promote this podcast to this user during this week'', it is unclear which appropriate structural assumptions allow us to generalize behavioral predictions to previously un… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  4. arXiv:2406.03346  [pdf, other

    cs.LG math.PR stat.ML

    Normalizing Flows for Conformal Regression

    Authors: Nicolo Colombo

    Abstract: Conformal Prediction (CP) algorithms estimate the uncertainty of a prediction model by calibrating its outputs on labeled data. The same calibration scheme usually applies to any model and data without modifications. The obtained prediction intervals are valid by construction but could be inefficient, i.e. unnecessarily big, if the prediction errors are not uniformly distributed over the input spa… ▽ More

    Submitted 26 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: To be presented at the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024). Changes from v1: improved Section 1.2, two figures replaced, minor typos fixed

  5. arXiv:2306.04648  [pdf, other

    cs.LG cs.AI stat.ML

    On training locally adaptive CP

    Authors: Nicolo Colombo

    Abstract: We address the problem of making Conformal Prediction (CP) intervals locally adaptive. Most existing methods focus on approximating the object-conditional validity of the intervals by partitioning or re-weighting the calibration set. Our strategy is new and conceptually different. Instead of re-weighting the calibration data, we redefine the conformity measure through a trainable change of variabl… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 15 pages, 1 table, 1 figure

  6. arXiv:2107.03375  [pdf, other

    cs.LG cs.CV stat.ML

    Differentiable Architecture Pruning for Transfer Learning

    Authors: Nicolo Colombo, Yang Gao

    Abstract: We propose a new gradient-based approach for extracting sub-architectures from a given large model. Contrarily to existing pruning methods, which are unable to disentangle the network architecture and the corresponding weights, our architecture-pruning scheme produces transferable new structures that can be successfully retrained to solve different tasks. We focus on a transfer-learning setup wher… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 19 pages (main + appendix), 7 figures and 1 table, Workshop @ ICML 2021, 24th July 2021

  7. arXiv:2105.03343  [pdf, other

    cs.LG cs.CL

    Adapting by Pruning: A Case Study on BERT

    Authors: Yang Gao, Nicolo Colombo, Wei Wang

    Abstract: Adapting pre-trained neural models to downstream tasks has become the standard practice for obtaining high-quality models. In this work, we propose a novel model adaptation paradigm, adapting by pruning, which prunes neural connections in the pre-trained model to optimise the performance on the target task; all remaining connections have their weights intact. We formulate adapting-by-pruning as an… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  8. arXiv:2009.05346  [pdf, other

    cs.LG cs.CV stat.ML

    Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification

    Authors: Nicolo Colombo, Yang Gao

    Abstract: The history of deep learning has shown that human-designed problem-specific networks can greatly improve the classification performance of general neural models. In most practical cases, however, choosing the optimal architecture for a given task remains a challenging problem. Recent architecture-search methods are able to automatically build neural models with strong performance but fail to fully… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: 22 pages and 10 figures

  9. arXiv:2005.07037  [pdf, other

    cs.LG stat.ML

    Training conformal predictors

    Authors: Nicolo Colombo, Vladimir Vovk

    Abstract: Efficiency criteria for conformal prediction, such as \emph{observed fuzziness} (i.e., the sum of p-values associated with false labels), are commonly used to \emph{evaluate} the performance of given conformal predictors. Here, we investigate whether it is possible to exploit efficiency criteria to \emph{learn} classifiers, both conformal predictors and point classifiers, by using such criteria as… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 8 pages, 2 figures, 4 tables

  10. arXiv:2002.05747  [pdf, other

    cs.LG math.MG stat.AP stat.ML

    Multiple Metric Learning for Structured Data

    Authors: Nicolo Colombo

    Abstract: We address the problem of merging graph and feature-space information while learning a metric from structured data. Existing algorithms tackle the problem in an asymmetric way, by either extracting vectorized summaries of the graph structure or adding hard constraints to feature-space algorithms. Following a different path, we define a metric regression scheme where we train metric-constrained lin… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: 24 pages, 6 figures

  11. arXiv:1908.07193  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Counterfactual Distribution Regression for Structured Inference

    Authors: Nicolo Colombo, Ricardo Silva, Soong M Kang, Arthur Gretton

    Abstract: We consider problems in which a system receives external \emph{perturbations} from time to time. For instance, the system can be a train network in which particular lines are repeatedly disrupted without warning, having an effect on passenger behavior. The goal is to predict changes in the behavior of the system at particular points of interest, such as passenger traffic around stations at the aff… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 24 pages, 5 figures

  12. arXiv:1809.04379  [pdf, other

    cs.LG cs.SI stat.ML

    Bayesian Semi-supervised Learning with Graph Gaussian Processes

    Authors: Yin Cheng Ng, Nicolo Colombo, Ricardo Silva

    Abstract: We propose a data-efficient Gaussian process-based Bayesian approach to the semi-supervised learning problem on graphs. The proposed model shows extremely competitive performance when compared to the state-of-the-art graph neural networks on semi-supervised learning benchmark experiments, and outperforms the neural networks in active learning experiments where labels are scarce. Furthermore, the m… ▽ More

    Submitted 12 October, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: To appear in NIPS 2018 Fixed an error in Figure 2. The previous arxiv version contains two identical sub-figures

  13. arXiv:1607.00514  [pdf, ps, other

    math.NA cs.LG stat.ML

    Approximate Joint Matrix Triangularization

    Authors: Nicolo Colombo, Nikos Vlassis

    Abstract: We consider the problem of approximate joint triangularization of a set of noisy jointly diagonalizable real matrices. Approximate joint triangularizers are commonly used in the estimation of the joint eigenstructure of a set of matrices, with applications in signal processing, linear algebra, and tensor decomposition. By assuming the input matrices to be perturbations of noise-free, simultaneousl… ▽ More

    Submitted 2 July, 2016; originally announced July 2016.

    Comments: 19 pages

    MSC Class: 15A23; 15A42; 15A45; 15B10

  14. arXiv:1407.6125  [pdf, other

    q-bio.QM cs.CE

    Spectral Sequence Motif Discovery

    Authors: Nicolò Colombo, Nikos Vlassis

    Abstract: Sequence discovery tools play a central role in several fields of computational biology. In the framework of Transcription Factor binding studies, motif finding algorithms of increasingly high performance are required to process the big datasets produced by new high-throughput sequencing technologies. Most existing algorithms are computationally demanding and often cannot support the large size of… ▽ More

    Submitted 26 August, 2014; v1 submitted 23 July, 2014; originally announced July 2014.

    Comments: 20 pages, 3 figures, 1 table