Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Manoel, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.11765  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

    Authors: Xinyu Tang, Richard Shin, Huseyin A. Inan, Andre Manoel, Fatemehsadat Mireshghallah, Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni, Robert Sim

    Abstract: We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that… ▽ More

    Submitted 27 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  2. arXiv:2307.11899  [pdf, other

    cs.LG cs.DC cs.SE

    Project Florida: Federated Learning Made Easy

    Authors: Daniel Madrigal Diaz, Andre Manoel, Jialei Chen, Nalin Singal, Robert Sim

    Abstract: We present Project Florida, a system architecture and software development kit (SDK) enabling deployment of large-scale Federated Learning (FL) solutions across a heterogeneous device ecosystem. Federated learning is an approach to machine learning based on a strong data sovereignty principle, i.e., that privacy and security of data is best enabled by storing it at its origin, whether on end-user… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2301.02344  [pdf, other

    cs.CR cs.LG

    TrojanPuzzle: Covertly Poisoning Code-Suggestion Models

    Authors: Hojjat Aghakhani, Wei Dai, Andre Manoel, Xavier Fernandes, Anant Kharkar, Christopher Kruegel, Giovanni Vigna, David Evans, Ben Zorn, Robert Sim

    Abstract: With tools like GitHub Copilot, automatic code suggestion is no longer a dream in software engineering. These tools, based on large language models, are typically trained on massive corpora of code mined from unvetted public sources. As a result, these models are susceptible to data poisoning attacks where an adversary manipulates the model's training by injecting malicious data. Poisoning attacks… ▽ More

    Submitted 24 January, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

  4. arXiv:2211.09722  [pdf, other

    cs.CL cs.LG

    Federated Multilingual Models for Medical Transcript Analysis

    Authors: Andre Manoel, Mirian Hipolito Garcia, Tal Baumel, Shize Su, Jialei Chen, Dan Miller, Danny Karmon, Robert Sim, Dimitrios Dimitriadis

    Abstract: Federated Learning (FL) is a novel machine learning approach that allows the model trainer to access more data samples, by training the model across multiple decentralized data sources, while data access constraints are in place. Such trained models can achieve significantly higher performance beyond what can be done when trained on a single data source. As part of FL's promises, none of the train… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  5. arXiv:2204.12703  [pdf, other

    cs.LG

    Heterogeneous Ensemble Knowledge Transfer for Training Large Models in Federated Learning

    Authors: Yae Jee Cho, Andre Manoel, Gauri Joshi, Robert Sim, Dimitrios Dimitriadis

    Abstract: Federated learning (FL) enables edge-devices to collaboratively learn a model without disclosing their private data to a central aggregating server. Most existing FL algorithms require models of identical architecture to be deployed across the clients and server, making it infeasible to train large models due to clients' limited system resources. In this work, we propose a novel ensemble knowledge… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: To appear in the proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI 2022)

  6. arXiv:2203.13789  [pdf, other

    cs.LG

    FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations

    Authors: Mirian Hipolito Garcia, Andre Manoel, Daniel Madrigal Diaz, Fatemehsadat Mireshghallah, Robert Sim, Dimitrios Dimitriadis

    Abstract: In this paper we introduce "Federated Learning Utilities and Tools for Experimentation" (FLUTE), a high-performance open-source platform for federated learning research and offline simulations. The goal of FLUTE is to enable rapid prototyping and simulation of new federated learning algorithms at scale, including novel optimization, privacy, and communications strategies. We describe the architect… ▽ More

    Submitted 14 November, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: 14 Pages, 3 Figures, 11 Tables

  7. arXiv:2110.06500  [pdf, other

    cs.LG cs.CL cs.CR stat.ML

    Differentially Private Fine-tuning of Language Models

    Authors: Da Yu, Saurabh Naik, Arturs Backurs, Sivakanth Gopi, Huseyin A. Inan, Gautam Kamath, Janardhan Kulkarni, Yin Tat Lee, Andre Manoel, Lukas Wutschitz, Sergey Yekhanin, Huishuai Zhang

    Abstract: We give simpler, sparser, and faster algorithms for differentially private fine-tuning of large-scale pre-trained language models, which achieve the state-of-the-art privacy versus utility tradeoffs on many standard NLP tasks. We propose a meta-framework for this problem, inspired by the recent success of highly parameter-efficient methods for fine-tuning. Our experiments show that differentially… ▽ More

    Submitted 14 July, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: ICLR 2022. Code available at https://github.com/huseyinatahaninan/Differentially-Private-Fine-tuning-of-Language-Models

  8. arXiv:2006.08997  [pdf, other

    cs.LG stat.ML

    Federated Survival Analysis with Discrete-Time Cox Models

    Authors: Mathieu Andreux, Andre Manoel, Romuald Menuet, Charlie Saillard, Chloé Simpson

    Abstract: Building machine learning models from decentralized datasets located in different centers with federated learning (FL) is a promising approach to circumvent local data scarcity while preserving privacy. However, the prominent Cox proportional hazards (PH) model, used for survival analysis, does not fit the FL framework, as its loss function is non-separable with respect to the samples. The naïve m… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 21 pages, 6 figures

    Journal ref: International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2020 (FL-ICML'20)

  9. arXiv:1912.06015  [pdf, other

    cs.LG cs.CV stat.ML

    Efficient Per-Example Gradient Computations in Convolutional Neural Networks

    Authors: Gaspar Rochette, Andre Manoel, Eric W. Tramel

    Abstract: Deep learning frameworks leverage GPUs to perform massively-parallel computations over batches of many training examples efficiently. However, for certain tasks, one may be interested in performing per-example computations, for instance using per-example gradients to evaluate a quantity of interest unique to each example. One notable application comes from the field of differential privacy, where… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Journal ref: Theory and Practice of Differential Privacy (TPDP) workshop at CCS 2020

  10. arXiv:1809.06304  [pdf, other

    stat.ML cs.IT cs.LG

    Approximate message-passing for convex optimization with non-separable penalties

    Authors: Andre Manoel, Florent Krzakala, Gaël Varoquaux, Bertrand Thirion, Lenka Zdeborová

    Abstract: We introduce an iterative optimization scheme for convex objectives consisting of a linear loss and a non-separable penalty, based on the expectation-consistent approximation and the vector approximate message-passing (VAMP) algorithm. Specifically, the penalties we approach are convex on a linear transformation of the variable to be determined, a notable example being total variation (TV). We des… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: 18 pages, 6 figures

  11. arXiv:1805.09785  [pdf, other

    cs.LG cond-mat.dis-nn cs.IT stat.ML

    Entropy and mutual information in models of deep neural networks

    Authors: Marylou Gabrié, Andre Manoel, Clément Luneau, Jean Barbier, Nicolas Macris, Florent Krzakala, Lenka Zdeborová

    Abstract: We examine a class of deep learning models with a tractable method to compute information-theoretic quantities. Our contributions are three-fold: (i) We show how entropies and mutual informations can be derived from heuristic statistical physics methods, under the assumption that weight matrices are independent and orthogonally-invariant. (ii) We extend particular cases in which this result is kno… ▽ More

    Submitted 29 October, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Journal ref: J. Stat. Mech. (2019) 124014. & NeurIPS 2018

  12. arXiv:1706.00705  [pdf, other

    stat.ML cond-mat.stat-mech cs.IT

    Streaming Bayesian inference: theoretical limits and mini-batch approximate message-passing

    Authors: Andre Manoel, Florent Krzakala, Eric W. Tramel, Lenka Zdeborová

    Abstract: In statistical learning for real-world large-scale data problems, one must often resort to "streaming" algorithms which operate sequentially on small batches of data. In this work, we present an analysis of the information-theoretic limits of mini-batch inference in the context of generalized linear models and low-rank matrix factorization. In a controlled Bayes-optimal setting, we characterize th… ▽ More

    Submitted 2 June, 2017; originally announced June 2017.

    Comments: 19 pages, 4 figures

    Journal ref: 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 2017, pp. 1048-1055

  13. arXiv:1702.03260  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE stat.ML

    A Deterministic and Generalized Framework for Unsupervised Learning with Restricted Boltzmann Machines

    Authors: Eric W. Tramel, Marylou Gabrié, Andre Manoel, Francesco Caltagirone, Florent Krzakala

    Abstract: Restricted Boltzmann machines (RBMs) are energy-based neural-networks which are commonly used as the building blocks for deep architectures neural architectures. In this work, we derive a deterministic framework for the training, evaluation, and use of RBMs based upon the Thouless-Anderson-Palmer (TAP) mean-field approximation of widely-connected systems with weak interactions coming from spin-gla… ▽ More

    Submitted 9 October, 2018; v1 submitted 10 February, 2017; originally announced February 2017.

    Journal ref: Phys. Rev. X 8, 041006 (2018)

  14. arXiv:1701.06981  [pdf, other

    cs.IT cond-mat.stat-mech stat.ML

    Multi-Layer Generalized Linear Estimation

    Authors: Andre Manoel, Florent Krzakala, Marc Mézard, Lenka Zdeborová

    Abstract: We consider the problem of reconstructing a signal from multi-layered (possibly) non-linear measurements. Using non-rigorous but standard methods from statistical physics we present the Multi-Layer Approximate Message Passing (ML-AMP) algorithm for computing marginal probabilities of the corresponding estimation problem and derive the associated state evolution equations to analyze its performance… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

    Comments: 5 pages, 1 figure

    Journal ref: IEEE International Symposium on Information Theory (ISIT), pages 2098-2102 (2017)

  15. arXiv:1609.04167  [pdf, other

    math.NA cs.CV cs.IT cs.LG math.OC

    Proceedings of the third "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'16)

    Authors: V. Abrol, O. Absil, P. -A. Absil, S. Anthoine, P. Antoine, T. Arildsen, N. Bertin, F. Bleichrodt, J. Bobin, A. Bol, A. Bonnefoy, F. Caltagirone, V. Cambareri, C. Chenot, V. Crnojević, M. Daňková, K. Degraux, J. Eisert, J. M. Fadili, M. Gabrié, N. Gac, D. Giacobello, A. Gonzalez, C. A. Gomez Gonzalez, A. González , et al. (36 additional authors not shown)

    Abstract: The third edition of the "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) took place in Aalborg, the 4th largest city in Denmark situated beautifully in the northern part of the country, from the 24th to 26th of August 2016. The workshop venue was at the Aalborg University campus. One implicit objective of this biennial workshop is to foster collab… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

    Comments: 69 pages, 22 extended abstracts, iTWIST'16 website: http://www.itwist16.es.aau.dk

  16. arXiv:1606.03956  [pdf, other

    cs.IT cond-mat.dis-nn cs.LG stat.ML

    Inferring Sparsity: Compressed Sensing using Generalized Restricted Boltzmann Machines

    Authors: Eric W. Tramel, Andre Manoel, Francesco Caltagirone, Marylou Gabrié, Florent Krzakala

    Abstract: In this work, we consider compressed sensing reconstruction from $M$ measurements of $K$-sparse structured signals which do not possess a writable correlation model. Assuming that a generative statistical model, such as a Boltzmann machine, can be trained in an unsupervised manner on example signals, we demonstrate how this signal model can be used within a Bayesian framework of signal reconstruct… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

    Comments: IEEE Information Theory Workshop, 2016

    Journal ref: 2016 IEEE Information Theory Workshop (ITW), Pages: 265 - 269

  17. arXiv:1406.4311  [pdf, other

    cs.IT cond-mat.dis-nn physics.data-an stat.ML

    Sparse Estimation with the Swept Approximated Message-Passing Algorithm

    Authors: Andre Manoel, Florent Krzakala, Eric W. Tramel, Lenka Zdeborová

    Abstract: Approximate Message Passing (AMP) has been shown to be a superior method for inference problems, such as the recovery of signals from sets of noisy, lower-dimensionality measurements, both in terms of reconstruction accuracy and in computational efficiency. However, AMP suffers from serious convergence issues in contexts that do not exactly match its assumptions. We propose a new approach to stabi… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: 11 pages, 3 figures, implementation available at https://github.com/eric-tramel/SwAMP-Demo

    Journal ref: Proceedings of the 32nd International Conference on Machine Learning (ICML), 2015, 1123-1132

  18. arXiv:1402.1384  [pdf, ps, other

    cs.IT cond-mat.stat-mech

    Variational Free Energies for Compressed Sensing

    Authors: Florent Krzakala, Andre Manoel, Eric W. Tramel, Lenka Zdeborova

    Abstract: We consider the variational free energy approach for compressed sensing. We first show that the naïve mean field approach performs remarkably well when coupled with a noise learning procedure. We also notice that it leads to the same equations as those used for iterative thresholding. We then discuss the Bethe free energy and how it corresponds to the fixed points of the approximate message passin… ▽ More

    Submitted 6 February, 2014; originally announced February 2014.

    Comments: 5 pages, 3 figures

    Journal ref: Information Theory Proceedings (ISIT), 2014 IEEE International Symposium on, page(s) 1499 - 1503

  19. arXiv:1211.6462  [pdf, ps, other

    cond-mat.dis-nn cs.SI physics.soc-ph

    Statistical mechanics of reputation systems in autonomous networks

    Authors: Andre Manoel, Renato Vicente

    Abstract: Reputation systems seek to infer which members of a community can be trusted based on ratings they issue about each other. We construct a Bayesian inference model and simulate approximate estimates using belief propagation (BP). The model is then mapped onto computing equilibrium properties of a spin glass in a random field and analyzed by employing the replica symmetric cavity approach. Having th… ▽ More

    Submitted 11 April, 2013; v1 submitted 27 November, 2012; originally announced November 2012.

    Comments: 20 pages, 14 figures

    Journal ref: Journal of Statistical Mechanics (2013) P08002