Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Saade, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.15539  [pdf, other

    cs.CL cs.AI

    SteloCoder: a Decoder-Only LLM for Multi-Language to Python Code Translation

    Authors: Jialing Pan, Adrien Sadé, Jin Kim, Eric Soriano, Guillem Sole, Sylvain Flamant

    Abstract: With the recent focus on Large Language Models (LLMs), both StarCoder (Li et al., 2023) and Code Llama (Rozière et al., 2023) have demonstrated remarkable performance in code generation. However, there is still a need for improvement in code translation functionality with efficient training techniques. In response to this, we introduce SteloCoder, a decoder-only StarCoder-based LLM designed specif… ▽ More

    Submitted 15 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  2. arXiv:2305.01521  [pdf, other

    cs.LG stat.ML

    Unlocking the Power of Representations in Long-term Novelty-based Exploration

    Authors: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, Bilal Piot

    Abstract: We introduce Robust Exploration via Clustering-based Online Density Estimation (RECODE), a non-parametric method for novelty-based exploration that estimates visitation counts for clusters of states based on their similarity in a chosen embedding space. By adapting classical clustering to the nonstationary setting of Deep RL, RECODE can efficiently track state visitation counts over thousands of e… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  3. arXiv:2206.08332  [pdf, other

    cs.LG cs.AI stat.ML

    BYOL-Explore: Exploration by Bootstrapped Prediction

    Authors: Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pîslar, Bernardo Avila Pires, Florent Altché, Corentin Tallec, Alaa Saade, Daniele Calandriello, Jean-Bastien Grill, Yunhao Tang, Michal Valko, Rémi Munos, Mohammad Gheshlaghi Azar, Bilal Piot

    Abstract: We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven exploration in visually-complex environments. BYOL-Explore learns a world representation, the world dynamics, and an exploration policy all-together by optimizing a single prediction loss in the latent space with no additional auxiliary objective. We show that BYOL-Explore is effective in DM-HARD-8, a challeng… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  4. arXiv:2101.02055  [pdf, other

    cs.LG

    Geometric Entropic Exploration

    Authors: Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Avila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos

    Abstract: Exploration is essential for solving complex Reinforcement Learning (RL) tasks. Maximum State-Visitation Entropy (MSVE) formulates the exploration problem as a well-defined policy optimization problem whose solution aims at visiting all states as uniformly as possible. This is in contrast to standard uncertainty-based approaches where exploration is transient and eventually vanishes. However, exis… ▽ More

    Submitted 7 January, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

  5. arXiv:2011.09464  [pdf, other

    cs.LG

    Counterfactual Credit Assignment in Model-Free Reinforcement Learning

    Authors: Thomas Mesnard, Théophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Tom Stepleton, Nicolas Heess, Arthur Guez, Éric Moulines, Marcus Hutter, Lars Buesing, Rémi Munos

    Abstract: Credit assignment in reinforcement learning is the problem of measuring an action's influence on future rewards. In particular, this requires separating skill from luck, i.e. disentangling the effect of an action on rewards from that of external factors and subsequent actions. To achieve this, we adapt the notion of counterfactuals from causality theory to a model-free RL setup. The key idea is to… ▽ More

    Submitted 14 December, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

  6. arXiv:1810.12735  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    Spoken Language Understanding on the Edge

    Authors: Alaa Saade, Alice Coucke, Alexandre Caulier, Joseph Dureau, Adrien Ball, Théodore Bluche, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet

    Abstract: We consider the problem of performing Spoken Language Understanding (SLU) on small devices typical of IoT applications. Our contributions are twofold. First, we outline the design of an embedded, private-by-design SLU system and show that it has performance on par with cloud-based commercial solutions. Second, we release the datasets used in our experiments in the interest of reproducibility and i… ▽ More

    Submitted 2 October, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1805.10190

  7. arXiv:1805.10190  [pdf, other

    cs.CL cs.NE

    Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces

    Authors: Alice Coucke, Alaa Saade, Adrien Ball, Théodore Bluche, Alexandre Caulier, David Leroy, Clément Doumouro, Thibault Gisselbrecht, Francesco Caltagirone, Thibaut Lavril, Maël Primet, Joseph Dureau

    Abstract: This paper presents the machine learning architecture of the Snips Voice Platform, a software solution to perform Spoken Language Understanding on microprocessors typical of IoT devices. The embedded inference is fast and accurate while enforcing privacy by design, as no personal user data is ever collected. Focusing on Automatic Speech Recognition and Natural Language Understanding, we detail our… ▽ More

    Submitted 6 December, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 29 pages, 9 figures, 17 tables

  8. arXiv:1803.09533  [pdf, other

    cs.CY cs.LG stat.ML

    Deep Representation for Patient Visits from Electronic Health Records

    Authors: Jean-Baptiste Escudié, Alaa Saade, Alice Coucke, Marc Lelarge

    Abstract: We show how to learn low-dimensional representations (embeddings) of patient visits from the corresponding electronic health record (EHR) where International Classification of Diseases (ICD) diagnosis codes are removed. We expect that these embeddings will be useful for the construction of predictive statistical models anticipated to drive personalized medicine and improve healthcare quality. Thes… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  9. arXiv:1610.04337  [pdf

    cond-mat.dis-nn cs.IT cs.LG

    Spectral Inference Methods on Sparse Graphs: Theory and Applications

    Authors: Alaa Saade

    Abstract: In an era of unprecedented deluge of (mostly unstructured) data, graphs are proving more and more useful, across the sciences, as a flexible abstraction to capture complex relationships between complex objects. One of the main challenges arising in the study of such networks is the inference of macroscopic, large-scale properties affecting a large number of objects, based solely on the microscopic… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

    Comments: PhD dissertation

  10. arXiv:1605.06422  [pdf, other

    cs.LG math.PR math.ST stat.ML

    Fast Randomized Semi-Supervised Clustering

    Authors: Alaa Saade, Florent Krzakala, Marc Lelarge, Lenka Zdeborová

    Abstract: We consider the problem of clustering partially labeled data from a minimal number of randomly chosen pairwise comparisons between the items. We introduce an efficient local algorithm based on a power iteration of the non-backtracking operator and study its performance on a simple model. For the case of two clusters, we give bounds on the classification error and show that a small error can be ach… ▽ More

    Submitted 9 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Journal ref: Journal of Physics: Conf. Series 1036 (2018) 012015

  11. arXiv:1601.06683  [pdf, other

    cs.SI cond-mat.dis-nn cs.LG

    Clustering from Sparse Pairwise Measurements

    Authors: Alaa Saade, Marc Lelarge, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the problem of grouping items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that thes… ▽ More

    Submitted 19 May, 2016; v1 submitted 25 January, 2016; originally announced January 2016.

    Journal ref: Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT) Pages: 780 - 784

  12. arXiv:1510.06664  [pdf, other

    cs.ET cs.LG physics.optics

    Random Projections through multiple optical scattering: Approximating kernels at the speed of light

    Authors: Alaa Saade, Francesco Caltagirone, Igor Carron, Laurent Daudet, Angélique Drémeau, Sylvain Gigan, Florent Krzakala

    Abstract: Random projections have proven extremely useful in many signal processing and machine learning applications. However, they often require either to store a very large random matrix, or to use a different, structured matrix to reduce the computational and memory costs. Here, we overcome this difficulty by proposing an analog, optical device, that performs the random projections literally at the spee… ▽ More

    Submitted 25 October, 2015; v1 submitted 22 October, 2015; originally announced October 2015.

    Journal ref: Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pages: 6215 - 6219

  13. arXiv:1506.03498  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

    Authors: Alaa Saade, Florent Krzakala, Lenka Zdeborová

    Abstract: The completion of low rank matrices from few entries is a task with many practical applications. We consider here two aspects of this problem: detectability, i.e. the ability to estimate the rank $r$ reliably from the fewest possible random entries, and performance in achieving small reconstruction error. We propose a spectral algorithm for these two tasks called MaCBetH (for Matrix Completion wit… ▽ More

    Submitted 28 January, 2016; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: NIPS Conference 2015

    Journal ref: Advances in Neural Information Processing Systems (NIPS 2015) 28, pages 1261--1269

  14. arXiv:1502.00163  [pdf, other

    cs.SI cond-mat.dis-nn cs.LG math.PR

    Spectral Detection in the Censored Block Model

    Authors: Alaa Saade, Florent Krzakala, Marc Lelarge, Lenka Zdeborová

    Abstract: We consider the problem of partially recovering hidden binary variables from the observation of (few) censored edge weights, a problem with applications in community detection, correlation clustering and synchronization. We describe two spectral algorithms for this task based on the non-backtracking and the Bethe Hessian operators. These algorithms are shown to be asymptotically optimal for the pa… ▽ More

    Submitted 10 June, 2015; v1 submitted 31 January, 2015; originally announced February 2015.

    Comments: ISIT 2015

    Journal ref: IEEE International Symposium on Information Theory (ISIT), pp.1184-1188 (2015)

  15. arXiv:1409.2290  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.CC physics.soc-ph

    Computational Complexity, Phase Transitions, and Message-Passing for Community Detection

    Authors: Aurélien Decelle, Janina Hüttel, Alaa Saade, Cristopher Moore

    Abstract: We take a whirlwind tour of problems and techniques at the boundary of computer science and statistical physics. We start with a brief description of P, NP, and NP-completeness. We then discuss random graphs, including the emergence of the giant component and the k-core, using techniques from branching processes and differential equations. Using these tools as well as the second moment method, we… ▽ More

    Submitted 8 September, 2014; originally announced September 2014.

    Comments: Chapter of "Statistical Physics, Optimization, Inference, and Message-Passing Algorithms", Eds.: F. Krzakala, F. Ricci-Tersenghi, L. Zdeborova, R. Zecchina, E. W. Tramel, L. F. Cugliandolo (Oxford University Press, to appear)

  16. arXiv:1406.1880  [pdf, other

    cond-mat.dis-nn cs.SI physics.soc-ph stat.ML

    Spectral Clustering of Graphs with the Bethe Hessian

    Authors: Alaa Saade, Florent Krzakala, Lenka Zdeborová

    Abstract: Spectral clustering is a standard approach to label nodes on a graph by studying the (largest or lowest) eigenvalues of a symmetric real matrix such as e.g. the adjacency or the Laplacian. Recently, it has been argued that using instead a more complicated, non-symmetric and higher dimensional operator, related to the non-backtracking walk on the graph, leads to improved performance in detecting cl… ▽ More

    Submitted 8 September, 2014; v1 submitted 7 June, 2014; originally announced June 2014.

    Comments: 8 pages, 2 figures

    Journal ref: Advances in Neural Information Processing Systems 27 (NIPS 2014) pp 406-414

  17. arXiv:1404.7787  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.SI

    Spectral density of the non-backtracking operator

    Authors: Alaa Saade, Florent Krzakala, Lenka Zdeborová

    Abstract: The non-backtracking operator was recently shown to provide a significant improvement when used for spectral clustering of sparse networks. In this paper we analyze its spectral density on large random sparse graphs using a mapping to the correlation functions of a certain interacting quantum disordered system on the graph. On sparse, tree-like graphs, this can be solved efficiently by the cavity… ▽ More

    Submitted 30 April, 2014; originally announced April 2014.

    Comments: 6 pages, 6 figures, submitted to EPL

    Journal ref: 2014 EPL 107 50005