Skip to main content

Showing 1–50 of 73 results for author: Abbe, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06467  [pdf, other

    cs.LG cs.AI stat.ML

    How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad

    Authors: Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Colin Sandon, Omid Saremi

    Abstract: Can Transformers predict new syllogisms by composing established ones? More generally, what type of targets can be learned by such models from scratch? Recent works show that Transformers can be Turing-complete in terms of expressivity, but this does not address the learnability objective. This paper puts forward the notion of 'distribution locality' to capture when weak learning is efficiently ac… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 38 pages, 11 figures

  2. arXiv:2406.06354  [pdf, other

    cs.LG

    On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

    Authors: Denys Pushkin, Raphaël Berthier, Emmanuel Abbe

    Abstract: We investigate the out-of-domain generalization of random feature (RF) models and Transformers. We first prove that in the `generalization on the unseen (GOTU)' setting, where training data is fully seen in some part of the domain but testing is made on another part, and for RF models in the small feature regime, the convergence takes place to interpolators of minimal degree as in the Boolean case… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages of main body, 24 pages in total. 7 figures Proceedings of the 41-st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  3. arXiv:2312.04329  [pdf, other

    cs.IT cs.DM math.CO

    Reed-Muller codes have vanishing bit-error probability below capacity: a simple tighter proof via camellia boosting

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: This paper shows that a class of codes such as Reed-Muller (RM) codes have vanishing bit-error probability below capacity on symmetric channels. The proof relies on the notion of `camellia codes': a class of symmetric codes decomposable into `camellias', i.e., set systems that differ from sunflowers by allowing for scattered petal overlaps. The proof then follows from a boosting argument on the ca… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  4. arXiv:2310.09753  [pdf, other

    cs.CL cs.AI cs.LG

    When can transformers reason with abstract symbols?

    Authors: Enric Boix-Adsera, Omid Saremi, Emmanuel Abbe, Samy Bengio, Etai Littwin, Joshua Susskind

    Abstract: We investigate the capabilities of transformer models on relational reasoning tasks. In these tasks, models are trained on a set of strings encoding abstract relations, and are then tested out-of-distribution on data that contains symbols that did not appear in the training dataset. We prove that for any relational reasoning task in a large family of tasks, transformers learn the abstract relation… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 25 figures

  5. arXiv:2309.12207  [pdf, other

    cs.LG cs.LO

    Boolformer: Symbolic Regression of Logic Functions with Transformers

    Authors: Stéphane d'Ascoli, Samy Bengio, Josh Susskind, Emmanuel Abbé

    Abstract: In this work, we introduce Boolformer, the first Transformer architecture trained to perform end-to-end symbolic regression of Boolean functions. First, we show that it can predict compact formulas for complex functions which were not seen during training, when provided a clean truth table. Then, we demonstrate its ability to find approximate expressions when provided incomplete and noisy observat… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  6. arXiv:2306.16921  [pdf, other

    cs.LG stat.ML

    Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs

    Authors: Emmanuel Abbe, Elisabetta Cornacchia, Aryo Lotfi

    Abstract: Experimental results have shown that curriculum learning, i.e., presenting simpler examples before more complex ones, can improve the efficiency of learning. Some recent theoretical results also showed that changing the sampling distribution can help neural networks learn parities, with formal results only for large learning rates and one-step arguments. Here we show a separation result in the num… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 34 pages, 8 figures

  7. arXiv:2306.07042  [pdf, other

    cs.LG

    Transformers learn through gradual rank increase

    Authors: Enric Boix-Adsera, Etai Littwin, Emmanuel Abbe, Samy Bengio, Joshua Susskind

    Abstract: We identify incremental learning dynamics in transformers, where the difference between trained and initial weights progressively increases in rank. We rigorously prove this occurs under the simplifying assumptions of diagonal weight matrices and small initialization. Our experiments support the theory and also show that phenomenon can occur in practice without the simplifying assumptions.

    Submitted 10 December, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: 39 pages, to appear in NeurIPS 2023

  8. arXiv:2304.02509  [pdf, other

    cs.IT cs.DM

    A proof that Reed-Muller codes achieve Shannon capacity on symmetric channels

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: Reed-Muller codes were introduced in 1954, with a simple explicit construction based on polynomial evaluations, and have long been conjectured to achieve Shannon capacity on symmetric channels. Major progress was made towards a proof over the last decades; using combinatorial weight enumerator bounds, a breakthrough on the erasure channel from sharp thresholds, hypercontractivity arguments, and po… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  9. arXiv:2302.11055  [pdf, other

    cs.LG stat.ML

    SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics

    Authors: Emmanuel Abbe, Enric Boix-Adsera, Theodor Misiakiewicz

    Abstract: We investigate the time complexity of SGD learning on fully-connected neural networks with isotropic data. We put forward a complexity measure -- the leap -- which measures how "hierarchical" target functions are. For $d$-dimensional uniform Boolean or isotropic Gaussian data, our main conjecture states that the time complexity to learn a function $f$ with low-dimensional support is… ▽ More

    Submitted 31 August, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  10. arXiv:2301.13105  [pdf, other

    cs.LG stat.ML

    Generalization on the Unseen, Logic Reasoning and Degree Curriculum

    Authors: Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Kevin Rizk

    Abstract: This paper considers the learning of logical (Boolean) functions with focus on the generalization on the unseen (GOTU) setting, a strong case of out-of-distribution generalization. This is motivated by the fact that the rich combinatorial nature of data in certain reasoning tasks (e.g., arithmetic/logic) makes representative data sampling challenging, and learning successfully under GOTU gives a f… ▽ More

    Submitted 28 June, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To appear in ICML 2023

  11. arXiv:2208.03113  [pdf, ps, other

    cs.LG

    On the non-universality of deep learning: quantifying the cost of symmetry

    Authors: Emmanuel Abbe, Enric Boix-Adsera

    Abstract: We prove limitations on what neural networks trained by noisy gradient descent (GD) can efficiently learn. Our results apply whenever GD training is equivariant, which holds for many standard architectures and initializations. As applications, (i) we characterize the functions that fully-connected networks can weak-learn on the binary hypercube and unit sphere, demonstrating that depth-2 is as pow… ▽ More

    Submitted 14 October, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: Improved exposition, to appear in NeurIPS'22

  12. arXiv:2205.13647  [pdf, other

    cs.LG stat.ML

    Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures

    Authors: Emmanuel Abbe, Samy Bengio, Elisabetta Cornacchia, Jon Kleinberg, Aryo Lotfi, Maithra Raghu, Chiyuan Zhang

    Abstract: This paper considers the Pointer Value Retrieval (PVR) benchmark introduced in [ZRKB21], where a 'reasoning' function acts on a string of digits to produce the label. More generally, the paper considers the learning of logical functions with gradient descent (GD) on neural networks. It is first shown that in order to learn logical functions with gradient descent on symmetric neural networks, the g… ▽ More

    Submitted 20 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: To appear in NeurIPS 2022

  13. arXiv:2202.12846  [pdf, other

    cs.LG

    An initial alignment between neural network and target is needed for gradient descent to learn

    Authors: Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła, Christopher Marquis

    Abstract: This paper introduces the notion of ``Initial Alignment'' (INAL) between a neural network at initialization and a target function. It is proved that if a network and a Boolean target function do not have a noticeable INAL, then noisy gradient descent on a fully connected network with normalized i.i.d. initialization will not learn in polynomial time. Thus a certain amount of knowledge about the ta… ▽ More

    Submitted 16 August, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the International Conference on Machine Learning, 2022

  14. arXiv:2202.08658  [pdf, other

    cs.LG cs.DS stat.ML

    The merged-staircase property: a necessary and nearly sufficient condition for SGD learning of sparse functions on two-layer neural networks

    Authors: Emmanuel Abbe, Enric Boix-Adsera, Theodor Misiakiewicz

    Abstract: It is currently known how to characterize functions that neural networks can learn with SGD for two extremal parameterizations: neural networks in the linear regime, and neural networks with no structural constraints. However, for the main parametrization of interest (non-linear but regular networks) no tight characterization has yet been achieved, despite significant developments. We take a ste… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  15. arXiv:2108.10573  [pdf, other

    cs.LG cs.DS cs.NE stat.ML

    The staircase property: How hierarchical structure can guide deep learning

    Authors: Emmanuel Abbe, Enric Boix-Adsera, Matthew Brennan, Guy Bresler, Dheeraj Nagaraj

    Abstract: This paper identifies a structural property of data distributions that enables deep neural networks to learn hierarchically. We define the "staircase" property for functions over the Boolean hypercube, which posits that high-order Fourier coefficients are reachable from lower-order Fourier coefficients along increasing chains. We prove that functions satisfying this property can be learned in poly… ▽ More

    Submitted 23 November, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

    Comments: 60 pages, accepted to NeurIPS '21

  16. arXiv:2108.04190  [pdf, ps, other

    cs.LG stat.ML

    On the Power of Differentiable Learning versus PAC and SQ Learning

    Authors: Emmanuel Abbe, Pritish Kamath, Eran Malach, Colin Sandon, Nathan Srebro

    Abstract: We study the power of learning via mini-batch stochastic gradient descent (SGD) on the population loss, and batch Gradient Descent (GD) on the empirical loss, of a differentiable model or neural network, and ask what learning problems can be learnt using these paradigms. We show that SGD and GD can always simulate learning with statistical queries (SQ), but their ability to go beyond that depends… ▽ More

    Submitted 5 February, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

  17. arXiv:2103.01210  [pdf, ps, other

    cs.LG stat.ML

    Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels

    Authors: Eran Malach, Pritish Kamath, Emmanuel Abbe, Nathan Srebro

    Abstract: We study the relative power of learning with gradient descent on differentiable models, such as neural networks, versus using the corresponding tangent kernels. We show that under certain conditions, gradient descent achieves small error only if a related tangent kernel method achieves a non-trivial advantage over random guessing (a.k.a. weak learning), though this advantage might be very small ev… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  18. arXiv:2101.12601  [pdf, other

    math.PR cs.IT

    Stochastic block model entropy and broadcasting on trees with survey

    Authors: Emmanuel Abbe, Elisabetta Cornacchia, Yuzhou Gu, Yury Polyanskiy

    Abstract: The limit of the entropy in the stochastic block model (SBM) has been characterized in the sparse regime for the special case of disassortative communities [COKPZ17] and for the classical case of assortative communities but in the dense regime [DAM16]. The problem has not been closed in the classical sparse and assortative case. This paper establishes the result in this case for any SNR besides fo… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  19. arXiv:2006.14614  [pdf, other

    cs.LG cs.IT nlin.AO stat.ML

    Maximum Multiscale Entropy and Neural Network Regularization

    Authors: Amir R. Asadi, Emmanuel Abbe

    Abstract: A well-known result across information theory, machine learning, and statistical physics shows that the maximum entropy distribution under a mean constraint has an exponential form called the Gibbs-Boltzmann distribution. This is used for instance in density estimation or to achieve excess risk bounds derived from single-scale entropy regularizers (Xu-Raginsky '17). This paper investigates a gener… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 27 pages, 2 figures

  20. arXiv:2006.14062  [pdf, other

    math.ST cs.LG math.PR stat.ML

    An $\ell_p$ theory of PCA and spectral clustering

    Authors: Emmanuel Abbe, Jianqing Fan, Kaizheng Wang

    Abstract: Principal Component Analysis (PCA) is a powerful tool in statistics and machine learning. While existing study of PCA focuses on the recovery of principal components and their associated eigenvalues, there are few precise characterizations of individual principal component scores that yield low-dimensional embedding of samples. That hinders the analysis of various spectral methods. In this paper,… ▽ More

    Submitted 9 April, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 72 pages, 2 figures

    MSC Class: 62H25; 62H30; 60B20

  21. arXiv:2006.11248  [pdf, ps, other

    math.CO cs.DM

    An Alon-Boppana theorem for powered graphs and generalized Ramanujan graphs

    Authors: Emmanuel Abbe, Peter Ralli

    Abstract: The r-th power of a graph modifies a graph by connecting every vertex pair within distance r. This paper gives a generalization of the Alon-Boppana Theorem for the r-th power of graphs, including irregular graphs. This leads to a generalized notion of Ramanujan graphs, those for which the powered graph has a spectral gap matching the derived Alon-Boppana bound. In particular, we show that certain… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:1809.04818

  22. arXiv:2006.07695  [pdf, ps, other

    math.ST cs.LG math.PR

    Learning Sparse Graphons and the Generalized Kesten-Stigum Threshold

    Authors: Emmanuel Abbe, Shuangping Li, Allan Sly

    Abstract: The problem of learning graphons has attracted considerable attention across several scientific communities, with significant progress over the recent years in sparser regimes. Yet, the current techniques still require diverging degrees in order to succeed with efficient algorithms in the challenging cases where the local structure of the graph is homogeneous. This paper provides an efficient algo… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

    Comments: 32 pages

  23. arXiv:2006.05251  [pdf, other

    cs.MA math.PR

    Polarization in Attraction-Repulsion Models

    Authors: Elisabetta Cornacchia, Neta Singer, Emmanuel Abbe

    Abstract: This paper introduces a model for opinion dynamics, where at each time step, randomly selected agents see their opinions - modeled as scalars in [0,1] - evolve depending on a local interaction function. In the classical Bounded Confidence Model, agents opinions get attracted when they are close enough. The proposed model extends this by adding a repulsion component, which models the effect of opin… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  24. arXiv:2004.09590  [pdf, other

    cs.IT

    Almost-Reed--Muller Codes Achieve Constant Rates for Random Errors

    Authors: Emmanuel Abbe, Jan Hązła, Ido Nachum

    Abstract: This paper considers '$δ$-almost Reed-Muller codes', i.e., linear codes spanned by evaluations of all but a $δ$ fraction of monomials of degree at most $d$. It is shown that for any $δ> 0$ and any $\varepsilon>0$, there exists a family of $δ$-almost Reed-Muller codes of constant rate that correct $1/2-\varepsilon$ fraction of random errors with high probability. For exact Reed-Muller codes, the an… ▽ More

    Submitted 5 October, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

  25. arXiv:2002.03317  [pdf, other

    cs.IT cs.DM

    Reed-Muller Codes: Theory and Algorithms

    Authors: Emmanuel Abbe, Amir Shpilka, Min Ye

    Abstract: Reed-Muller (RM) codes are among the oldest, simplest and perhaps most ubiquitous family of codes. They are used in many areas of coding theory in both electrical engineering and computer science. Yet, many of their important properties are still under investigation. This paper covers some of the recent developments regarding the weight enumerator and the capacity-achieving properties of RM codes,… ▽ More

    Submitted 10 June, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

  26. arXiv:2001.02992  [pdf, other

    cs.LG cs.CC cs.IT stat.ML

    Poly-time universality and limitations of deep learning

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: The goal of this paper is to characterize function distributions that deep learning can or cannot learn in poly-time. A universality result is proved for SGD-based deep learning and a non-universality result is proved for GD-based deep learning; this also gives a separation between SGD-based deep learning and statistical query algorithms: (1) {\it Deep learning with SGD is efficiently universal.… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1812.06369

  27. arXiv:1909.12175  [pdf, other

    cs.IT math.CO

    Entropic matroids and their representation

    Authors: Emmanuel Abbe, Sophie Spirkl

    Abstract: This paper investigates entropic matroids, that is, matroids whose rank function is given as the Shannon entropy of random variables. In particular, we consider $p$-entropic matroids, for which the random variables each have support of cardinality $p$. We draw connections between such entropic matroids and secret-sharing matroids and show that entropic matroids are linear matroids when $p = 2,3$ b… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

  28. arXiv:1906.11148  [pdf, other

    cs.LG cs.IT stat.ML

    Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets

    Authors: Amir R. Asadi, Emmanuel Abbe

    Abstract: We derive generalization and excess risk bounds for neural nets using a family of complexity measures based on a multilevel relative entropy. The bounds are obtained by introducing the notion of generated hierarchical coverings of neural nets and by using the technique of chaining mutual information introduced in Asadi et al. NeurIPS'18. The resulting bounds are algorithm-dependent and exploit the… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: 30 pages, 3 figures

  29. arXiv:1902.02431  [pdf, ps, other

    math.PR cs.IT

    Subadditivity Beyond Trees and the Chi-Squared Mutual Information

    Authors: Emmanuel Abbe, Enric Boix-Adserà

    Abstract: In 2000, Evans et al. [Eva+00] proved the subadditivity of the mutual information in the broadcasting on tree model with binary vertex labels and symmetric channels. They raised the question of whether such subadditivity extends to loopy graphs in some appropriate way. We recently proposed such an extension that applies to general graphs and binary vertex labels [AB18], using synchronization model… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: 16 pages

  30. arXiv:1902.01470  [pdf, other

    cs.IT

    Recursive projection-aggregation decoding of Reed-Muller codes

    Authors: Min Ye, Emmanuel Abbe

    Abstract: We propose a new class of efficient decoding algorithms for Reed-Muller (RM) codes over binary-input memoryless channels. The algorithms are based on projecting the code on its cosets, recursively decoding the projected codes (which are lower-order RM codes), and aggregating the reconstructions (e.g., using majority votes). We further provide extensions of the algorithms using list-decoding. We… ▽ More

    Submitted 26 February, 2020; v1 submitted 4 February, 2019; originally announced February 2019.

  31. arXiv:1901.11533  [pdf, other

    cs.IT

    Reed-Muller codes polarize

    Authors: Emmanuel Abbe, Min Ye

    Abstract: Reed-Muller (RM) codes and polar codes are generated by the same matrix $G_m= \bigl[\begin{smallmatrix}1 & 0 \\ 1 & 1 \\ \end{smallmatrix}\bigr]^{\otimes m}$ but using different subset of rows. RM codes select simply rows having largest weights. Polar codes select instead rows having the largest conditional mutual information proceeding top to down in $G_m$; while this is a more elaborate and chan… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

  32. arXiv:1812.06369  [pdf, other

    cs.LG cs.CC cs.IT stat.ML

    Provable limitations of deep learning

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: As the success of deep learning reaches more grounds, one would like to also envision the potential limits of deep learning. This paper gives a first set of results proving that certain deep learning algorithms fail at learning certain efficiently learnable functions. The results put forward a notion of cross-predictability that characterizes when such failures take place. Parity functions provide… ▽ More

    Submitted 29 April, 2019; v1 submitted 15 December, 2018; originally announced December 2018.

  33. arXiv:1809.04818  [pdf, other

    cs.DS cs.DM math.PR

    Graph powering and spectral robustness

    Authors: Emmanuel Abbe, Enric Boix, Peter Ralli, Colin Sandon

    Abstract: Spectral algorithms, such as principal component analysis and spectral clustering, typically require careful data transformations to be effective: upon observing a matrix $A$, one may look at the spectrum of $ψ(A)$ for a properly chosen $ψ$. The issue is that the spectrum of $A$ might be contaminated by non-informational top eigenvalues, e.g., due to scale` variations in the data, and the applicat… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  34. arXiv:1806.03803  [pdf, other

    cs.LG cs.IT math.PR stat.ML

    Chaining Mutual Information and Tightening Generalization Bounds

    Authors: Amir R. Asadi, Emmanuel Abbe, Sergio Verdú

    Abstract: Bounding the generalization error of learning algorithms has a long history, which yet falls short in explaining various generalization successes including those of deep learning. Two important difficulties are (i) exploiting the dependencies between the hypotheses, (ii) exploiting the dependence between the algorithm's input and output. Progress on the first point was made with the chaining metho… ▽ More

    Submitted 1 July, 2019; v1 submitted 11 June, 2018; originally announced June 2018.

    Comments: 20 pages, 1 figure; published at the NeurIPS 2018 conference

  35. arXiv:1806.03227  [pdf, ps, other

    math.PR cs.IT math.ST

    An Information-Percolation Bound for Spin Synchronization on General Graphs

    Authors: Emmanuel Abbe, Enric Boix

    Abstract: This paper considers the problem of reconstructing $n$ independent uniform spins $X_1,\dots,X_n$ living on the vertices of an $n$-vertex graph $G$, by observing their interactions on the edges of the graph. This captures instances of models such as (i) broadcasting on trees, (ii) block models, (iii) synchronization on grids, (iv) spiked Wigner models. The paper gives an upper-bound on the mutual i… ▽ More

    Submitted 11 June, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: The results of this paper are from Enric Boix's undergraduate senior thesis, advised by Emmanuel Abbe. The results were presented at the Workshop on Combinatorial Statistics, Montreal, May 2018

  36. arXiv:1802.03475  [pdf, other

    stat.ML cs.DC cs.IT cs.LG

    Communication-Computation Efficient Gradient Coding

    Authors: Min Ye, Emmanuel Abbe

    Abstract: This paper develops coding techniques to reduce the running time of distributed learning tasks. It characterizes the fundamental tradeoff to compute gradients (and more generally vector summations) in terms of three parameters: computation load, straggler tolerance and communication cost. It further gives an explicit coding scheme that achieves the optimal tradeoff based on recursive polynomial co… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

  37. arXiv:1801.04366  [pdf, other

    cs.IT

    Estimation in the group action channel

    Authors: Emmanuel Abbe, João M. Pereira, Amit Singer

    Abstract: We analyze the problem of estimating a signal from multiple measurements on a $\mbox{group action channel}$ that linearly transforms a signal by a random group action followed by a fixed projection and additive Gaussian noise. This channel is motivated by applications such as multi-reference alignment and cryo-electron microscopy. We focus on the large noise regime prevalent in these applications.… ▽ More

    Submitted 12 January, 2018; originally announced January 2018.

    Comments: 5 pages, conference

    MSC Class: 94A15; 62B10

  38. arXiv:1710.02793  [pdf, other

    cs.IT

    Multireference Alignment is Easier with an Aperiodic Translation Distribution

    Authors: Emmanuel Abbe, Tamir Bendory, William Leeb, João Pereira, Nir Sharon, Amit Singer

    Abstract: In the multireference alignment model, a signal is observed by the action of a random circular translation and the addition of Gaussian noise. The goal is to recover the signal's orbit by accessing multiple independent observations. Of particular interest is the sample complexity, i.e., the number of observations/samples needed in terms of the signal-to-noise ratio (the signal energy divided by th… ▽ More

    Submitted 3 November, 2018; v1 submitted 8 October, 2017; originally announced October 2017.

  39. arXiv:1706.09942  [pdf, other

    math.PR cs.IT cs.SI

    Community Detection on Euclidean Random Graphs

    Authors: Emmanuel Abbe, Francois Baccelli, Abishek Sankararaman

    Abstract: We study the problem of community detection (CD) on Euclidean random geometric graphs where each vertex has two latent variables: a binary community label and a $\mathbb{R}^d$ valued location label which forms the support of a Poisson point process of intensity $λ$. A random graph is then drawn with edge probabilities dependent on both the community and location labels. In contrast to the stochast… ▽ More

    Submitted 19 March, 2020; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear, in Information and Inference : A journal of the IMA, 2020

  40. arXiv:1706.08561  [pdf, ps, other

    math.ST cs.IT

    Group Synchronization on Grids

    Authors: Emmanuel Abbe, Laurent Massoulie, Andrea Montanari, Allan Sly, Nikhil Srivastava

    Abstract: Group synchronization requires to estimate unknown elements $(θ_v)_{v\in V}$ of a compact group ${\mathfrak G}$ associated to the vertices of a graph $G=(V,E)$, using noisy observations of the group differences associated to the edges. This model is relevant to a variety of applications ranging from structure from motion in computer vision to graph localization and positioning, to certain families… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

    Comments: 21 pages

  41. arXiv:1706.05295  [pdf, other

    cs.SI cs.CC cs.IT math.PR stat.ML

    Nonbacktracking Bounds on the Influence in Independent Cascade Models

    Authors: Emmanuel Abbe, Sanjeev Kulkarni, Eun Jee Lee

    Abstract: This paper develops upper and lower bounds on the influence measure in a network, more precisely, the expected number of nodes that a seed set can influence in the independent cascade model. In particular, our bounds exploit nonbacktracking walks, Fortuin-Kasteleyn-Ginibre (FKG) type inequalities, and are computed by message passing implementation. Nonbacktracking walks have recently allowed for h… ▽ More

    Submitted 29 June, 2017; v1 submitted 23 May, 2017; originally announced June 2017.

  42. arXiv:1703.10146  [pdf, other

    math.PR cs.CC cs.IT cs.SI stat.ML

    Community Detection and Stochastic Block Models

    Authors: Emmanuel Abbe

    Abstract: The stochastic block model (SBM) is a random graph model with different group of vertices connecting differently. It is widely employed as a canonical model to study clustering and community detection, and provides a fertile ground to study the information-theoretic and computational tradeoffs that arise in combinatorial statistics and more generally data science. This monograph surveys the rece… ▽ More

    Submitted 24 October, 2023; v1 submitted 29 March, 2017; originally announced March 2017.

  43. arXiv:1701.07540  [pdf, ps, other

    cs.IT

    Sample Complexity of the Boolean Multireference Alignment Problem

    Authors: Emmanuel Abbe, Joao Pereira, Amit Singer

    Abstract: The Boolean multireference alignment problem consists in recovering a Boolean signal from multiple shifted and noisy observations. In this paper we obtain an expression for the error exponent of the maximum A posteriori decoder. This expression is used to characterize the number of measurements needed for signal recovery in the low SNR regime, in terms of higher order autocorrelations of the signa… ▽ More

    Submitted 2 February, 2017; v1 submitted 25 January, 2017; originally announced January 2017.

    Comments: 5 pages, submitted to ISIT

  44. arXiv:1512.09080  [pdf, other

    math.PR cs.CC cs.IT cs.LG cs.SI

    Detection in the stochastic block model with multiple clusters: proof of the achievability conjectures, acyclic BP, and the information-computation gap

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: In a paper that initiated the modern study of the stochastic block model, Decelle et al., backed by Mossel et al., made the following conjecture: Denote by $k$ the number of balanced communities, $a/n$ the probability of connecting inside communities and $b/n$ across, and set $\mathrm{SNR}=(a-b)^2/(k(a+(k-1)b)$; for any $k \geq 2$, it is possible to detect communities efficiently whenever… ▽ More

    Submitted 14 September, 2016; v1 submitted 30 December, 2015; originally announced December 2015.

    Comments: Extended version with further details on the algorithms and methods

  45. Entropies of weighted sums in cyclic groups and an application to polar codes

    Authors: Emmanuel Abbe, Jiange Li, Mokshay Madiman

    Abstract: In this note, the following basic question is explored: in a cyclic group, how are the Shannon entropies of the sum and difference of i.i.d. random variables related to each other? For the integer group, we show that they can differ by any real number additively, but not too much multiplicatively; on the other hand, for $\mathbb{Z}/3\mathbb{Z}$, the entropy of the difference is always at least as… ▽ More

    Submitted 21 April, 2016; v1 submitted 30 November, 2015; originally announced December 2015.

    Journal ref: Entropy, 19(9), 235, 2017

  46. arXiv:1509.05121  [pdf, other

    q-bio.GN cs.SI stat.AP

    Detecting Community Structures in Hi-C Genomic Data

    Authors: Irineo Cabreros, Emmanuel Abbe, Aristotelis Tsirigos

    Abstract: Community detection (CD) algorithms are applied to Hi-C data to discover new communities of loci in the 3D conformation of human and mouse DNA. We find that CD has some distinct advantages over pre-existing methods: (1) it is capable of finding a variable number of communities, (2) it can detect communities of DNA loci either adjacent or distant in the 1D sequence, and (3) it allows us to obtain a… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

  47. arXiv:1507.08685  [pdf, other

    cs.IT cond-mat.stat-mech math.ST

    Asymptotic Mutual Information for the Two-Groups Stochastic Block Model

    Authors: Yash Deshpande, Emmanuel Abbe, Andrea Montanari

    Abstract: We develop an information-theoretic view of the stochastic block model, a popular statistical model for the large-scale structure of complex networks. A graph $G$ from such a model is generated by first assigning vertex labels at random from a finite alphabet, and then connecting vertices with edge probabilities depending on the labels of the endpoints. In the case of the symmetric two-group model… ▽ More

    Submitted 30 July, 2015; originally announced July 2015.

    Comments: 41 pages, 3 pdf figures

  48. arXiv:1506.03729  [pdf, other

    math.PR cs.IT cs.LG cs.SI

    Recovering communities in the general stochastic block model without knowing the parameters

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: Most recent developments on the stochastic block model (SBM) rely on the knowledge of the model parameters, or at least on the number of communities. This paper introduces efficient algorithms that do not require such knowledge and yet achieve the optimal information-theoretic tradeoffs identified in [AS15] for linear size communities. The results are three-fold: (i) in the constant degree regime,… ▽ More

    Submitted 11 June, 2015; originally announced June 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1503.00609

  49. arXiv:1504.08316  [pdf, other

    math.PR cs.CC

    Concentration of the number of solutions of random planted CSPs and Goldreich's one-way candidates

    Authors: Emmanuel Abbe, Katherine Edwards

    Abstract: This paper shows that the logarithm of the number of solutions of a random planted $k$-SAT formula concentrates around a deterministic $n$-independent threshold. Specifically, if $F^*_{k}(α,n)$ is a random $k$-SAT formula on $n$ variables, with clause density $α$ and with a uniformly drawn planted solution, there exists a function $φ_k(\cdot)$ such that, besides for some $α$ in a set of Lesbegue m… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

    Comments: 17 pages

  50. arXiv:1503.00609  [pdf, other

    math.PR cs.IT cs.SI

    Community detection in general stochastic block models: fundamental limits and efficient recovery algorithms

    Authors: Emmanuel Abbe, Colin Sandon

    Abstract: New phase transition phenomena have recently been discovered for the stochastic block model, for the special case of two non-overlapping symmetric communities. This gives raise in particular to new algorithmic challenges driven by the thresholds. This paper investigates whether a general phenomenon takes place for multiple communities, without imposing symmetry. In the general stochastic block m… ▽ More

    Submitted 4 April, 2015; v1 submitted 2 March, 2015; originally announced March 2015.