Search | arXiv e-print repository

Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters

Authors: Ya-Wei Eileen Lin, Ronen Talmon, Ron Levie

Abstract: Equivariant machine learning is an approach for designing deep learning models that respect the symmetries of the problem, with the aim of reducing model complexity and improving generalization. In this paper, we focus on an extension of shift equivariance, which is the basis of convolution networks on images, to general graphs. Unlike images, graphs do not have a natural notion of domain translat… ▽ More Equivariant machine learning is an approach for designing deep learning models that respect the symmetries of the problem, with the aim of reducing model complexity and improving generalization. In this paper, we focus on an extension of shift equivariance, which is the basis of convolution networks on images, to general graphs. Unlike images, graphs do not have a natural notion of domain translation. Therefore, we consider the graph functional shifts as the symmetry group: the unitary operators that commute with the graph shift operator. Notably, such symmetries operate in the signal space rather than directly in the spatial space. We remark that each linear filter layer of a standard spectral graph neural network (GNN) commutes with graph functional shifts, but the activation function breaks this symmetry. Instead, we propose nonlinear spectral filters (NLSFs) that are fully equivariant to graph functional shifts and show that they have universal approximation properties. The proposed NLSFs are based on a new form of spectral domain that is transferable between graphs. We demonstrate the superior performance of NLSFs over existing spectral GNNs in node and graph classification benchmarks. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.20724 [pdf, other]

Learning on Large Graphs using Intersecting Communities

Authors: Ben Finkelshtein, İsmail İlkan Ceylan, Michael Bronstein, Ron Levie

Abstract: Message Passing Neural Networks (MPNNs) are a staple of graph machine learning. MPNNs iteratively update each node's representation in an input graph by aggregating messages from the node's neighbors, which necessitates a memory complexity of the order of the number of graph edges. This complexity might quickly become prohibitive for large graphs provided they are not very sparse. In this paper, w… ▽ More Message Passing Neural Networks (MPNNs) are a staple of graph machine learning. MPNNs iteratively update each node's representation in an input graph by aggregating messages from the node's neighbors, which necessitates a memory complexity of the order of the number of graph edges. This complexity might quickly become prohibitive for large graphs provided they are not very sparse. In this paper, we propose a novel approach to alleviate this problem by approximating the input graph as an intersecting community graph (ICG) -- a combination of intersecting cliques. The key insight is that the number of communities required to approximate a graph does not depend on the graph size. We develop a new constructive version of the Weak Graph Regularity Lemma to efficiently construct an approximating ICG for any input graph. We then devise an efficient graph learning algorithm operating directly on ICG in linear memory and time with respect to the number of nodes (rather than edges). This offers a new and fundamentally different pipeline for learning on very large non-sparse graphs, whose applicability is demonstrated empirically on node classification tasks and spatio-temporal data processing. △ Less

Submitted 31 May, 2024; originally announced May 2024.

arXiv:2404.03473 [pdf, ps, other]

Generalization Bounds for Message Passing Networks on Mixture of Graphons

Authors: Sohir Maskey, Gitta Kutyniok, Ron Levie

Abstract: We study the generalization capabilities of Message Passing Neural Networks (MPNNs), a prevalent class of Graph Neural Networks (GNN). We derive generalization bounds specifically for MPNNs with normalized sum aggregation and mean aggregation. Our analysis is based on a data generation model incorporating a finite set of template graphons. Each graph within this framework is generated by sampling… ▽ More We study the generalization capabilities of Message Passing Neural Networks (MPNNs), a prevalent class of Graph Neural Networks (GNN). We derive generalization bounds specifically for MPNNs with normalized sum aggregation and mean aggregation. Our analysis is based on a data generation model incorporating a finite set of template graphons. Each graph within this framework is generated by sampling from one of the graphons with a certain degree of perturbation. In particular, we extend previous MPNN generalization results to a more realistic setting, which includes the following modifications: 1) we analyze simple random graphs with Bernoulli-distributed edges instead of weighted graphs; 2) we sample both graphs and graph signals from perturbed graphons instead of clean graphons; and 3) we analyze sparse graphs instead of dense graphs. In this more realistic and challenging scenario, we provide a generalization bound that decreases as the average number of nodes in the graphs increases. Our results imply that MPNNs with higher complexity than the size of the training set can still generalize effectively, as long as the graphs are sufficiently large. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2402.02287 [pdf, other]

Future Directions in the Theory of Graph Machine Learning

Authors: Christopher Morris, Fabrizio Frasca, Nadav Dym, Haggai Maron, İsmail İlkan Ceylan, Ron Levie, Derek Lim, Michael Bronstein, Martin Grohe, Stefanie Jegelka

Abstract: Machine learning on graphs, especially using graph neural networks (GNNs), has seen a surge in interest due to the wide availability of graph data across a broad spectrum of disciplines, from life to social and engineering sciences. Despite their practical success, our theoretical understanding of the properties of GNNs remains highly incomplete. Recent theoretical advancements primarily focus on… ▽ More Machine learning on graphs, especially using graph neural networks (GNNs), has seen a surge in interest due to the wide availability of graph data across a broad spectrum of disciplines, from life to social and engineering sciences. Despite their practical success, our theoretical understanding of the properties of GNNs remains highly incomplete. Recent theoretical advancements primarily focus on elucidating the coarse-grained expressive power of GNNs, predominantly employing combinatorial techniques. However, these studies do not perfectly align with practice, particularly in understanding the generalization behavior of GNNs when trained with stochastic first-order optimization techniques. In this position paper, we argue that the graph machine learning community needs to shift its attention to developing a balanced theory of graph machine learning, focusing on a more thorough understanding of the interplay of expressive power, generalization, and optimization. △ Less

Submitted 14 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

Comments: ICML 2024

arXiv:2310.07658 [pdf, other]

The First Pathloss Radio Map Prediction Challenge

Authors: Çağkan Yapar, Fabian Jaensch, Ron Levie, Gitta Kutyniok, Giuseppe Caire

Abstract: To foster research and facilitate fair comparisons among recently proposed pathloss radio map prediction methods, we have launched the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this short overview paper, we briefly describe the pathloss prediction problem, the provided datasets, the challenge task and the challenge evaluation methodology. Finally, we present the results of the… ▽ More To foster research and facilitate fair comparisons among recently proposed pathloss radio map prediction methods, we have launched the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this short overview paper, we briefly describe the pathloss prediction problem, the provided datasets, the challenge task and the challenge evaluation methodology. Finally, we present the results of the challenge. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: ICASSP 2023

arXiv:2308.10436 [pdf, other]

Approximately Equivariant Graph Networks

Authors: Ningyuan Huang, Ron Levie, Soledad Villar

Abstract: Graph neural networks (GNNs) are commonly described as being permutation equivariant with respect to node relabeling in the graph. This symmetry of GNNs is often compared to the translation equivariance of Euclidean convolution neural networks (CNNs). However, these two symmetries are fundamentally different: The translation equivariance of CNNs corresponds to symmetries of the fixed domain acting… ▽ More Graph neural networks (GNNs) are commonly described as being permutation equivariant with respect to node relabeling in the graph. This symmetry of GNNs is often compared to the translation equivariance of Euclidean convolution neural networks (CNNs). However, these two symmetries are fundamentally different: The translation equivariance of CNNs corresponds to symmetries of the fixed domain acting on the image signals (sometimes known as active symmetries), whereas in GNNs any permutation acts on both the graph signals and the graph domain (sometimes described as passive symmetries). In this work, we focus on the active symmetries of GNNs, by considering a learning setting where signals are supported on a fixed graph. In this case, the natural symmetries of GNNs are the automorphisms of the graph. Since real-world graphs tend to be asymmetric, we relax the notion of symmetries by formalizing approximate symmetries via graph coarsening. We present a bias-variance formula that quantifies the tradeoff between the loss in expressivity and the gain in the regularity of the learned estimator, depending on the chosen symmetry group. To illustrate our approach, we conduct extensive experiments on image inpainting, traffic flow prediction, and human pose estimation with different choices of symmetries. We show theoretically and empirically that the best generalization performance can be achieved by choosing a suitably larger group than the graph automorphism, but smaller than the permutation group. △ Less

Submitted 17 November, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2306.03698 [pdf, other]

Fine-grained Expressivity of Graph Neural Networks

Authors: Jan Böker, Ron Levie, Ningyuan Huang, Soledad Villar, Christopher Morris

Abstract: Numerous recent works have analyzed the expressive power of message-passing graph neural networks (MPNNs), primarily utilizing combinatorial techniques such as the $1$-dimensional Weisfeiler-Leman test ($1$-WL) for the graph isomorphism problem. However, the graph isomorphism objective is inherently binary, not giving insights into the degree of similarity between two given graphs. This work resol… ▽ More Numerous recent works have analyzed the expressive power of message-passing graph neural networks (MPNNs), primarily utilizing combinatorial techniques such as the $1$-dimensional Weisfeiler-Leman test ($1$-WL) for the graph isomorphism problem. However, the graph isomorphism objective is inherently binary, not giving insights into the degree of similarity between two given graphs. This work resolves this issue by considering continuous extensions of both $1$-WL and MPNNs to graphons. Concretely, we show that the continuous variant of $1$-WL delivers an accurate topological characterization of the expressive power of MPNNs on graphons, revealing which graphs these networks can distinguish and the level of difficulty in separating them. We identify the finest topology where MPNNs separate points and prove a universal approximation theorem. Consequently, we provide a theoretical framework for graph and graphon similarity combining various topological variants of classical characterizations of the $1$-WL. In particular, we characterize the expressive power of MPNNs in terms of the tree distance, which is a graph distance based on the concept of fractional isomorphisms, and substructure counts via tree homomorphisms, showing that these concepts have the same expressive power as the $1$-WL and MPNNs on graphons. Empirically, we validate our theoretical findings by showing that randomly initialized MPNNs, without training, exhibit competitive performance compared to their trained counterparts. Moreover, we evaluate different MPNN architectures based on their ability to preserve graph distances, highlighting the significance of our continuous $1$-WL test in understanding MPNNs' expressivity. △ Less

Submitted 2 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2305.15987 [pdf, other]

A graphon-signal analysis of graph neural networks

Authors: Ron Levie

Abstract: We present an approach for analyzing message passing graph neural networks (MPNNs) based on an extension of graphon analysis to a so called graphon-signal analysis. A MPNN is a function that takes a graph and a signal on the graph (a graph-signal) and returns some value. Since the input space of MPNNs is non-Euclidean, i.e., graphs can be of any size and topology, properties such as generalization… ▽ More We present an approach for analyzing message passing graph neural networks (MPNNs) based on an extension of graphon analysis to a so called graphon-signal analysis. A MPNN is a function that takes a graph and a signal on the graph (a graph-signal) and returns some value. Since the input space of MPNNs is non-Euclidean, i.e., graphs can be of any size and topology, properties such as generalization are less well understood for MPNNs than for Euclidean neural networks. We claim that one important missing ingredient in past work is a meaningful notion of graph-signal similarity measure, that endows the space of inputs to MPNNs with a regular structure. We present such a similarity measure, called the graphon-signal cut distance, which makes the space of all graph-signals a dense subset of a compact metric space -- the graphon-signal space. Informally, two deterministic graph-signals are close in cut distance if they ``look like'' they were sampled from the same random graph-signal model. Hence, our cut distance is a natural notion of graph-signal similarity, which allows comparing any pair of graph-signals of any size and topology. We prove that MPNNs are Lipschitz continuous functions over the graphon-signal metric space. We then give two applications of this result: 1) a generalization bound for MPNNs, and, 2) the stability of MPNNs to subsampling of graph-signals. Our results apply to any regular enough MPNN on any distribution of graph-signals, making the analysis rather universal. △ Less

Submitted 8 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2212.11777 [pdf, other]

Dataset of Pathloss and ToA Radio Maps With Localization Application

Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

Abstract: In this article, we present a collection of radio map datasets in dense urban setting, which we generated and made publicly available. The datasets include simulated pathloss/received signal strength (RSS) and time of arrival (ToA) radio maps over a large collection of realistic dense urban setting in real city maps. The two main applications of the presented dataset are 1) learning methods that p… ▽ More In this article, we present a collection of radio map datasets in dense urban setting, which we generated and made publicly available. The datasets include simulated pathloss/received signal strength (RSS) and time of arrival (ToA) radio maps over a large collection of realistic dense urban setting in real city maps. The two main applications of the presented dataset are 1) learning methods that predict the pathloss from input city maps (namely, deep learning-based simulations), and, 2) wireless localization. The fact that the RSS and ToA maps are computed by the same simulations over the same city maps allows for a fair comparison of the RSS and ToA-based localization methods. △ Less

Submitted 11 September, 2024; v1 submitted 18 November, 2022; originally announced December 2022.

arXiv:2212.00728 [pdf, other]

On the Effective Usage of Priors in RSS-based Localization

Authors: Çağkan Yapar, Fabian Jaensch, Ron Levie, Giuseppe Caire

Abstract: In this paper, we study the localization problem in dense urban settings. In such environments, Global Navigation Satellite Systems fail to provide good accuracy due to low likelihood of line-of-sight (LOS) links between the receiver (Rx) to be located and the satellites, due to the presence of obstacles like the buildings. Thus, one has to resort to other technologies, which can reliably operate… ▽ More In this paper, we study the localization problem in dense urban settings. In such environments, Global Navigation Satellite Systems fail to provide good accuracy due to low likelihood of line-of-sight (LOS) links between the receiver (Rx) to be located and the satellites, due to the presence of obstacles like the buildings. Thus, one has to resort to other technologies, which can reliably operate under non-line-of-sight (NLOS) conditions. Recently, we proposed a Received Signal Strength (RSS) fingerprint and convolutional neural network-based algorithm, LocUNet, and demonstrated its state-of-the-art localization performance with respect to the widely adopted k-nearest neighbors (kNN) algorithm, and to state-of-the-art time of arrival (ToA) ranging-based methods. In the current work, we first recognize LocUNet's ability to learn the underlying prior distribution of the Rx position or Rx and transmitter (Tx) association preferences from the training data, and attribute its high performance to these. Conversely, we demonstrate that classical methods based on probabilistic approach, can greatly benefit from an appropriate incorporation of such prior information. Our studies also numerically prove LocUNet's close to optimal performance in many settings, by comparing it with the theoretically optimal formulations. △ Less

Submitted 27 November, 2022; originally announced December 2022.

arXiv:2211.12857 [pdf, other]

Explaining Image Classifiers with Multiscale Directional Image Representation

Authors: Stefan Kolek, Robert Windesheim, Hector Andrade Loarca, Gitta Kutyniok, Ron Levie

Abstract: Image classifiers are known to be difficult to interpret and therefore require explanation methods to understand their decisions. We present ShearletX, a novel mask explanation method for image classifiers based on the shearlet transform -- a multiscale directional image representation. Current mask explanation methods are regularized by smoothness constraints that protect against undesirable fine… ▽ More Image classifiers are known to be difficult to interpret and therefore require explanation methods to understand their decisions. We present ShearletX, a novel mask explanation method for image classifiers based on the shearlet transform -- a multiscale directional image representation. Current mask explanation methods are regularized by smoothness constraints that protect against undesirable fine-grained explanation artifacts. However, the smoothness of a mask limits its ability to separate fine-detail patterns, that are relevant for the classifier, from nearby nuisance patterns, that do not affect the classifier. ShearletX solves this problem by avoiding smoothness regularization all together, replacing it by shearlet sparsity constraints. The resulting explanations consist of a few edges, textures, and smooth parts of the original image, that are the most relevant for the decision of the classifier. To support our method, we propose a mathematical definition for explanation artifacts and an information theoretic score to evaluate the quality of mask explanations. We demonstrate the superiority of ShearletX over previous mask based explanation methods using these new metrics, and present exemplary situations where separating fine-detail patterns allows explaining phenomena that were not explainable before. △ Less

Submitted 28 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

Journal ref: CVPR 2023

arXiv:2210.08219 [pdf, other]

Unveiling the Sampling Density in Non-Uniform Geometric Graphs

Authors: Raffaele Paolino, Aleksandar Bojchevski, Stephan Günnemann, Gitta Kutyniok, Ron Levie

Abstract: A powerful framework for studying graphs is to consider them as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius. Currently, the literature mostly focuses on uniform sampling and constant neighborhood radius. However, real-world graphs are likely to be better represented b… ▽ More A powerful framework for studying graphs is to consider them as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius. Currently, the literature mostly focuses on uniform sampling and constant neighborhood radius. However, real-world graphs are likely to be better represented by a model in which the sampling density and the neighborhood radius can both vary over the latent space. For instance, in a social network communities can be modeled as densely sampled areas, and hubs as nodes with larger neighborhood radius. In this work, we first perform a rigorous mathematical analysis of this (more general) class of models, including derivations of the resulting graph shift operators. The key insight is that graph shift operators should be corrected in order to avoid potential distortions introduced by the non-uniform sampling. Then, we develop methods to estimate the unknown sampling density in a self-supervised fashion. Finally, we present exemplary applications in which the learnt density is used to 1) correct the graph shift operator and improve performance on a variety of tasks, 2) improve pooling, and 3) extract knowledge from networks. Our experimental findings support our theory and provide strong evidence for our model. △ Less

Submitted 25 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

Comments: updated affiliations; improved references; more experiments; streamlined the paper; added justification for the geometric graph with hubs model

arXiv:2206.05530 [pdf, other]

Memorization-Dilation: Modeling Neural Collapse Under Label Noise

Authors: Duc Anh Nguyen, Ron Levie, Julian Lienen, Gitta Kutyniok, Eyke Hüllermeier

Abstract: The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of all examples of the same class tend to collapse to a single representation, and the features of different classes tend to separate as much as possible. Neural co… ▽ More The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of all examples of the same class tend to collapse to a single representation, and the features of different classes tend to separate as much as possible. Neural collapse is often studied through a simplified model, called the unconstrained feature representation, in which the model is assumed to have "infinite expressivity" and can map each data point to any arbitrary representation. In this work, we propose a more realistic variant of the unconstrained feature representation that takes the limited expressivity of the network into account. Empirical evidence suggests that the memorization of noisy data points leads to a degradation (dilation) of the neural collapse. Using a model of the memorization-dilation (M-D) phenomenon, we show one mechanism by which different losses lead to different performances of the trained network on noisy data. Our proofs reveal why label smoothing, a modification of cross-entropy empirically observed to produce a regularization effect, leads to improved generalization in classification tasks. △ Less

Submitted 4 April, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

Comments: to be published at ICLR 2023

arXiv:2202.00738 [pdf, ps, other]

LocUNet: Fast Urban Positioning Using Radio Maps and Deep Learning

Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

Abstract: This paper deals with the problem of localization in a cellular network in a dense urban scenario. Global Navigation Satellite Systems (GNSS) typically perform poorly in urban environments, where the likelihood of line-of-sight conditions is low, and thus alternative localization methods are required for good accuracy. We present LocUNet: A deep learning method for localization, based merely on Re… ▽ More This paper deals with the problem of localization in a cellular network in a dense urban scenario. Global Navigation Satellite Systems (GNSS) typically perform poorly in urban environments, where the likelihood of line-of-sight conditions is low, and thus alternative localization methods are required for good accuracy. We present LocUNet: A deep learning method for localization, based merely on Received Signal Strength (RSS) from Base Stations (BSs), which does not require any increase in computation complexity at the user devices with respect to the device standard operations, unlike methods that rely on time of arrival or angle of arrival information. In the proposed method, the user to be localized reports the RSS from BSs to a Central Processing Unit (CPU), which may be located in the cloud. Alternatively, the localization can be performed locally at the user. Using estimated pathloss radio maps of the BSs, LocUNet can localize users with state-of-the-art accuracy and enjoys high robustness to inaccuracies in the radio maps. The proposed method does not require pre-sampling of the environment; and is suitable for real-time applications, thanks to the RadioUNet, a neural network-based radio map estimator. We also introduce two datasets that allow numerical comparisons of RSS and Time of Arrival (ToA) methods in realistic urban environments. △ Less

Submitted 2 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

Comments: To appear in ICASSP 2022. arXiv admin note: substantial text overlap with arXiv:2106.12556

arXiv:2202.00645 [pdf, other]

Generalization Analysis of Message Passing Neural Networks on Large Random Graphs

Authors: Sohir Maskey, Ron Levie, Yunseok Lee, Gitta Kutyniok

Abstract: Message passing neural networks (MPNN) have seen a steep rise in popularity since their introduction as generalizations of convolutional neural networks to graph-structured data, and are now considered state-of-the-art tools for solving a large variety of graph-focused problems. We study the generalization error of MPNNs in graph classification and regression. We assume that graphs of different cl… ▽ More Message passing neural networks (MPNN) have seen a steep rise in popularity since their introduction as generalizations of convolutional neural networks to graph-structured data, and are now considered state-of-the-art tools for solving a large variety of graph-focused problems. We study the generalization error of MPNNs in graph classification and regression. We assume that graphs of different classes are sampled from different random graph models. We show that, when training a MPNN on a dataset sampled from such a distribution, the generalization gap increases in the complexity of the MPNN, and decreases, not only with respect to the number of training samples, but also with the average number of nodes in the graphs. This shows how a MPNN with high complexity can generalize from a small dataset of graphs, as long as the graphs are large. The generalization bound is derived from a uniform convergence result, that shows that any MPNN, applied on a graph, approximates the MPNN applied on the geometric model that the graph discretizes. △ Less

Submitted 4 August, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

Comments: Preprint in Review

MSC Class: 68T07; 68R10

arXiv:2110.08252 [pdf, other]

A Rate-Distortion Framework for Explaining Black-box Model Decisions

Authors: Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok

Abstract: We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework's adaptability to diverse data modalities, particularly images, audio, and… ▽ More We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework's adaptability to diverse data modalities, particularly images, audio, and physical simulations of urban environments. △ Less

Submitted 12 October, 2021; originally announced October 2021.

arXiv:2110.03485 [pdf, other]

Cartoon Explanations of Image Classifiers

Authors: Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok

Abstract: We present CartoonX (Cartoon Explanation), a novel model-agnostic explanation method tailored towards image classifiers and based on the rate-distortion explanation (RDE) framework. Natural images are roughly piece-wise smooth signals -- also called cartoon-like images -- and tend to be sparse in the wavelet domain. CartoonX is the first explanation method to exploit this by requiring its explanat… ▽ More We present CartoonX (Cartoon Explanation), a novel model-agnostic explanation method tailored towards image classifiers and based on the rate-distortion explanation (RDE) framework. Natural images are roughly piece-wise smooth signals -- also called cartoon-like images -- and tend to be sparse in the wavelet domain. CartoonX is the first explanation method to exploit this by requiring its explanations to be sparse in the wavelet domain, thus extracting the relevant piece-wise smooth part of an image instead of relevant pixel-sparse regions. We demonstrate that CartoonX can reveal novel valuable explanatory information, particularly for misclassifications. Moreover, we show that CartoonX achieves a lower distortion with fewer coefficients than other state-of-the-art methods. △ Less

Submitted 20 October, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

Comments: ECCV 2022 (oral)

arXiv:2109.10096 [pdf, ps, other]

Transferability of Graph Neural Networks: an Extended Graphon Approach

Authors: Sohir Maskey, Ron Levie, Gitta Kutyniok

Abstract: We study spectral graph convolutional neural networks (GCNNs), where filters are defined as continuous functions of the graph shift operator (GSO) through functional calculus. A spectral GCNN is not tailored to one specific graph and can be transferred between different graphs. It is hence important to study the GCNN transferability: the capacity of the network to have approximately the same reper… ▽ More We study spectral graph convolutional neural networks (GCNNs), where filters are defined as continuous functions of the graph shift operator (GSO) through functional calculus. A spectral GCNN is not tailored to one specific graph and can be transferred between different graphs. It is hence important to study the GCNN transferability: the capacity of the network to have approximately the same repercussion on different graphs that represent the same phenomenon. Transferability ensures that GCNNs trained on certain graphs generalize if the graphs in the test set represent the same phenomena as the graphs in the training set. In this paper, we consider a model of transferability based on graphon analysis. Graphons are limit objects of graphs, and, in the graph paradigm, two graphs represent the same phenomenon if both approximate the same graphon. Our main contributions can be summarized as follows: 1) we prove that any fixed GCNN with continuous filters is transferable under graphs that approximate the same graphon, 2) we prove transferability for graphs that approximate unbounded graphon shift operators, which are defined in this paper, and, 3) we obtain non-asymptotic approximation results, proving linear stability of GCNNs. This extends current state-of-the-art results which show asymptotic transferability for polynomial filters under graphs that approximate bounded graphons. △ Less

Submitted 27 June, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

Comments: Preprint in Review

MSC Class: 68T07; 68R10; 47A60

arXiv:2108.12018 [pdf, ps, other]

doi 10.1002/mana.202100466

Existence of Uncertainty Minimizers for the Continuous Wavelet Transform

Authors: Simon Halvdansson, Jan-Fredrik Olsen, Nir Sochen, Ron Levie

Abstract: Continuous wavelet design is the endeavor to construct mother wavelets with desirable properties for the continuous wavelet transform (CWT). One class of methods for choosing a mother wavelet involves minimizing a functional, called the wavelet uncertainty functional. Recently, two new wavelet uncertainty functionals were derived from theoretical foundations. In both approaches, the uncertainty of… ▽ More Continuous wavelet design is the endeavor to construct mother wavelets with desirable properties for the continuous wavelet transform (CWT). One class of methods for choosing a mother wavelet involves minimizing a functional, called the wavelet uncertainty functional. Recently, two new wavelet uncertainty functionals were derived from theoretical foundations. In both approaches, the uncertainty of a mother wavelet describes its concentration, or accuracy, as a time-scale probe. While an uncertainty minimizing mother wavelet can be proven to have desirable localization properties, the existence of such a minimizer was never studied. In this paper, we prove the existence of minimizers for the two uncertainty functionals. △ Less

Submitted 15 November, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

Comments: 15 pages, v2 fixed minor typos

arXiv:2106.12556 [pdf, other]

Real-time Outdoor Localization Using Radio Maps: A Deep Learning Approach

Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

Abstract: Global Navigation Satellite Systems typically perform poorly in urban environments, where the likelihood of line-of-sight conditions between devices and satellites is low. Therefore, alternative location methods are required to achieve good accuracy. We present LocUNet: A convolutional, end-to-end trained neural network (NN) for the localization task, which is able to estimate the position of a us… ▽ More Global Navigation Satellite Systems typically perform poorly in urban environments, where the likelihood of line-of-sight conditions between devices and satellites is low. Therefore, alternative location methods are required to achieve good accuracy. We present LocUNet: A convolutional, end-to-end trained neural network (NN) for the localization task, which is able to estimate the position of a user from the received signal strength (RSS) of a small number of Base Stations (BS). Using estimations of pathloss radio maps of the BSs and the RSS measurements of the users to be localized, LocUNet can localize users with state-of-the-art accuracy and enjoys high robustness to inaccuracies in the estimations of radio maps. The proposed method does not require generating RSS fingerprints of each specific area where the localization task is performed and is suitable for real-time applications. Moreover, two novel datasets that allow for numerical evaluations of RSS and ToA methods in realistic urban environments are presented and made publicly available for the research community. By using these datasets, we also provide a fair comparison of state-of-the-art RSS and ToA-based methods in the dense urban scenario and show numerically that LocUNet outperforms all the compared methods. △ Less

Submitted 9 April, 2023; v1 submitted 23 June, 2021; originally announced June 2021.

Comments: Submitted to IEEE Transactions on Wireless Communications

arXiv:2104.01654 [pdf, other]

Wavelet Design with Optimally Localized Ambiguity Function: a Variational Approach

Authors: Ron Levie, Efrat Krimer Avraham, Nir Sochen

Abstract: In this paper, we design mother wavelets for the 1D continuous wavelet transform with some optimality properties. An optimal mother wavelet here is one that has an ambiguity function with minimal spread in the continuous coefficient space (also called phase space). Since the ambiguity function is the reproducing kernel of the coefficient space, optimal windows lead to phase space representations w… ▽ More In this paper, we design mother wavelets for the 1D continuous wavelet transform with some optimality properties. An optimal mother wavelet here is one that has an ambiguity function with minimal spread in the continuous coefficient space (also called phase space). Since the ambiguity function is the reproducing kernel of the coefficient space, optimal windows lead to phase space representations which are "optimally sharp." Namely, the wavelet coefficients have minimal correlations with each other. Such a construction also promotes sparsity in phase space. The spread of the ambiguity function is modeled as the sum of variances along the axes in phase space. In order to optimize the mother wavelet directly as a 1D signal, we pull-back the variances, defined on the 2D phase space, to the so called window-signal space. This is done using the recently developed wavelet-Plancharel theory. The approach allows formulating the optimization problem of the 2D ambiguity function as a minimization problem of the 1D mother wavelet. The resulting 1D formulation is more efficient and does not involve complicated constraints on the 2D ambiguity function. We optimize the mother wavelet using gradient descent, which yields a locally optimal mother wavelet. △ Less

Submitted 4 April, 2021; originally announced April 2021.

MSC Class: 42C40; 20C40; 65T60; 65K10; 47J30

arXiv:2011.02025 [pdf, other]

Quasi Monte Carlo Time-Frequency Analysis

Authors: Ron Levie, Haim Avron, Gitta Kutyniok

Abstract: We study signal processing tasks in which the signal is mapped via some generalized time-frequency transform to a higher dimensional time-frequency space, processed there, and synthesized to an output signal. We show how to approximate such methods using a quasi-Monte Carlo (QMC) approach. We consider cases where the time-frequency representation is redundant, having feature axes in addition to th… ▽ More We study signal processing tasks in which the signal is mapped via some generalized time-frequency transform to a higher dimensional time-frequency space, processed there, and synthesized to an output signal. We show how to approximate such methods using a quasi-Monte Carlo (QMC) approach. We consider cases where the time-frequency representation is redundant, having feature axes in addition to the time and frequency axes. The proposed QMC method allows sampling both efficiently and evenly such redundant time-frequency representations. Indeed, 1) the number of samples required for a certain accuracy is log-linear in the resolution of the signal space, and depends only weakly on the dimension of the redundant time-frequency space, and 2) the quasi-random samples have low discrepancy, so they are spread evenly in the redundant time-frequency space. One example of such redundant representation is the localizing time-frequency transform (LTFT), where the time-frequency plane is enhanced by a third axis. This higher dimensional time-frequency space improves the quality of some time-frequency signal processing tasks, like the phase vocoder (an audio signal processing effect). Since the computational complexity of the QMC is log-linear in the resolution of the signal space, this higher dimensional time-frequency space does not degrade the computation complexity of the proposed QMC method. The proposed QMC method is more efficient than standard Monte Carlo methods, since the deterministic QMC sample points are optimally spread in the time-frequency space, while random samples are not. △ Less

Submitted 4 September, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

arXiv:2009.10525 [pdf, ps, other]

Randomized Continuous Frames in Time-Frequency Analysis

Authors: Ron Levie, Haim Avron

Abstract: Recently, a Monte Carlo approach was proposed for processing highly redundant continuous frames. In this paper we present and analyze applications of this new theory. The computational complexity of the Monte Carlo method relies on the continuous frame being so called linear volume discretizable (LVD). The LVD property means that the number of samples in the coefficient space required by the Monte… ▽ More Recently, a Monte Carlo approach was proposed for processing highly redundant continuous frames. In this paper we present and analyze applications of this new theory. The computational complexity of the Monte Carlo method relies on the continuous frame being so called linear volume discretizable (LVD). The LVD property means that the number of samples in the coefficient space required by the Monte Carlo method is proportional to the resolution of the discrete signal. We show in this paper that the continuous wavelet transform (CWT) and the localizing time-frequency transform (LTFT) are LVD. The LTFT is a time-frequency representation based on a 3D time-frequency space with a richer class of time-frequency atoms than classical time-frequency transforms like the short time Fourier transform (STFT) and the CWT. Our analysis proves that performing signal processing with the LTFT has the same asymptotic complexity as signal processing with the STFT and CWT (based on FFT), even though the coefficient space of the LTFT is higher dimensional. △ Less

Submitted 7 October, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

arXiv:2009.09398 [pdf, other]

Analysis of simultaneous inpainting and geometric separation based on sparse decomposition

Authors: Van Tiep Do, Ron Levie, Gitta Kutyniok

Abstract: Natural images are often the superposition of various parts of different geometric characteristics. For instance, an image might be a mixture of cartoon and texture structures. In addition, images are often given with missing data. In this paper, we develop a method for simultaneously decomposing an image to its two underlying parts and inpainting the missing data. Our separation inpainting method… ▽ More Natural images are often the superposition of various parts of different geometric characteristics. For instance, an image might be a mixture of cartoon and texture structures. In addition, images are often given with missing data. In this paper, we develop a method for simultaneously decomposing an image to its two underlying parts and inpainting the missing data. Our separation inpainting method is based on and $l_1$ minimization approach, using two dictionaries, each sparsifying one of the image parts but not the other. We introduce a comprehensive convergence analysis of our method, in a general setting, utilizing the concepts of joint concentration, clustered sparsity, and cluster coherence. As the main application of our theory, we consider the problem of separating and inpainting an image to a cartoon and texture parts. △ Less

Submitted 7 July, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

MSC Class: 42C40; 42C15; 65J22; 65T60; 68U10; 90C25

arXiv:2007.00758 [pdf, other]

In-Distribution Interpretability for Challenging Modalities

Authors: Cosmas Heiß, Ron Levie, Cinjon Resnick, Gitta Kutyniok, Joan Bruna

Abstract: It is widely recognized that the predictions of deep neural networks are difficult to parse relative to simpler approaches. However, the development of methods to investigate the mode of operation of such models has advanced rapidly in the past few years. Recent work introduced an intuitive framework which utilizes generative models to improve on the meaningfulness of such explanations. In this wo… ▽ More It is widely recognized that the predictions of deep neural networks are difficult to parse relative to simpler approaches. However, the development of methods to investigate the mode of operation of such models has advanced rapidly in the past few years. Recent work introduced an intuitive framework which utilizes generative models to improve on the meaningfulness of such explanations. In this work, we display the flexibility of this method to interpret diverse and challenging modalities: music and physical simulations of urban environments. △ Less

Submitted 7 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

arXiv:2006.05397 [pdf, other]

Real-time Localization Using Radio Maps

Authors: Çağkan Yapar, Ron Levie, Gitta Kutyniok, Giuseppe Caire

Abstract: This paper deals with the problem of localization in a cellular network in a dense urban scenario. Global Navigation Satellite System typically performs poorly in urban environments when there is no line-of-sight between the devices and the satellites, and thus alternative localization methods are often required. We present a simple yet effective method for localization based on pathloss. In our a… ▽ More This paper deals with the problem of localization in a cellular network in a dense urban scenario. Global Navigation Satellite System typically performs poorly in urban environments when there is no line-of-sight between the devices and the satellites, and thus alternative localization methods are often required. We present a simple yet effective method for localization based on pathloss. In our approach, the user to be localized reports the received signal strength from a set of base stations with known locations. For each base station we have a good approximation of the pathloss at each location in the map, provided by RadioUNet, an efficient deep learning-based simulator of pathloss functions in urban environment, akin to ray-tracing. Using the approximations of the pathloss functions of all base stations and the reported signal strengths, we are able to extract a very accurate approximation of the location of the user. △ Less

Submitted 9 June, 2020; originally announced June 2020.

arXiv:1911.09002 [pdf, other]

RadioUNet: Fast Radio Map Estimation with Convolutional Neural Networks

Authors: Ron Levie, Çağkan Yapar, Gitta Kutyniok, Giuseppe Caire

Abstract: In this paper we propose a highly efficient and very accurate deep learning method for estimating the propagation pathloss from a point $x$ (transmitter location) to any point $y$ on a planar domain. For applications such as user-cell site association and device-to-device link scheduling, an accurate knowledge of the pathloss function for all pairs of transmitter-receiver locations is very importa… ▽ More In this paper we propose a highly efficient and very accurate deep learning method for estimating the propagation pathloss from a point $x$ (transmitter location) to any point $y$ on a planar domain. For applications such as user-cell site association and device-to-device link scheduling, an accurate knowledge of the pathloss function for all pairs of transmitter-receiver locations is very important. Commonly used statistical models approximate the pathloss as a decaying function of the distance between transmitter and receiver. However, in realistic propagation environments characterized by the presence of buildings, street canyons, and objects at different heights, such radial-symmetric functions yield very misleading results. In this paper we show that properly designed and trained deep neural networks are able to learn how to estimate the pathloss function, given an urban environment, in a very accurate and computationally efficient manner. Our proposed method, termed RadioUNet, learns from a physical simulation dataset, and generates pathloss estimations that are very close to the simulations, but are much faster to compute for real-time applications. Moreover, we propose methods for transferring what was learned from simulations to real-life. Numerical results show that our method significantly outperforms previously proposed methods. △ Less

Submitted 22 December, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

arXiv:1907.12972 [pdf, other]

Transferability of Spectral Graph Convolutional Neural Networks

Authors: Ron Levie, Wei Huang, Lorenzo Bucci, Michael M. Bronstein, Gitta Kutyniok

Abstract: This paper focuses on spectral graph convolutional neural networks (ConvNets), where filters are defined as elementwise multiplication in the frequency domain of a graph. In machine learning settings where the dataset consists of signals defined on many different graphs, the trained ConvNet should generalize to signals on graphs unseen in the training set. It is thus important to transfer ConvNets… ▽ More This paper focuses on spectral graph convolutional neural networks (ConvNets), where filters are defined as elementwise multiplication in the frequency domain of a graph. In machine learning settings where the dataset consists of signals defined on many different graphs, the trained ConvNet should generalize to signals on graphs unseen in the training set. It is thus important to transfer ConvNets between graphs. Transferability, which is a certain type of generalization capability, can be loosely defined as follows: if two graphs describe the same phenomenon, then a single filter or ConvNet should have similar repercussions on both graphs. This paper aims at debunking the common misconception that spectral filters are not transferable. We show that if two graphs discretize the same "continuous" space, then a spectral filter or ConvNet has approximately the same repercussion on both graphs. Our analysis is more permissive than the standard analysis. Transferability is typically described as the robustness of the filter to small graph perturbations and re-indexing of the vertices. Our analysis accounts also for large graph perturbations. We prove transferability between graphs that can have completely different dimensions and topologies, only requiring that both graphs discretize the same underlying space in some generic sense. △ Less

Submitted 12 June, 2021; v1 submitted 30 July, 2019; originally announced July 2019.

arXiv:1901.10524 [pdf, other]

On the Transferability of Spectral Graph Filters

Authors: Ron Levie, Elvin Isufi, Gitta Kutyniok

Abstract: This paper focuses on spectral filters on graphs, namely filters defined as elementwise multiplication in the frequency domain of a graph. In many graph signal processing settings, it is important to transfer a filter from one graph to another. One example is in graph convolutional neural networks (ConvNets), where the dataset consists of signals defined on many different graphs, and the learned f… ▽ More This paper focuses on spectral filters on graphs, namely filters defined as elementwise multiplication in the frequency domain of a graph. In many graph signal processing settings, it is important to transfer a filter from one graph to another. One example is in graph convolutional neural networks (ConvNets), where the dataset consists of signals defined on many different graphs, and the learned filters should generalize to signals on new graphs, not present in the training set. A necessary condition for transferability (the ability to transfer filters) is stability. Namely, given a graph filter, if we add a small perturbation to the graph, then the filter on the perturbed graph is a small perturbation of the original filter. It is a common misconception that spectral filters are not stable, and this paper aims at debunking this mistake. We introduce a space of filters, called the Cayley smoothness space, that contains the filters of state-of-the-art spectral filtering methods, and whose filters can approximate any generic spectral filter. For filters in this space, the perturbation in the filter is bounded by a constant times the perturbation in the graph, and filters in the Cayley smoothness space are thus termed linearly stable. By combining stability with the known property of equivariance, we prove that graph spectral filters are transferable. △ Less

Submitted 29 January, 2019; originally announced January 2019.

arXiv:1808.08810 [pdf, ps, other]

Randomized Signal Processing with Continuous Frames

Authors: Ron Levie, Haim Avron

Abstract: This paper focuses on signal processing tasks in which the signal is transformed from the signal space to a higher dimensional coefficient space (also called phase space) using a continuous frame, processed in the coefficient space, and synthesized to an output signal. We show how to approximate such methods, termed phase space signal processing methods, using a Monte Carlo method. As opposed to s… ▽ More This paper focuses on signal processing tasks in which the signal is transformed from the signal space to a higher dimensional coefficient space (also called phase space) using a continuous frame, processed in the coefficient space, and synthesized to an output signal. We show how to approximate such methods, termed phase space signal processing methods, using a Monte Carlo method. As opposed to standard discretizations of continuous frames, based on sampling discrete frames from the continuous system, the proposed Monte Carlo method is directly a quadrature approximation of the continuous frame. We show that the Monte Carlo method allows working with highly redundant continuous frames, since the number of samples required for a certain accuracy is proportional to the dimension of the signal space, and not to the dimension of the phase space. Moreover, even though the continuous frame is highly redundant, the Monte Carlo samples are spread uniformly, and hence represent the coefficient space more faithfully than standard frame discretizations. △ Less

Submitted 12 September, 2021; v1 submitted 27 August, 2018; originally announced August 2018.

arXiv:1712.02770 [pdf, other]

A Wavelet Plancherel Theory with Application to Multipliers and Sparse Approximations

Authors: Ron Levie, Nir Sochen

Abstract: We introduce an extension of continuous wavelet theory that enables an efficient implementation of multiplicative operators in the coefficient space. In the new theory, the signal space is embedded in a larger abstract signal space -- the so called window-signal space. There is a canonical extension of the wavelet transform to an isometric isomorphism between the window-signal space and the coeffi… ▽ More We introduce an extension of continuous wavelet theory that enables an efficient implementation of multiplicative operators in the coefficient space. In the new theory, the signal space is embedded in a larger abstract signal space -- the so called window-signal space. There is a canonical extension of the wavelet transform to an isometric isomorphism between the window-signal space and the coefficient space. Hence, the new framework is called a wavelet-Plancherel theory, and the extended wavelet transform is called the wavelet-Plancherel transform. Since the wavelet-Plancherel transform is an isometric isomorphism, any operation in the coefficient space can be pulled-back to an operation in the window-signal space. It is then possible to improve the computational complexity of methods that involve a multiplicative operator in the coefficient space, by performing all computations directly in the window-signal space. As one example application, we show how continuous wavelet multipliers (also called Calderón-Toeplitz Operators), with polynomial symbols, can be implemented with linear complexity in the resolution of the 1D signal. As another example, we develop a framework for efficiently computing greedy sparse approximations to signals based on elements of continuous wavelet systems. △ Less

Submitted 6 July, 2021; v1 submitted 7 December, 2017; originally announced December 2017.

arXiv:1707.04863 [pdf, ps, other]

Uncertainty principles and optimally sparse wavelet transforms

Authors: Ron Levie, Nir Sochen

Abstract: In this paper we introduce a new localization framework for wavelet transforms, such as the 1D wavelet transform and the Shearlet transform. Our goal is to design nonadaptive window functions that promote sparsity in some sense. For that, we introduce a framework for analyzing localization aspects of window functions. Our localization theory diverges from the conventional theory in two ways. First… ▽ More In this paper we introduce a new localization framework for wavelet transforms, such as the 1D wavelet transform and the Shearlet transform. Our goal is to design nonadaptive window functions that promote sparsity in some sense. For that, we introduce a framework for analyzing localization aspects of window functions. Our localization theory diverges from the conventional theory in two ways. First, we distinguish between the group generators, and the operators that measure localization (called observables). Second, we define the uncertainty of a signal transform based on a window as a whole, instead of defining the uncertainty of an individual window. We show that the uncertainty of a window function, in the signal space, is closely related to the localization of the reproducing kernel of the wavelet transform, in phase space. As a result, we show that using uncertainty minimizing window functions, results in representations which are optimally sparse in some sense. △ Less

Submitted 9 July, 2018; v1 submitted 16 July, 2017; originally announced July 2017.

arXiv:1705.07664 [pdf, other]

CayleyNets: Graph Convolutional Neural Networks with Complex Rational Spectral Filters

Authors: Ron Levie, Federico Monti, Xavier Bresson, Michael M. Bronstein

Abstract: The rise of graph-structured data such as social networks, regulatory networks, citation graphs, and functional brain networks, in combination with resounding success of deep learning in various applications, has brought the interest in generalizing deep learning models to non-Euclidean domains. In this paper, we introduce a new spectral domain convolutional architecture for deep learning on graph… ▽ More The rise of graph-structured data such as social networks, regulatory networks, citation graphs, and functional brain networks, in combination with resounding success of deep learning in various applications, has brought the interest in generalizing deep learning models to non-Euclidean domains. In this paper, we introduce a new spectral domain convolutional architecture for deep learning on graphs. The core ingredient of our model is a new class of parametric rational complex functions (Cayley polynomials) allowing to efficiently compute spectral filters on graphs that specialize on frequency bands of interest. Our model generates rich spectral filters that are localized in space, scales linearly with the size of the input data for sparsely-connected graphs, and can handle different constructions of Laplacian operators. Extensive experimental results show the superior performance of our approach, in comparison to other spectral domain convolutional architectures, on spectral image classification, community detection, vertex classification and matrix completion tasks. △ Less

Submitted 31 October, 2018; v1 submitted 22 May, 2017; originally announced May 2017.

Showing 1–33 of 33 results for author: Levie, R