Search | arXiv e-print repository

Normalizing flows for lattice gauge theory in arbitrary space-time dimension

Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Abstract: Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tracta… ▽ More Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions. We report new algorithmic developments of gauge-equivariant flow architectures facilitating the generalization to higher-dimensional lattice geometries. Specifically, we discuss masked autoregressive transformations with tractable and unbiased Jacobian determinants, a key ingredient for scalable and asymptotically exact flow-based sampling algorithms. For concreteness, results from a proof-of-principle application to SU(3) lattice gauge theory in four space-time dimensions are reported. △ Less

Submitted 3 May, 2023; originally announced May 2023.

arXiv:2211.07541 [pdf, other]

Aspects of scaling and scalability for flow-based sampling of lattice QCD

Authors: Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Abstract: Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the vi… ▽ More Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing. However, these demonstrations have been at the scale of toy models, and it remains to be determined whether they can be applied to state-of-the-art lattice quantum chromodynamics calculations. Assessing the viability of sampling algorithms for lattice field theory at scale has traditionally been accomplished using simple cost scaling laws, but as we discuss in this work, their utility is limited for flow-based approaches. We conclude that flow-based approaches to sampling are better thought of as a broad family of algorithms with different scaling properties, and that scalability must be assessed experimentally. △ Less

Submitted 14 November, 2022; originally announced November 2022.

Comments: 22 pages, 8 figures

Report number: MIT-CTP/5496

arXiv:2205.06175 [pdf, other]

A Generalist Agent

Authors: Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

Abstract: Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, dec… ▽ More Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens. In this report we describe the model and the data, and document the current capabilities of Gato. △ Less

Submitted 11 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: Published at TMLR, 42 pages

Journal ref: Transactions on Machine Learning Research, 11/2022, https://openreview.net/forum?id=1ikK0kHjvj

arXiv:2203.01187 [pdf, other]

Visual Feature Encoding for GNNs on Road Networks

Authors: Oliver Stromann, Alireza Razavi, Michael Felsberg

Abstract: In this work, we present a novel approach to learning an encoding of visual features into graph neural networks with the application on road network data. We propose an architecture that combines state-of-the-art vision backbone networks with graph neural networks. More specifically, we perform a road type classification task on an Open Street Map road network through encoding of satellite imagery… ▽ More In this work, we present a novel approach to learning an encoding of visual features into graph neural networks with the application on road network data. We propose an architecture that combines state-of-the-art vision backbone networks with graph neural networks. More specifically, we perform a road type classification task on an Open Street Map road network through encoding of satellite imagery using various ResNet architectures. Our architecture further enables fine-tuning and a transfer-learning approach is evaluated by pretraining on the NWPU-RESISC45 image classification dataset for remote sensing and comparing them to purely ImageNet-pretrained ResNet models as visual feature encoders. The results show not only that the visual feature encoders are superior to low-level visual features, but also that the fine-tuning of the visual feature encoder to a general remote sensing dataset such as NWPU-RESISC45 can further improve the performance of a GNN on a machine learning task like road type classification. △ Less

Submitted 2 March, 2022; originally announced March 2022.

arXiv:2112.10624 [pdf, other]

Learning to integrate vision data into road network data

Authors: Oliver Stromann, Alireza Razavi, Michael Felsberg

Abstract: Road networks are the core infrastructure for connected and autonomous vehicles, but creating meaningful representations for machine learning applications is a challenging task. In this work, we propose to integrate remote sensing vision data into road network data for improved embeddings with graph neural networks. We present a segmentation of road edges based on spatio-temporal road and traffic… ▽ More Road networks are the core infrastructure for connected and autonomous vehicles, but creating meaningful representations for machine learning applications is a challenging task. In this work, we propose to integrate remote sensing vision data into road network data for improved embeddings with graph neural networks. We present a segmentation of road edges based on spatio-temporal road and traffic characteristics, which allows to enrich the attribute set of road networks with visual features of satellite imagery and digital surface models. We show that both, the segmentation and the integration of vision data can increase performance on a road type classification task, and we achieve state-of-the-art performance on the OSM+DiDi Chuxing dataset on Chengdu, China. △ Less

Submitted 2 March, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

arXiv:2106.04615 [pdf, other]

Vector Quantized Models for Planning

Authors: Sherjil Ozair, Yazhe Li, Ali Razavi, Ioannis Antonoglou, Aäron van den Oord, Oriol Vinyals

Abstract: Recent developments in the field of model-based RL have proven successful in a range of environments, especially ones where planning is essential. However, such successes have been limited to deterministic fully-observed environments. We present a new approach that handles stochastic and partially-observable environments. Our key insight is to use discrete autoencoders to capture the multiple poss… ▽ More Recent developments in the field of model-based RL have proven successful in a range of environments, especially ones where planning is essential. However, such successes have been limited to deterministic fully-observed environments. We present a new approach that handles stochastic and partially-observable environments. Our key insight is to use discrete autoencoders to capture the multiple possible effects of an action in a stochastic environment. We use a stochastic variant of Monte Carlo tree search to plan over both the agent's actions and the discrete latent variables representing the environment's response. Our approach significantly outperforms an offline version of MuZero on a stochastic interpretation of chess where the opponent is considered part of the environment. We also show that our approach scales to DeepMind Lab, a first-person 3D environment with large visual observations and partial observability. △ Less

Submitted 10 June, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

Comments: ICML 2021

arXiv:2103.01950 [pdf, other]

Predicting Video with VQVAE

Authors: Jacob Walker, Ali Razavi, Aäron van den Oord

Abstract: In recent years, the task of video prediction-forecasting future video given past video frames-has attracted attention in the research community. In this paper we propose a novel approach to this problem with Vector Quantized Variational AutoEncoders (VQ-VAE). With VQ-VAE we compress high-resolution videos into a hierarchical set of multi-scale discrete latent variables. Compared to pixels, this c… ▽ More In recent years, the task of video prediction-forecasting future video given past video frames-has attracted attention in the research community. In this paper we propose a novel approach to this problem with Vector Quantized Variational AutoEncoders (VQ-VAE). With VQ-VAE we compress high-resolution videos into a hierarchical set of multi-scale discrete latent variables. Compared to pixels, this compressed latent space has dramatically reduced dimensionality, allowing us to apply scalable autoregressive generative models to predict video. In contrast to previous work that has largely emphasized highly constrained datasets, we focus on very diverse, large-scale datasets such as Kinetics-600. We predict video at a higher resolution on unconstrained videos, 256x256, than any other previous method to our knowledge. We further validate our approach against prior work via a crowdsourced human evaluation. △ Less

Submitted 2 March, 2021; originally announced March 2021.

Comments: 13 Pages

ACM Class: I.2.6; I.2.10

arXiv:2007.03356 [pdf, other]

Do Transformers Need Deep Long-Range Memory

Authors: Jack W. Rae, Ali Razavi

Abstract: Deep attention models have advanced the modelling of sequential data across many domains. For language modelling in particular, the Transformer-XL -- a Transformer augmented with a long-range memory of past activations -- has been shown to be state-of-the-art across a variety of well-studied benchmarks. The Transformer-XL incorporates a long-range memory at every layer of the network, which render… ▽ More Deep attention models have advanced the modelling of sequential data across many domains. For language modelling in particular, the Transformer-XL -- a Transformer augmented with a long-range memory of past activations -- has been shown to be state-of-the-art across a variety of well-studied benchmarks. The Transformer-XL incorporates a long-range memory at every layer of the network, which renders its state to be thousands of times larger than RNN predecessors. However it is unclear whether this is necessary. We perform a set of interventions to show that comparable performance can be obtained with 6X fewer long range memories and better performance can be obtained by limiting the range of attention in lower layers of the network. △ Less

Submitted 7 July, 2020; originally announced July 2020.

Comments: published at 58th Annual Meeting of the Association for Computational Linguistics. 6 pages, 4 figures, 1 table

arXiv:2001.10568 [pdf, other]

Landmark2Vec: An Unsupervised Neural Network-Based Landmark Positioning Method

Authors: Alireza Razavi

Abstract: A Neural Network-based method for unsupervised landmarks map estimation from measurements taken from landmarks is introduced. The measurements needed for training the network are the signals observed/received from landmarks by an agent. The definition of landmarks, agent, and the measurements taken by agent from landmarks is rather broad here: landmarks can be visual objects, e.g., poles along a r… ▽ More A Neural Network-based method for unsupervised landmarks map estimation from measurements taken from landmarks is introduced. The measurements needed for training the network are the signals observed/received from landmarks by an agent. The definition of landmarks, agent, and the measurements taken by agent from landmarks is rather broad here: landmarks can be visual objects, e.g., poles along a road, with measurements being the size of landmark in a visual sensor mounted on a vehicle (agent), or they can be radio transmitters, e.g., WiFi access points inside a building, with measurements being the Received Signal Strength (RSS) heard from them by a mobile device carried by a person (agent). The goal of the map estimation is then to find the positions of landmarks up to a scale, rotation, and shift (i.e., the topological map of the landmarks). Assuming that there are $L$ landmarks, the measurements will be $L \times 1$ vectors collected over the area. A shallow network then will be trained to learn the map without any ground truth information. △ Less

Submitted 28 January, 2020; originally announced January 2020.

arXiv:1906.00446 [pdf, other]

Generating Diverse High-Fidelity Images with VQ-VAE-2

Authors: Ali Razavi, Aaron van den Oord, Oriol Vinyals

Abstract: We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation. To this end, we scale and enhance the autoregressive priors used in VQ-VAE to generate synthetic samples of much higher coherence and fidelity than possible before. We use simple feed-forward encoder and decoder networks, making our model an attractive candidate for applications where t… ▽ More We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation. To this end, we scale and enhance the autoregressive priors used in VQ-VAE to generate synthetic samples of much higher coherence and fidelity than possible before. We use simple feed-forward encoder and decoder networks, making our model an attractive candidate for applications where the encoding and/or decoding speed is critical. Additionally, VQ-VAE requires sampling an autoregressive model only in the compressed latent space, which is an order of magnitude faster than sampling in the pixel space, especially for large images. We demonstrate that a multi-scale hierarchical organization of VQ-VAE, augmented with powerful priors over the latent codes, is able to generate samples with quality that rivals that of state of the art Generative Adversarial Networks on multifaceted datasets such as ImageNet, while not suffering from GAN's known shortcomings such as mode collapse and lack of diversity. △ Less

Submitted 2 June, 2019; originally announced June 2019.

arXiv:1905.09272 [pdf, other]

Data-Efficient Image Recognition with Contrastive Predictive Coding

Authors: Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord

Abstract: Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge. We hypothesize that data-efficient recognition is enabled by representations which make the variability in natural signals more predictable. We therefore revisit and improve Contrastive Predictive Coding, an unsupervised objective for learning suc… ▽ More Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge. We hypothesize that data-efficient recognition is enabled by representations which make the variability in natural signals more predictable. We therefore revisit and improve Contrastive Predictive Coding, an unsupervised objective for learning such representations. This new implementation produces features which support state-of-the-art linear classification accuracy on the ImageNet dataset. When used as input for non-linear classification with deep neural networks, this representation allows us to use 2-5x less labels than classifiers trained directly on image pixels. Finally, this unsupervised representation substantially improves transfer learning to object detection on the PASCAL VOC dataset, surpassing fully supervised pre-trained ImageNet classifiers. △ Less

Submitted 1 July, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

arXiv:1901.03416 [pdf, other]

Preventing Posterior Collapse with delta-VAEs

Authors: Ali Razavi, Aäron van den Oord, Ben Poole, Oriol Vinyals

Abstract: Due to the phenomenon of "posterior collapse," current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires augmenting the objective so it does not only maximize the likelihood of the data. In this paper, we propose an alternative that utilizes the most powerful generative models as decoders, whilst optimising the variationa… ▽ More Due to the phenomenon of "posterior collapse," current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires augmenting the objective so it does not only maximize the likelihood of the data. In this paper, we propose an alternative that utilizes the most powerful generative models as decoders, whilst optimising the variational lower bound all while ensuring that the latent variables preserve and encode useful information. Our proposed $δ$-VAEs achieve this by constraining the variational family for the posterior to have a minimum distance to the prior. For sequential latent variable models, our approach resembles the classic representation learning approach of slow feature analysis. We demonstrate the efficacy of our approach at modeling text on LM1B and modeling images: learning representations, improving sample quality, and achieving state of the art log-likelihood on CIFAR-10 and ImageNet $32\times 32$. △ Less

Submitted 10 January, 2019; originally announced January 2019.

arXiv:1805.09786 [pdf, other]

Hyperbolic Attention Networks

Authors: Caglar Gulcehre, Misha Denil, Mateusz Malinowski, Ali Razavi, Razvan Pascanu, Karl Moritz Hermann, Peter Battaglia, Victor Bapst, David Raposo, Adam Santoro, Nando de Freitas

Abstract: We introduce hyperbolic attention networks to endow neural networks with enough capacity to match the complexity of data with hierarchical and power-law structure. A few recent approaches have successfully demonstrated the benefits of imposing hyperbolic geometry on the parameters of shallow networks. We extend this line of work by imposing hyperbolic geometry on the activations of neural networks… ▽ More We introduce hyperbolic attention networks to endow neural networks with enough capacity to match the complexity of data with hierarchical and power-law structure. A few recent approaches have successfully demonstrated the benefits of imposing hyperbolic geometry on the parameters of shallow networks. We extend this line of work by imposing hyperbolic geometry on the activations of neural networks. This allows us to exploit hyperbolic geometry to reason about embeddings produced by deep networks. We achieve this by re-expressing the ubiquitous mechanism of soft attention in terms of operations defined for hyperboloid and Klein models. Our method shows improvements in terms of generalization on neural machine translation, learning on graphs and visual question answering tasks while keeping the neural representations compact. △ Less

Submitted 24 May, 2018; originally announced May 2018.

arXiv:1711.09846 [pdf, other]

Population Based Training of Neural Networks

Authors: Max Jaderberg, Valentin Dalibard, Simon Osindero, Wojciech M. Czarnecki, Jeff Donahue, Ali Razavi, Oriol Vinyals, Tim Green, Iain Dunning, Karen Simonyan, Chrisantha Fernando, Koray Kavukcuoglu

Abstract: Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In this work we present \emph{Population Based Training (PBT)}, a simple asynchronous optimisation algorithm which effectively utilises a fixed computational budget… ▽ More Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In this work we present \emph{Population Based Training (PBT)}, a simple asynchronous optimisation algorithm which effectively utilises a fixed computational budget to jointly optimise a population of models and their hyperparameters to maximise performance. Importantly, PBT discovers a schedule of hyperparameter settings rather than following the generally sub-optimal strategy of trying to find a single fixed set to use for the whole course of training. With just a small modification to a typical distributed hyperparameter training framework, our method allows robust and reliable training of models. We demonstrate the effectiveness of PBT on deep reinforcement learning problems, showing faster wall-clock convergence and higher final performance of agents by optimising over a suite of hyperparameters. In addition, we show the same method can be applied to supervised learning for machine translation, where PBT is used to maximise the BLEU score directly, and also to training of Generative Adversarial Networks to maximise the Inception score of generated images. In all cases PBT results in the automatic discovery of hyperparameter schedules and model selection which results in stable training and better final performance. △ Less

Submitted 28 November, 2017; v1 submitted 27 November, 2017; originally announced November 2017.

arXiv:1509.01600 [pdf, ps, other]

doi 10.1109/GLOCOMW.2015.7414026

K-Means Fingerprint Clustering for Low-Complexity Floor Estimation in Indoor Mobile Localization

Authors: Alireza Razavi, Mikko Valkama, Elena-Simona Lohan

Abstract: Indoor localization in multi-floor buildings is an important research problem. Finding the correct floor, in a fast and efficient manner, in a shopping mall or an unknown university building can save the users' search time and can enable a myriad of Location Based Services in the future. One of the most widely spread techniques for floor estimation in multi-floor buildings is the fingerprinting-ba… ▽ More Indoor localization in multi-floor buildings is an important research problem. Finding the correct floor, in a fast and efficient manner, in a shopping mall or an unknown university building can save the users' search time and can enable a myriad of Location Based Services in the future. One of the most widely spread techniques for floor estimation in multi-floor buildings is the fingerprinting-based localization using Received Signal Strength (RSS) measurements coming from indoor networks, such as WLAN and BLE. The clear advantage of RSS-based floor estimation is its ease of implementation on a multitude of mobile devices at the Application Programming Interface (API) level, because RSS values are directly accessible through API interface. However, the downside of a fingerprinting approach, especially for large-scale floor estimation and positioning solutions, is their need to store and transmit a huge amount of fingerprinting data. The problem becomes more severe when the localization is intended to be done on mobile devices which have limited memory, power, and computational resources. An alternative floor estimation method, which has lower complexity and is faster than the fingerprinting is the Weighted Centroid Localization (WCL) method. The trade-off is however paid in terms of a lower accuracy than the one obtained with traditional fingerprinting with Nearest Neighbour (NN) estimates. In this paper a novel K-means-based method for floor estimation via fingerprint clustering of WiFi and various other positioning sensor outputs is introduced. Our method achieves a floor estimation accuracy close to the one with NN fingerprinting, while significantly improves the complexity and the speed of the floor detection algorithm. The decrease in the database size is achieved through storing and transmitting only the cluster heads (CH's) and their corresponding floor labels. △ Less

Submitted 24 September, 2015; v1 submitted 4 September, 2015; originally announced September 2015.

Comments: Accepted to IEEE Globecom 2015, Workshop on Localization and Tracking: Indoors, Outdoors and Emerging Networks

arXiv:1507.02999 [pdf, ps, other]

doi 10.1109/TSP.2016.2560132

Compressive Detection of Random Subspace Signals

Authors: Alireza Razavi, Mikko Valkama, Danijela Cabric

Abstract: The problem of compressive detection of random subspace signals is studied. We consider signals modeled as $\mathbf{s} = \mathbf{H} \mathbf{x}$ where $\mathbf{H}$ is an $N \times K$ matrix with $K \le N$ and $\mathbf{x} \sim \mathcal{N}(\mathbf{0}_{K,1},σ_x^2 \mathbf{I}_K)$. We say that signal $\mathbf{s}$ lies in or leans toward a subspace if the largest eigenvalue of $\mathbf{H} \mathbf{H}^T$ is… ▽ More The problem of compressive detection of random subspace signals is studied. We consider signals modeled as $\mathbf{s} = \mathbf{H} \mathbf{x}$ where $\mathbf{H}$ is an $N \times K$ matrix with $K \le N$ and $\mathbf{x} \sim \mathcal{N}(\mathbf{0}_{K,1},σ_x^2 \mathbf{I}_K)$. We say that signal $\mathbf{s}$ lies in or leans toward a subspace if the largest eigenvalue of $\mathbf{H} \mathbf{H}^T$ is strictly greater than its smallest eigenvalue. We first design a measurement matrix $\mathbfΦ=[\mathbfΦ_s^T,\mathbfΦ_o^T]^T$ comprising of two sub-matrices $\mathbfΦ_s$ and $\mathbfΦ_o$ where $\mathbfΦ_s$ projects the signals to the strongest left-singular vectors, i.e., the left-singular vectors corresponding to the largest singular values, of subspace matrix $\mathbf{H}$ and $\mathbfΦ_o$ projects it to the weakest left-singular vectors. We then propose two detectors which work based on the difference in energies of the samples measured by two sub-matrices $\mathbfΦ_s$ and $\mathbfΦ_o$ and prove their optimality. Simplified versions of the proposed detectors for the case when the variance of noise is known are also provided. Furthermore, we study the performance of the detector when measurements are imprecise and show how imprecision can be compensated by employing more measurement devices. The problem is then re-formulated for the case when the signal lies in the union of a finite number of linear subspaces instead of a single linear subspace. Finally, we study the performance of the proposed methods by simulation examples. △ Less

Submitted 30 December, 2015; v1 submitted 10 July, 2015; originally announced July 2015.

Comments: 33 pages, 11 figures, Revised version

arXiv:1507.02455 [pdf, ps, other]

doi 10.1109/GLOCOM.2014.7417565

Compressive Identification of Active OFDM Subcarriers in Presence of Timing Offset

Authors: Alireza Razavi, Mikko Valkama, Danijela Cabric

Abstract: In this paper we study the problem of identifying active subcarriers in an OFDM signal from compressive measurements sampled at sub-Nyquist rate. The problem is of importance in Cognitive Radio systems when secondary users (SUs) are looking for available spectrum opportunities to communicate over them while sensing at Nyquist rate sampling can be costly or even impractical in case of very wide ban… ▽ More In this paper we study the problem of identifying active subcarriers in an OFDM signal from compressive measurements sampled at sub-Nyquist rate. The problem is of importance in Cognitive Radio systems when secondary users (SUs) are looking for available spectrum opportunities to communicate over them while sensing at Nyquist rate sampling can be costly or even impractical in case of very wide bandwidth. We first study the effect of timing offset and derive the necessary and sufficient conditions for signal recovery in the oracle-assisted case when the true active sub-carriers are assumed known. Then we propose an Orthogonal Matching Pursuit (OMP)-based joint sparse recovery method for identifying active subcarriers when the timing offset is known. Finally we extend the problem to the case of unknown timing offset and develop a joint dictionary learning and sparse approximation algorithm, where in the dictionary learning phase the timing offset is estimated and in the sparse approximation phase active subcarriers are identified. The obtained results demonstrate that active subcarrier identification can be carried out reliably, by using the developed framework. △ Less

Submitted 9 July, 2015; originally announced July 2015.

Comments: To appear in the proceedings of the IEEE Global Communications Conference (GLOBECOM) 2015

arXiv:1501.02405 [pdf, ps, other]

doi 10.1016/j.sigpro.2014.11.017

Covariance-Based OFDM Spectrum Sensing with Sub-Nyquist Samples

Authors: Alireza Razavi, Mikko Valkama, Danijela Cabric

Abstract: In this paper, we propose a feature-based method for spectrum sensing of OFDM signals from sub-Nyquist samples over a single band. We exploit the structure of the covariance matrix of OFDM signals to convert an underdetermined set of covariance-based equations to an overdetermined one. The statistical properties of sample covariance matrix are analyzed and then based on that an approximate General… ▽ More In this paper, we propose a feature-based method for spectrum sensing of OFDM signals from sub-Nyquist samples over a single band. We exploit the structure of the covariance matrix of OFDM signals to convert an underdetermined set of covariance-based equations to an overdetermined one. The statistical properties of sample covariance matrix are analyzed and then based on that an approximate Generalized Likelihood Ratio Test (GLRT) for detection of OFDM signals from sub-Nyquist samples is derived. The method is also extended to the frequency-selective channels. △ Less

Submitted 10 January, 2015; originally announced January 2015.

Comments: 30 pages, 5 figures

Journal ref: Signal Processing, Volume 109, April 2015, Pages 261-268

arXiv:1203.6177 [pdf, ps, other]

On Distance Function among Finite Set of Points

Authors: Hajar Ghahremani Gol, Asadollah Razavi, Farzad Didehva

Abstract: In practical purposes for some geometrical problems in computer science we have as information the coordinates of some finite points in surface instead of the whole body of a surface. The problem arised here is: "How to define a distance function in a finite space?" as we will show the appropriate function for this purpose is not a metric function. Here we try to define this distance function in o… ▽ More In practical purposes for some geometrical problems in computer science we have as information the coordinates of some finite points in surface instead of the whole body of a surface. The problem arised here is: "How to define a distance function in a finite space?" as we will show the appropriate function for this purpose is not a metric function. Here we try to define this distance function in order to apply it in further proposes, specially in the field setting of transportation theory and vehicle routing problem. More precisely in this paper we consider VRP problem for two dimensional manifolds in R3. △ Less

Submitted 28 March, 2012; originally announced March 2012.

MSC Class: 97PXX

Showing 1–19 of 19 results for author: Razavi, A