Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Sahraee-Ardakan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.08082  [pdf, other

    stat.ML cs.LG

    Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

    Authors: Mojtaba Sahraee-Ardakan, Melikasadat Emami, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Empirical observation of high dimensional phenomena, such as the double descent behaviour, has attracted a lot of interest in understanding classical techniques such as kernel methods, and their implications to explain generalization properties of neural networks. Many recent works analyze such models in a certain high-dimensional regime where the covariates are independent and the number of sampl… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  2. arXiv:2103.04557  [pdf, other

    stat.ML cs.LG

    Asymptotics of Ridge Regression in Convolutional Models

    Authors: Mojtaba Sahraee-Ardakan, Tung Mai, Anup Rao, Ryan Rossi, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Understanding generalization and estimation error of estimators for simple models such as linear and generalized linear models has attracted a lot of attention recently. This is in part due to an interesting observation made in machine learning community that highly over-parameterized neural networks achieve zero training error, and yet they are able to generalize well over the test samples. This… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  3. arXiv:2101.07833  [pdf, ps, other

    cs.LG cs.NE eess.SY stat.ML

    Implicit Bias of Linear RNNs

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Contemporary wisdom based on empirical studies suggests that standard recurrent neural networks (RNNs) do not perform well on tasks requiring long-term memory. However, precise reasoning for this behavior is still unknown. This paper provides a rigorous explanation of this property in the special case of linear RNNs. Although this work is limited to linear RNNs, even these systems have traditional… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 30 pages, 4 figures

  4. arXiv:2005.05053  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SP stat.ML

    Low-Rank Nonlinear Decoding of $μ$-ECoG from the Primary Auditory Cortex

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Alyson K. Fletcher, Sundeep Rangan, Michael Trumpis, Brinnae Bent, Chia-Han Chiang, Jonathan Viventi

    Abstract: This paper considers the problem of neural decoding from parallel neural measurements systems such as micro-electrocorticography ($μ$-ECoG). In systems with large numbers of array elements at very high sampling rates, the dimension of the raw measurement data may be large. Learning neural decoders for this high-dimensional data can be challenging, particularly when the number of training samples i… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 4 pages, 3 figures

  5. arXiv:2005.00180  [pdf, other

    cs.LG stat.ML

    Generalization Error of Generalized Linear Models in High Dimensions

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our understanding of their generalization capabilities is incomplete. This task is made harder by the non-convexity of the underlying learning problems. We provide a general… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: 20 pages, 4 figures

  6. arXiv:2001.09396  [pdf, other

    cs.LG cs.IT cs.NE eess.SP stat.ML

    Inference in Multi-Layer Networks with Matrix-Valued Unknowns

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

    Abstract: We consider the problem of inferring the input and hidden variables of a stochastic multi-layer neural network from an observation of the output. The hidden variables in each layer are represented as matrices. This problem applies to signal recovery via deep generative prior models, multi-task and mixed regression and learning certain classes of two-layer neural networks. A unified approximation a… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

    Comments: 3 figures, 6 pages (two-column) + Appendix. arXiv admin note: text overlap with arXiv:1911.03409

  7. arXiv:1911.03409  [pdf, other

    cs.LG cs.IT cs.NE eess.SP stat.ML

    Inference with Deep Generative Priors in High Dimensions

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter, Alyson K. Fletcher

    Abstract: Deep generative priors offer powerful models for complex-structured data, such as images, audio, and text. Using these priors in inverse problems typically requires estimating the input and/or hidden signals in a multi-layer deep neural network from observation of its output. While these approaches have been successful in practice, rigorous performance analysis is complicated by the non-convex nat… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 50 pages, double-spaced

  8. arXiv:1910.13672  [pdf, other

    cs.LG stat.ML

    Input-Output Equivalence of Unitary and Contractive RNNs

    Authors: M. Emami, M. Sahraee-Ardakan, S. Rangan, A. K. Fletcher

    Abstract: Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This work shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice th… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  9. arXiv:1903.09631  [pdf, other

    math.ST cs.LG eess.SP stat.ML

    High-Dimensional Bernoulli Autoregressive Process with Long-Range Dependence

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Arash A. Amini, Sundeep Rangan, Alyson K. Fletcher

    Abstract: We consider the problem of estimating the parameters of a multivariate Bernoulli process with auto-regressive feedback in the high-dimensional setting where the number of samples available is much less than the number of parameters. This problem arises in learning interconnections of networks of dynamical systems with spiking or binary-valued data. We allow the process to depend on its past up to… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: To appear at AISTATS 2019 titled "Sparse Multivariate Bernoulli Processes in High Dimensions"

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha, Okinawa, Japan. PMLR: Volume 89

  10. arXiv:1706.06054  [pdf, other

    cs.IT cs.LG

    Rigorous Dynamics and Consistent Estimation in Arbitrarily Conditioned Linear Systems

    Authors: Alyson K. Fletcher, Mojtaba Sahraee-Ardakan, Philip Schniter, Sundeep Rangan

    Abstract: The problem of estimating a random vector x from noisy linear measurements y = A x + w with unknown parameters on the distributions of x and w, which must also be learned, arises in a wide range of statistical learning and linear inverse problems. We show that a computationally simple iterative message-passing algorithm can provably obtain asymptotically consistent estimates in a certain high-dime… ▽ More

    Submitted 19 June, 2017; originally announced June 2017.

  11. arXiv:1701.03420  [pdf, other

    cs.CV

    Joint Dictionary Learning for Example-based Image Super-resolution

    Authors: Mojtaba Sahraee-Ardakan, Mohsen Joneidi

    Abstract: In this paper, we propose a new joint dictionary learning method for example-based image super-resolution (SR), using sparse representation. The low-resolution (LR) dictionary is trained from a set of LR sample image patches. Using the sparse representation coefficients of these LR patches over the LR dictionary, the high-resolution (HR) dictionary is trained by minimizing the reconstruction error… ▽ More

    Submitted 12 January, 2017; originally announced January 2017.

    Comments: 5 pages, 1 figure, 1 table

  12. arXiv:1602.07795  [pdf, ps, other

    cs.IT stat.ML

    Expectation Consistent Approximate Inference: Generalizations and Convergence

    Authors: Alyson K. Fletcher, Mojtaba Sahraee-Ardakan, Sundeep Rangan, Philip Schniter

    Abstract: Approximations of loopy belief propagation, including expectation propagation and approximate message passing, have attracted considerable attention for probabilistic inference problems. This paper proposes and analyzes a generalization of Opper and Winther's expectation consistent (EC) approximate inference method. The proposed method, called Generalized Expectation Consistency (GEC), can be appl… ▽ More

    Submitted 24 January, 2017; v1 submitted 25 February, 2016; originally announced February 2016.

    Comments: 10 pages