Zum Hauptinhalt springen

Showing 1–31 of 31 results for author: Mhaskar, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15992  [pdf, ps, other

    cs.LG math.NA

    Data Complexity Estimates for Operator Learning

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Hrushikesh Mhaskar

    Abstract: Operator learning has emerged as a new paradigm for the data-driven approximation of nonlinear operators. Despite its empirical success, the theoretical underpinnings governing the conditions for efficient operator learning remain incomplete. The present work develops theory to study the data complexity of operator learning, complementing existing research on the parametric complexity. We investig… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2402.12687  [pdf, other

    cs.LG stat.ML

    Learning on manifolds without manifold learning

    Authors: H. N. Mhaskar, Ryan O'Dowd

    Abstract: Function approximation based on data drawn randomly from an unknown distribution is an important problem in machine learning. The manifold hypothesis assumes that the data is sampled from an unknown submanifold of a high dimensional Euclidean space. A great deal of research deals with obtaining information about this manifold, such as the eigendecomposition of the Laplace-Beltrami operator or coor… ▽ More

    Submitted 18 August, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2308.03230  [pdf, ps, other

    cs.LG math.NA

    Tractability of approximation by general shallow networks

    Authors: Hrushikesh Mhaskar, Tong Mao

    Abstract: In this paper, we present a sharper version of the results in the paper Dimension independent bounds for general shallow networks; Neural Networks, \textbf{123} (2020), 142-152. Let $\mathbb{X}$ and $\mathbb{Y}$ be compact metric spaces. We consider approximation of functions of the form $ x\mapsto\int_{\mathbb{Y}} G( x, y)dτ( y)$, $ x\in\mathbb{X}$, by $G$-networks of the form… ▽ More

    Submitted 10 December, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

  4. arXiv:2305.03890  [pdf, ps, other

    cs.LG math.NA

    Approximation by non-symmetric networks for cross-domain learning

    Authors: Hrushikesh Mhaskar

    Abstract: For the past 30 years or so, machine learning has stimulated a great deal of research in the study of approximation capabilities (expressive power) of a multitude of processes, such as approximation by shallow or deep neural networks, radial basis function networks, and a variety of kernel based methods. Motivated by applications such as invariant learning, transfer learning, and synthetic apertur… ▽ More

    Submitted 5 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  5. arXiv:2303.00984  [pdf, other

    cs.LG

    Encoding of data sets and algorithms

    Authors: Katarina Doctor, Tong Mao, Hrushikesh Mhaskar

    Abstract: In many high-impact applications, it is important to ensure the quality of output of a machine learning algorithm as well as its reliability in comparison with the complexity of the algorithm used. In this paper, we have initiated a mathematically rigorous theory to decide which models (algorithms applied on data sets) are close to each other in terms of certain metrics, such as performance and th… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  6. arXiv:2302.00160  [pdf, ps, other

    cs.LG stat.ML

    Local transfer learning from one data space to another

    Authors: H. N. Mhaskar, Ryan O'Dowd

    Abstract: A fundamental problem in manifold learning is to approximate a functional relationship in a data chosen randomly from a probability distribution supported on a low dimensional sub-manifold of a high dimensional ambient Euclidean space. The manifold is essentially defined by the data set itself and, typically, designed so that the data is dense on the manifold in some sense. The notion of a data sp… ▽ More

    Submitted 7 July, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: To appear in Proceedings of ICAIPA 2022, Editors: S. Pereverzyev, R. Radha, S. Sivananthan, Springer Verlag

  7. arXiv:2202.06392  [pdf, other

    math.NA cs.LG math.FA

    Local approximation of operators

    Authors: Hrushikesh Mhaskar

    Abstract: Many applications, such as system identification, classification of time series, direct and inverse problems in partial differential equations, and uncertainty quantification lead to the question of approximation of a non-linear operator between metric spaces $\mathfrak{X}$ and $\mathfrak{Y}$. We study the problem of determining the degree of approximation of such operators on a compact subset… ▽ More

    Submitted 1 December, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  8. arXiv:2110.01670  [pdf, other

    cs.LG eess.SP math.NA

    A manifold learning approach for gesture recognition from micro-Doppler radar measurements

    Authors: Eric Mason, Hrushikesh Mhaskar, Adam Guo

    Abstract: A recent paper (Neural Networks, {\bf 132} (2020), 253-268) introduces a straightforward and simple kernel based approximation for manifold learning that does not require the knowledge of anything about the manifold, except for its dimension. In this paper, we examine how the pointwise error in approximation using least squares optimization based on similarly localized kernels depends upon the dat… ▽ More

    Submitted 21 April, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: To appear in Neural Networks

  9. arXiv:2109.14752  [pdf, other

    stat.ML cs.LG

    Kernel distance measures for time series, random fields and other structured data

    Authors: Srinjoy Das, Hrushikesh Mhaskar, Alexander Cloninger

    Abstract: This paper introduces kdiff, a novel kernel-based measure for estimating distances between instances of time series, random fields and other forms of structured data. This measure is based on the idea of matching distributions that only overlap over a portion of their region of support. Our proposed measure is inspired by MPdist which has been previously proposed for such datasets and is construct… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  10. arXiv:2105.05893  [pdf, other

    cs.LG math.NA

    A function approximation approach to the prediction of blood glucose levels

    Authors: H. N. Mhaskar, S. V. Pereverzyev, M. D. van der Walt

    Abstract: The problem of real time prediction of blood glucose (BG) levels based on the readings from a continuous glucose monitoring (CGM) device is a problem of great importance in diabetes care, and therefore, has attracted a lot of research in recent years, especially based on machine learning. An accurate prediction with a 30, 60, or 90 minute prediction horizon has the potential of saving millions of… ▽ More

    Submitted 29 June, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1707.05828

  11. arXiv:2010.04227  [pdf, other

    cs.LG math.PR

    A low discrepancy sequence on graphs

    Authors: A. Cloninger, H. N. Mhaskar

    Abstract: Many applications such as election forecasting, environmental monitoring, health policy, and graph based machine learning require taking expectation of functions defined on the vertices of a graph. We describe a construction of a sampling scheme analogous to the so called Leja points in complex potential theory that can be proved to give low discrepancy estimates for the approximation of the expec… ▽ More

    Submitted 7 June, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted for publication in Journal of Fourier Analysis and Applications

  12. arXiv:2008.01245  [pdf, other

    cs.LG math.ST stat.ML

    Cautious Active Clustering

    Authors: Alexander Cloninger, Hrushikesh Mhaskar

    Abstract: We consider the problem of classification of points sampled from an unknown probability measure on a Euclidean space. We study the question of querying the class label at a very small number of judiciously chosen points so as to be able to attach the appropriate class label to every point in the set. Our approach is to consider the unknown probability measure as a convex combination of the conditi… ▽ More

    Submitted 7 December, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  13. arXiv:2003.13226  [pdf, ps, other

    cs.LG stat.ML

    Kernel based analysis of massive data

    Authors: Hrushikesh N Mhaskar

    Abstract: Dealing with massive data is a challenging task for machine learning. An important aspect of machine learning is function approximation. In the context of massive data, some of the commonly used tools for this purpose are sparsity, divide-and-conquer, and distributed learning. In this paper, we develop a very general theory of approximation by networks, which we have called eignets, to achieve loc… ▽ More

    Submitted 7 July, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in Frontiers in Applied Mathematics and Statistics, section Mathematics of Computation and Data Science. Special issue on Fundamental Mathematical Topics in Data Science

  14. arXiv:2001.12006  [pdf, other

    cs.LG eess.SP stat.ML

    Theory inspired deep network for instantaneous-frequency extraction and signal components recovery from discrete blind-source data

    Authors: Charles K. Chui, Ningning Han, Hrushikesh N. Mhaskar

    Abstract: This paper is concerned with the inverse problem of recovering the unknown signal components, along with extraction of their instantaneous frequencies (IFs), governed by the adaptive harmonic model (AHM), from discrete (and possibly non-uniform) samples of the blind-source composite signal. None of the existing decomposition methods and algorithms, including the most popular empirical mode decom… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

  15. arXiv:1908.09880  [pdf, ps, other

    cs.LG stat.ML

    Dimension independent bounds for general shallow networks

    Authors: Hrushikesh N. Mhaskar

    Abstract: This paper proves an abstract theorem addressing in a unified manner two important problems in function approximation: avoiding curse of dimensionality and estimating the degree of approximation for out-of-sample extension in manifold learning. We consider an abstract (shallow) network that includes, for example, neural networks, radial basis function networks, and kernels on data defined manifold… ▽ More

    Submitted 4 November, 2019; v1 submitted 26 August, 2019; originally announced August 2019.

  16. arXiv:1908.00156  [pdf, other

    cs.LG math.FA stat.ML

    A direct approach for function approximation on data defined manifolds

    Authors: Hrushikesh Mhaskar

    Abstract: In much of the literature on function approximation by deep networks, the function is assumed to be defined on some known domain, such as a cube or a sphere. In practice, the data might not be dense on these domains, and therefore, the approximation theory results are observed to be too conservative. In manifold learning, one assumes instead that the data is sampled from an unknown manifold; i.e.,… ▽ More

    Submitted 20 August, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: Version 1 was submitted on August 1, 2019 under the title Deep Gaussian networks for function approximation on data defined manifolds. This version is accepted for publication in Neural Networks

  17. arXiv:1907.04895  [pdf, ps, other

    math.FA cs.LG stat.ML

    Super-resolution meets machine learning: approximation of measures

    Authors: H. N. Mhaskar

    Abstract: The problem of super-resolution in general terms is to recuperate a finitely supported measure $μ$ given finitely many of its coefficients $\hatμ(k)$ with respect to some orthonormal system. The interesting case concerns situations, where the number of coefficients required is substantially smaller than a power of the reciprocal of the minimal separation among the points in the support of $μ$. In… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: 14 pages, To appear in Journal of Fourier Analysis and Applications

  18. arXiv:1905.12882  [pdf, other

    cs.LG stat.ML

    Function approximation by deep networks

    Authors: H. N. Mhaskar, T. Poggio

    Abstract: We show that deep networks are better than shallow networks at approximating functions that can be expressed as a composition of functions described by a directed acyclic graph, because the deep networks can be designed to have the same compositional structure, while a shallow network cannot exploit this knowledge. Thus, the blessing of compositionality mitigates the curse of dimensionality. On th… ▽ More

    Submitted 23 November, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: To appear in Communications in pure and applied mathematics

  19. arXiv:1901.02975  [pdf, other

    cs.LG stat.ML

    A witness function based construction of discriminative models using Hermite polynomials

    Authors: H. N. Mhaskar, A. Cloninger, X. Cheng

    Abstract: In machine learning, we are given a dataset of the form $\{(\mathbf{x}_j,y_j)\}_{j=1}^M$, drawn as i.i.d. samples from an unknown probability distribution $μ$; the marginal distribution for the $\mathbf{x}_j$'s being $μ^*$. We propose that rather than using a positive kernel such as the Gaussian for estimation of these measures, using a non-positive kernel that preserves a large number of moments… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

    Comments: 20 pages, 3.1 MB

  20. arXiv:1806.02003  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Deep Algorithms: designs for networks

    Authors: Abhejit Rajagopal, Shivkumar Chandrasekaran, Hrushikesh N. Mhaskar

    Abstract: A new design methodology for neural networks that is guided by traditional algorithm design is presented. To prove our point, we present two heuristics and demonstrate an algorithmic technique for incorporating additional weights in their signal-flow graphs. We show that with training the performance of these networks can not only exceed the performance of the initial network, but can match the pe… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: submitted to Thirty-second Annual Conference on Neural Information Processing Systems (NIPS), May 2018

  21. arXiv:1802.06266  [pdf, other

    cs.LG math.NA

    An analysis of training and generalization errors in shallow and deep networks

    Authors: Hrushikesh Mhaskar, Tomaso Poggio

    Abstract: This paper is motivated by an open problem around deep networks, namely, the apparent absence of over-fitting despite large over-parametrization which allows perfect fitting of the training data. In this paper, we analyze this phenomenon in the case of regression problems when each unit evaluates a periodic activation function. We argue that the minimal expected value of the square loss is inappro… ▽ More

    Submitted 27 August, 2019; v1 submitted 17 February, 2018; originally announced February 2018.

    Comments: 21 pages; Accepted for publication in Neural Networks

  22. arXiv:1801.00173  [pdf, other

    cs.LG

    Theory of Deep Learning III: explaining the non-overfitting puzzle

    Authors: Tomaso Poggio, Kenji Kawaguchi, Qianli Liao, Brando Miranda, Lorenzo Rosasco, Xavier Boix, Jack Hidary, Hrushikesh Mhaskar

    Abstract: A main puzzle of deep networks revolves around the absence of overfitting despite large overparametrization and despite the large capacity demonstrated by zero training error on randomly labeled data. In this note, we show that the dynamics associated to gradient descent minimization of nonlinear networks is topologically equivalent, near the asymptotically stable minima of the empirical error, to… ▽ More

    Submitted 16 January, 2018; v1 submitted 30 December, 2017; originally announced January 2018.

  23. arXiv:1709.08174  [pdf, other

    cs.LG math.NA

    Function approximation with zonal function networks with activation functions analogous to the rectified linear unit functions

    Authors: Hrushikesh N. Mhaskar

    Abstract: A zonal function (ZF) network on the $q$ dimensional sphere $\mathbb{S}^q$ is a network of the form $\mathbf{x}\mapsto \sum_{k=1}^n a_kφ(\mathbf{x}\cdot\mathbf{x}_k)$ where $φ:[-1,1]\to\mathbf{R}$ is the activation function, $\mathbf{x}_k\in\mathbb{S}^q$ are the centers, and $a_k\in\mathbb{R}$. While the approximation properties of such networks are well studied in the context of positive definite… ▽ More

    Submitted 8 July, 2018; v1 submitted 24 September, 2017; originally announced September 2017.

    Comments: 18 pages, Title changed from the pervious version

  24. arXiv:1707.09428  [pdf, ps, other

    math.NA cs.LG

    A unified method for super-resolution recovery and real exponential-sum separation

    Authors: Charles K. Chui, Hrushikesh N. Mhaskar

    Abstract: In this paper, motivated by diffraction of traveling light waves, a simple mathematical model is proposed, both for the multivariate super-resolution problem and the problem of blind-source separation of real-valued exponential sums. This model facilitates the development of a unified theory and a unified solution of both problems in this paper. Our consideration of the super-resolution problem is… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

  25. arXiv:1707.09319  [pdf, ps, other

    stat.OT cs.LG math.NA

    A Fourier-invariant method for locating point-masses and computing their attributes

    Authors: Charles K. Chui, Hrushikesh N. Mhaskar

    Abstract: Motivated by the interest of observing the growth of cancer cells among normal living cells and exploring how galaxies and stars are truly formed, the objective of this paper is to introduce a rigorous and effective method for counting point-masses, determining their spatial locations, and computing their attributes. Based on computation of Hermite moments that are Fourier-invariant, our approach… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

  26. A deep learning approach to diabetic blood glucose prediction

    Authors: H. N. Mhaskar, S. V. Pereverzyev, M. D. van der Walt

    Abstract: We consider the question of 30-minute prediction of blood glucose levels measured by continuous glucose monitoring devices, using clinical data. While most studies of this nature deal with one patient at a time, we take a certain percentage of patients in the data set as training data, and test on the remainder of the patients; i.e., the machine need not re-calibrate on the new patients in the dat… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Journal ref: Front. Appl. Math. Stat., 14 July 2017

  27. arXiv:1611.00740  [pdf, other

    cs.LG

    Why and When Can Deep -- but Not Shallow -- Networks Avoid the Curse of Dimensionality: a Review

    Authors: Tomaso Poggio, Hrushikesh Mhaskar, Lorenzo Rosasco, Brando Miranda, Qianli Liao

    Abstract: The paper characterizes classes of functions for which deep learning can be exponentially better than shallow learning. Deep convolutional networks are a special case of these conditions, though weight sharing is not the main reason for their exponential advantage.

    Submitted 4 February, 2017; v1 submitted 2 November, 2016; originally announced November 2016.

  28. arXiv:1608.03287  [pdf, other

    cs.LG math.FA

    Deep vs. shallow networks : An approximation theory perspective

    Authors: Hrushikesh Mhaskar, Tomaso Poggio

    Abstract: The paper briefy reviews several recent results on hierarchical architectures for learning from examples, that may formally explain the conditions under which Deep Convolutional Neural Networks perform much better in function approximation problems than shallow, one-hidden layer architectures. The paper announces new results for a non-smooth activation function - the ReLU function - used in presen… ▽ More

    Submitted 10 August, 2016; originally announced August 2016.

    Comments: 14 pages, 4 figures, to be published in a Journal

    Report number: CBMM Memo 54

  29. arXiv:1607.07110  [pdf, ps, other

    cs.LG

    Deep nets for local manifold learning

    Authors: Charles K. Chui, H. N. Mhaskar

    Abstract: The problem of extending a function $f$ defined on a training data $\mathcal{C}$ on an unknown manifold $\mathbb{X}$ to the entire manifold and a tubular neighborhood of this manifold is considered in this paper. For $\mathbb{X}$ embedded in a high dimensional ambient Euclidean space $\mathbb{R}^D$, a deep learning algorithm is developed for finding a local coordinate system for the manifold {\bf… ▽ More

    Submitted 24 July, 2016; originally announced July 2016.

    Comments: Submitted on Sept. 17, 2015

  30. arXiv:1603.00988  [pdf, other

    cs.LG

    Learning Functions: When Is Deep Better Than Shallow

    Authors: Hrushikesh Mhaskar, Qianli Liao, Tomaso Poggio

    Abstract: While the universal approximation property holds both for hierarchical and shallow networks, we prove that deep (hierarchical) networks can approximate the class of compositional functions with the same accuracy as shallow networks but with exponentially lower number of training parameters as well as VC-dimension. This theorem settles an old conjecture by Bengio on the role of depth in networks. W… ▽ More

    Submitted 29 May, 2016; v1 submitted 3 March, 2016; originally announced March 2016.

  31. arXiv:0909.5000  [pdf, ps, other

    cs.LG cs.NE math.NA

    Eignets for function approximation on manifolds

    Authors: H. N. Mhaskar

    Abstract: Let $\XX$ be a compact, smooth, connected, Riemannian manifold without boundary, $G:\XX\times\XX\to \RR$ be a kernel. Analogous to a radial basis function network, an eignet is an expression of the form $\sum_{j=1}^M a_jG(\circ,y_j)$, where $a_j\in\RR$, $y_j\in\XX$, $1\le j\le M$. We describe a deterministic, universal algorithm for constructing an eignet for approximating functions in… ▽ More

    Submitted 28 September, 2009; originally announced September 2009.

    Comments: 28 pages. Articles in press; Applied and Computational Harmonic Analysis, 2009