Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Ho, L S T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.00656  [pdf, other

    cs.LG cs.AI stat.ML

    Simple Transferability Estimation for Regression Tasks

    Authors: Cuong N. Nguyen, Phong Tran, Lam Si Tung Ho, Vu Dinh, Anh T. Tran, Tal Hassner, Cuong V. Nguyen

    Abstract: We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel… ▽ More

    Submitted 3 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Paper published at The 39th Conference on Uncertainty in Artificial Intelligence (UAI) 2023

  2. arXiv:2310.05892  [pdf, ps, other

    stat.ML cs.LG

    A Generalization Bound of Deep Neural Networks for Dependent Data

    Authors: Quan Huu Do, Binh T. Nguyen, Lam Si Tung Ho

    Abstract: Existing generalization bounds for deep neural networks require data to be independent and identically distributed (iid). This assumption may not hold in real-life applications such as evolutionary biology, infectious disease epidemiology, and stock price prediction. This work establishes a generalization bound of feed-forward neural networks for non-stationary $φ$-mixing data.

    Submitted 9 October, 2023; originally announced October 2023.

  3. arXiv:2211.08277  [pdf, other

    cs.LG physics.soc-ph q-bio.PE

    SPADE4: Sparsity and Delay Embedding based Forecasting of Epidemics

    Authors: Esha Saha, Lam Si Tung Ho, Giang Tran

    Abstract: Predicting the evolution of diseases is challenging, especially when the data availability is scarce and incomplete. The most popular tools for modelling and predicting infectious disease epidemics are compartmental models. They stratify the population into compartments according to health status and model the dynamics of these compartments using dynamical systems. However, these predefined system… ▽ More

    Submitted 13 June, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 24 pages, 13 figures, 2 tables

    Journal ref: Bull.Math.Bio.85.8 (2023) 71

  4. arXiv:2209.05709  [pdf, ps, other

    cs.LG cs.AI

    Generalization Bounds for Deep Transfer Learning Using Majority Predictor Accuracy

    Authors: Cuong N. Nguyen, Lam Si Tung Ho, Vu Dinh, Tal Hassner, Cuong V. Nguyen

    Abstract: We analyze new generalization bounds for deep learning models trained by transfer learning from a source to a target task. Our bounds utilize a quantity called the majority predictor accuracy, which can be computed efficiently from data. We show that our theory is useful in practice since it implies that the majority predictor accuracy can be used as a transferability measure, a fact that is also… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 5 pages, Paper published at the International Symposium on Information Theory and Its Applications (ISITA 2022)

  5. arXiv:2111.10243  [pdf, other

    math.ST cs.LG

    Posterior concentration and fast convergence rates for generalized Bayesian learning

    Authors: Lam Si Tung Ho, Binh T. Nguyen, Vu Dinh, Duy Nguyen

    Abstract: In this paper, we study the learning rate of generalized Bayes estimators in a general setting where the hypothesis class can be uncountable and have an irregular shape, the loss function can have heavy tails, and the optimal hypothesis may not be unique. We prove that under the multi-scale Bernstein's condition, the generalized posterior distribution concentrates around the set of optimal hypothe… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  6. arXiv:2109.13061  [pdf, other

    cs.LG stat.ML

    Searching for Minimal Optimal Neural Networks

    Authors: Lam Si Tung Ho, Vu Dinh

    Abstract: Large neural network models have high predictive power but may suffer from overfitting if the training set is not large enough. Therefore, it is desirable to select an appropriate size for neural networks. The destructive approach, which starts with a large architecture and then reduces the size using a Lasso-type penalty, has been used extensively for this task. Despite its popularity, there is n… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  7. arXiv:2108.10825  [pdf, other

    cs.LG math.NA

    Adaptive Group Lasso Neural Network Models for Functions of Few Variables and Time-Dependent Data

    Authors: Lam Si Tung Ho, Nicholas Richardson, Giang Tran

    Abstract: In this paper, we propose an adaptive group Lasso deep neural network for high-dimensional function approximation where input data are generated from a dynamical system and the target function depends on few active variables or few linear combinations of variables. We approximate the target function by a deep neural network and enforce an adaptive group Lasso constraint to the weights of a suitabl… ▽ More

    Submitted 3 December, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

  8. arXiv:2105.15024  [pdf, other

    cs.LG

    OASIS: An Active Framework for Set Inversion

    Authors: Binh T. Nguyen, Duy M. Nguyen, Lam Si Tung Ho, Vu Dinh

    Abstract: In this work, we introduce a novel method for solving the set inversion problem by formulating it as a binary classification problem. Aiming to develop a fast algorithm that can work effectively with high-dimensional and computationally expensive nonlinear models, we focus on active learning, a family of new and powerful techniques which can achieve the same level of accuracy with fewer data point… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: 13 pages, 8 figures

    Journal ref: Frontiers in Artificial Intelligence and Applications, 2018

  9. arXiv:2010.08097  [pdf, other

    cs.LG math.ST stat.ML

    Consistent Feature Selection for Analytic Deep Neural Networks

    Authors: Vu Dinh, Lam Si Tung Ho

    Abstract: One of the most important steps toward interpretability and explainability of neural network models is feature selection, which aims to identify the subset of relevant features. Theoretical results in the field have mostly focused on the prediction aspect of the problem with virtually no work on feature selection consistency for deep neural networks due to the model's severe nonlinearity and unide… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  10. arXiv:2006.00334  [pdf, other

    stat.ML cs.LG math.ST

    Consistent feature selection for neural networks via Adaptive Group Lasso

    Authors: Vu Dinh, Lam Si Tung Ho

    Abstract: One main obstacle for the wide use of deep learning in medical and engineering sciences is its interpretability. While neural network models are strong tools for making predictions, they often provide little information about which features play significant roles in influencing the prediction accuracy. To overcome this issue, many regularization procedures for learning with neural networks have be… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 May, 2020; originally announced June 2020.

  11. arXiv:1906.02179  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Bayesian Active Learning With Abstention Feedbacks

    Authors: Cuong V. Nguyen, Lam Si Tung Ho, Huan Xu, Vu Dinh, Binh Nguyen

    Abstract: We study pool-based active learning with abstention feedbacks where a labeler can abstain from labeling a queried example with some unknown abstention rate. This is an important problem with many useful applications. We take a Bayesian approach to the problem and develop two new greedy algorithms that learn both the classification problem and the unknown abstention rate at the same time. These are… ▽ More

    Submitted 30 December, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Poster presented at 2019 ICML Workshop on Human in the Loop Learning 2019 (non-archival). arXiv admin note: substantial text overlap with arXiv:1705.08481

  12. arXiv:1811.10115  [pdf, other

    cs.IT cs.LG stat.ML

    Recovery guarantees for polynomial approximation from dependent data with outliers

    Authors: Lam Si Tung Ho, Hayden Schaeffer, Giang Tran, Rachel Ward

    Abstract: Learning non-linear systems from noisy, limited, and/or dependent data is an important task across various scientific fields including statistics, engineering, computer science, mathematics, and many more. In general, this learning task is ill-posed; however, additional information about the data's structure or on the behavior of the unknown function can make the task well-posed. In this work, we… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: 17 pages, 1 figure

    MSC Class: 68T05; 41A10; 60F05; 68Q32; 62G08; 94A15; 65K10

  13. arXiv:1705.08481   

    stat.ML cs.LG

    Bayesian Pool-based Active Learning With Abstention Feedbacks

    Authors: Cuong V. Nguyen, Lam Si Tung Ho, Huan Xu, Vu Dinh, Binh Nguyen

    Abstract: We study pool-based active learning with abstention feedbacks, where a labeler can abstain from labeling a queried example with some unknown abstention rate. This is an important problem with many useful applications. We take a Bayesian approach to the problem and develop two new greedy algorithms that learn both the classification problem and the unknown abstention rate at the same time. These ar… ▽ More

    Submitted 2 January, 2021; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: There is a new version at arXiv:1906.02179

  14. arXiv:1609.09481  [pdf, ps, other

    stat.ML cs.LG

    Fast learning rates with heavy-tailed losses

    Authors: Vu Dinh, Lam Si Tung Ho, Duy Nguyen, Binh T. Nguyen

    Abstract: We study fast learning rates when the losses are not necessarily bounded and may have a distribution with heavy tails. To enable such analyses, we introduce two new conditions: (i) the envelope function $\sup_{f \in \mathcal{F}}|\ell \circ f|$, where $\ell$ is the loss function and $\mathcal{F}$ is the hypothesis class, exists and is $L^r$-integrable, and (ii) $\ell$ satisfies the multi-scale Bern… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

    Comments: Advances in Neural Information Processing Systems (NIPS 2016): 11 pages

  15. arXiv:1411.7338  [pdf, other

    q-bio.PE cs.DS

    Bounds on the Expected Size of the Maximum Agreement Subtree

    Authors: Daniel Irving Bernstein, Lam Si Tung Ho, Colby Long, Mike Steel, Katherine St. John, Seth Sullivant

    Abstract: We prove polynomial upper and lower bounds on the expected size of the maximum agreement subtree of two random binary phylogenetic trees under both the uniform distribution and Yule-Harding distribution. This positively answers a question posed in earlier work. Determining tight upper and lower bounds remains an open problem.

    Submitted 31 August, 2015; v1 submitted 26 November, 2014; originally announced November 2014.

    Comments: Revised version

  16. arXiv:1406.1568  [pdf, other

    q-bio.PE cs.CE math.PR math.ST

    Phase transition on the convergence rate of parameter estimation under an Ornstein-Uhlenbeck diffusion on a tree

    Authors: Cécile Ané, Lam Si Tung Ho, Sebastien Roch

    Abstract: Diffusion processes on trees are commonly used in evolutionary biology to model the joint distribution of continuous traits, such as body mass, across species. Estimating the parameters of such processes from tip values presents challenges because of the intrinsic correlation between the observations produced by the shared evolutionary history, thus violating the standard independence assumption o… ▽ More

    Submitted 25 May, 2016; v1 submitted 5 June, 2014; originally announced June 2014.