-
Informativeness of Weighted Conformal Prediction
Authors:
Mufang Ying,
Wenge Guo,
Koulik Khamaru,
Ying Hung
Abstract:
Weighted conformal prediction (WCP), a recently proposed framework, provides uncertainty quantification with the flexibility to accommodate different covariate distributions between training and test data. However, it is pointed out in this paper that the effectiveness of WCP heavily relies on the overlap between covariate distributions; insufficient overlap can lead to uninformative prediction in…
▽ More
Weighted conformal prediction (WCP), a recently proposed framework, provides uncertainty quantification with the flexibility to accommodate different covariate distributions between training and test data. However, it is pointed out in this paper that the effectiveness of WCP heavily relies on the overlap between covariate distributions; insufficient overlap can lead to uninformative prediction intervals. To enhance the informativeness of WCP, we propose two methods for scenarios involving multiple sources with varied covariate distributions. We establish theoretical guarantees for our proposed methods and demonstrate their efficacy through simulations.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
A Unified Gaussian Process for Branching and Nested Hyperparameter Optimization
Authors:
Jiazhao Zhang,
Ying Hung,
Chung-Ching Lin,
Zicheng Liu
Abstract:
Choosing appropriate hyperparameters plays a crucial role in the success of neural networks as hyper-parameters directly control the behavior and performance of the training algorithms. To obtain efficient tuning, Bayesian optimization methods based on Gaussian process (GP) models are widely used. Despite numerous applications of Bayesian optimization in deep learning, the existing methodologies a…
▽ More
Choosing appropriate hyperparameters plays a crucial role in the success of neural networks as hyper-parameters directly control the behavior and performance of the training algorithms. To obtain efficient tuning, Bayesian optimization methods based on Gaussian process (GP) models are widely used. Despite numerous applications of Bayesian optimization in deep learning, the existing methodologies are developed based on a convenient but restrictive assumption that the tuning parameters are independent of each other. However, tuning parameters with conditional dependence are common in practice. In this paper, we focus on two types of them: branching and nested parameters. Nested parameters refer to those tuning parameters that exist only within a particular setting of another tuning parameter, and a parameter within which other parameters are nested is called a branching parameter. To capture the conditional dependence between branching and nested parameters, a unified Bayesian optimization framework is proposed. The sufficient conditions are rigorously derived to guarantee the validity of the kernel function, and the asymptotic convergence of the proposed optimization framework is proven under the continuum-armed-bandit setting. Based on the new GP model, which accounts for the dependent structure among input variables through a new kernel function, higher prediction accuracy and better optimization efficiency are observed in a series of synthetic simulations and real data applications of neural networks. Sensitivity analysis is also performed to provide insights into how changes in hyperparameter values affect prediction accuracy.
△ Less
Submitted 19 January, 2024;
originally announced February 2024.
-
Advancing inverse scattering with surrogate modeling and Bayesian inference for functional inputs
Authors:
Chih-Li Sung,
Yao Song,
Ying Hung
Abstract:
Inverse scattering aims to infer information about a hidden object by using the received scattered waves and training data collected from forward mathematical models. Recent advances in computing have led to increasing attention towards functional inverse inference, which can reveal more detailed properties of a hidden object. However, rigorous studies on functional inverse, including the reconstr…
▽ More
Inverse scattering aims to infer information about a hidden object by using the received scattered waves and training data collected from forward mathematical models. Recent advances in computing have led to increasing attention towards functional inverse inference, which can reveal more detailed properties of a hidden object. However, rigorous studies on functional inverse, including the reconstruction of the functional input and quantification of uncertainty, remain scarce. Motivated by an inverse scattering problem where the objective is to infer the functional input representing the refractive index of a bounded scatterer, a new Bayesian framework is proposed. It contains a surrogate model that takes into account the functional inputs directly through kernel functions, and a Bayesian procedure that infers functional inputs through the posterior distribution. Furthermore, the proposed Bayesian framework is extended to reconstruct functional inverse by integrating multi-fidelity simulations, including a high-fidelity simulator solved by finite element methods and a low-fidelity simulator called the Born approximation. When compared with existing alternatives developed by finite basis expansion, the proposed method provides more accurate functional recoveries with smaller prediction variations.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits
Authors:
Yu-Heng Hung,
Ping-Chun Hsieh
Abstract:
Reward-biased maximum likelihood estimation (RBMLE) is a classic principle in the adaptive control literature for tackling explore-exploit trade-offs. This paper studies the stochastic contextual bandit problem with general bounded reward functions and proposes NeuralRBMLE, which adapts the RBMLE principle by adding a bias term to the log-likelihood to enforce exploration. NeuralRBMLE leverages th…
▽ More
Reward-biased maximum likelihood estimation (RBMLE) is a classic principle in the adaptive control literature for tackling explore-exploit trade-offs. This paper studies the stochastic contextual bandit problem with general bounded reward functions and proposes NeuralRBMLE, which adapts the RBMLE principle by adding a bias term to the log-likelihood to enforce exploration. NeuralRBMLE leverages the representation power of neural networks and directly encodes exploratory behavior in the parameter space, without constructing confidence intervals of the estimated rewards. We propose two variants of NeuralRBMLE algorithms: The first variant directly obtains the RBMLE estimator by gradient ascent, and the second variant simplifies RBMLE to a simple index policy through an approximation. We show that both algorithms achieve $\widetilde{\mathcal{O}}(\sqrt{T})$ regret. Through extensive experiments, we demonstrate that the NeuralRBMLE algorithms achieve comparable or better empirical regrets than the state-of-the-art methods on real-world datasets with non-linear reward functions.
△ Less
Submitted 29 May, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Functional-Input Gaussian Processes with Applications to Inverse Scattering Problems
Authors:
Chih-Li Sung,
Wenjia Wang,
Fioralba Cakoni,
Isaac Harris,
Ying Hung
Abstract:
Surrogate modeling based on Gaussian processes (GPs) has received increasing attention in the analysis of complex problems in science and engineering. Despite extensive studies on GP modeling, the developments for functional inputs are scarce. Motivated by an inverse scattering problem in which functional inputs representing the support and material properties of the scatterer are involved in the…
▽ More
Surrogate modeling based on Gaussian processes (GPs) has received increasing attention in the analysis of complex problems in science and engineering. Despite extensive studies on GP modeling, the developments for functional inputs are scarce. Motivated by an inverse scattering problem in which functional inputs representing the support and material properties of the scatterer are involved in the partial differential equations, a new class of kernel functions for functional inputs is introduced for GPs. Based on the proposed GP models, the asymptotic convergence properties of the resulting mean squared prediction errors are derived and the finite sample performance is demonstrated by numerical examples. In the application to inverse scattering, a surrogate model is constructed with functional inputs, which is crucial to recover the reflective index of an inhomogeneous isotropic scattering region of interest for a given far-field pattern.
△ Less
Submitted 3 January, 2023; v1 submitted 5 January, 2022;
originally announced January 2022.
-
Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Authors:
Yu-Heng Hung,
Ping-Chun Hsieh,
Xi Liu,
P. R. Kumar
Abstract:
Modifying the reward-biased maximum likelihood method originally proposed in the adaptive control literature, we propose novel learning algorithms to handle the explore-exploit trade-off in linear bandits problems as well as generalized linear bandits problems. We develop novel index policies that we prove achieve order-optimality, and show that they achieve empirical performance competitive with…
▽ More
Modifying the reward-biased maximum likelihood method originally proposed in the adaptive control literature, we propose novel learning algorithms to handle the explore-exploit trade-off in linear bandits problems as well as generalized linear bandits problems. We develop novel index policies that we prove achieve order-optimality, and show that they achieve empirical performance competitive with the state-of-the-art benchmark methods in extensive experiments. The new policies achieve this with low computation time per pull for linear bandits, and thereby resulting in both favorable regret as well as computational efficiency.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Efficient calibration for imperfect epidemic models with applications to the analysis of COVID-19
Authors:
Chih-Li Sung,
Ying Hung
Abstract:
The estimation of unknown parameters in simulations, also known as calibration, is crucial for practical management of epidemics and prediction of pandemic risk. A simple yet widely used approach is to estimate the parameters by minimizing the sum of the squared distances between actual observations and simulation outputs. It is shown in this paper that this method is inefficient, particularly whe…
▽ More
The estimation of unknown parameters in simulations, also known as calibration, is crucial for practical management of epidemics and prediction of pandemic risk. A simple yet widely used approach is to estimate the parameters by minimizing the sum of the squared distances between actual observations and simulation outputs. It is shown in this paper that this method is inefficient, particularly when the epidemic models are developed based on certain simplifications of reality, also known as imperfect models which are commonly used in practice. To address this issue, a new estimator is introduced that is asymptotically consistent, has a smaller estimation variance than the least squares estimator, and achieves the semiparametric efficiency. Numerical studies are performed to examine the finite sample performance. The proposed method is applied to the analysis of the COVID-19 pandemic for 20 countries based on the SEIR (Susceptible-Exposed-Infectious-Recovered) model with both deterministic and stochastic simulations. The estimation of the parameters, including the basic reproduction number and the average incubation period, reveal the risk of disease outbreaks in each country and provide insights to the design of public health interventions.
△ Less
Submitted 22 June, 2023; v1 submitted 26 September, 2020;
originally announced September 2020.
-
CLAIMED: A CLAssification-Incorporated Minimum Energy Design to explore a multivariate response surface with feasibility constraints
Authors:
Mert Y. Sengul,
Yao Song,
Linglin He,
Adri C. T. van Duin,
Ying Hung,
Tirthankar Dasgupta
Abstract:
Motivated by the problem of optimization of force-field systems in physics using large-scale computer simulations, we consider exploration of a deterministic complex multivariate response surface. The objective is to find input combinations that generate output close to some desired or "target" vector. In spite of reducing the problem to exploration of the input space with respect to a one-dimensi…
▽ More
Motivated by the problem of optimization of force-field systems in physics using large-scale computer simulations, we consider exploration of a deterministic complex multivariate response surface. The objective is to find input combinations that generate output close to some desired or "target" vector. In spite of reducing the problem to exploration of the input space with respect to a one-dimensional loss function, the search is nontrivial and challenging due to infeasible input combinations, high dimensionalities of the input and output space and multiple "desirable" regions in the input space and the difficulty of emulating the objective function well with a surrogate model. We propose an approach that is based on combining machine learning techniques with smart experimental design ideas to locate multiple good regions in the input space.
△ Less
Submitted 13 September, 2021; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Compacting, Picking and Growing for Unforgetting Continual Learning
Authors:
Steven C. Y. Hung,
Cheng-Hao Tu,
Cheng-En Wu,
Chien-Hung Chen,
Yi-Ming Chan,
Chu-Song Chen
Abstract:
Continual lifelong learning is essential to many applications. In this paper, we propose a simple but effective approach to continual deep learning. Our approach leverages the principles of deep model compression, critical weights selection, and progressive networks expansion. By enforcing their integration in an iterative manner, we introduce an incremental learning method that is scalable to the…
▽ More
Continual lifelong learning is essential to many applications. In this paper, we propose a simple but effective approach to continual deep learning. Our approach leverages the principles of deep model compression, critical weights selection, and progressive networks expansion. By enforcing their integration in an iterative manner, we introduce an incremental learning method that is scalable to the number of sequential tasks in a continual learning process. Our approach is easy to implement and owns several favorable characteristics. First, it can avoid forgetting (i.e., learn new tasks while remembering all previous tasks). Second, it allows model expansion but can maintain the model compactness when handling sequential tasks. Besides, through our compaction and selection/expansion mechanism, we show that the knowledge accumulated through learning previous tasks is helpful to build a better model for the new tasks compared to training the models independently with tasks. Experimental results show that our approach can incrementally learn a deep model tackling multiple tasks without forgetting, while the model compactness is maintained with the performance more satisfiable than individual task training.
△ Less
Submitted 30 October, 2019; v1 submitted 15 October, 2019;
originally announced October 2019.
-
Calibration for computer experiments with binary responses and application to cell adhesion study
Authors:
Chih-Li Sung,
Ying Hung,
William Rittase,
Cheng Zhu,
C. F. Jeff Wu
Abstract:
Calibration refers to the estimation of unknown parameters which are present in computer experiments but not available in physical experiments. An accurate estimation of these parameters is important because it provides a scientific understanding of the underlying system which is not available in physical experiments. Most of the work in the literature is limited to the analysis of continuous resp…
▽ More
Calibration refers to the estimation of unknown parameters which are present in computer experiments but not available in physical experiments. An accurate estimation of these parameters is important because it provides a scientific understanding of the underlying system which is not available in physical experiments. Most of the work in the literature is limited to the analysis of continuous responses. Motivated by a study of cell adhesion experiments, we propose a new calibration framework for binary responses. Its application to the T cell adhesion data provides insight into the unknown values of the kinetic parameters which are difficult to determine by physical experiments due to the limitation of the existing experimental techniques.
△ Less
Submitted 20 March, 2019; v1 submitted 4 June, 2018;
originally announced June 2018.
-
Hit Song Prediction for Pop Music by Siamese CNN with Ranking Loss
Authors:
Lang-Chi Yu,
Yi-Hsuan Yang,
Yun-Ning Hung,
Yi-An Chen
Abstract:
A model for hit song prediction can be used in the pop music industry to identify emerging trends and potential artists or songs before they are marketed to the public. While most previous work formulates hit song prediction as a regression or classification problem, we present in this paper a convolutional neural network (CNN) model that treats it as a ranking problem. Specifically, we use a comm…
▽ More
A model for hit song prediction can be used in the pop music industry to identify emerging trends and potential artists or songs before they are marketed to the public. While most previous work formulates hit song prediction as a regression or classification problem, we present in this paper a convolutional neural network (CNN) model that treats it as a ranking problem. Specifically, we use a commercial dataset with daily play-counts to train a multi-objective Siamese CNN model with Euclidean loss and pairwise ranking loss to learn from audio the relative ranking relations among songs. Besides, we devise a number of pair sampling methods according to some empirical observation of the data. Our experiment shows that the proposed model with a sampling method called A/B sampling leads to much higher accuracy in hit song prediction than the baseline regression model. Moreover, we can further improve the accuracy by using a neural attention mechanism to extract the highlights of songs and by using a separate CNN model to offer high-level features of songs.
△ Less
Submitted 30 October, 2017;
originally announced October 2017.
-
A generalized Gaussian process model for computer experiments with binary time series
Authors:
Chih-Li Sung,
Ying Hung,
William Rittase,
Cheng Zhu,
C. F. Jeff Wu
Abstract:
Non-Gaussian observations such as binary responses are common in some computer experiments. Motivated by the analysis of a class of cell adhesion experiments, we introduce a generalized Gaussian process model for binary responses, which shares some common features with standard GP models. In addition, the proposed model incorporates a flexible mean function that can capture different types of time…
▽ More
Non-Gaussian observations such as binary responses are common in some computer experiments. Motivated by the analysis of a class of cell adhesion experiments, we introduce a generalized Gaussian process model for binary responses, which shares some common features with standard GP models. In addition, the proposed model incorporates a flexible mean function that can capture different types of time series structures. Asymptotic properties of the estimators are derived, and an optimal predictor as well as its predictive distribution are constructed. Their performance is examined via two simulation studies. The methodology is applied to study computer simulations for cell adhesion experiments. The fitted model reveals important biological information in repeated cell bindings, which is not directly observable in lab experiments.
△ Less
Submitted 24 September, 2018; v1 submitted 6 May, 2017;
originally announced May 2017.
-
Analysis of Computer Experiments with Functional Response
Authors:
Ying Hung,
V. Roshan Joseph,
Shreyes N. Melkote
Abstract:
This paper is motivated by a computer experiment conducted for optimizing residual stresses in the machining of metals. Although kriging is widely used in the analysis of computer experiments, it cannot be easily applied to model the residual stresses because they are obtained as a profile. The high dimensionality caused by this functional response introduces severe computational challenges in kri…
▽ More
This paper is motivated by a computer experiment conducted for optimizing residual stresses in the machining of metals. Although kriging is widely used in the analysis of computer experiments, it cannot be easily applied to model the residual stresses because they are obtained as a profile. The high dimensionality caused by this functional response introduces severe computational challenges in kriging. It is well known that if the functional data are observed on a regular grid, the computations can be simplified using an application of Kronecker products. However, the case of irregular grid is quite complex. In this paper, we develop a Gibbs sampling-based expectation maximization algorithm, which converts the irregularly spaced data into a regular grid so that the Kronecker product-based approach can be employed for efficiently fitting a kriging model to the functional data.
△ Less
Submitted 7 November, 2012;
originally announced November 2012.
-
Order selection in nonlinear time series models with application to the study of cell memory
Authors:
Ying Hung
Abstract:
Cell adhesion experiments are biomechanical experiments studying the binding of a cell to another cell at the level of single molecules. Such a study plays an important role in tumor metastasis in cancer study. Motivated by analyzing a repeated cell adhesion experiment, a new class of nonlinear time series models with an order selection procedure is developed in this paper. Due to the nonlinearity…
▽ More
Cell adhesion experiments are biomechanical experiments studying the binding of a cell to another cell at the level of single molecules. Such a study plays an important role in tumor metastasis in cancer study. Motivated by analyzing a repeated cell adhesion experiment, a new class of nonlinear time series models with an order selection procedure is developed in this paper. Due to the nonlinearity, there are two types of overfitting. Therefore, a double penalized approach is introduced for order selection. To implement this approach, a global optimization algorithm using mixed integer programming is discussed. The procedure is shown to be asymptotically consistent in estimating both the order and parameters of the proposed model. Simulations show that the new order selection approach outperforms standard methods. The finite-sample performance of the estimator is also examined via a simulation study. The application of the proposed methodology to a T-cell experiment provides a better understanding of the kinetics and mechanics of cell adhesion, including quantifying the memory effect on a repeated unbinding force experiment and identifying the order of the memory.
△ Less
Submitted 1 October, 2012;
originally announced October 2012.