Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Charoenphakdee, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.10656  [pdf, other

    cs.LG cs.AI stat.ML

    Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

    Authors: Kenta Oono, Nontawat Charoenphakdee, Kotatsu Bito, Zhengyan Gao, Yoshiaki Ota, Shoichiro Yamaguchi, Yohei Sugawara, Shin-ichi Maeda, Kunihiko Miyoshi, Yuki Saito, Koki Tsuda, Hiroshi Maruyama, Kohei Hayashi

    Abstract: Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healt… ▽ More

    Submitted 14 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figures

  2. arXiv:2210.17128  [pdf, other

    cs.LG cs.AI

    Diffusion models for missing value imputation in tabular data

    Authors: Shuhan Zheng, Nontawat Charoenphakdee

    Abstract: Missing value imputation in machine learning is the task of estimating the missing values in the dataset accurately using available information. In this task, several deep generative modeling methods have been proposed and demonstrated their usefulness, e.g., generative adversarial imputation networks. Recently, diffusion models have gained popularity because of their effectiveness in the generati… ▽ More

    Submitted 10 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted to Table Representation Learning Workshop at NeurIPS 2022. Renamed proposed method name to TabCSDI

  3. arXiv:2202.00395  [pdf, other

    cs.LG stat.ML

    Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification

    Authors: Takashi Ishida, Ikko Yamane, Nontawat Charoenphakdee, Gang Niu, Masashi Sugiyama

    Abstract: There is a fundamental limitation in the prediction performance that a machine learning model can achieve due to the inevitable uncertainty of the prediction target. In classification problems, this can be characterized by the Bayes error, which is the best achievable error with any classifier. The Bayes error can be used as a criterion to evaluate classifiers with state-of-the-art performance and… ▽ More

    Submitted 13 March, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: ICLR 2023 (notable-top-5%)

  4. arXiv:2109.04400  [pdf

    cs.CL cs.AI cs.LG

    Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph

    Authors: Nuttapong Chairatanakul, Noppayut Sriwatanasakdi, Nontawat Charoenphakdee, Xin Liu, Tsuyoshi Murata

    Abstract: In cross-lingual text classification, it is required that task-specific training data in high-resource source languages are available, where the task is identical to that of a low-resource target language. However, collecting such training data can be infeasible because of the labeling cost, task characteristics, and privacy concerns. This paper proposes an alternative solution that uses only task… ▽ More

    Submitted 9 September, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Published in Findings of EMNLP 2021

  5. arXiv:2101.01366  [pdf, other

    stat.ML cs.LG

    A Symmetric Loss Perspective of Reliable Machine Learning

    Authors: Nontawat Charoenphakdee, Jongyeong Lee, Masashi Sugiyama

    Abstract: When minimizing the empirical risk in binary classification, it is a common practice to replace the zero-one loss with a surrogate loss to make the learning objective feasible to optimize. Examples of well-known surrogate losses for binary classification include the logistic loss, hinge loss, and sigmoid loss. It is known that the choice of a surrogate loss can highly influence the performance of… ▽ More

    Submitted 5 June, 2023; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: Invited article preprint

  6. arXiv:2011.09172  [pdf, other

    stat.ML cs.LG

    On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective

    Authors: Nontawat Charoenphakdee, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama

    Abstract: The focal loss has demonstrated its effectiveness in many real-world applications such as object detection and image classification, but its theoretical understanding has been limited so far. In this paper, we first prove that the focal loss is classification-calibrated, i.e., its minimizer surely yields the Bayes-optimal classifier and thus the use of the focal loss in classification can be theor… ▽ More

    Submitted 13 December, 2020; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: 57 pages

  7. arXiv:2010.11748  [pdf, other

    stat.ML cs.LG

    Classification with Rejection Based on Cost-sensitive Classification

    Authors: Nontawat Charoenphakdee, Zhenghang Cui, Yivan Zhang, Masashi Sugiyama

    Abstract: The goal of classification with rejection is to avoid risky misclassification in error-critical applications such as medical diagnosis and product inspection. In this paper, based on the relationship between classification with rejection and cost-sensitive classification, we propose a novel method of classification with rejection by learning an ensemble of cost-sensitive classifiers, which satisfi… ▽ More

    Submitted 29 September, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 40 pages. Added the discussion of the recent work by Gangrade et al. (2021) at the end of Section 3.4, where the idea of constructing cost-sensitive classifiers for classification with rejection has also been explored in a different framework of classification with rejection (where the goal is not minimizing the 0-1-c risk as in our paper)

  8. arXiv:2010.10181  [pdf, other

    stat.ML cs.AI cs.LG

    Robust Imitation Learning from Noisy Demonstrations

    Authors: Voot Tangkaratt, Nontawat Charoenphakdee, Masashi Sugiyama

    Abstract: Robust learning from noisy demonstrations is a practical but highly challenging problem in imitation learning. In this paper, we first theoretically show that robust imitation learning can be achieved by optimizing a classification risk with a symmetric loss. Based on this theoretical finding, we then propose a new imitation learning method that optimizes the classification risk by effectively com… ▽ More

    Submitted 19 February, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 16 pages, 9 figures. Accepted to AISTATS 2021

  9. arXiv:2004.06316  [pdf, other

    stat.ML cs.LG

    Learning from Aggregate Observations

    Authors: Yivan Zhang, Nontawat Charoenphakdee, Zhenguo Wu, Masashi Sugiyama

    Abstract: We study the problem of learning from aggregate observations where supervision signals are given to sets of instances instead of individual instances, while the goal is still to predict labels of unseen individuals. A well-known example is multiple instance learning (MIL). In this paper, we extend MIL beyond binary classification to other problems such as multiclass classification and regression.… ▽ More

    Submitted 7 January, 2021; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: NeurIPS 2020 proceedings version

  10. arXiv:2003.04691  [pdf, other

    stat.ML cs.LG

    Time-varying Gaussian Process Bandit Optimization with Non-constant Evaluation Time

    Authors: Hideaki Imamura, Nontawat Charoenphakdee, Futoshi Futami, Issei Sato, Junya Honda, Masashi Sugiyama

    Abstract: The Gaussian process bandit is a problem in which we want to find a maximizer of a black-box function with the minimum number of function evaluations. If the black-box function varies with time, then time-varying Bayesian optimization is a promising framework. However, a drawback with current methods is in the assumption that the evaluation time for every observation is constant, which can be unre… ▽ More

    Submitted 10 March, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

  11. arXiv:1910.04394  [pdf, other

    stat.ML cs.LG

    Learning from Indirect Observations

    Authors: Yivan Zhang, Nontawat Charoenphakdee, Masashi Sugiyama

    Abstract: Weakly-supervised learning is a paradigm for alleviating the scarcity of labeled data by leveraging lower-quality but larger-scale supervision signals. While existing work mainly focuses on utilizing a certain type of weak supervision, we present a probabilistic framework, learning from indirect observations, for learning from a wide range of weak supervision in real-world problems, e.g., noisy la… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  12. arXiv:1910.04385  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Only from Relevant Keywords and Unlabeled Documents

    Authors: Nontawat Charoenphakdee, Jongyeong Lee, Yiping Jin, Dittaya Wanvarie, Masashi Sugiyama

    Abstract: We consider a document classification problem where document labels are absent but only relevant keywords of a target class and unlabeled documents are given. Although heuristic methods based on pseudo-labeling have been considered, theoretical understanding of this problem has still been limited. Moreover, previous methods cannot easily incorporate well-developed techniques in supervised text cla… ▽ More

    Submitted 29 October, 2019; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: EMNLP-IJCNLP2019, fix typos in Theorem 1: change $π$ and $π'$ to $θ$ and $θ'$

  13. arXiv:1907.10225  [pdf, ps, other

    cs.LG stat.ML

    Classification from Triplet Comparison Data

    Authors: Zhenghang Cui, Nontawat Charoenphakdee, Issei Sato, Masashi Sugiyama

    Abstract: Learning from triplet comparison data has been extensively studied in the context of metric learning, where we want to learn a distance metric between two instances, and ordinal embedding, where we want to learn an embedding in an Euclidean space of the given instances that preserves the comparison order as well as possible. Unlike fully-labeled data, triplet comparison data can be collected in a… ▽ More

    Submitted 18 April, 2020; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: Code: https://github.com/zchenry/triplet_classification

  14. arXiv:1901.11351  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization

    Authors: Taira Tsuchiya, Nontawat Charoenphakdee, Issei Sato, Masashi Sugiyama

    Abstract: Ordinal regression is aimed at predicting an ordinal class label. In this paper, we consider its semi-supervised formulation, in which we have unlabeled data along with ordinal-labeled data to train an ordinal regressor. There are several metrics to evaluate the performance of ordinal regression, such as the mean absolute error, mean zero-one error, and mean squared error. However, the existing st… ▽ More

    Submitted 10 June, 2021; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: 38 pages, 9 figures

  15. arXiv:1901.10655  [pdf, other

    stat.ML cs.LG

    On the Calibration of Multiclass Classification with Rejection

    Authors: Chenri Ni, Nontawat Charoenphakdee, Junya Honda, Masashi Sugiyama

    Abstract: We investigate the problem of multiclass classification with rejection, where a classifier can choose not to make a prediction to avoid critical misclassification. First, we consider an approach based on simultaneous training of a classifier and a rejector, which achieves the state-of-the-art performance in the binary case. We analyze this approach for the multiclass case and derive a general cond… ▽ More

    Submitted 29 October, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: NeurIPS2019 camera-ready, 31 pages

  16. arXiv:1901.10654  [pdf, other

    stat.ML cs.LG

    Domain Discrepancy Measure for Complex Models in Unsupervised Domain Adaptation

    Authors: Jongyeong Lee, Nontawat Charoenphakdee, Seiichi Kuroki, Masashi Sugiyama

    Abstract: Appropriately evaluating the discrepancy between domains is essential for the success of unsupervised domain adaptation. In this paper, we first point out that existing discrepancy measures are less informative when complex models such as deep neural networks are used, in addition to the facts that they can be computationally highly demanding and their range of applications is limited only to bina… ▽ More

    Submitted 21 October, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: 21 pages

  17. arXiv:1901.09387  [pdf, other

    cs.LG cs.AI stat.ML

    Imitation Learning from Imperfect Demonstration

    Authors: Yueh-Hua Wu, Nontawat Charoenphakdee, Han Bao, Voot Tangkaratt, Masashi Sugiyama

    Abstract: Imitation learning (IL) aims to learn an optimal policy from demonstrations. However, such demonstrations are often imperfect since collecting optimal ones is costly. To effectively learn from imperfect demonstrations, we propose a novel approach that utilizes confidence scores, which describe the quality of demonstrations. More specifically, we propose two confidence-based IL methods, namely two-… ▽ More

    Submitted 29 January, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

  18. arXiv:1901.09314  [pdf, other

    stat.ML cs.LG

    On Symmetric Losses for Learning from Corrupted Labels

    Authors: Nontawat Charoenphakdee, Jongyeong Lee, Masashi Sugiyama

    Abstract: This paper aims to provide a better understanding of a symmetric loss. First, we emphasize that using a symmetric loss is advantageous in the balanced error rate (BER) minimization and area under the receiver operating characteristic curve (AUC) maximization from corrupted labels. Second, we prove general theoretical properties of symmetric losses, including a classification-calibration condition,… ▽ More

    Submitted 7 September, 2019; v1 submitted 26 January, 2019; originally announced January 2019.

    Comments: ICML2019 with minor typo fixes

  19. arXiv:1809.07011  [pdf, other

    stat.ML cs.LG

    Positive-Unlabeled Classification under Class Prior Shift and Asymmetric Error

    Authors: Nontawat Charoenphakdee, Masashi Sugiyama

    Abstract: Bottlenecks of binary classification from positive and unlabeled data (PU classification) are the requirements that given unlabeled patterns are drawn from the test marginal distribution, and the penalty of the false positive error is identical to the false negative error. However, such requirements are often not fulfilled in practice. In this paper, we generalize PU classification to the class pr… ▽ More

    Submitted 9 November, 2020; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: Fixed typos

  20. arXiv:1809.03839  [pdf, other

    cs.LG stat.ML

    Unsupervised Domain Adaptation Based on Source-guided Discrepancy

    Authors: Seiichi Kuroki, Nontawat Charoenphakdee, Han Bao, Junya Honda, Issei Sato, Masashi Sugiyama

    Abstract: Unsupervised domain adaptation is the problem setting where data generating distributions in the source and target domains are different, and labels in the target domain are unavailable. One important question in unsupervised domain adaptation is how to measure the difference between the source and target domains. A previously proposed discrepancy that does not use the source domain labels require… ▽ More

    Submitted 19 November, 2018; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: To appear in AAAI-19