Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Komiyama, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10429  [pdf, ps, other

    stat.ML cs.LG

    Fixed Confidence Best Arm Identification in the Bayesian Setting

    Authors: Kyoungseok Jang, Junpei Komiyama, Kazutoshi Yamazaki

    Abstract: We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian setting. This problem aims to find the arm of the largest mean with a fixed confidence level when the bandit model has been sampled from the known prior. Most studies on the FC-BAI problem have been conducted in the frequentist setting, where the bandit model is predetermined before the game starts. We show t… ▽ More

    Submitted 22 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  2. arXiv:2402.07391  [pdf, other

    stat.ML cs.LG

    Replicability is Asymptotically Free in Multi-armed Bandits

    Authors: Junpei Komiyama, Shinji Ito, Yuichi Yoshida, Souta Koshino

    Abstract: This work is motivated by the growing demand for reproducible machine learning. We study the stochastic multi-armed bandit problem. In particular, we consider a replicable algorithm that ensures, with high probability, that the algorithm's sequence of actions is not affected by the randomness inherent in the dataset. We observe that existing algorithms require $O(1/ρ^2)$ times more regret than non… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  3. arXiv:2311.09068  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Fair Division from Bandit Feedback

    Authors: Hakuei Yamada, Junpei Komiyama, Kenshi Abe, Atsushi Iwasaki

    Abstract: This work addresses learning online fair division under uncertainty, where a central planner sequentially allocates items without precise knowledge of agents' values or utilities. Departing from conventional online algorithm, the planner here relies on noisy, estimated values obtained after allocating items. We introduce wrapper algorithms utilizing \textit{dual averaging}, enabling gradual learni… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2306.11017  [pdf, ps, other

    stat.ML cs.LG

    High-dimensional Contextual Bandit Problem without Sparsity

    Authors: Junpei Komiyama, Masaaki Imaizumi

    Abstract: In this research, we investigate the high-dimensional linear contextual bandit problem where the number of features $p$ is greater than the budget $T$, or it may even be infinite. Differing from the majority of previous works in this field, we do not impose sparsity on the regression coefficients. Instead, we rely on recent findings on overparameterized models, which enables us to analyze the perf… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  5. arXiv:2207.04480  [pdf, other

    econ.GN cs.CY

    Strategic Choices of Migrants and Smugglers in the Central Mediterranean Sea

    Authors: Katherine Hoffmann Pham, Junpei Komiyama

    Abstract: The sea crossing from Libya to Italy is one of the world's most dangerous and politically contentious migration routes, and yet over half a million people have attempted the crossing since 2014. Leveraging data on aggregate migration flows and individual migration incidents, we estimate how migrants and smugglers have reacted to changes in border enforcement, namely the rise in interceptions by th… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  6. arXiv:2206.04646  [pdf, other

    stat.ML cs.LG

    Minimax Optimal Algorithms for Fixed-Budget Best Arm Identification

    Authors: Junpei Komiyama, Taira Tsuchiya, Junya Honda

    Abstract: We consider the fixed-budget best arm identification problem where the goal is to find the arm of the largest mean with a fixed number of samples. It is known that the probability of misidentifying the best arm is exponentially small to the number of rounds. However, limited characterizations have been discussed on the rate (exponent) of this value. In this paper, we characterize the minimax optim… ▽ More

    Submitted 26 October, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 version https://openreview.net/forum?id=TIQfmR7IF6H

  7. arXiv:2202.06570  [pdf, other

    cs.GT cs.AI

    Anytime Capacity Expansion in Medical Residency Match by Monte Carlo Tree Search

    Authors: Kenshi Abe, Junpei Komiyama, Atsushi Iwasaki

    Abstract: This paper considers the capacity expansion problem in two-sided matchings, where the policymaker is allowed to allocate some extra seats as well as the standard seats. In medical residency match, each hospital accepts a limited number of doctors. Such capacity constraints are typically given in advance. However, such exogenous constraints can compromise the welfare of the doctors; some popular ho… ▽ More

    Submitted 22 May, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Journal ref: IJCAI 2022

  8. arXiv:2202.05193  [pdf, other

    stat.ML cs.LG math.PR

    Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

    Authors: Junpei Komiyama

    Abstract: We consider the fixed-budget best arm identification problem with rewards following normal distributions. In this problem, the forecaster is given $K$ arms (or treatments) and $T$ time steps. The forecaster attempts to find the arm with the largest mean, via an adaptive experiment conducted using an algorithm. The algorithm's performance is evaluated by simple regret, reflecting the quality of the… ▽ More

    Submitted 14 April, 2024; v1 submitted 10 February, 2022; originally announced February 2022.

  9. arXiv:2111.09885  [pdf, other

    cs.LG stat.ML

    Rate-optimal Bayesian Simple Regret in Best Arm Identification

    Authors: Junpei Komiyama, Kaito Ariu, Masahiro Kato, Chao Qin

    Abstract: We consider best arm identification in the multi-armed bandit problem. Assuming certain continuity conditions of the prior, we characterize the rate of the Bayesian simple regret. Differing from Bayesian regret minimization (Lai, 1987), the leading term in the Bayesian simple regret derives from the region where the gap between optimal and suboptimal arms is smaller than $\sqrt{\frac{\log T}{T}}$.… ▽ More

    Submitted 25 July, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: To appear in Mathematics of Operations Research. Changed the title from the previous version

    MSC Class: Primary: 62L05; secondary: 62C10; 68W27

  10. arXiv:2109.09816  [pdf, other

    econ.TH cs.IR stat.ML

    Deviation-Based Learning: Training Recommender Systems Using Informed User Choice

    Authors: Junpei Komiyama, Shunya Noda

    Abstract: This paper proposes a new approach to training recommender systems called deviation-based learning. The recommender and rational users have different knowledge. The recommender learns user knowledge by observing what action users take upon receiving recommendations. Learning eventually stalls if the recommender always suggests a choice: Before the recommender completes learning, users start follow… ▽ More

    Submitted 18 August, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  11. arXiv:2109.08229  [pdf, ps, other

    econ.EM cs.LG stat.ME

    Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling

    Authors: Kaito Ariu, Masahiro Kato, Junpei Komiyama, Kenichiro McAlinn, Chao Qin

    Abstract: We consider the "policy choice" problem -- otherwise known as best arm identification in the bandit literature -- proposed by Kasy and Sautmann (2021) for adaptive experimental design. Theorem 1 of Kasy and Sautmann (2021) provides three asymptotic results that give theoretical guarantees for exploration sampling developed for this setting. We first show that the proof of Theorem 1 (1) has technic… ▽ More

    Submitted 24 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Submitted to Econometrica

  12. arXiv:2107.11419  [pdf, other

    stat.ML cs.LG

    Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits

    Authors: Junpei Komiyama, Edouard Fouché, Junya Honda

    Abstract: We consider nonstationary multi-armed bandit problems where the model parameters of the arms change over time. We introduce the adaptive resetting bandit (ADR-bandit), a bandit algorithm class that leverages adaptive windowing techniques from literature on data streams. We first provide new guarantees on the quality of estimators resulting from adaptive windowing techniques, which are of independe… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: Revision: Regret bound for ADR-Bandit + TS

  13. arXiv:2102.07826  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Controlling False Discovery Rates under Cross-Sectional Correlations

    Authors: Junpei Komiyama, Masaya Abe, Kei Nakagawa, Kenichiro McAlinn

    Abstract: We consider controlling the false discovery rate for testing many time series with an unknown cross-sectional correlation structure. Given a large number of hypotheses, false and missing discoveries can plague an analysis. While many procedures have been proposed to control false discovery, most of them either assume independent hypotheses or lack statistical power. A problem of particular interes… ▽ More

    Submitted 9 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  14. arXiv:2010.01079  [pdf, other

    econ.TH cs.GT econ.EM stat.ML

    On Statistical Discrimination as a Failure of Social Learning: A Multi-Armed Bandit Approach

    Authors: Junpei Komiyama, Shunya Noda

    Abstract: We analyze statistical discrimination in hiring markets using a multi-armed bandit model. Myopic firms face workers arriving with heterogeneous observable characteristics. The association between the worker's skill and characteristics is unknown ex ante; thus, firms need to learn it. Laissez-faire causes perpetual underestimation: minority workers are rarely hired, and therefore, the underestimati… ▽ More

    Submitted 14 July, 2023; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 1st round of revision (management science)

  15. arXiv:1910.01491  [pdf, other

    q-fin.ST cs.LG stat.AP stat.ML

    A Robust Transferable Deep Learning Framework for Cross-sectional Investment Strategy

    Authors: Kei Nakagawa, Masaya Abe, Junpei Komiyama

    Abstract: Stock return predictability is an important research theme as it reflects our economic and social organization, and significant efforts are made to explain the dynamism therein. Statistics of strong explanative power, called "factor" have been proposed to summarize the essence of predictive stock returns. Although machine learning methods are increasingly popular in stock return prediction, an inf… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

  16. arXiv:1810.04996  [pdf, other

    stat.ME cs.LG stat.ML

    A Simple Way to Deal with Cherry-picking

    Authors: Junpei Komiyama, Takanori Maehara

    Abstract: Statistical hypothesis testing serves as statistical evidence for scientific innovation. However, if the reported results are intentionally biased, hypothesis testing no longer controls the rate of false discovery. In particular, we study such selection bias in machine learning models where the reporter is motivated to promote an algorithmic innovation. When the number of possible configurations (… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

  17. arXiv:1806.05112  [pdf, other

    cs.AI cs.LG stat.ML

    Comparing Fairness Criteria Based on Social Outcome

    Authors: Junpei Komiyama, Hajime Shimao

    Abstract: Fairness in algorithmic decision-making processes is attracting increasing concern. When an algorithm is applied to human-related decision-making an estimator solely optimizing its predictive power can learn biases on the existing data, which motivates us the notion of fairness in machine learning. while several different notions are studied in the literature, little studies are done on how these… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  18. arXiv:1710.04924  [pdf, other

    stat.ML cs.AI cs.LG

    Two-stage Algorithm for Fairness-aware Machine Learning

    Authors: Junpei Komiyama, Hajime Shimao

    Abstract: Algorithmic decision making process now affects many aspects of our lives. Standard tools for machine learning, such as classification and regression, are subject to the bias in data, and thus direct application of such off-the-shelf tools could lead to a specific group being unfairly discriminated. Removing sensitive attributes of data does not solve this problem because a \textit{disparate impac… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  19. arXiv:1605.01677  [pdf, other

    stat.ML cs.LG

    Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm

    Authors: Junpei Komiyama, Junya Honda, Hiroshi Nakagawa

    Abstract: We study the K-armed dueling bandit problem, a variation of the standard stochastic bandit problem where the feedback is limited to relative comparisons of a pair of arms. The hardness of recommending Copeland winners, the arms that beat the greatest number of other arms, is characterized by deriving an asymptotic regret bound. We propose Copeland Winners Relative Minimum Empirical Divergence (CW-… ▽ More

    Submitted 24 May, 2016; v1 submitted 5 May, 2016; originally announced May 2016.

    Comments: To appear in ICML2016

  20. arXiv:1509.09011  [pdf, ps, other

    stat.ML cs.LG

    Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring

    Authors: Junpei Komiyama, Junya Honda, Hiroshi Nakagawa

    Abstract: Partial monitoring is a general model for sequential learning with limited feedback formalized as a game between two players. In this game, the learner chooses an action and at the same time the opponent chooses an outcome, then the learner suffers a loss and receives a feedback signal. The goal of the learner is to minimize the total loss. In this paper, we study partial monitoring with finite ac… ▽ More

    Submitted 30 September, 2015; originally announced September 2015.

    Comments: 24 pages, to appear in NIPS2015

  21. arXiv:1506.02550  [pdf, ps, other

    stat.ML cs.LG

    Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem

    Authors: Junpei Komiyama, Junya Honda, Hisashi Kashima, Hiroshi Nakagawa

    Abstract: We study the $K$-armed dueling bandit problem, a variation of the standard stochastic bandit problem where the feedback is limited to relative comparisons of a pair of arms. We introduce a tight asymptotic regret lower bound that is based on the information divergence. An algorithm that is inspired by the Deterministic Minimum Empirical Divergence algorithm (Honda and Takemura, 2010) is proposed,… ▽ More

    Submitted 29 June, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: 26 pages, 10 figures, to appear in COLT2015 (ver.3: revised related work (RUCB))

  22. arXiv:1506.00779  [pdf, other

    stat.ML cs.LG

    Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays

    Authors: Junpei Komiyama, Junya Honda, Hiroshi Nakagawa

    Abstract: We discuss a multiple-play multi-armed bandit (MAB) problem in which several arms are selected at each round. Recently, Thompson sampling (TS), a randomized algorithm with a Bayesian spirit, has attracted much attention for its empirically excellent performance, and it is revealed to have an optimal regret bound in the standard single-play MAB problem. In this paper, we propose the multiple-play T… ▽ More

    Submitted 20 March, 2019; v1 submitted 2 June, 2015; originally announced June 2015.

    Comments: Appeared in ICML2015. Fixed the evaluation of term (B) in Lemma 3. Replaced \tildeμ->θ