Skip to main content

Showing 1–1 of 1 results for author: Balakir, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.09426  [pdf, other

    cs.LG stat.ML

    Offline Reinforcement Learning for Optimizing Production Bidding Policies

    Authors: Dmytro Korenkevych, Frank Cheng, Artsiom Balakir, Alex Nikulkov, Lingnan Gao, Zhihao Cen, Zuobing Xu, Zheqing Zhu

    Abstract: The online advertising market, with its thousands of auctions run per second, presents a daunting challenge for advertisers who wish to optimize their spend under a budget constraint. Thus, advertising platforms typically provide automated agents to their customers, which act on their behalf to bid for impression opportunities in real time at scale. Because these proxy agents are owned by the plat… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.