Search | arXiv e-print repository

Online Matching: A Real-time Bandit System for Large-scale Recommendations

Authors: Xinyang Yi, Shao-Chuan Wang, Ruining He, Hariharan Chandrasekaran, Charles Wu, Lukasz Heldt, Lichan Hong, Minmin Chen, Ed H. Chi

Abstract: The last decade has witnessed many successes of deep learning-based models for industry-scale recommender systems. These models are typically trained offline in a batch manner. While being effective in capturing users' past interactions with recommendation platforms, batch learning suffers from long model-update latency and is vulnerable to system biases, making it hard to adapt to distribution sh… ▽ More The last decade has witnessed many successes of deep learning-based models for industry-scale recommender systems. These models are typically trained offline in a batch manner. While being effective in capturing users' past interactions with recommendation platforms, batch learning suffers from long model-update latency and is vulnerable to system biases, making it hard to adapt to distribution shift and explore new items or user interests. Although online learning-based approaches (e.g., multi-armed bandits) have demonstrated promising theoretical results in tackling these challenges, their practical real-time implementation in large-scale recommender systems remains limited. First, the scalability of online approaches in servicing a massive online traffic while ensuring timely updates of bandit parameters poses a significant challenge. Additionally, exploring uncertainty in recommender systems can easily result in unfavorable user experience, highlighting the need for devising intricate strategies that effectively balance the trade-off between exploitation and exploration. In this paper, we introduce Online Matching: a scalable closed-loop bandit system learning from users' direct feedback on items in real time. We present a hybrid "offline + online" approach for constructing this system, accompanied by a comprehensive exposition of the end-to-end system architecture. We propose Diag-LinUCB -- a novel extension of the LinUCB algorithm -- to enable distributed updates of bandits parameter in a scalable and timely manner. We conduct live experiments in YouTube and show that Online Matching is able to enhance the capabilities of fresh content discovery and item exploration in the present platform. △ Less

Submitted 29 July, 2023; originally announced July 2023.

Comments: RecSys 2023

arXiv:2001.06610 [pdf]

doi 10.1109/SusTech47890.2020.9150529

A New Model to Analyze Power and Communication System Intra-and-Inter Dependencies

Authors: Sohini Roy, Harish Chandrasekaran, Anamitra Pal, Arunabha Sen

Abstract: The reliable and resilient operation of the smart grid necessitates a clear understanding of the intra-and-inter dependencies of its power and communication systems. This understanding can only be achieved by accurately depicting the interactions between the different components of these two systems. This paper presents a model, called modified implicative interdependency model (MIIM), for capturi… ▽ More The reliable and resilient operation of the smart grid necessitates a clear understanding of the intra-and-inter dependencies of its power and communication systems. This understanding can only be achieved by accurately depicting the interactions between the different components of these two systems. This paper presents a model, called modified implicative interdependency model (MIIM), for capturing these interactions. Data obtained from a power utility in the U.S. Southwest is used to ensure the validity of the model. The performance of the model for a specific power system application namely, state estimation, is demonstrated using the IEEE 118-bus system. The results indicate that the proposed model is more accurate than its predecessor, the implicative interdependency model (IIM) [1], in predicting the system state in case of failures in the power and/or communication systems. △ Less

Submitted 18 January, 2020; originally announced January 2020.

Comments: 8 pages. Accepted for publication in SusTech 2020 : 2020 IEEE Conference on Technologies for Sustainability

arXiv:1905.10979 [pdf, other]

Scalable K-Medoids via True Error Bound and Familywise Bandits

Authors: Aravindakshan Babu, Saurabh Agarwal, Sudarshan Babu, Hariharan Chandrasekaran

Abstract: K-Medoids(KM) is a standard clustering method, used extensively on semi-metric data.Error analyses of KM have traditionally used an in-sample notion of error,which can be far from the true error and suffer from generalization gap. We formalize the true K-Medoid error based on the underlying data distribution.We decompose the true error into fundamental statistical problems of: minimum estimation (… ▽ More K-Medoids(KM) is a standard clustering method, used extensively on semi-metric data.Error analyses of KM have traditionally used an in-sample notion of error,which can be far from the true error and suffer from generalization gap. We formalize the true K-Medoid error based on the underlying data distribution.We decompose the true error into fundamental statistical problems of: minimum estimation (ME) and minimum mean estimation (MME). We provide a convergence result for MME. We show $\errMME$ decreases no slower than $Θ(\frac{1}{n^{\frac{2}{3}}})$, where $n$ is a measure of sample size. Inspired by this bound, we propose a computationally efficient, distributed KM algorithm namely MCPAM. MCPAM has expected runtime $\mathcal{O}(km)$,where $k$ is the number of medoids and $m$ is number of samples. MCPAM provides massive computational savings for a small tradeoff in accuracy. We verify the quality and scaling properties of MCPAM on various datasets. And achieve the hitherto unachieved feat of calculating the KM of 1 billion points on semi-metric spaces. △ Less

Submitted 29 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

Showing 1–3 of 3 results for author: Chandrasekaran, H