Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Gampa, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.03148  [pdf, other

    cs.IR cs.LG

    Multi-Task Learning For Reduced Popularity Bias In Multi-Territory Video Recommendations

    Authors: Phanideep Gampa, Farnoosh Javadi, Belhassen Bayar, Ainur Yessenalina

    Abstract: Various data imbalances that naturally arise in a multi-territory personalized recommender system can lead to a significant item bias for globally prevalent items. A locally popular item can be overshadowed by a globally prevalent item. Moreover, users' viewership patterns/statistics can drastically change from one geographic location to another which may suggest to learn specific user embeddings.… ▽ More

    Submitted 24 September, 2023; originally announced October 2023.

    Comments: Recsys CARS 2023 Workshop paper

  2. arXiv:2310.01419  [pdf, other

    cs.IR cs.LG

    Design Principles of Robust Multi-Armed Bandit Framework in Video Recommendations

    Authors: Belhassen Bayar, Phanideep Gampa, Ainur Yessenalina, Zhen Wen

    Abstract: Current multi-armed bandit approaches in recommender systems (RS) have focused more on devising effective exploration techniques, while not adequately addressing common exploitation challenges related to distributional changes and item cannibalization. Little work exists to guide the design of robust bandit frameworks that can address these frequent challenges in RS. In this paper, we propose a ne… ▽ More

    Submitted 24 September, 2023; originally announced October 2023.

    Comments: RecSys CARS 2023 Workshop paper

  3. arXiv:2111.01166  [pdf, other

    cs.LG math.DS stat.ML

    Dynamics of Local Elasticity During Training of Neural Nets

    Authors: Soham Dan, Anirbit Mukherjee, Avirup Das, Phanideep Gampa

    Abstract: In the recent past, a property of neural training trajectories in weight-space had been isolated, that of "local elasticity" (denoted as $S_{\rm rel}$). Local elasticity attempts to quantify the propagation of the influence of a sampled data point on the prediction at another data. In this work, we embark on a comprehensive study of the existing notion of $S_{\rm rel}$ and also propose a new defin… ▽ More

    Submitted 24 August, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 40 pages (single column), the experiments have been significantly improved than the previous version

  4. arXiv:2006.16225  [pdf, other

    cs.LG stat.ML

    Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems

    Authors: Anirudh Goyal, Alex Lamb, Phanideep Gampa, Philippe Beaudoin, Sergey Levine, Charles Blundell, Yoshua Bengio, Michael Mozer

    Abstract: Modeling a structured, dynamic environment like a video game requires keeping track of the objects and their states declarative knowledge) as well as predicting how objects behave (procedural knowledge). Black-box models with a monolithic hidden state often fail to apply procedural knowledge consistently and uniformly, i.e., they lack systematicity. For example, in a video game, correct prediction… ▽ More

    Submitted 12 November, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: Type/Token Distinction in Deep learning Framework

  5. arXiv:1910.10410  [pdf, other

    cs.IR cs.LG

    BanditRank: Learning to Rank Using Contextual Bandits

    Authors: Phanideep Gampa, Sumio Fujita

    Abstract: We propose an extensible deep learning method that uses reinforcement learning to train neural networks for offline ranking in information retrieval (IR). We call our method BanditRank as it treats ranking as a contextual bandit problem. In the domain of learning to rank for IR, current deep learning models are trained on objective functions different from the measures they are evaluated on. Since… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 9 pages

  6. A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning

    Authors: Phanideep Gampa, Sairam Satwik Kondamudi, Lakshmanan Kailasam

    Abstract: We consider the finite horizon continuous reinforcement learning problem. Our contribution is three-fold. First,we give a tractable algorithm based on optimistic value iteration for the problem. Next,we give a lower bound on regret of order $Ω(T^{2/3})$ for any algorithm discretizes the state space, improving the previous regret bound of $Ω(T^{1/2})$ of Ortner and Ryabko \cite{contrl} for the same… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: InProceedings of International Conference on Intelligent Autonomous System, ICOIAS 2019