Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Phade, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.06951   

    cs.AI cs.LG

    AI For Global Climate Cooperation 2023 Competition Proceedings

    Authors: Yoshua Bengio, Prateek Gupta, Lu Li, Soham Phade, Sunil Srinivasa, Andrew Williams, Tianyu Zhang, Yang Zhang, Stephan Zheng

    Abstract: The international community must collaborate to mitigate climate change and sustain economic growth. However, collaboration is hard to achieve, partly because no global authority can ensure compliance with international climate agreements. Combining AI with climate-economic simulations offers a promising solution to design international frameworks, including negotiation protocols and climate agree… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  2. arXiv:2304.04668  [pdf, other

    cs.LG

    MERMAIDE: Learning to Align Learners using Model-Based Meta-Learning

    Authors: Arundhati Banerjee, Soham Phade, Stefano Ermon, Stephan Zheng

    Abstract: We study how a principal can efficiently and effectively intervene on the rewards of a previously unseen learning agent in order to induce desirable outcomes. This is relevant to many real-world settings like auctions or taxation, where the principal may not know the learning behavior nor the rewards of real people. Moreover, the principal should be few-shot adaptable and minimize the number of in… ▽ More

    Submitted 9 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Published in TMLR

  3. arXiv:2212.06891  [pdf, other

    cs.LG cs.GT

    Interactive Learning with Pricing for Optimal and Stable Allocations in Markets

    Authors: Yigit Efe Erginbas, Soham Phade, Kannan Ramchandran

    Abstract: Large-scale online recommendation systems must facilitate the allocation of a limited number of items among competing users while learning their preferences from user feedback. As a principled way of incorporating market constraints and user incentives in the design, we consider our objectives to be two-fold: maximal social welfare with minimal instability. To maximize social welfare, our proposed… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2207.04143

  4. arXiv:2208.07004  [pdf, other

    cs.LG cs.MA

    AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N

    Authors: Tianyu Zhang, Andrew Williams, Soham Phade, Sunil Srinivasa, Yang Zhang, Prateek Gupta, Yoshua Bengio, Stephan Zheng

    Abstract: Comprehensive global cooperation is essential to limit global temperature increases while continuing economic development, e.g., reducing severe inequality or achieving long-term economic growth. Achieving long-term cooperation on climate change mitigation with n strategic agents poses a complex game-theoretic problem. For example, agents may negotiate and reach climate agreements, but there is no… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 12 pages (21 with appendices), 5 figures. For associated working group, see https://www.ai4climatecoop.org/

    MSC Class: 93A16; 91-10; 68T07 ACM Class: I.2.11; J.2; J.4

  5. arXiv:2207.04143  [pdf, other

    cs.LG cs.GT cs.IR

    Interactive Recommendations for Optimal Allocations in Markets with Constraints

    Authors: Yigit Efe Erginbas, Soham Phade, Kannan Ramchandran

    Abstract: Recommendation systems when employed in markets play a dual role: they assist users in selecting their most desired items from a large pool and they help in allocating a limited number of items to the users who desire them the most. Despite the prevalence of capacity constraints on allocations in many real-world recommendation settings, a principled way of incorporating them in the design of these… ▽ More

    Submitted 28 July, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  6. arXiv:2201.01163  [pdf, other

    cs.GT cs.AI cs.LG econ.GN

    Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

    Authors: Michael Curry, Alexander Trott, Soham Phade, Yu Bai, Stephan Zheng

    Abstract: Real economies can be modeled as a sequential imperfect-information game with many heterogeneous agents, such as consumers, firms, and governments. Dynamic general equilibrium (DGE) models are often used for macroeconomic analysis in this setting. However, finding general equilibria is challenging using existing theoretical or computational methods, especially when using microfoundations to model… ▽ More

    Submitted 23 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

  7. arXiv:2101.08722  [pdf, other

    cs.GT econ.TH

    Mechanism Design for Cumulative Prospect Theoretic Agents: A General Framework and the Revelation Principle

    Authors: Soham R. Phade, Venkat Anantharam

    Abstract: This paper initiates a discussion of mechanism design when the participating agents exhibit preferences that deviate from expected utility theory (EUT). In particular, we consider mechanism design for systems where the agents are modeled as having cumulative prospect theory (CPT) preferences, which is a generalization of EUT preferences. We point out some of the key modifications needed in the the… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  8. arXiv:2012.02125  [pdf, other

    cs.GT stat.ML

    On the Impossibility of Convergence of Mixed Strategies with No Regret Learning

    Authors: Vidya Muthukumar, Soham Phade, Anant Sahai

    Abstract: We study the limiting behavior of the mixed strategies that result from optimal no-regret learning strategies in a repeated game setting where the stage game is any 2 by 2 competitive game. We consider optimal no-regret algorithms that are mean-based and monotonic in their argument. We show that for any such algorithm, the limiting mixed strategies of the players cannot converge almost surely to a… ▽ More

    Submitted 2 March, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 47 pages, 12 figures

  9. arXiv:2008.07793  [pdf, other

    cs.DC cs.GT

    Utility-based Resource Allocation and Pricing for Serverless Computing

    Authors: Vipul Gupta, Soham Phade, Thomas Courtade, Kannan Ramchandran

    Abstract: Serverless computing platforms currently rely on basic pricing schemes that are static and do not reflect customer feedback. This leads to significant inefficiencies from a total utility perspective. As one of the fastest-growing cloud services, serverless computing provides an opportunity to better serve both users and providers through the incorporation of market-based strategies for pricing and… ▽ More

    Submitted 24 January, 2022; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: 31 pages, 10 figures

  10. arXiv:2004.09592  [pdf, other

    econ.TH cs.GT

    Black-Box Strategies and Equilibrium for Games with Cumulative Prospect Theoretic Players

    Authors: Soham R. Phade, Venkat Anantharam

    Abstract: The betweenness property of preference relations states that a probability mixture of two lotteries should lie between them in preference. It is a weakened form of the independence property and hence satisfied in expected utility theory (EUT). Experimental violations of betweenness are well-documented and several preference theories, notably cumulative prospect theory (CPT), do not satisfy between… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  11. arXiv:1812.00501  [pdf, ps, other

    econ.TH cs.GT cs.NI math.OC q-fin.RM

    Optimal Resource Allocation over Networks via Lottery-Based Mechanisms

    Authors: Soham R. Phade, Venkat Anantharam

    Abstract: We show that, in a resource allocation problem, the ex ante aggregate utility of players with cumulative-prospect-theoretic preferences can be increased over deterministic allocations by implementing lotteries. We formulate an optimization problem, called the system problem, to find the optimal lottery allocation. The system problem exhibits a two-layer structure comprised of a permutation profile… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

  12. arXiv:1804.08005  [pdf, other

    cs.GT

    Learning in Games with Cumulative Prospect Theoretic Preferences

    Authors: Soham R. Phade, Venkat Anantharam

    Abstract: We consider repeated games where the players behave according to cumulative prospect theory (CPT). We show that, when the players have calibrated strategies and behave according to CPT, the natural analog of the notion of correlated equilibrium in the CPT case, as defined by Keskin, is not enough to capture all subsequential limits of the empirical distribution of action play. We define the notion… ▽ More

    Submitted 16 July, 2020; v1 submitted 21 April, 2018; originally announced April 2018.

  13. On the Geometry of Nash and Correlated Equilibria with Cumulative Prospect Theoretic Preferences

    Authors: Soham R. Phade, Venkat Anantharam

    Abstract: It is known that the set of all correlated equilibria of an n-player non-cooperative game is a convex polytope and includes all the Nash equilibria. Further, the Nash equilibria all lie on the boundary of this polytope. We study the geometry of both these equilibrium notions when the players have cumulative prospect theoretic (CPT) preferences. The set of CPT correlated equilibria includes all the… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.