Zum Hauptinhalt springen

Showing 1–21 of 21 results for author: Lykouris, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06929  [pdf, ps, other

    cs.GT

    Social Learning with Bounded Rationality: Negative Reviews Persist under Newest First

    Authors: Jackie Baek, Atanas Dinev, Thodoris Lykouris

    Abstract: We study a model of social learning from reviews where customers are computationally limited and make purchases based on reading only the first few reviews displayed by the platform. Under this bounded rationality, we establish that the review ordering policy can have a significant impact. In particular, the popular Newest First ordering induces a negative review to persist as the most recent revi… ▽ More

    Submitted 22 August, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: An extended abstract appeared at the Twenty-Fifth ACM Conference on Economics and Computation (EC 2024)

  2. arXiv:2402.12237  [pdf, other

    cs.LG cs.AI cs.GT cs.HC cs.PF

    Learning to Defer in Content Moderation: The Human-AI Interplay

    Authors: Thodoris Lykouris, Wentao Weng

    Abstract: Successful content moderation in online platforms relies on a human-AI collaboration approach. A typical heuristic estimates the expected harmfulness of a post and uses fixed thresholds to decide whether to remove it and whether to send it for human review. This disregards the prediction uncertainty, the time-varying element of human review capacity and post arrivals, and the selective sampling in… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2308.07817  [pdf, other

    cs.LG cs.DS cs.PF math.PR

    Quantifying the Cost of Learning in Queueing Systems

    Authors: Daniel Freund, Thodoris Lykouris, Wentao Weng

    Abstract: Queueing systems are widely applicable stochastic models with use cases in communication networks, healthcare, service systems, etc. Although their optimal control has been extensively studied, most existing approaches assume perfect knowledge of the system parameters. Of course, this assumption rarely holds in practice where there is parameter uncertainty, thus motivating a recent line of work on… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: A condensed version of this work was accepted for presentation at the Conference on Neural Information Processing Systems (NeurIPS 2023). Compared to the first version of the paper, the current version expands the comparison with related work

  4. arXiv:2301.10642  [pdf, other

    cs.GT

    Group fairness in dynamic refugee assignment

    Authors: Daniel Freund, Thodoris Lykouris, Elisabeth Paulson, Bradley Sturt, Wentao Weng

    Abstract: Ensuring that refugees and asylum seekers thrive (e.g., find employment) in their host countries is a profound humanitarian goal, and a primary driver of employment is the geographic location within a host country to which the refugee or asylum seeker is assigned. Recent research has proposed and implemented algorithms that assign refugees and asylum seekers to geographic locations in a manner tha… ▽ More

    Submitted 11 January, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

  5. arXiv:2208.09407  [pdf, other

    cs.GT cs.DS cs.LG

    Learning in Stackelberg Games with Non-myopic Agents

    Authors: Nika Haghtalab, Thodoris Lykouris, Sloan Nietert, Alex Wei

    Abstract: We study Stackelberg games where a principal repeatedly interacts with a long-lived, non-myopic agent, without knowing the agent's payoff function. Although learning in Stackelberg games is well-understood when the agent is myopic, non-myopic agents pose additional complications. In particular, non-myopic agents may strategically select actions that are inferior in the present to mislead the princ… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: An extended abstract of this work appeared at the ACM Conference on Economics and Computation (EC) 2022

  6. arXiv:2206.03324  [pdf, other

    cs.LG

    Efficient decentralized multi-agent learning in asymmetric bipartite queueing systems

    Authors: Daniel Freund, Thodoris Lykouris, Wentao Weng

    Abstract: We study decentralized multi-agent learning in bipartite queueing systems, a standard model for service systems. In particular, N agents request service from K servers in a fully decentralized way, i.e, by running the same algorithm without communication. Previous decentralized algorithms are restricted to symmetric systems, have performance that is degrading exponentially in the number of servers… ▽ More

    Submitted 5 August, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: To appear in Operations Research. A preliminary version of this work was accepted for presentation at the Conference on Learning Theory (COLT) 2022. Compared to the first version of the paper, the current version expands upon the related work and adds intuition on the technical content

  7. arXiv:2107.01509  [pdf, other

    cs.LG math.ST stat.ML

    Bayesian decision-making under misspecified priors with applications to meta-learning

    Authors: Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu, Thodoris Lykouris, Miroslav Dudík, Robert E. Schapire

    Abstract: Thompson sampling and other Bayesian sequential decision-making algorithms are among the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The choice of prior in these algorithms offers flexibility to encode domain knowledge but can also lead to poor performance when misspecified. In this paper, we demonstrate that performance degrades gracefully with misspecifi… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

  8. arXiv:2007.07990  [pdf, other

    cs.GT cs.DS

    Static pricing for multi-unit prophet inequalities

    Authors: Shuchi Chawla, Nikhil Devanur, Thodoris Lykouris

    Abstract: We study a pricing problem where a seller has $k$ identical copies of a product, buyers arrive sequentially, and the seller prices the items aiming to maximize social welfare. When $k=1$, this is the so called "prophet inequality" problem for which there is a simple pricing scheme achieving a competitive ratio of $1/2$. On the other end of the spectrum, as $k$ goes to infinity, the asymptotic perf… ▽ More

    Submitted 20 June, 2023; v1 submitted 15 July, 2020; originally announced July 2020.

  9. arXiv:2006.05051  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Constrained episodic reinforcement learning in concave-convex and knapsack settings

    Authors: Kianté Brantley, Miroslav Dudik, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

    Abstract: We propose an algorithm for tabular episodic reinforcement learning with constraints. We provide a modular analysis with strong theoretical guarantees for settings with concave rewards and convex constraints, and for settings with hard constraints (knapsacks). Most of the previous work in constrained reinforcement learning is limited to linear constraints, and the remaining work focuses on either… ▽ More

    Submitted 5 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: The NeurIPS 2020 version of this paper includes a small bug, leading to an incorrect dependence on H in Theorem 3.4. This version fixes it by adjusting Eq. (9), Theorem 3.4 and the relevant proofs. Changes in the main text are noted in red. Changes in the appendix are limited to Appendices B.1, B.5, and B.6 and the statement of Lemma F.3

  10. arXiv:2003.02287  [pdf, other

    cs.LG cs.GT stat.ML

    Bandits with adversarial scaling

    Authors: Thodoris Lykouris, Vahab Mirrokni, Renato Paes Leme

    Abstract: We study "adversarial scaling", a multi-armed bandit model where rewards have a stochastic and an adversarial component. Our model captures display advertising where the "click-through-rate" can be decomposed to a (fixed across time) arm-quality component and a non-stochastic user-relevance component (fixed across arms). Despite the relative stochasticity of our model, we demonstrate two settings… ▽ More

    Submitted 28 August, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Appeared in ICML 2020

  11. arXiv:2002.11650  [pdf, other

    cs.LG cs.DS cs.GT econ.GN stat.ML

    Contextual Search in the Presence of Adversarial Corruptions

    Authors: Akshay Krishnamurthy, Thodoris Lykouris, Chara Podimata, Robert Schapire

    Abstract: We study contextual search, a generalization of binary search in higher dimensions, which captures settings such as feature-based dynamic pricing. Standard formulations of this problem assume that agents act in accordance with a specific homogeneous response model. In practice, however, some responses may be adversarially corrupted. Existing algorithms heavily depend on the assumed response model… ▽ More

    Submitted 6 August, 2022; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: The first version was titled "Corrupted multidimensional binary search: Learning in the presence of irrational agents". An 8-page extended abstract titled "Contextual search in the presence of irrational agents" appeared at the 53rd ACM Symposium on the Theory of Computing (STOC '21)

  12. arXiv:1911.08689  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    Corruption-robust exploration in episodic reinforcement learning

    Authors: Thodoris Lykouris, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

    Abstract: We initiate the study of multi-stage episodic reinforcement learning under adversarial corruptions in both the rewards and the transition probabilities of the underlying system extending recent results for the special case of stochastic bandits. We provide a framework which modifies the aggressive exploration enjoyed by existing reinforcement learning approaches based on "optimism in the face of u… ▽ More

    Submitted 31 October, 2023; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted in Mathematics of Operations Research. Preliminary version was accepted for presentation at COLT'21

  13. arXiv:1909.08375  [pdf, other

    cs.LG cs.GT stat.ML

    Advancing subgroup fairness via sleeping experts

    Authors: Avrim Blum, Thodoris Lykouris

    Abstract: We study methods for improving fairness to subgroups in settings with overlapping populations and sequential predictions. Classical notions of fairness focus on the balance of some property across different populations. However, in many applications the goal of the different groups is not to be predicted equally but rather to be predicted well. We demonstrate that the task of satisfying this guara… ▽ More

    Submitted 2 December, 2019; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: To appear in ITCS 2020

  14. arXiv:1905.09898  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Feedback graph regret bounds for Thompson Sampling and UCB

    Authors: Thodoris Lykouris, Eva Tardos, Drishti Wali

    Abstract: We study the stochastic multi-armed bandit problem with the graph-based feedback structure introduced by Mannor and Shamir. We analyze the performance of the two most prominent stochastic bandit algorithms, Thompson Sampling and Upper Confidence Bound (UCB), in the graph-based feedback setting. We show that these algorithms achieve regret guarantees that combine the graph structure and the gaps be… ▽ More

    Submitted 14 February, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: Appeared in ALT 2020

  15. arXiv:1810.11829  [pdf, ps, other

    cs.LG cs.DS stat.ML

    On preserving non-discrimination when combining expert advice

    Authors: Avrim Blum, Suriya Gunasekar, Thodoris Lykouris, Nathan Srebro

    Abstract: We study the interplay between sequential decision making and avoiding discrimination against protected groups, when examples arrive online and do not follow distributional assumptions. We consider the most basic extension of classical online learning: "Given a class of predictors that are individually non-discriminatory with respect to a particular metric, how can we combine them to perform as we… ▽ More

    Submitted 29 March, 2019; v1 submitted 28 October, 2018; originally announced October 2018.

    Comments: Appeared in NIPS 2018

  16. arXiv:1803.09353  [pdf, ps, other

    cs.LG cs.DS cs.GT stat.ML

    Stochastic bandits robust to adversarial corruptions

    Authors: Thodoris Lykouris, Vahab Mirrokni, Renato Paes Leme

    Abstract: We introduce a new model of stochastic bandits with adversarial corruptions which aims to capture settings where most of the input follows a stochastic pattern but some fraction of it can be adversarially changed to trick the algorithm, e.g., click fraud, fake reviews and email spam. The goal of this model is to encourage the design of bandit algorithms that (i) work well in mixed adversarial and… ▽ More

    Submitted 25 March, 2018; originally announced March 2018.

    Comments: To appear in STOC 2018

  17. arXiv:1802.05399  [pdf, other

    cs.DS cs.LG

    Competitive caching with machine learned advice

    Authors: Thodoris Lykouris, Sergei Vassilvitskii

    Abstract: Traditional online algorithms encapsulate decision making under uncertainty, and give ways to hedge against all possible future events, while guaranteeing a nearly optimal solution as compared to an offline optimum. On the other hand, machine learning algorithms are in the business of extrapolating patterns found in the data to predict the future, and usually come with strong guarantees on the exp… ▽ More

    Submitted 21 August, 2020; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: Preliminary versions appeared in ICML 18 and SysML 18. The current version improves the presentation of the suggested framework (Section 2.2), provides a more clear discussion on how it can be more broadly applied, and fixes some more minor presentation issues in other sections

  18. arXiv:1711.03639  [pdf, ps, other

    cs.LG cs.DS

    Small-loss bounds for online learning with partial information

    Authors: Thodoris Lykouris, Karthik Sridharan, Eva Tardos

    Abstract: We consider the problem of adversarial (non-stochastic) online learning with partial information feedback, where at each round, a decision maker selects an action from a finite set of alternatives. We develop a black-box approach for such problems where the learner observes as feedback only losses of a subset of the actions that includes the selected action. When losses of actions are non-negative… ▽ More

    Submitted 26 July, 2021; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: The current version represents the content that will appear in Mathematics of Operations Research. An extended abstract of the paper appeared at the 31st Annual Conference on Learning Theory (COLT 2018)

  19. arXiv:1608.06819  [pdf, ps, other

    cs.GT cs.DS cs.SI math.OC

    Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

    Authors: Siddhartha Banerjee, Daniel Freund, Thodoris Lykouris

    Abstract: Optimizing shared vehicle systems (bike/scooter/car/ride-sharing) is more challenging compared to traditional resource allocation settings due to the presence of \emph{complex network externalities} -- changes in the demand/supply at any location affect future supply throughout the system within short timescales. These externalities are well captured by steady-state Markovian models, which are the… ▽ More

    Submitted 10 May, 2021; v1 submitted 24 August, 2016; originally announced August 2016.

    Comments: The current version represents the content that will appear in Operations Research. A one-page abstract of the paper appeared at the 18th ACM Conference on Economics and Computation (EC 2017)

  20. arXiv:1606.06244  [pdf, ps, other

    cs.GT cs.LG

    Learning in Games: Robustness of Fast Convergence

    Authors: Dylan J. Foster, Zhiyuan Li, Thodoris Lykouris, Karthik Sridharan, Eva Tardos

    Abstract: We show that learning algorithms satisfying a $\textit{low approximate regret}$ property experience fast convergence to approximate optimality in a large class of repeated games. Our property, which simply requires that each learner has small regret compared to a $(1+ε)$-multiplicative approximation to the best action in hindsight, is ubiquitous among learning algorithms; it is satisfied even by t… ▽ More

    Submitted 16 December, 2016; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: 27 pages. NIPS 2016

  21. arXiv:1505.00391  [pdf, ps, other

    cs.GT

    Learning and Efficiency in Games with Dynamic Population

    Authors: Thodoris Lykouris, Vasilis Syrgkanis, Eva Tardos

    Abstract: We study the quality of outcomes in repeated games when the population of players is dynamically changing and participants use learning algorithms to adapt to the changing environment. Game theory classically considers Nash equilibria of one-shot games, while in practice many games are played repeatedly, and in such games players often use algorithmic tools to learn to play in the given environmen… ▽ More

    Submitted 22 May, 2020; v1 submitted 2 May, 2015; originally announced May 2015.

    Comments: Preliminary version appeared in ACM Symposium on Discrete Algorithms 2016 (SODA 2016). This version adds a major new result: asymptotic optimality of simultaneous second-price auctions with dynamic population. Presentation is significantly simplified and all results are presented parametrically