Zum Hauptinhalt springen

Showing 1–50 of 120 results for author: Singla, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11334  [pdf, other

    cs.AI

    Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment

    Authors: Chao Wen, Jacqueline Staub, Adish Singla

    Abstract: Large language and multimodal models have shown remarkable successes on various benchmarks focused on specific skills such as general-purpose programming, natural language understanding, math word problem-solving, and visual question answering. However, it is unclear how well these models perform on tasks that require a combination of these skills. In this paper, we curate a novel program synthesi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.09891  [pdf, other

    cs.AI

    Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming

    Authors: Victor-Alexandru Pădurean, Adish Singla

    Abstract: Generative models have demonstrated human-level proficiency in various benchmarks across domains like programming, natural sciences, and general knowledge. Despite these promising results on competitive benchmarks, they still struggle with seemingly simple problem-solving tasks typically carried out by elementary-level students. How do state-of-the-art models perform on standardized tests designed… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.05053  [pdf, other

    cs.LG cs.AI

    Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation

    Authors: Nachiket Kotalwar, Alkis Gotovos, Adish Singla

    Abstract: Generative AI and large language models hold great promise in enhancing programming education by generating individualized feedback and hints for learners. Recent works have primarily focused on improving the quality of generated feedback to achieve human tutors' quality. While quality is an important performance criterion, it is not the only criterion to optimize for real-world educational deploy… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2405.02481  [pdf, other

    cs.LG cs.AI

    Proximal Curriculum with Task Correlations for Deep Reinforcement Learning

    Authors: Georgios Tzannetos, Parameswaran Kamalaruban, Adish Singla

    Abstract: Curriculum design for reinforcement learning (RL) can speed up an agent's learning process and help it learn to perform well on complex tasks. However, existing techniques typically require domain-specific hyperparameter tuning, involve expensive optimization procedures for task selection, or are suitable only for specific learning objectives. In this work, we consider curriculum design in context… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: IJCAI'24 paper (longer version)

  5. arXiv:2405.02173  [pdf, other

    cs.HC cs.CY

    Task Synthesis for Elementary Visual Programming in XLogoOnline Environment

    Authors: Chao Wen, Ahana Ghosh, Jacqueline Staub, Adish Singla

    Abstract: In recent years, the XLogoOnline programming platform has gained popularity among novice learners. It integrates the Logo programming language with visual programming, providing a visual interface for learning computing concepts. However, XLogoOnline offers only a limited set of tasks, which are inadequate for learners to master the computing concepts that require sufficient practice. To address t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted as a paper at the AIED'24 conference in the late-breaking results track

  6. arXiv:2404.18978  [pdf, other

    cs.LG cs.AI cs.CY

    Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs

    Authors: Bahar Radmehr, Adish Singla, Tanja Käser

    Abstract: There has been a growing interest in developing learner models to enhance learning and teaching experiences in educational environments. However, existing works have primarily focused on structured environments relying on meticulously crafted representations of tasks, thereby limiting the agent's ability to generalize skills across tasks. In this paper, we aim to enhance the generalization capabil… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted as a full paper at EDM 2024: The 17th International Conference on Educational Data Mining, 14-17 of July 2024, Atlanta

  7. arXiv:2403.12227  [pdf, other

    cs.CY

    Analyzing-Evaluating-Creating: Assessing Computational Thinking and Problem Solving in Visual Programming Domains

    Authors: Ahana Ghosh, Liina Malva, Adish Singla

    Abstract: Computational thinking (CT) and problem-solving skills are increasingly integrated into K-8 school curricula worldwide. Consequently, there is a growing need to develop reliable assessments for measuring students' proficiency in these skills. Recent works have proposed tests for assessing these skills across various CT concepts and practices, in particular, based on multi-choice items enabling psy… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: This extended version of the SIGCSE 2024 paper includes all 21 test items from ACE along with their answers in the appendix

  8. arXiv:2403.07933  [pdf, other

    cs.GT cs.LG

    Corruption-Robust Offline Two-Player Zero-Sum Markov Games

    Authors: Andi Nika, Debmalya Mandal, Adish Singla, Goran Radanović

    Abstract: We study data corruption robustness in offline two-player zero-sum Markov games. Given a dataset of realized trajectories of two players, an adversary is allowed to modify an $ε$-fraction of it. The learner's goal is to identify an approximate Nash Equilibrium policy pair from the corrupted data. We consider this problem in linear Markov games under different degrees of data coverage and corruptio… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2403.01857  [pdf, ps, other

    cs.LG

    Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

    Authors: Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Georgios Tzannetos, Goran Radanović, Adish Singla

    Abstract: In this paper, we take a step towards a deeper understanding of learning from human preferences by systematically comparing the paradigm of reinforcement learning from human feedback (RLHF) with the recently proposed paradigm of direct preference optimization (DPO). We focus our attention on the class of loglinear policy parametrization and linear reward functions. In order to compare the two para… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  10. arXiv:2402.07019  [pdf, other

    cs.LG

    Informativeness of Reward Functions in Reinforcement Learning

    Authors: Rati Devidze, Parameswaran Kamalaruban, Adish Singla

    Abstract: Reward functions are central in specifying the task we want a reinforcement learning agent to perform. Given a task and desired optimal behavior, we study the problem of designing informative reward functions so that the designed rewards speed up the agent's convergence. In particular, we consider expert-driven reward design settings where an expert or teacher seeks to provide informative and inte… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: Longer version of the AAMAS'24 paper

  11. arXiv:2402.06734  [pdf, ps, other

    cs.LG cs.AI

    Corruption Robust Offline Reinforcement Learning with Human Feedback

    Authors: Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, Goran Radanović

    Abstract: We study data corruption robustness for reinforcement learning with human feedback (RLHF) in an offline setting. Given an offline dataset of pairs of trajectories along with feedback about human preferences, an $\varepsilon$-fraction of the pairs is corrupted (e.g., feedback flipped or trajectory features manipulated), capturing an adversarial attack or noisy human preferences. We aim to design al… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  12. arXiv:2402.01580  [pdf, other

    cs.CY cs.AI

    Generative AI for Education (GAIED): Advances, Opportunities, and Challenges

    Authors: Paul Denny, Sumit Gulwani, Neil T. Heffernan, Tanja Käser, Steven Moore, Anna N. Rafferty, Adish Singla

    Abstract: This survey article has grown out of the GAIED (pronounced "guide") workshop organized by the authors at the NeurIPS 2023 conference. We organized the GAIED workshop as part of a community-building effort to bring together researchers, educators, and practitioners to explore the potential of generative AI for enhancing education. This article aims to provide an overview of the workshop activities… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  13. arXiv:2312.16365  [pdf, other

    cs.LG cs.AI stat.ML

    Active Third-Person Imitation Learning

    Authors: Timo Klein, Susanna Weinberger, Adish Singla, Sebastian Tschiatschek

    Abstract: We consider the problem of third-person imitation learning with the additional challenge that the learner must select the perspective from which they observe the expert. In our setting, each perspective provides only limited information about the expert's behavior, and the learning agent must carefully select and combine information from different perspectives to achieve competitive performance. T… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  14. arXiv:2312.16119  [pdf, other

    cs.LG cs.CL cs.NE

    A bi-objective $ε$-constrained framework for quality-cost optimization in language model ensembles

    Authors: Aditi Singla, Aditya Singh, Kanishk Kukreja

    Abstract: We propose an ensembling framework that uses diverse open-sourced Large Language Models (LLMs) to achieve high response quality while maintaining cost efficiency. We formulate a bi-objective optimization problem to represent the quality-cost tradeoff and then introduce an additional budget constraint that reduces the problem to a straightforward 0/1 knapsack problem. We empirically demonstrate tha… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  15. arXiv:2311.15399  [pdf, other

    cs.LG cs.AI

    Optimally Teaching a Linear Behavior Cloning Agent

    Authors: Shubham Kumar Bharti, Stephen Wright, Adish Singla, Xiaojin Zhu

    Abstract: We study optimal teaching of Linear Behavior Cloning (LBC) learners. In this setup, the teacher can select which states to demonstrate to an LBC learner. The learner maintains a version space of infinite linear hypotheses consistent with the demonstration. The goal of the teacher is to teach a realizable target policy to the learner using minimum number of state demonstrations. This number is know… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  16. arXiv:2310.10690  [pdf, other

    cs.CL cs.AI

    Large Language Models for In-Context Student Modeling: Synthesizing Student's Behavior in Visual Programming

    Authors: Manh Hung Nguyen, Sebastian Tschiatschek, Adish Singla

    Abstract: Student modeling is central to many educational technologies as it enables predicting future learning outcomes and designing targeted instructional strategies. However, open-ended learning domains pose challenges for accurately modeling students due to the diverse behaviors and a large space of possible misconceptions. To approach these challenges, we explore the application of large language mode… ▽ More

    Submitted 3 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Published in International Conference on Educational Data Mining (EDM) 2024

  17. arXiv:2310.03780  [pdf, other

    cs.AI

    Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

    Authors: Tung Phung, Victor-Alexandru Pădurean, Anjali Singh, Christopher Brooks, José Cambronero, Sumit Gulwani, Adish Singla, Gustavo Soares

    Abstract: Generative AI and large language models hold great promise in enhancing programming education by automatically generating individualized feedback for students. We investigate the role of generative AI models in providing human tutor-style programming hints to help students resolve errors in their buggy programs. Recent works have benchmarked state-of-the-art models for various feedback generation… ▽ More

    Submitted 6 August, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Published in Learning Analytics and Knowledge Conference (LAK) 2024

  18. arXiv:2308.02522  [pdf, other

    cs.LG cs.AI cs.CY

    Evaluating ChatGPT and GPT-4 for Visual Programming

    Authors: Adish Singla

    Abstract: Generative AI and large language models have the potential to drastically improve the landscape of computing education by automatically generating personalized feedback and content. Recent works have studied the capabilities of these models for different programming education scenarios; however, these works considered only text-based programming, in particular, Python programming. Consequently, th… ▽ More

    Submitted 30 July, 2023; originally announced August 2023.

    Comments: This article is a full version of the poster (extended abstract) from ICER'23

  19. arXiv:2306.17156  [pdf, other

    cs.CY cs.AI cs.CL

    Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors

    Authors: Tung Phung, Victor-Alexandru Pădurean, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

    Abstract: Generative AI and large language models hold great promise in enhancing computing education by powering next-generation educational technologies for introductory programming. Recent works have studied these models for different scenarios relevant to programming education; however, these works are limited for several reasons, as they typically consider already outdated models or only specific scena… ▽ More

    Submitted 31 July, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: This article is a full version of the poster (extended abstract) from ICER'23

  20. arXiv:2306.03311  [pdf, other

    cs.LG cs.AI

    Learning Embeddings for Sequential Tasks Using Population of Agents

    Authors: Mridul Mahajan, Georgios Tzannetos, Goran Radanovic, Adish Singla

    Abstract: We present an information-theoretic framework to learn fixed-dimensional embeddings for tasks in reinforcement learning. We leverage the idea that two tasks are similar if observing an agent's performance on one task reduces our uncertainty about its performance on the other. This intuition is captured by our information-theoretic criterion which uses a diverse agent population as an approximation… ▽ More

    Submitted 8 May, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: IJCAI'24 paper (longer version)

  21. arXiv:2306.00262  [pdf, other

    cs.CV cs.LG

    Maximal Domain Independent Representations Improve Transfer Learning

    Authors: Adrian Shuai Li, Elisa Bertino, Xuan-Hong Dang, Ankush Singla, Yuhai Tu, Mark N Wegman

    Abstract: The most effective domain adaptation (DA) involves the decomposition of data representation into a domain independent representation (DIRep), and a domain dependent representation (DDRep). A classifier is trained by using the DIRep of the labeled source images. Since the DIRep is domain invariant, the classifier can be "transferred" to make predictions for the target domain with no (or few) labels… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

  22. arXiv:2305.18342  [pdf, other

    cs.LG cs.AI cs.CL cs.CY cs.PL

    Neural Task Synthesis for Visual Programming

    Authors: Victor-Alexandru Pădurean, Georgios Tzannetos, Adish Singla

    Abstract: Generative neural models hold great promise in enhancing programming education by synthesizing new content. We seek to design neural models that can automatically generate programming tasks for a given specification in the context of visual programming domains. Despite the recent successes of large generative models like GPT-4, our initial results show that these models are ineffective in synthesi… ▽ More

    Submitted 14 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR) 2024

  23. arXiv:2305.17518  [pdf, other

    cs.AI cs.CY

    Synthesizing a Progression of Subtasks for Block-Based Visual Programming Tasks

    Authors: Alperen Tercan, Ahana Ghosh, Hasan Ferit Eniser, Maria Christakis, Adish Singla

    Abstract: Block-based visual programming environments play an increasingly important role in introducing computing concepts to K-12 students. In recent years, they have also gained popularity in neuro-symbolic AI, serving as a benchmark to evaluate general problem-solving and logical reasoning skills. The open-ended and conceptual nature of these visual programming tasks make them challenging, both for stat… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  24. arXiv:2304.12877  [pdf, other

    cs.LG

    Proximal Curriculum for Reinforcement Learning Agents

    Authors: Georgios Tzannetos, Bárbara Gomes Ribeiro, Parameswaran Kamalaruban, Adish Singla

    Abstract: We consider the problem of curriculum design for reinforcement learning (RL) agents in contextual multi-task settings. Existing techniques on automatic curriculum design typically require domain-specific hyperparameter tuning or have limited theoretical underpinnings. To tackle these limitations, we design our curriculum strategy, ProCuRL, inspired by the pedagogical concept of Zone of Proximal De… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR) 2023

  25. arXiv:2303.16359  [pdf, ps, other

    cs.AI cs.CY cs.PL

    Adaptive Scaffolding in Block-Based Programming via Synthesizing New Tasks as Pop Quizzes

    Authors: Ahana Ghosh, Sebastian Tschiatschek, Sam Devlin, Adish Singla

    Abstract: Block-based programming environments are increasingly used to introduce computing concepts to beginners. However, novice students often struggle in these environments, given the conceptual and open-ended nature of programming tasks. To effectively support a student struggling to solve a given task, it is important to provide adaptive scaffolding that guides the student towards a solution. We intro… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Preprint. Accepted as a paper at the AIED'22 conference

  26. arXiv:2302.13851  [pdf, other

    cs.LG cs.AI cs.CR cs.MA

    Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks

    Authors: Mohammad Mohammadi, Jonathan Nöther, Debmalya Mandal, Adish Singla, Goran Radanovic

    Abstract: In targeted poisoning attacks, an attacker manipulates an agent-environment interaction to force the agent into adopting a policy of interest, called target policy. Prior work has primarily focused on attacks that modify standard MDP primitives, such as rewards or transitions. In this paper, we study targeted poisoning attacks in a two-agent setting where an attacker implicitly poisons the effecti… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  27. arXiv:2302.04662  [pdf, other

    cs.PL cs.AI cs.CL

    Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

    Authors: Tung Phung, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

    Abstract: Large language models (LLMs), such as Codex, hold great promise in enhancing programming education by automatically generating feedback for students. We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming. More concretely, given a student's buggy program, our goal is to generate feedback comprising a fixed program alon… ▽ More

    Submitted 28 April, 2023; v1 submitted 24 January, 2023; originally announced February 2023.

    Comments: Published in International Conference on Educational Data Mining (EDM) 2023

  28. arXiv:2302.03608  [pdf, other

    cs.LG

    Online Reinforcement Learning with Uncertain Episode Lengths

    Authors: Debmalya Mandal, Goran Radanovic, Jiarui Gan, Adish Singla, Rupak Majumdar

    Abstract: Existing episodic reinforcement algorithms assume that the length of an episode is fixed across time and known a priori. In this paper, we consider a general framework of episodic reinforcement learning when the length of each episode is drawn from a distribution. We first establish that this problem is equivalent to online reinforcement learning with general discounting where the learner is tryin… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: To appear at AAAI-2023

  29. arXiv:2212.13442  [pdf, other

    eess.IV cs.MM cs.SD eess.AS

    Audiovisual Database with 360 Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research

    Authors: Thomas Robotham, Ashutosh Singla, Olli S. Rummukainen, Alexander Raake, Emanuël A. P. Habets

    Abstract: Research into multi-modal perception, human cognition, behavior, and attention can benefit from high-fidelity content that may recreate real-life-like scenes when rendered on head-mounted displays. Moreover, aspects of audiovisual perception, cognitive processes, and behavior may complement questionnaire-based Quality of Experience (QoE) evaluation of interactive virtual environments. Currently, t… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 6 pages, 2 figures, accepted and presented at the 2022 14th International Conference on Quality of Multimedia Experience (QoMEX). Database is publicly accessible at https://qoevave.github.io/database/

  30. arXiv:2211.10530  [pdf, other

    cs.LG cs.AI

    Provable Defense against Backdoor Policies in Reinforcement Learning

    Authors: Shubham Kumar Bharti, Xuezhou Zhang, Adish Singla, Xiaojin Zhu

    Abstract: We propose a provable defense mechanism against backdoor policies in reinforcement learning under subspace trigger assumption. A backdoor policy is a security threat where an adversary publishes a seemingly well-behaved policy which in fact allows hidden triggers. During deployment, the adversary can modify observed states in a particular way to trigger unexpected actions and harm the agent. We as… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted at Neurips 2022

  31. Multiple Instance Neuroimage Transformer

    Authors: Ayush Singla, Qingyu Zhao, Daniel K. Do, Yuyin Zhou, Kilian M. Pohl, Ehsan Adeli

    Abstract: For the first time, we propose using a multiple instance learning based convolution-free transformer model, called Multiple Instance Neuroimage Transformer (MINiT), for the classification of T1weighted (T1w) MRIs. We first present several variants of transformer models adopted for neuroimages. These models extract non-overlapping 3D blocks from the input volume and perform multi-headed self-attent… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  32. arXiv:2208.08429  [pdf, other

    cs.NI

    New primitives for bounded degradation in network service

    Authors: Simon Kassing, Vojislav Dukic, Ce Zhang, Ankit Singla

    Abstract: Certain new ascendant data center workloads can absorb some degradation in network service, not needing fully reliable data transport and/or their fair-share of network bandwidth. This opens up opportunities for superior network and infrastructure multiplexing by having this flexible traffic cede capacity under congestion to regular traffic with stricter needs. We posit there is opportunity in net… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    ACM Class: C.2.2

  33. arXiv:2206.06054  [pdf, other

    cs.LG cs.SE

    Specifying and Testing $k$-Safety Properties for Machine-Learning Models

    Authors: Maria Christakis, Hasan Ferit Eniser, Jörg Hoffmann, Adish Singla, Valentin Wüstholz

    Abstract: Machine-learning models are becoming increasingly prevalent in our lives, for instance assisting in image-classification or decision-making tasks. Consequently, the reliability of these models is of critical importance and has resulted in the development of numerous approaches for validating and verifying their robustness and fairness. However, beyond such specific properties, it is challenging to… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  34. arXiv:2205.02333  [pdf, other

    cs.LG

    Equity and Fairness of Bayesian Knowledge Tracing

    Authors: Sebastian Tschiatschek, Maria Knobelsdorf, Adish Singla

    Abstract: We consider the equity and fairness of curricula derived from Knowledge Tracing models. We begin by defining a unifying notion of an equitable tutoring system as a system that achieves maximum possible knowledge in minimal time for each student interacting with it. Realizing perfect equity requires tutoring systems that can provide individualized curricula per student. In particular, we investigat… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  35. arXiv:2205.01265  [pdf, other

    cs.AI cs.CY cs.LG

    From {Solution Synthesis} to {Student Attempt Synthesis} for Block-Based Visual Programming Tasks

    Authors: Adish Singla, Nikitas Theodoropoulos

    Abstract: Block-based visual programming environments are increasingly used to introduce computing concepts to beginners. Given that programming tasks are open-ended and conceptual, novice students often struggle when learning in these environments. AI-driven programming tutors hold great promise in automatically assisting struggling students, and need several components to realize this potential. We invest… ▽ More

    Submitted 28 March, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Longer version of EDM 2022 paper

  36. arXiv:2204.00302  [pdf, other

    cs.AI

    Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes

    Authors: Stelios Triantafyllou, Adish Singla, Goran Radanovic

    Abstract: Actual causality and a closely related concept of responsibility attribution are central to accountable decision making. Actual causality focuses on specific outcomes and aims to identify decisions (actions) that were critical in realizing an outcome of interest. Responsibility attribution is complementary and aims to identify the extent to which decision makers (agents) are responsible for this o… ▽ More

    Submitted 9 August, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society (AIES22)

  37. arXiv:2202.09112  [pdf, other

    cs.NI

    Prepare your video for streaming with Segue

    Authors: Melissa Licciardello, Lukas Humbel, Fabian Rohr, Maximilian Grüner, Ankit Singla

    Abstract: We identify new opportunities in video streaming, involving the joint consideration of offline video chunking and online rate adaptation. Due to a video's complexity varying over time, certain parts are more likely to cause performance impairments during playback with a particular rate adaptation algorithm. To address such an issue, we propose Segue, which carefully uses variable-length video segm… ▽ More

    Submitted 12 April, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    ACM Class: C.2.0; C.2.2

  38. arXiv:2201.02185  [pdf, other

    cs.LG cs.AI

    Admissible Policy Teaching through Reward Design

    Authors: Kiarash Banihashem, Adish Singla, Jiarui Gan, Goran Radanovic

    Abstract: We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy from a set of admissible policies. The goal of the reward designer is to modify the underlying reward function cost-efficiently while ensuring that any approximately optimal deterministic policy under the new reward function is admissible and performs well under the original reward function. This p… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  39. arXiv:2110.14888  [pdf, other

    cs.LG stat.ML

    Teaching an Active Learner with Contrastive Examples

    Authors: Chaoqi Wang, Adish Singla, Yuxin Chen

    Abstract: We study the problem of active learning with the added twist that the learner is assisted by a helpful teacher. We consider the following natural interaction protocol: At each round, the learner proposes a query asking for the label of an instance $x^q$, the teacher provides the requested label $\{x^q, y^q\}$ along with explanatory information to guide the learning process. In this paper, we view… ▽ More

    Submitted 10 December, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Fix the illustrative example

  40. arXiv:2110.12020  [pdf, ps, other

    cs.LG cs.CY

    Fairness Degrading Adversarial Attacks Against Clustering Algorithms

    Authors: Anshuman Chhabra, Adish Singla, Prasant Mohapatra

    Abstract: Clustering algorithms are ubiquitous in modern data science pipelines, and are utilized in numerous fields ranging from biology to facility location. Due to their widespread use, especially in societal resource allocation problems, recent research has aimed at making clustering algorithms fair, with great success. Furthermore, it has also been shown that clustering algorithms, much like other mach… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Accepted at AFCR workshop, NeurIPS 2021

  41. arXiv:2109.11328  [pdf, other

    cs.LG

    Reinforcement Learning Under Algorithmic Triage

    Authors: Eleni Straitouri, Adish Singla, Vahid Balazadeh Meresht, Manuel Gomez-Rodriguez

    Abstract: Methods to learn under algorithmic triage have predominantly focused on supervised learning settings where each decision, or prediction, is independent of each other. Under algorithmic triage, a supervised learning model predicts a fraction of the instances and humans predict the remaining ones. In this work, we take a first step towards developing reinforcement learning models that are optimized… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  42. arXiv:2107.11927  [pdf, other

    cs.AI

    On Blame Attribution for Accountable Multi-Agent Sequential Decision Making

    Authors: Stelios Triantafyllou, Adish Singla, Goran Radanovic

    Abstract: Blame attribution is one of the key aspects of accountable decision making, as it provides means to quantify the responsibility of an agent for a decision making outcome. In this paper, we study blame attribution in the context of cooperative multi-agent sequential decision making. As a particular setting of interest, we focus on cooperative decision making formalized by Multi-Agent Markov Decisio… ▽ More

    Submitted 25 January, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  43. arXiv:2107.08828  [pdf, ps, other

    cs.LG cs.AI

    Reinforcement Learning for Education: Opportunities and Challenges

    Authors: Adish Singla, Anna N. Rafferty, Goran Radanovic, Neil T. Heffernan

    Abstract: This survey article has grown out of the RL4ED workshop organized by the authors at the Educational Data Mining (EDM) 2021 conference. We organized this workshop as part of a community-building effort to bring together researchers and practitioners interested in the broad areas of reinforcement learning (RL) and education (ED). This article aims to provide an overview of the workshop activities an… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  44. arXiv:2106.05137  [pdf, ps, other

    cs.GT

    Bayesian Persuasion in Sequential Decision-Making

    Authors: Jiarui Gan, Rupak Majumdar, Goran Radanovic, Adish Singla

    Abstract: We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes actions in each time step based on the current state, the principal's advice/signal, and beliefs about the external parameter. The action of the agent updates the st… ▽ More

    Submitted 24 May, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

  45. arXiv:2106.04696  [pdf, other

    cs.LG cs.AI

    Curriculum Design for Teaching via Demonstrations: Theory and Applications

    Authors: Gaurav Yengera, Rati Devidze, Parameswaran Kamalaruban, Adish Singla

    Abstract: We consider the problem of teaching via demonstrations in sequential decision-making settings. In particular, we study how to design a personalized curriculum over demonstrations to speed up the learner's convergence. We provide a unified curriculum strategy for two popular learner models: Maximum Causal Entropy Inverse Reinforcement Learning (MaxEnt-IRL) and Cross-Entropy Behavioral Cloning (Cros… ▽ More

    Submitted 15 December, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  46. arXiv:2106.00600  [pdf, other

    cs.LG

    Fair Clustering Using Antidote Data

    Authors: Anshuman Chhabra, Adish Singla, Prasant Mohapatra

    Abstract: Clustering algorithms are widely utilized for many modern data science applications. This motivates the need to make outputs of clustering algorithms fair. Traditionally, new fair algorithmic variants to clustering algorithms are developed for specific notions of fairness. However, depending on the application context, different definitions of fairness might need to be employed. As a result, new a… ▽ More

    Submitted 22 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted at AFCR workshop, NeurIPS 2021

  47. arXiv:2105.12663  [pdf, other

    cs.NI cs.DC cs.PF

    Towards Million-Server Network Simulations on Just a Laptop

    Authors: Maciej Besta, Marcel Schneider, Salvatore Di Girolamo, Ankit Singla, Torsten Hoefler

    Abstract: The growing size of data center and HPC networks pose unprecedented requirements on the scalability of simulation infrastructure. The ability to simulate such large-scale interconnects on a simple PC would facilitate research efforts. Unfortunately, as we first show in this work, existing shared-memory packet-level simulators do not scale to the sizes of the largest networks considered today. We t… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  48. Towards Demystifying Serverless Machine Learning Training

    Authors: Jiawei Jiang, Shaoduo Gan, Yue Liu, Fanlin Wang, Gustavo Alonso, Ana Klimovic, Ankit Singla, Wentao Wu, Ce Zhang

    Abstract: The appeal of serverless (FaaS) has triggered a growing interest on how to use it in data-intensive applications such as ETL, query processing, or machine learning (ML). Several systems exist for training large-scale ML models on top of serverless infrastructures (e.g., AWS Lambda) but with inconclusive results in terms of their performance and relative advantage over "serverful" infrastructures (… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  49. Loss-Aversively Fair Classification

    Authors: Junaid Ali, Muhammad Bilal Zafar, Adish Singla, Krishna P. Gummadi

    Abstract: The use of algorithmic (learning-based) decision making in scenarios that affect human lives has motivated a number of recent studies to investigate such decision making systems for potential unfairness, such as discrimination against subjects based on their sensitive features like gender or race. However, when judging the fairness of a newly designed decision making system, these studies have ove… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: 8 pages, Accepted at AIES 2019

    Journal ref: In AAAI/ACM Conference on AI, Ethics, and Society (AIES 2019), January 27-28 2019 Honolulu, HI, USA

  50. arXiv:2103.16437  [pdf, other

    cs.CR cs.NI

    Order P4-66: Characterizing and mitigating surreptitious programmable network device exploitation

    Authors: Simon Kassing, Hussain Abbas, Laurent Vanbever, Ankit Singla

    Abstract: Substantial efforts are invested in improving network security, but the threat landscape is rapidly evolving, particularly with the recent interest in programmable network hardware. We explore a new security threat, from an attacker who has gained control of such devices. While it should be obvious that such attackers can trivially cause substantial damage, the challenge and novelty are in doing s… ▽ More

    Submitted 27 May, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 14 pages, 13 figures, 1 table

    ACM Class: C.2.3