Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Biswas, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14491  [pdf, other

    cs.LG cs.MM

    Multimodal Methods for Analyzing Learning and Training Environments: A Systematic Literature Review

    Authors: Clayton Cohn, Eduardo Davalos, Caleb Vatral, Joyce Horn Fonteles, Hanchen David Wang, Meiyi Ma, Gautam Biswas

    Abstract: Recent technological advancements have enhanced our ability to collect and analyze rich multimodal data (e.g., speech, video, and eye gaze) to better inform learning and training experiences. While previous reviews have focused on parts of the multimodal pipeline (e.g., conceptual models and data fusion), a comprehensive literature review on the methods informing multimodal learning and training e… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Submitted to ACM Computing Surveys. Currently under review

  2. arXiv:2407.08021  [pdf, other

    cs.MA

    Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers

    Authors: Yuhang Zhang, Zhiyao Zhang, Marcos Quiñones-Grueiro, William Barbour, Clay Weston, Gautam Biswas, Daniel Work

    Abstract: This article presents the first field deployment of a multi-agent reinforcement-learning (MARL) based variable speed limit (VSL) control system on the I-24 freeway near Nashville, Tennessee. We describe how we train MARL agents in a traffic simulator and directly deploy the simulation-based policy on a 17-mile stretch of Interstate 24 with 67 VSL controllers. We use invalid action masking and seve… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  3. arXiv:2406.15283  [pdf, other

    cs.LG

    FT-AED: Benchmark Dataset for Early Freeway Traffic Anomalous Event Detection

    Authors: Austin Coursey, Junyi Ji, Marcos Quinones-Grueiro, William Barbour, Yuhang Zhang, Tyler Derr, Gautam Biswas, Daniel B. Work

    Abstract: Early and accurate detection of anomalous events on the freeway, such as accidents, can improve emergency response and clearance. However, existing delays and errors in event identification and reporting make it a difficult problem to solve. Current large-scale freeway traffic datasets are not designed for anomaly detection and ignore these challenges. In this paper, we introduce the first large-s… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.11003  [pdf, other

    cs.CV cs.AI

    3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments

    Authors: Eduardo Davalos, Yike Zhang, Ashwin T. S., Joyce H. Fonteles, Umesh Timalsina, Guatam Biswas

    Abstract: This study presents a novel framework for 3D gaze tracking tailored for mixed-reality settings, aimed at enhancing joint attention and collaborative efforts in team-based scenarios. Conventional gaze tracking, often limited by monocular cameras and traditional eye-tracking apparatus, struggles with simultaneous data synchronization and analysis from multiple participants in group contexts. Our pro… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 9 pages, 8 figures, conference, submitted to ICMI 2024

  5. arXiv:2405.06203  [pdf, other

    cs.AI

    A First Step in Using Machine Learning Methods to Enhance Interaction Analysis for Embodied Learning Environments

    Authors: Joyce Fonteles, Eduardo Davalos, Ashwin T. S., Yike Zhang, Mengxi Zhou, Efrat Ayalon, Alicia Lane, Selena Steinberg, Gabriella Anton, Joshua Danish, Noel Enyedy, Gautam Biswas

    Abstract: Investigating children's embodied learning in mixed-reality environments, where they collaboratively simulate scientific processes, requires analyzing complex multimodal data to interpret their learning and coordination behaviors. Learning scientists have developed Interaction Analysis (IA) methodologies for analyzing such data, but this requires researchers to watch hours of videos to extract and… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  6. Towards A Human-in-the-Loop LLM Approach to Collaborative Discourse Analysis

    Authors: Clayton Cohn, Caitlin Snyder, Justin Montenegro, Gautam Biswas

    Abstract: LLMs have demonstrated proficiency in contextualizing their outputs using human input, often matching or beating human-level performance on a variety of tasks. However, LLMs have not yet been used to characterize synergistic learning in students' collaborative discourse. In this exploratory work, we take a first step towards adopting a human-in-the-loop prompt engineering approach with GPT-4-Turbo… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: In press at the 25th international conference on Artificial Intelligence in Education (AIED) Late-Breaking Results (LBR) track

  7. A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science

    Authors: Clayton Cohn, Nicole Hutchins, Tuan Le, Gautam Biswas

    Abstract: This paper explores the use of large language models (LLMs) to score and explain short-answer assessments in K-12 science. While existing methods can score more structured math and computer science assessments, they often do not provide explanations for the scores. Our study focuses on employing GPT-4 for automated assessment in middle school Earth Science, combining few-shot and active learning w… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: In press at EAAI-24: The 14th Symposium on Educational Advances in Artificial Intelligence

  8. arXiv:2310.12359  [pdf, other

    cs.MA cs.LG

    MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable Speed Limits

    Authors: Yuhang Zhang, Marcos Quinones-Grueiro, Zhiyao Zhang, Yanbing Wang, William Barbour, Gautam Biswas, Daniel Work

    Abstract: Variable Speed Limit (VSL) control acts as a promising highway traffic management strategy with worldwide deployment, which can enhance traffic safety by dynamically adjusting speed limits according to real-time traffic conditions. Most of the deployed VSL control algorithms so far are rule-based, lacking generalizability under varying and complex traffic scenarios. In this work, we propose MARVEL… ▽ More

    Submitted 17 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  9. arXiv:2305.12543  [pdf, other

    eess.SY cs.LG

    A Reinforcement Learning Approach for Robust Supervisory Control of UAVs Under Disturbances

    Authors: Ibrahim Ahmed, Marcos Quinones-Grueiro, Gautam Biswas

    Abstract: In this work, we present an approach to supervisory reinforcement learning control for unmanned aerial vehicles (UAVs). UAVs are dynamic systems where control decisions in response to disturbances in the environment have to be made in the order of milliseconds. We formulate a supervisory control architecture that interleaves with extant embedded control and demonstrates robustness to environmental… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: In review (2023-05-16)

  10. arXiv:2305.12158  [pdf, other

    eess.SY cs.LG

    Model-based adaptation for sample efficient transfer in reinforcement learning control of parameter-varying systems

    Authors: Ibrahim Ahmed, Marcos Quinones-Grueiro, Gautam Biswas

    Abstract: In this paper, we leverage ideas from model-based control to address the sample efficiency problem of reinforcement learning (RL) algorithms. Accelerating learning is an active field of RL highly relevant in the context of time-varying systems. Traditional transfer learning methods propose to use prior knowledge of the system behavior to devise a gradual or immediate data-driven transformation of… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Published to IEEE CoDiT 2023

  11. arXiv:2205.09836  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Concurrent Policy Blending and System Identification for Generalized Assistive Control

    Authors: Luke Bhan, Marcos Quinones-Grueiro, Gautam Biswas

    Abstract: In this work, we address the problem of solving complex collaborative robotic tasks subject to multiple varying parameters. Our approach combines simultaneous policy blending with system identification to create generalized policies that are robust to changes in system parameters. We employ a blending network whose state space relies solely on parameter estimates from a system identification techn… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted to ICRA 2022

  12. arXiv:2202.09698  [pdf

    cs.CY

    Analyzing Adaptive Scaffolds that Help Students Develop Self-Regulated Learning Behaviors

    Authors: Anabil Munshi, Gautam Biswas, Ryan Baker, Jaclyn Ocumpaugh, Stephen Hutt, Luc Paquette

    Abstract: Providing adaptive scaffolds to help learners develop self-regulated learning (SRL) processes has been an important goal for intelligent learning environments. Adaptive scaffolding is especially important in open-ended learning environments (OELE), where novice learners often face difficulties in completing their learning tasks. This paper presents a systematic framework for adaptive scaffolding i… ▽ More

    Submitted 1 June, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

  13. arXiv:2012.06016  [pdf, other

    cs.LG eess.SY

    Performance-Weighed Policy Sampling for Meta-Reinforcement Learning

    Authors: Ibrahim Ahmed, Marcos Quinones-Grueiro, Gautam Biswas

    Abstract: This paper discusses an Enhanced Model-Agnostic Meta-Learning (E-MAML) algorithm that generates fast convergence of the policy function from a small number of training examples when applied to new learning tasks. Built on top of Model-Agnostic Meta-Learning (MAML), E-MAML maintains a set of policy parameters learned in the environment for previous tasks. We apply E-MAML to developing reinforcement… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  14. Complementary Meta-Reinforcement Learning for Fault-Adaptive Control

    Authors: Ibrahim Ahmed, Marcos Quinones-Grueiro, Gautam Biswas

    Abstract: Faults are endemic to all systems. Adaptive fault-tolerant control maintains degraded performance when faults occur as opposed to unsafe conditions or catastrophic events. In systems with abrupt faults and strict time constraints, it is imperative for control to adapt quickly to system changes to maintain system operations. We present a meta-reinforcement learning approach that quickly adapts its… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: Accepted to PHM Conference 2020

    Journal ref: Annual Conference of the PHM Society. Vol. 12. No. 1. 2020

  15. arXiv:2008.04407  [pdf, other

    eess.SY cs.AI cs.LG

    Fault-Tolerant Control of Degrading Systems with On-Policy Reinforcement Learning

    Authors: Ibrahim Ahmed, Marcos Quiñones-Grueiro, Gautam Biswas

    Abstract: We propose a novel adaptive reinforcement learning control approach for fault tolerant control of degrading systems that is not preceded by a fault detection and diagnosis step. Therefore, \textit{a priori} knowledge of faults that may occur in the system is not required. The adaptive scheme combines online and offline learning of the on-policy control method to improve exploration and sample effi… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Published in IFAC World Congress 2020

  16. arXiv:2008.04403  [pdf, other

    eess.SY cs.AI cs.LG

    Comparison of Model Predictive and Reinforcement Learning Methods for Fault Tolerant Control

    Authors: Ibrahim Ahmed, Hamed Khorasgani, Gautam Biswas

    Abstract: A desirable property in fault-tolerant controllers is adaptability to system changes as they evolve during systems operations. An adaptive controller does not require optimal control policies to be enumerated for possible faults. Instead it can approximate one in real-time. We present two adaptive fault-tolerant control schemes for a discrete time system based on hierarchical reinforcement learnin… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Published in IFAC SAFEPROCESS 2018

  17. arXiv:2008.01879  [pdf, other

    cs.LG eess.SP

    A Relearning Approach to Reinforcement Learning for Control of Smart Buildings

    Authors: Avisek Naug, Marcos Quiñones-Grueiro, Gautam Biswas

    Abstract: This paper demonstrates that continual relearning of control policies using incremental deep reinforcement learning (RL) can improve policy learning for non-stationary processes. We demonstrate this approach for a data-driven 'smart building environment' that we use as a test-bed for developing HVAC controllers for reducing energy consumption of large buildings on our university campus. The non-st… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

  18. arXiv:1304.2721  [pdf

    cs.AI

    Using the Dempster-Shafer Scheme in a Diagnostic Expert System Shell

    Authors: Gautam Biswas, Teywansh S. Anand

    Abstract: This paper discusses an expert system shell that integrates rule-based reasoning and the Dempster-Shafer evidence combination scheme. Domain knowledge is stored as rules with associated belief functions. The reasoning component uses a combination of forward and backward inferencing mechanisms to allow interaction with users in a mixed-initiative format.

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

    Report number: UAI-P-1987-PG-98-105

  19. arXiv:1111.7051  [pdf

    cs.CR

    Design of Image Cryptosystem by Simultaneous VQ-Compression and Shuffling of Codebook and Index Matrix

    Authors: Arup Kumar Pal, G. P. Biswas, S. Mukhopadhyay

    Abstract: The popularity of Internet usage although increases exponentially, it is incapable of providing the security for exchange of confidential data between the users. As a result, several cryptosystems for encryption of data and images have been developed for secured transmission over Internet. In this work, a scheme for Image encryption/decryption based on Vector Quantization (VQ) has been proposed th… ▽ More

    Submitted 30 November, 2011; originally announced November 2011.

    Journal ref: The International journal of Multimedia & Its Applications (IJMA), Vol.1, No.1, November 2009