Zum Hauptinhalt springen

Showing 1–50 of 99 results for author: Thomas, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09311  [pdf, other

    cs.CL

    An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval Interface

    Authors: Kevin Jose Thomas

    Abstract: This paper introduces an open-source interface for American Sign Language fingerspell recognition and semantic pose retrieval, aimed to serve as a stepping stone towards more advanced sign language translation systems. Utilizing a combination of convolutional neural networks and pose estimation models, the interface provides two modular components: a recognition module for translating ASL fingersp… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 8 pages, 9 figures

  2. arXiv:2408.08941  [pdf

    quant-ph cs.ET

    Quantum Circuit Optimization: Current trends and future direction

    Authors: Geetha Karuppasamy, Varun Puram, Stevens Johnson, Johnson P Thomas

    Abstract: Optimization of quantum circuits for a given problem is very important in order to achieve faster calculations as well as reduce errors due to noise. Optimization has to be achieved while ensuring correctness at all times. In this survey paper, recent advancements in quantum circuit optimization are explored. Both hardware independent as well as hardware dependent optimization are presented. State… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  3. arXiv:2408.08940  [pdf

    cs.DS

    Quantum Algorithm for Jaccard Similarity

    Authors: Varun Puram, Ruthvik Rao Bobbili, Johnson P Thomas

    Abstract: Jaccard Similarity is a very common proximity measurement used to compute the similarity between two asymmetric binary vectors. Jaccard Similarity is the ratio between the 1s (Intersection of two vectors) to 1s (Union of two vectors). This paper introduces a quantum algorithm for finding the Jaccard Similarity 1s, in the Intersection and Union of two binary vectors. There are two sub-algorithms on… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2407.13183  [pdf

    eess.IV cs.CV

    Methods to Measure the Broncho-Arterial Ratio and Wall Thickness in the Right Lower Lobe for Defining Radiographic Reversibility of Bronchiectasis

    Authors: Abhijith R. Beeravolu, Ian Brent Masters, Mirjam Jonkman, Kheng Cher Yeo, Spyridon Prountzos, Rahul J Thomas, Eva Ignatious, Sami Azam, Gabrielle B McCallum, Efthymia Alexopoulou, Anne B Chang, Friso De Boer

    Abstract: The diagnosis of bronchiectasis requires measuring abnormal bronchial dilation. It is confirmed using a chest CT scan, where the key feature is an increased broncho-arterial ratio (BAR) (>0.8 in children), often with bronchial wall thickening. Image processing methods facilitate quicker interpretation and detailed evaluations by lobes and segments. Challenges like inclined nature, oblique orientat… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 14 pages

  5. Optimizing Nurse Scheduling: A Supply Chain Approach for Healthcare Institutions

    Authors: Jubin Thomas

    Abstract: When managing an organization, planners often encounter numerous challenging scenarios. In such instances, relying solely on intuition or managerial experience may not suffice, necessitating a quantitative approach. This demand is further accentuated in the era of big data, where the sheer scale and complexity of constraints pose significant challenges. Therefore, the aim of this study is to provi… ▽ More

    Submitted 29 May, 2024; originally announced July 2024.

    Journal ref: Vol. 20 No. 6s (2024)

  6. arXiv:2407.04522  [pdf, other

    cs.LG

    Graph Reinforcement Learning for Power Grids: A Comprehensive Survey

    Authors: Mohamed Hassouna, Clara Holzhüter, Pawel Lytaev, Josephine Thomas, Bernhard Sick, Christoph Scholz

    Abstract: The rise of renewable energy and distributed generation requires new approaches to overcome the limitations of traditional methods. In this context, Graph Neural Networks are promising due to their ability to learn from graph-structured data. Combined with Reinforcement Learning, they can serve as control approaches to determine remedial network actions. This review analyses how Graph Reinforcemen… ▽ More

    Submitted 26 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  7. arXiv:2406.18718  [pdf

    cs.HC eess.SY

    State-Based Automation for Time-Restricted Eating Adherence

    Authors: Samuel E. Armstrong, Aaron D. Mullen, J. Matthew Thomas, Dorothy D. Sears, Julie S. Pendergast, Jeffrey Talbert, Cody Bumgardner

    Abstract: Developing and enforcing study protocols is a foundational component of medical research. As study complexity for participant interactions increases, translating study protocols to supporting application code becomes challenging. A collaboration exists between the University of Kentucky and Arizona State University to determine the efficacy of time-restricted eating in improving metabolic risk amo… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures, submitted to AMIA 2024 Annual Symposium

  8. arXiv:2406.03251  [pdf, other

    cs.SD cs.AI eess.AS

    ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings

    Authors: Theo Mariotte, Anthony Larcher, Silvio Montresor, Jean-Hugh Thomas

    Abstract: Speaker Diarization (SD) aims at grouping speech segments that belong to the same speaker. This task is required in many speech-processing applications, such as rich meeting transcription. In this context, distant microphone arrays usually capture the audio signal. Beamforming, i.e., spatial filtering, is a common practice to process multi-microphone audio data. However, it often requires an expli… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 2 tables, accepted at Interspeech 2024

  9. arXiv:2405.14445  [pdf

    cs.CL cs.AI

    Exploring the use of a Large Language Model for data extraction in systematic reviews: a rapid feasibility study

    Authors: Lena Schmidt, Kaitlyn Hair, Sergio Graziozi, Fiona Campbell, Claudia Kapp, Alireza Khanteymoori, Dawn Craig, Mark Engelbert, James Thomas

    Abstract: This paper describes a rapid feasibility study of using GPT-4, a large language model (LLM), to (semi)automate data extraction in systematic reviews. Despite the recent surge of interest in LLMs there is still a lack of understanding of how to design LLM-based automation tools and how to robustly evaluate their performance. During the 2023 Evidence Synthesis Hackathon we conducted two feasibility… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Conference proceedings, peer-reviewed and presented at the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, Glasgow, 2024

    Journal ref: Proceedings of the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, 2024

  10. arXiv:2404.15549  [pdf, other

    cs.CL cs.AI

    PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models

    Authors: Shashi Kant Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller, Sorena Nadaf-Rahrov, Yanshan Wang, Hrituraj Singh

    Abstract: Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 Pages, 8 Figures, Supplementary Work Attached

  11. arXiv:2402.08312  [pdf, other

    eess.AS cs.SD

    Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

    Authors: Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas

    Abstract: Voice Activity Detection (VAD) and Overlapped Speech Detection (OSD) are key pre-processing tasks for speaker diarization. In the meeting context, it is often easier to capture speech with a distant device. This consideration however leads to severe performance degradation. We study a unified supervised learning framework to solve distant multi-microphone joint VAD and OSD (VAD+OSD). This paper in… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 14 pages, 5 figures, accepted at IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

  12. The Devil Is in the Command Line: Associating the Compiler Flags With the Binary and Build Metadata

    Authors: Gunnar Kudrjavets, Aditya Kumar, Jeff Thomas, Ayushi Rastogi

    Abstract: Engineers build large software systems for multiple architectures, operating systems, and configurations. A set of inconsistent or missing compiler flags generates code that catastrophically impacts the system's behavior. In the authors' industry experience, defects caused by an undesired combination of compiler flags are common in nontrivial software projects. We are unaware of any build and CI/C… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 3 pages. To be published in the 46th International Conference on Software Engineering (ICSE 2024), April 14 - April 20 2024, Lisbon, Portugal

  13. What Do You Mean by Memory? When Engineers Are Lost in the Maze of Complexity

    Authors: Gunnar Kudrjavets, Aditya Kumar, Jeff Thomas, Ayushi Rastogi

    Abstract: An accepted practice to decrease applications' memory usage is to reduce the amount and frequency of memory allocations. Factors such as (a) the prevalence of out-of-memory (OOM) killers, (b) memory allocations in modern programming languages done implicitly, (c) overcommitting being a default strategy in the Linux kernel, and (d) the rise in complexity and terminology related to memory management… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 3 pages. To be published in the 46th International Conference on Software Engineering (ICSE 2024), April 14 - April 20 2024, Lisbon, Portugal

  14. arXiv:2308.07247  [pdf, other

    cs.LG cs.AI stat.ML

    Can we Agree? On the Rashōmon Effect and the Reliability of Post-Hoc Explainable AI

    Authors: Clement Poiret, Antoine Grigis, Justin Thomas, Marion Noulhiane

    Abstract: The Rashōmon effect poses challenges for deriving reliable knowledge from machine learning models. This study examined the influence of sample size on explanations from models in a Rashōmon set using SHAP. Experiments on 5 public datasets showed that explanations gradually converged as the sample size increased. Explanations from <128 samples exhibited high variability, limiting reliable knowledge… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 13 pages, 6 figures and 6 tables

    ACM Class: H.1.1; I.6.4; I.2.1

  15. arXiv:2307.13012  [pdf, other

    cs.SD cs.AI cs.NE eess.AS eess.SP

    Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains

    Authors: Martin Lebourdais, Théo Mariotte, Marie Tahon, Anthony Larcher, Antoine Laurent, Silvio Montresor, Sylvain Meignier, Jean-Hugh Thomas

    Abstract: Voice activity and overlapped speech detection (respectively VAD and OSD) are key pre-processing tasks for speaker diarization. The final segmentation performance highly relies on the robustness of these sub-tasks. Recent studies have shown VAD and OSD can be trained jointly using a multi-class classification model. However, these works are often restricted to a specific speech domain, lacking inf… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  16. arXiv:2307.08175  [pdf, other

    cs.LG cs.NE stat.ML

    Multi-Objective Optimization of Performance and Interpretability of Tabular Supervised Machine Learning Models

    Authors: Lennart Schneider, Bernd Bischl, Janek Thomas

    Abstract: We present a model-agnostic framework for jointly optimizing the predictive performance and interpretability of supervised machine learning models for tabular data. Interpretability is quantified via three measures: feature sparsity, interaction sparsity of features, and sparsity of non-monotone feature effects. By treating hyperparameter optimization of a machine learning algorithm as a multi-obj… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Extended version of the paper accepted at GECCO 2023. 16 pages, 7 tables, 7 figures

  17. arXiv:2306.04268  [pdf, other

    cs.SD cs.CL eess.AS

    Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features

    Authors: Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas

    Abstract: Speaker diarization is the task of answering Who spoke and when? in an audio stream. Pipeline systems rely on speech segmentation to extract speakers' segments and achieve robust speaker diarization. This paper proposes a common framework to solve three segmentation tasks in the distant speech scenario: Voice Activity Detection (VAD), Overlapped Speech Detection (OSD), and Speaker Change Detection… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Interspeech 2023, international Speech Communication Association (ISCA), Aug 2023, Dublin, Ireland

  18. arXiv:2305.14394  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Unsupervised Spiking Neural Network Model of Prefrontal Cortex to study Task Switching with Synaptic deficiency

    Authors: Ashwin Viswanathan Kannan, Goutam Mylavarapu, Johnson P Thomas

    Abstract: In this study, we build a computational model of Prefrontal Cortex (PFC) using Spiking Neural Networks (SNN) to understand how neurons adapt and respond to tasks switched under short and longer duration of stimulus changes. We also explore behavioral deficits arising out of the PFC lesions by simulating lesioned states in our Spiking architecture model. Although there are some computational models… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  19. arXiv:2305.04502  [pdf, other

    cs.LG cs.NE

    MO-DEHB: Evolutionary-based Hyperband for Multi-Objective Optimization

    Authors: Noor Awad, Ayushi Sharma, Philipp Muller, Janek Thomas, Frank Hutter

    Abstract: Hyperparameter optimization (HPO) is a powerful technique for automating the tuning of machine learning (ML) models. However, in many real-world applications, accuracy is only one of multiple performance criteria that must be considered. Optimizing these objectives simultaneously on a complex and diverse search space remains a challenging task. In this paper, we propose MO-DEHB, an effective and f… ▽ More

    Submitted 11 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  20. arXiv:2304.00111  [pdf

    cs.CL

    Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing

    Authors: Aokun Chen, Daniel Paredes, Zehao Yu, Xiwei Lou, Roberta Brunson, Jamie N. Thomas, Kimberly A. Martinez, Robert J. Lucero, Tanja Magoc, Laurence M. Solberg, Urszula A. Snigurska, Sarah E. Ser, Mattia Prosperi, Jiang Bian, Ragnhildur I. Bjarnadottir, Yonghui Wu

    Abstract: Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  21. Investigating Strategies for Clause Recommendation

    Authors: Sagar Joshi, Sumanth Balaji, Jerrin Thomas, Aparna Garimella, Vasudeva Varma

    Abstract: Clause recommendation is the problem of recommending a clause to a legal contract, given the context of the contract in question and the clause type to which the clause should belong. With not much prior work being done toward the generation of legal contracts, this problem was proposed as a first step toward the bigger problem of contract generation. As an open-ended text generation problem, the… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: Published in Legal Knowledge and Information Systems (JURIX) 2022. (10 pages, 4 figures)

    ACM Class: I.2.7

    Journal ref: Volume 362: Legal Knowledge and Information Systems (2022), Frontiers in Artificial Intelligence and Applications

  22. arXiv:2301.00843  [pdf, other

    eess.SP cs.IT q-bio.QM

    Explicitly Solvable Continuous-time Inference for Partially Observed Markov Processes

    Authors: Daniel Chen, Alexander G. Strang, Andrew W. Eckford, Peter J. Thomas

    Abstract: Many natural and engineered systems can be modeled as discrete state Markov processes. Often, only a subset of states are directly observable. Inferring the conditional probability that a system occupies a particular hidden state, given the partial observation, is a problem with broad application. In this paper, we introduce a continuous-time formulation of the sum-product algorithm, which is a we… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing

  23. Who Ate My Memory? Towards Attribution in Memory Management

    Authors: Gunnar Kudrjavets, Ayushi Rastogi, Jeff Thomas, Nachiappan Nagappan

    Abstract: To understand applications' memory usage details, engineers use instrumented builds and profiling tools. Both approaches are impractical for use in production environments or deployed mobile applications. As a result, developers can gather only high-level memory-related statistics for deployed software. In our experience, the lack of granular field data makes fixing performance and reliability-rel… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 3 pages. To be published in the 45th International Conference on Software Engineering (ICSE 2023), May 14 - May 20 2023, Melbourne, Australia

  24. arXiv:2212.11498  [pdf, other

    cs.LG cs.AI cs.MA cs.RO

    Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

    Authors: Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, Stefano V. Albrecht

    Abstract: We consider a warehouse in which dozens of mobile robots and human pickers work together to collect and deliver items within the warehouse. The fundamental problem we tackle, called the order-picking problem, is how these worker agents must coordinate their movement and actions in the warehouse to maximise performance in this task. Established industry methods using heuristic approaches require la… ▽ More

    Submitted 30 August, 2024; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

  25. arXiv:2210.03990  [pdf, other

    cs.LG cs.AI

    Weisfeiler-Lehman goes Dynamic: An Analysis of the Expressive Power of Graph Neural Networks for Attributed and Dynamic Graphs

    Authors: Silvia Beddar-Wiesing, Giuseppe Alessio D'Inverno, Caterina Graziani, Veronica Lachi, Alice Moallemy-Oureh, Franco Scarselli, Josephine Maria Thomas

    Abstract: Graph Neural Networks (GNNs) are a large class of relational models for graph processing. Recent theoretical studies on the expressive power of GNNs have focused on two issues. On the one hand, it has been proven that GNNs are as powerful as the Weisfeiler-Lehman test (1-WL) in their ability to distinguish graphs. Moreover, it has been shown that the equivalence enforced by 1-WL equals unfolding e… ▽ More

    Submitted 3 May, 2024; v1 submitted 8 October, 2022; originally announced October 2022.

  26. arXiv:2209.05874  [pdf, other

    cs.NI cs.LG

    Federated Meta-Learning for Traffic Steering in O-RAN

    Authors: Hakan Erdol, Xiaoyang Wang, Peizheng Li, Jonathan D. Thomas, Robert Piechocki, George Oikonomou, Rui Inacio, Abdelrahim Ahmad, Keith Briggs, Shipra Kapoor

    Abstract: The vision of 5G lies in providing high data rates, low latency (for the aim of near-real-time applications), significantly increased base station capacity, and near-perfect quality of service (QoS) for users, compared to LTE networks. In order to provide such services, 5G systems will support various combinations of access technologies such as LTE, NR, NR-U and Wi-Fi. Each radio access technology… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 7 pages, 3 figures, 2 algorithms, and 3 tables

  27. When malloc() Never Returns NULL -- Reliability as an Illusion

    Authors: Gunnar Kudrjavets, Jeff Thomas, Aditya Kumar, Nachiappan Nagappan, Ayushi Rastogi

    Abstract: For decades, the guidance given to software engineers has been to check the memory allocation results. This validation step is necessary to avoid crashes. However, in user mode, in modern operating systems (OS), such as Android, FreeBSD, iOS, and macOS, the caller does not have an opportunity to handle the memory allocation failures. This behavioral trait results from the actions of a system compo… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: 6 pages. To be published in the 33rd IEEE International Symposium on Software Reliability Engineering (ISSRE 2022), Oct 31 - Nov 3 2022, Charlotte, North Carolina, USA

  28. arXiv:2208.00204  [pdf, other

    cs.LG cs.NE stat.ML

    Tackling Neural Architecture Search With Quality Diversity Optimization

    Authors: Lennart Schneider, Florian Pfisterer, Paul Kent, Juergen Branke, Bernd Bischl, Janek Thomas

    Abstract: Neural architecture search (NAS) has been studied extensively and has grown to become a research field with substantial impact. While classical single-objective NAS searches for the architecture with the best performance, multi-objective NAS considers multiple objectives that should be optimized simultaneously, e.g., minimizing resource usage along the validation error. Although considerable progr… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted at the First Conference on Automated Machine Learning (Main Track). 30 pages, 8 tables, 13 figures

  29. arXiv:2208.00160  [pdf, other

    cs.CV

    Learning Feature Decomposition for Domain Adaptive Monocular Depth Estimation

    Authors: Shao-Yuan Lo, Wei Wang, Jim Thomas, Jingjing Zheng, Vishal M. Patel, Cheng-Hao Kuo

    Abstract: Monocular depth estimation (MDE) has attracted intense study due to its low cost and critical functions for robotic tasks such as localization, mapping and obstacle detection. Supervised approaches have led to great success with the advance of deep learning, but they rely on large quantities of ground-truth depth annotations that are expensive to acquire. Unsupervised domain adaptation (UDA) trans… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  30. arXiv:2207.12560  [pdf, other

    cs.LG stat.ML

    AMLB: an AutoML Benchmark

    Authors: Pieter Gijsbers, Marcos L. P. Bueno, Stefan Coors, Erin LeDell, Sébastien Poirier, Janek Thomas, Bernd Bischl, Joaquin Vanschoren

    Abstract: Comparing different AutoML frameworks is notoriously challenging and often done incorrectly. We introduce an open and extensible benchmark that follows best practices and avoids common mistakes when comparing AutoML frameworks. We conduct a thorough comparison of 9 well-known AutoML frameworks across 71 classification and 33 regression tasks. The differences between the AutoML frameworks are explo… ▽ More

    Submitted 16 November, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: UNDER REVIEW: Revised submission to JMLR, with updated results from June 2023

  31. There Ain't No Such Thing as a Free Custom Memory Allocator

    Authors: Gunnar Kudrjavets, Jeff Thomas, Aditya Kumar, Nachiappan Nagappan, Ayushi Rastogi

    Abstract: Using custom memory allocators is an efficient performance optimization technique. However, dependency on a custom allocator can introduce several maintenance-related issues. We present lessons learned from the industry and provide critical guidance for using custom memory allocators and enumerate various challenges associated with integrating them. These recommendations are based on years of expe… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 4 pages. To be published in 38th IEEE International Conference on Software Maintenance and Evolution (ICSME 2022), Oct 3-7, 2022, Limassol, Cyprus

  32. arXiv:2206.07438  [pdf, other

    cs.LG stat.ML

    Multi-Objective Hyperparameter Optimization in Machine Learning -- An Overview

    Authors: Florian Karl, Tobias Pielok, Julia Moosbauer, Florian Pfisterer, Stefan Coors, Martin Binder, Lennart Schneider, Janek Thomas, Jakob Richter, Michel Lang, Eduardo C. Garrido-Merchán, Juergen Branke, Bernd Bischl

    Abstract: Hyperparameter optimization constitutes a large part of typical modern machine learning workflows. This arises from the fact that machine learning methods and corresponding preprocessing steps often only yield optimal performance when hyperparameters are properly tuned. But in many applications, we are not only interested in optimizing ML pipelines solely for predictive accuracy; additional metric… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Published at ACM TELO

    Journal ref: ACM Transactions on Evolutionary Learning and Optimization 3.4 (2023): 1-50

  33. Is Kernel Code Different From Non-Kernel Code? A Case Study of BSD Family Operating Systems

    Authors: Gunnar Kudrjavets, Jeff Thomas, Nachiappan Nagappan, Ayushi Rastogi

    Abstract: Code churn and code velocity describe the evolution of a code base. Current research quantifies and studies code churn and velocity at a high level of abstraction, often at the overall project level or even at the level of an entire company. We argue that such an approach ignores noticeable differences among the subsystems of large projects. We conducted an exploratory study on four BSD family ope… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

    Comments: 13 pages. To be published in 38th IEEE International Conference on Software Maintenance and Evolution (ICSME 2022), Oct 3-7, 2022, Limassol, Cyprus

  34. arXiv:2206.03846  [pdf, other

    cs.LG cs.NI

    Sim2real for Reinforcement Learning Driven Next Generation Networks

    Authors: Peizheng Li, Jonathan Thomas, Xiaoyang Wang, Hakan Erdol, Abdelrahim Ahmad, Rui Inacio, Shipra Kapoor, Arjun Parekh, Angela Doufexi, Arman Shojaeifard, Robert Piechocki

    Abstract: The next generation of networks will actively embrace artificial intelligence (AI) and machine learning (ML) technologies for automation networks and optimal network operation strategies. The emerging network structure represented by Open RAN (O-RAN) conforms to this trend, and the radio intelligent controller (RIC) at the centre of its specification serves as an ML applications host. Various ML m… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: 7 pages, 4 figures

  35. arXiv:2206.03469  [pdf, other

    cs.LG

    Marked Neural Spatio-Temporal Point Process Involving a Dynamic Graph Neural Network

    Authors: Alice Moallemy-Oureh, Silvia Beddar-Wiesing, Yannick Nagel, Rüdiger Nather, Josephine M. Thomas

    Abstract: Temporal Point Processes (TPPs) have recently become increasingly interesting for learning dynamics in graph data. A reason for this is that learning on dynamic graph data is becoming more relevant, since data from many scientific fields, ranging from mathematics, biology, social sciences, and physics to computer science, is naturally related and inherently dynamic. In addition, TPPs provide a mea… ▽ More

    Submitted 28 August, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

  36. arXiv:2206.02462  [pdf, other

    cs.RO

    Achieving Goals using Reward Shaping and Curriculum Learning

    Authors: Mihai Anca, Jonathan D. Thomas, Dabal Pedamonti, Matthew Studley, Mark Hansen

    Abstract: Real-time control for robotics is a popular research area in the reinforcement learning community. Through the use of techniques such as reward shaping, researchers have managed to train online agents across a multitude of domains. Despite these advances, solving goal-oriented tasks still requires complex architectural changes or hard constraints to be placed on the problem. In this article, we so… ▽ More

    Submitted 20 April, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: To be published at Future Technologies Conference (FTC) 2023

  37. The Evolving Landscape of Software Performance Engineering

    Authors: Gunnar Kudrjavets, Jeff Thomas, Nachiappan Nagappan

    Abstract: Satisfactory software performance is essential for the adoption and the success of a product. In organizations that follow traditional software development models (e.g., waterfall), Software Performance Engineering (SPE) involves time-consuming experimental modeling and performance testing outside the actual production environment. Such existing SPE methods, however, are not optimized for environm… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 2 pages. To be published in The International Conference on Evaluation and Assessment in Software Engineering 2022 (EASE 2022), June 13-15, 2022, Gothenburg, Sweden

  38. arXiv:2204.14061  [pdf, other

    cs.LG

    A Collection of Quality Diversity Optimization Problems Derived from Hyperparameter Optimization of Machine Learning Models

    Authors: Lennart Schneider, Florian Pfisterer, Janek Thomas, Bernd Bischl

    Abstract: The goal of Quality Diversity Optimization is to generate a collection of diverse yet high-performing solutions to a given problem at hand. Typical benchmark problems are, for example, finding a repertoire of robot arm configurations or a collection of game playing strategies. In this paper, we propose a set of Quality Diversity Optimization problems that tackle hyperparameter optimization of mach… ▽ More

    Submitted 30 July, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at the GECCO'22 Workshop on Quality Diversity Algorithm Benchmarks. 7 pages, 6 tables, 7 figures

  39. arXiv:2204.03080  [pdf, other

    cs.LG

    Graph Neural Networks Designed for Different Graph Types: A Survey

    Authors: Josephine M. Thomas, Alice Moallemy-Oureh, Silvia Beddar-Wiesing, Clara Holzhüter

    Abstract: Graphs are ubiquitous in nature and can therefore serve as models for many practical but also theoretical problems. For this purpose, they can be defined as many different types which suitably reflect the individual contexts of the represented problem. To address cutting-edge problems based on graph data, the research field of Graph Neural Networks (GNNs) has emerged. Despite the field's youth and… ▽ More

    Submitted 26 April, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

  40. arXiv:2203.15423  [pdf

    cs.HC

    Development of a Scale to Measure Technology Acceptance in Smart Agriculture

    Authors: Rosemary J Thomas, Rebecca Whetton, Andy Doyle, David Coyle

    Abstract: This paper describes the development of a scale to measure technology acceptance in smart agriculture. The scale is intended for use in diverse situations, ranging for the evaluation of existing technologies already in widespread use, to the evaluation of prototype systems. A systematic screening of prior literature was conducted to identify initial scale items regarding how technology acceptance… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  41. arXiv:2203.06732  [pdf, other

    q-bio.QM cs.CE q-bio.MN

    BioSimulators: a central registry of simulation engines and services for recommending specific tools

    Authors: Bilal Shaikh, Lucian P. Smith, Dan Vasilescu, Gnaneswara Marupilla, Michael Wilson, Eran Agmon, Henry Agnew, Steven S. Andrews, Azraf Anwar, Moritz E. Beber, Frank T. Bergmann, David Brooks, Lutz Brusch, Laurence Calzone, Kiri Choi, Joshua Cooper, John Detloff, Brian Drawert, Michel Dumontier, G. Bard Ermentrout, James R. Faeder, Andrew P. Freiburger, Fabian Fröhlich, Akira Funahashi, Alan Garny , et al. (46 additional authors not shown)

    Abstract: Computational models have great potential to accelerate bioscience, bioengineering, and medicine. However, it remains challenging to reproduce and reuse simulations, in part, because the numerous formats and methods for simulating various subsystems and scales remain siloed by different software tools. For example, each tool must be executed through a distinct interface. To help investigators find… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 6 pages, 2 figures

  42. Quantifying Daily Evolution of Mobile Software Based on Memory Allocator Churn

    Authors: Gunnar Kudrjavets, Jeff Thomas, Aditya Kumar, Nachiappan Nagappan, Ayushi Rastogi

    Abstract: The pace and volume of code churn necessary to evolve modern software systems present challenges for analyzing the performance impact of any set of code changes. Traditional methods used in performance analysis rely on extensive data collection and profiling, which often takes days. For large organizations utilizing Continuous Integration (CI) and Continuous Deployment (CD), these traditional tech… ▽ More

    Submitted 6 May, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 5 pages. To be published in Proceedings of The 9th International Conference on Mobile Software Engineering and Systems (MobileSoft '22). ACM, New York, NY, USA

  43. Better Modelling Out-of-Distribution Regression on Distributed Acoustic Sensor Data Using Anchored Hidden State Mixup

    Authors: Hasan Asyari Arief, Peter James Thomas, Tomasz Wiktorski

    Abstract: Generalizing the application of machine learning models to situations where the statistical distribution of training and test data are different has been a complex problem. Our contributions in this paper are threefold: (1) we introduce an anchored-based Out of Distribution (OOD) Regression Mixup algorithm, leveraging manifold hidden state mixup and observation similarities to form a novel regular… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: TII Accepted Version

    Journal ref: IEEE Transactions on Industrial Informatics (TII.2022.3154783)

  44. arXiv:2112.03577  [pdf, other

    cs.RO cs.AI

    Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi

    Authors: Serena Raju, Sherin Shibu, Riya Mol Raji, Joel Thomas

    Abstract: In this paper, pragmatic implementation of an indoor autonomous delivery system that exploits Reinforcement Learning algorithms for path planning and collision avoidance is audited. The proposed system is a cost-efficient approach that is implemented to facilitate a Raspberry Pi controlled four-wheel-drive non-holonomic robot map a grid. This approach computes and navigates the shortest path from… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 5 pages, 7 figures

    ACM Class: I.2.9; F.2.0

  45. arXiv:2111.14535  [pdf, other

    cs.AR

    Enabling Reusable Physical Design Flows with Modular Flow Generators

    Authors: Alex Carsello, James Thomas, Ankita Nayak, Po-Han Chen, Mark Horowitz, Priyanka Raina, Christopher Torng

    Abstract: Achieving high code reuse in physical design flows is challenging but increasingly necessary to build complex systems. Unfortunately, existing approaches based on parameterized Tcl generators support very limited reuse and struggle to preserve reusable code as designers customize flows for specific designs and technologies. We present a vision and framework based on modular flow generators that en… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  46. arXiv:2111.11129  [pdf, other

    cs.AI cs.MA

    Multi-lingual agents through multi-headed neural networks

    Authors: J. D. Thomas, R. Santos-Rodríguez, R. Piechocki, M. Anca

    Abstract: This paper considers cooperative Multi-Agent Reinforcement Learning, focusing on emergent communication in settings where multiple pairs of independent learners interact at varying frequencies. In this context, multiple distinct and incompatible languages can emerge. When an agent encounters a speaker of an alternative language, there is a requirement for a period of adaptation before they can eff… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: Cooperative AI workshop NeurIPS 2021

  47. RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN

    Authors: Peizheng Li, Jonathan Thomas, Xiaoyang Wang, Ahmed Khalil, Abdelrahim Ahmad, Rui Inacio, Shipra Kapoor, Arjun Parekh, Angela Doufexi, Arman Shojaeifard, Robert Piechocki

    Abstract: Radio access network (RAN) technologies continue to evolve, with Open RAN gaining the most recent momentum. In the O-RAN specifications, the RAN intelligent controllers (RICs) are software-defined orchestration and automation functions for the intelligent management of RAN. This article introduces principles for machine learning (ML), in particular, reinforcement learning (RL) applications in the… ▽ More

    Submitted 25 November, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

    Comments: 17 pages, 6 figrues

    Journal ref: IEEE Access (2022), vol. 10, pp. 113808-113826

  48. arXiv:2109.10708  [pdf, other

    cs.DM cs.CC cs.DS cs.IT

    A Note on the Modeling Power of Different Graph Types

    Authors: Josephine M. Thomas, Silvia Beddar-Wiesing, Alice Moallemy-Oureh, Rüdiger Nather

    Abstract: Graphs can have different properties that lead to several graph types and may allow for a varying representation of diverse information. In order to clarify the modeling power of graphs, we introduce a partial order on the most common graph types based on an expressivity relation. The expressivity relation quantifies how many properties a graph type can encode compared to another type. Additionall… ▽ More

    Submitted 7 September, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

  49. arXiv:2107.05847  [pdf, other

    stat.ML cs.LG

    Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

    Authors: Bernd Bischl, Martin Binder, Michel Lang, Tobias Pielok, Jakob Richter, Stefan Coors, Janek Thomas, Theresa Ullmann, Marc Becker, Anne-Laure Boulesteix, Difan Deng, Marius Lindauer

    Abstract: Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find well-performing hyperparameter configurations, various automatic hyperparameter optimization (HPO) methods, e.g., based on resampling error estimation for superv… ▽ More

    Submitted 24 November, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  50. Linear solvers for power grid optimization problems: a review of GPU-accelerated linear solvers

    Authors: Kasia Swirydowicz, Eric Darve, Wesley Jones, Jonathan Maack, Shaked Regev, Michael A. Saunders, Stephen J. Thomas, Slaven Peles

    Abstract: The linear equations that arise in interior methods for constrained optimization are sparse symmetric indefinite and become extremely ill-conditioned as the interior method converges. These linear systems present a challenge for existing solver frameworks based on sparse LU or LDL^T decompositions. We benchmark five well known direct linear solver packages using matrices extracted from power grid… ▽ More

    Submitted 13 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.