Zum Hauptinhalt springen

Showing 1–50 of 258 results for author: Le, V

Searching in archive cs. Search in all archives.
.
  1. A Survey on Small-Scale Testbeds for Connected and Automated Vehicles and Robot Swarms

    Authors: Armin Mokhtarian, Jianye Xu, Patrick Scheffe, Maximilian Kloock, Simon Schäfer, Heeseung Bang, Viet-Anh Le, Sangeet Ulhas, Johannes Betz, Sean Wilson, Spring Berman, Liam Paull, Amanda Prorok, Bassam Alrifaee

    Abstract: Connected and automated vehicles and robot swarms hold transformative potential for enhancing safety, efficiency, and sustainability in the transportation and manufacturing sectors. Extensive testing and validation of these technologies is crucial for their deployment in the real world. While simulations are essential for initial testing, they often have limitations in capturing the complex dynami… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 16 pages, 11 figures, 1 table. This work has been submitted to the IEEE Robotics & Automation Magazine for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. Leveraging LLMs for the Quality Assurance of Software Requirements

    Authors: Sebastian Lubos, Alexander Felfernig, Thi Ngoc Trang Tran, Damian Garber, Merfat El Mansi, Seda Polat Erdeniz, Viet-Man Le

    Abstract: Successful software projects depend on the quality of software requirements. Creating high-quality requirements is a crucial step toward successful software development. Effective support in this area can significantly reduce development costs and enhance the software quality. In this paper, we introduce and assess the capabilities of a Large Language Model (LLM) to evaluate the quality characteri… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at the RE@Next! track of RE 2024

  3. arXiv:2408.08781  [pdf, other

    cs.AI cs.CL

    Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

    Authors: Bhuvanashree Murugadoss, Christian Poelitz, Ian Drosos, Vu Le, Nick McKenna, Carina Suzana Negreanu, Chris Parnin, Advait Sarkar

    Abstract: LLMs-as-a-judge is a recently popularized method which replaces human judgements in task evaluation (Zheng et al. 2024) with automatic evaluation using LLMs. Due to widespread use of RLHF (Reinforcement Learning from Human Feedback), state-of-the-art LLMs like GPT4 and Llama3 are expected to have strong alignment with human preferences when prompted for a quality judgement, such as the coherence o… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2408.05865  [pdf, ps, other

    cs.CC cs.DM cs.DS

    The complexity of strong conflict-free vertex-connection $k$-colorability

    Authors: Sun-Yuan Hsieh, Hoang-Oanh Le, Van Bang Le, Sheng-Lung Peng

    Abstract: We study a new variant of graph coloring by adding a connectivity constraint. A path in a vertex-colored graph is called conflict-free if there is a color that appears exactly once on its vertices. A connected graph $G$ is said to be strongly conflict-free vertex-connection $k$-colorable if $G$ admits a vertex $k$-coloring such that any two distinct vertices of $G$ are connected by a conflict-free… ▽ More

    Submitted 14 August, 2024; v1 submitted 11 August, 2024; originally announced August 2024.

    Comments: The full version of a COCOON 2024 paper

  5. arXiv:2407.21787  [pdf, other

    cs.LG cs.AI

    Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

    Authors: Bradley Brown, Jordan Juravsky, Ryan Ehrlich, Ronald Clark, Quoc V. Le, Christopher Ré, Azalia Mirhoseini

    Abstract: Scaling the amount of compute used to train language models has dramatically improved their capabilities. However, when it comes to inference, we often limit the amount of compute to only one attempt per problem. Here, we explore inference compute as another axis for scaling by increasing the number of generated samples. Across multiple tasks and models, we observe that coverage - the fraction of… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  6. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  7. arXiv:2407.10657  [pdf, other

    cs.CL cs.AI

    An Empirical Study of Validating Synthetic Data for Formula Generation

    Authors: Usneek Singh, José Cambronero, Sumit Gulwani, Aditya Kanade, Anirudh Khatry, Vu Le, Mukul Singh, Gust Verbruggen

    Abstract: Large language models (LLMs) can be leveraged to help with writing formulas in spreadsheets, but resources on these formulas are scarce, impacting both the base performance of pre-trained models and limiting the ability to fine-tune them. Given a corpus of formulas, we can use a(nother) model to generate synthetic natural language utterances for fine-tuning. However, it is important to validate wh… ▽ More

    Submitted 23 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  8. KAT: Dependency-aware Automated API Testing with Large Language Models

    Authors: Tri Le, Thien Tran, Duy Cao, Vy Le, Tien Nguyen, Vu Nguyen

    Abstract: API testing has increasing demands for software companies. Prior API testing tools were aware of certain types of dependencies that needed to be concise between operations and parameters. However, their approaches, which are mostly done manually or using heuristic-based algorithms, have limitations due to the complexity of these dependencies. In this paper, we present KAT (Katalon API Testing), a… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ICST 2024

  9. arXiv:2407.02086  [pdf, ps, other

    cs.DS

    On polynomial kernelization for Stable Cutset

    Authors: Stefan Kratsch, Van Bang Le

    Abstract: A stable cutset in a graph $G$ is a set $S\subseteq V(G)$ such that vertices of $S$ are pairwise non-adjacent and such that $G-S$ is disconnected, i.e., it is both stable (or independent) set and a cutset (or separator). Unlike general cutsets, it is $NP$-complete to determine whether a given graph $G$ has any stable cutset. Recently, Rauch et al.\ [FCT 2023] gave a number of fixed-parameter tract… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: For Dieter Kratsch on his 65th birthday

  10. arXiv:2407.01983  [pdf, other

    cs.CV

    SADL: An Effective In-Context Learning Method for Compositional Visual QA

    Authors: Long Hoang Dang, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran

    Abstract: Large vision-language models (LVLMs) offer a novel capability for performing in-context learning (ICL) in Visual QA. When prompted with a few demonstrations of image-question-answer triplets, LVLMs have demonstrated the ability to discern underlying patterns and transfer this latent knowledge to answer new questions about unseen images without the need for expensive supervised fine-tuning. However… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  11. arXiv:2406.06156  [pdf, other

    cs.SE

    Stronger, Cheaper and Demonstration-Free Log Parsing with LLMs

    Authors: Yi Xiao, Van-Hoang Le, Hongyu Zhang

    Abstract: Log parsing, the process of converting raw log messages into structured formats, is an important initial step for automated analysis of logs of large-scale software systems. Traditional log parsers often rely on heuristics or handcrafted features, which may not generalize well across diverse log sources or require extensive model tuning. Recently, some log parsers have utilized powerful generative… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  12. arXiv:2406.04520  [pdf, other

    cs.CL cs.AI

    NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou

    Abstract: We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  13. arXiv:2405.15130  [pdf, other

    cs.SE cs.CL cs.LG

    OptLLM: Optimal Assignment of Queries to Large Language Models

    Authors: Yueyue Liu, Hongyu Zhang, Yuantian Miao, Van-Hoang Le, Zhiqiang Li

    Abstract: Large Language Models (LLMs) have garnered considerable attention owing to their remarkable capabilities, leading to an increasing number of companies offering LLMs as services. Different LLMs achieve different performance at different costs. A challenge for users lies in choosing the LLMs that best fit their needs, balancing cost and performance. In this paper, we propose a framework for addressi… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by ICWS 2024

  14. arXiv:2405.07475  [pdf, other

    cs.HC

    How Non-native English Speakers Use, Assess, and Select AI-Generated Paraphrases with Information Aids

    Authors: Yewon Kim, Thanh-Long V. Le, Donghwi Kim, Mina Lee, Sung-Ju Lee

    Abstract: Non-native English speakers (NNESs) often face challenges in achieving fluency in their written English. AI paraphrasing tools have the potential to improve their writing by suggesting more fluent paraphrases to their original sentences. Yet, the effectiveness of these tools depends on the user's ability to accurately assess and select context-appropriate suggestions, which is a significant challe… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  15. arXiv:2405.03411  [pdf, other

    cs.RO

    Greedy Heuristics for Sampling-based Motion Planning in High-Dimensional State Spaces

    Authors: Phone Thiha Kyaw, Anh Vu Le, Lim Yi, Prabakaran Veerajagadheswar, Mohan Rajesh Elara, Dinh Tung Vo, Minh Bui Vu

    Abstract: Sampling-based motion planning algorithms are very effective at finding solutions in high-dimensional continuous state spaces as they do not require prior approximations of the problem domain compared to traditional discrete graph-based searches. The anytime version of the Rapidly-exploring Random Trees (RRT) algorithm, denoted as RRT*, often finds high-quality solutions by incrementally approxima… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: To be published at the International Journal of Robotics Research (IJRR)

  16. arXiv:2405.01556  [pdf, other

    cs.SE cs.AI cs.CL

    Semantically Aligned Question and Code Generation for Automated Insight Generation

    Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

    Abstract: Automated insight generation is a common tactic for helping knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  17. arXiv:2405.00688  [pdf

    cs.RO cs.AI cs.CL cs.HC cs.LG

    Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment Analysis

    Authors: Yuchen Du, Tho V. Le

    Abstract: This article presents a comprehensive sentiment analysis (SA) of comments on YouTube videos related to Sidewalk Delivery Robots (SDRs). We manually annotated the collected YouTube comments with three sentiment labels: negative (0), positive (1), and neutral (2). We then constructed models for text sentiment classification and tested the models' performance on both binary and ternary classification… ▽ More

    Submitted 9 March, 2024; originally announced May 2024.

    Comments: 34 pages, 7 figures, 2 tables

  18. arXiv:2403.18802  [pdf, other

    cs.CL cs.AI cs.LG

    Long-form factuality in large language models

    Authors: Jerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Jie Huang, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le

    Abstract: Large language models (LLMs) often generate content that contains factual errors when responding to fact-seeking prompts on open-ended topics. To benchmark a model's long-form factuality in open domains, we first use GPT-4 to generate LongFact, a prompt set comprising thousands of questions spanning 38 topics. We then propose that LLM agents can be used as automated evaluators for long-form factua… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  19. arXiv:2403.06119  [pdf, other

    cs.CV

    CLEAR: Cross-Transformers with Pre-trained Language Model is All you need for Person Attribute Recognition and Retrieval

    Authors: Doanh C. Bui, Thinh V. Le, Ba Hung Ngo, Tae Jong Choi

    Abstract: Person attribute recognition and attribute-based retrieval are two core human-centric tasks. In the recognition task, the challenge is specifying attributes depending on a person's appearance, while the retrieval task involves searching for matching persons based on attribute queries. There is a significant relationship between recognition and retrieval tasks. In this study, we demonstrate that if… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  20. Assessing User Apprehensions About Mixed Reality Artifacts and Applications: The Mixed Reality Concerns (MRC) Questionnaire

    Authors: Christopher Katins, Paweł W. Woźniak, Aodi Chen, Ihsan Tumay, Luu Viet Trinh Le, John Uschold, Thomas Kosch

    Abstract: Current research in Mixed Reality (MR) presents a wide range of novel use cases for blending virtual elements with the real world. This yet-to-be-ubiquitous technology challenges how users currently work and interact with digital content. While offering many potential advantages, MR technologies introduce new security, safety, and privacy challenges. Thus, it is relevant to understand users' appre… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  21. arXiv:2402.11734  [pdf, other

    cs.PL cs.AI cs.SE

    Solving Data-centric Tasks using Large Language Models

    Authors: Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams

    Abstract: Large language models (LLMs) are rapidly replacing help forums like StackOverflow, and are especially helpful for non-professional programmers and end users. These users are often interested in data-centric tasks, such as spreadsheet manipulation and data wrangling, which are hard to solve if the intent is only communicated using a natural-language description, without including the data. But how… ▽ More

    Submitted 24 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Paper accepted to NAACL 2024 (Findings)

  22. arXiv:2402.04931  [pdf, other

    cs.DM cs.CC cs.DS math.CO

    Complexity of the (Connected) Cluster Vertex Deletion problem on $H$-free graphs

    Authors: Hoang-Oanh Le, Van Bang Le

    Abstract: The well-known Cluster Vertex Deletion problem (CVD) asks for a given graph $G$ and an integer $k$ whether it is possible to delete a set $S$ of at most $k$ vertices of $G$ such that the resulting graph $G-S$ is a cluster graph (a disjoint union of cliques). We give a complete characterization of graphs $H$ for which CVD on $H$-free graphs is polynomially solvable and for which it is NP-complete.… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Extended version of a MFCS 2022 paper. To appear in Theory of Computing Systems

  23. arXiv:2402.03620  [pdf, other

    cs.AI cs.CL

    Self-Discover: Large Language Models Self-Compose Reasoning Structures

    Authors: Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng

    Abstract: We introduce SELF-DISCOVER, a general framework for LLMs to self-discover the task-intrinsic reasoning structures to tackle complex reasoning problems that are challenging for typical prompting methods. Core to the framework is a self-discovery process where LLMs select multiple atomic reasoning modules such as critical thinking and step-by-step thinking, and compose them into an explicit reasonin… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages, 11 figures, 5 tables

  24. arXiv:2312.12960  [pdf, ps, other

    math.CO cs.CC cs.DM cs.DS

    Maximizing Matching Cuts

    Authors: Van Bang Le, Felicia Lucke, Daniël Paulusma, Bernard Ries

    Abstract: A matching cut in a graph G is an edge cut of G that is also a matching. This short survey gives an overview of old and new results and open problems for Maximum Matching Cut, which is to determine the size of a largest matching cut in a graph. We also compare this problem with the related problems Matching Cut, Minimum Matching Cut, and Perfect Matching Cut, which are to determine if a graph has… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  25. arXiv:2312.11524  [pdf, other

    cs.CL cs.AI cs.CV

    Assessing GPT4-V on Structured Reasoning Tasks

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Gust Verbruggen

    Abstract: Multi-modality promises to unlock further uses for large language models. Recently, the state-of-the-art language model GPT-4 was enhanced with vision capabilities. We carry out a prompting evaluation of GPT-4V and five other baselines on structured reasoning tasks, such as mathematical reasoning, visual data analysis, and code generation. We show that visual Chain-of-Thought, an extension of Chai… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 9 pages, 9 figures

  26. arXiv:2312.08472  [pdf, other

    cs.NE cs.LG math.NA

    AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

    Authors: Esteban Real, Yao Chen, Mirko Rossini, Connal de Souza, Manav Garg, Akhil Verghese, Moritz Firsching, Quoc V. Le, Ekin Dogus Cubuk, David H. Park

    Abstract: Computers calculate transcendental functions by approximating them through the composition of a few limited-precision instructions. For example, an exponential can be calculated with a Taylor series. These approximation methods were developed over the centuries by mathematicians, who emphasized the attainability of arbitrary precision. Computers, however, operate on few limited precision types, su… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    ACM Class: I.2.2; I.2.6; G.1.2

  27. arXiv:2312.03785  [pdf, ps, other

    cs.IR cs.AI

    Sports Recommender Systems: Overview and Research Issues

    Authors: Alexander Felfernig, Manfred Wundara, Thi Ngoc Trang Tran, Viet-Man Le, Sebastian Lubos, Seda Polat-Erdeniz

    Abstract: Sports recommender systems receive an increasing attention due to their potential of fostering healthy living, improving personal well-being, and increasing performances in sport. These systems support people in sports, for example, by the recommendation of healthy and performance boosting food items, the recommendation of training practices, talent and team recommendation, and the recommendation… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Article under review in the Journal of Intelligent Information Systems (Springer JIIS)

    ACM Class: I.2; J.3

  28. arXiv:2311.17317  [pdf

    cs.SE

    Digital Twins for Logistics and Supply Chain Systems: Literature Review, Conceptual Framework, Research Potential, and Practical Challenges

    Authors: Tho V. Le, Ruoling Fan

    Abstract: To facilitate an effective, efficient, transparent, and timely decision-making process as well as to provide guidelines for industry planning and public policy development, a conceptual framework of digital twins (DTs) for logistics and supply chain systems (LSCS) is needed. This paper first introduces the background of the logistics and supply chain industry, the DT and its potential benefits, an… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 45 pages

  29. arXiv:2311.08502  [pdf, other

    quant-ph cs.LG math.OC

    Variational Quantum Eigensolver with Constraints (VQEC): Solving Constrained Optimization Problems via VQE

    Authors: Thinh Viet Le, Vassilis Kekatos

    Abstract: Variational quantum approaches have shown great promise in finding near-optimal solutions to computationally challenging tasks. Nonetheless, enforcing constraints in a disciplined fashion has been largely unexplored. To address this gap, this work proposes a hybrid quantum-classical algorithmic paradigm termed VQEC that extends the celebrated VQE to handle optimization with constraints. As with th… ▽ More

    Submitted 26 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 22 pages, 13 figures, 1 table

  30. arXiv:2311.02803  [pdf, other

    cs.CV

    Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers

    Authors: Hai Phan, Cindy Le, Vu Le, Yihui He, Anh Totti Nguyen

    Abstract: Most face identification approaches employ a Siamese neural network to compare two images at the image embedding level. Yet, this technique can be subject to occlusion (e.g. faces with masks or sunglasses) and out-of-distribution data. DeepFace-EMD (Phan et al. 2022) reaches state-of-the-art accuracy on out-of-distribution data by first comparing two images at the image level, and then at the patc… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 20 pages, 15 Figures

  31. arXiv:2310.17680   

    cs.SE cs.AI cs.CL cs.PL

    CodeFusion: A Pre-trained Diffusion Model for Code Generation

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

    Abstract: Imagine a developer who can only change their last line of code, how often would they have to start writing a function from scratch before it is correct? Auto-regressive models for code generation from natural language have a similar limitation: they do not easily allow reconsidering earlier tokens generated. We introduce CodeFusion, a pre-trained diffusion code generation model that addresses thi… ▽ More

    Submitted 1 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Contains inappropriately sourced conjecture of OpenAI's ChatGPT parameter count from www.forbes.com/sites/forbestechcouncil/2023/02/17/is-bigger-better-why-the-chatgpt-vs-gpt-3-vs-gpt-4-battle-is-just-a-family-chat, a citation which was omitted. The authors do not have direct knowledge or verification of this information, and relied solely on this article, which may lead to public confusion

  32. arXiv:2310.17475  [pdf

    cs.CY

    Analytical model for large-scale design of sidewalk delivery robot systems

    Authors: Hai Yang, Yuchen Du, Tho V. Le, Joseph Y. J. Chow

    Abstract: With the rise in demand for local deliveries and e-commerce, robotic deliveries are being considered as efficient and sustainable solutions. However, the deployment of such systems can be highly complex due to numerous factors involving stochastic demand, stochastic charging and maintenance needs, complex routing, etc. We propose a model that uses continuous approximation methods for evaluating se… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  33. arXiv:2310.17306   

    cs.AI cs.CL cs.DB cs.PL

    FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Elnaz Nouri, Mohammad Raza, Gust Verbruggen

    Abstract: Formatting is an important property in tables for visualization, presentation, and analysis. Spreadsheet software allows users to automatically format their tables by writing data-dependent conditional formatting (CF) rules. Writing such rules is often challenging for users as it requires them to understand and implement the underlying logic. We present FormaT5, a transformer-based model that can… ▽ More

    Submitted 1 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Contains inappropriately sourced conjecture of OpenAI's ChatGPT parameter count from www.forbes.com/sites/forbestechcouncil/2023/02/17/is-bigger-better-why-the-chatgpt-vs-gpt-3-vs-gpt-4-battle-is-just-a-family-chat, a citation which was omitted. The authors do not have direct knowledge or verification of this information, and relied solely on this article, which may lead to public confusion

  34. arXiv:2310.17228  [pdf, other

    cs.AI cs.CL cs.SE

    TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World

    Authors: Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Ananya Singha, Mukul Singh, Gust Verbruggen

    Abstract: Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance. Its goal is to adapt a sentence embedding model to have the similarity between two NL inputs match the similarity between their associated code outputs. In this paper, we propose different methods to apply and improve TST… ▽ More

    Submitted 28 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP-Findings, 2023

  35. arXiv:2310.10358  [pdf, other

    cs.CL cs.AI

    Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

    Authors: Ananya Singha, José Cambronero, Sumit Gulwani, Vu Le, Chris Parnin

    Abstract: Large language models (LLMs) are increasingly applied for tabular tasks using in-context learning. The prompt representation for a table may play a role in the LLMs ability to process the table. Inspired by prior work, we generate a collection of self-supervised structural tasks (e.g. navigate to a cell and row; transpose the table) and evaluate the performance differences when using 8 formats. In… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  36. arXiv:2310.06964  [pdf, other

    cs.RO cs.MA

    Multi-Robot Cooperative Navigation in Crowds: A Game-Theoretic Learning-Based Model Predictive Control Approach

    Authors: Viet-Anh Le, Vaishnav Tadiparthi, Behdad Chalaki, Hossein Nourkhiz Mahjoub, Jovin D'sa, Ehsan Moradi-Pari, Andreas A. Malikopoulos

    Abstract: In this paper, we develop a control framework for the coordination of multiple robots as they navigate through crowded environments. Our framework comprises of a local model predictive control (MPC) for each robot and a social long short-term memory model that forecasts pedestrians' trajectories. We formulate the local MPC formulation for each individual robot that includes both individual and sha… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  37. arXiv:2310.06117  [pdf, other

    cs.LG cs.AI cs.CL

    Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen, Heng-Tze Cheng, Ed H. Chi, Quoc V Le, Denny Zhou

    Abstract: We present Step-Back Prompting, a simple prompting technique that enables LLMs to do abstractions to derive high-level concepts and first principles from instances containing specific details. Using the concepts and principles to guide reasoning, LLMs significantly improve their abilities in following a correct reasoning path towards the solution. We conduct experiments of Step-Back Prompting with… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  38. arXiv:2310.02658  [pdf, other

    cs.AI

    Solving Multi-Configuration Problems: A Performance Analysis with Choco Solver

    Authors: Benjamin Ritz, Alexander Felfernig, Viet-Man Le, Sebastian Lubos

    Abstract: In many scenarios, configurators support the configuration of a solution that satisfies the preferences of a single user. The concept of \emph{multi-configuration} is based on the idea of configuring a set of configurations. Such a functionality is relevant in scenarios such as the configuration of personalized exams, the configuration of project teams, and the configuration of different trips for… ▽ More

    Submitted 19 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: The paper was presented at ConfWS'23: 25th International Workshop on Configuration, September 6-7, 2023, Málaga, Spain and is published in the conference proceedings: https://ceur-ws.org/Vol-3509/

  39. arXiv:2309.16838  [pdf, other

    cs.RO cs.MA

    Social Navigation in Crowded Environments with Model Predictive Control and Deep Learning-Based Human Trajectory Prediction

    Authors: Viet-Anh Le, Behdad Chalaki, Vaishnav Tadiparthi, Hossein Nourkhiz Mahjoub, Jovin D'sa, Ehsan Moradi-Pari

    Abstract: Crowd navigation has received increasing attention from researchers over the last few decades, resulting in the emergence of numerous approaches aimed at addressing this problem to date. Our proposed approach couples agent motion prediction and planning to avoid the freezing robot problem while simultaneously capturing multi-agent social interactions by utilizing a state-of-the-art trajectory pred… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 7 pages, 3 figures, 6 tables

  40. arXiv:2309.09479  [pdf, other

    cs.SE

    LogShrink: Effective Log Compression by Leveraging Commonality and Variability of Log Data

    Authors: Xiaoyun Li, Hongyu Zhang, Van-Hoang Le, Pengfei Chen

    Abstract: Log data is a crucial resource for recording system events and states during system execution. However, as systems grow in scale, log data generation has become increasingly explosive, leading to an expensive overhead on log storage, such as several petabytes per day in production. To address this issue, log compression has become a crucial task in reducing disk storage while allowing for further… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted by ICSE 2024 Research Track

  41. arXiv:2309.03409  [pdf, other

    cs.LG cs.AI cs.CL

    Large Language Models as Optimizers

    Authors: Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, Denny Zhou, Xinyun Chen

    Abstract: Optimization is ubiquitous. While derivative-based algorithms have been powerful tools for various problems, the absence of gradient imposes challenges on many real-world applications. In this work, we propose Optimization by PROmpting (OPRO), a simple and effective approach to leverage large language models (LLMs) as optimizers, where the optimization task is described in natural language. In eac… ▽ More

    Submitted 15 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: ICLR 2024; 42 pages, 26 figures, 15 tables. Code at https://github.com/google-deepmind/opro

  42. arXiv:2308.10922  [pdf, other

    cs.DB cs.AI

    DataVinci: Learning Syntactic and Semantic String Repairs

    Authors: Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

    Abstract: String data is common in real-world datasets: 67.6% of values in a sample of 1.8 million real Excel spreadsheets from the web were represented as text. Systems that successfully clean such string data can have a significant impact on real users. While prior work has explored errors in string data, proposed approaches have often been limited to error detection or require that the user provide annot… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 13 pages

  43. arXiv:2308.10756  [pdf, ps, other

    math.CO cs.DM

    Computing Optimal Leaf Roots of Chordal Cographs in Linear Time

    Authors: Van Bang Le, Christian Rosenke

    Abstract: A graph G is a k-leaf power, for an integer k >= 2, if there is a tree T with leaf set V(G) such that, for all vertices x, y in V(G), the edge xy exists in G if and only if the distance between x and y in T is at most k. Such a tree T is called a k-leaf root of G. The computational problem of constructing a k-leaf root for a given graph G and an integer k, if any, is motivated by the challenge fro… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 22 pages, 2 figures, full version of the FCT 2023 paper

    MSC Class: 05C85 ACM Class: F.2.2

  44. Towards a Formally Verified Security Monitor for VM-based Confidential Computing

    Authors: Wojciech Ozga, Guerney D. H. Hunt, Michael V. Le, Elaine R. Palmer, Avraham Shinnar

    Abstract: Confidential computing is a key technology for isolating high-assurance applications from the large amounts of untrusted code typical in modern systems. Existing confidential computing systems cannot be certified for use in critical applications, like systems controlling critical infrastructure, hardware security modules, or aircraft, as they lack formal verification. This paper presents an appr… ▽ More

    Submitted 1 October, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Journal ref: HASP '23: Proceedings of the 12th International Workshop on Hardware and Architectural Support for Security and Privacy, October 2023

  45. arXiv:2308.07357  [pdf, other

    cs.SE cs.AI cs.DB

    Demonstration of CORNET: A System For Learning Spreadsheet Formatting Rules By Example

    Authors: Mukul Singh, Jose Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

    Abstract: Data management and analysis tasks are often carried out using spreadsheet software. A popular feature in most spreadsheet platforms is the ability to define data-dependent formatting rules. These rules can express actions such as "color red all entries in a column that are negative" or "bold all rows not containing error or failure." Unfortunately, users who want to exercise this functionality ne… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 4 Pages, VLDB 2023 Demonstration Track

  46. arXiv:2308.06539  [pdf, other

    cs.IT eess.SP

    Phase Shift Design for RIS-Aided Cell-Free Massive MIMO with Improved Differential Evolution

    Authors: Trinh Van Chien, Cuong V. Le, Huynh Thi Thanh Binh, Hien Quoc Ngo, Symeon Chatzinotas

    Abstract: This paper proposes a novel phase shift design for cell-free massive multiple-input and multiple-output (MIMO) systems assisted by reconfigurable intelligent surface (RIS), which only utilizes channel statistics to achieve the uplink sum ergodic throughput maximization under spatial channel correlations. Due to the non-convexity and the scale of the derived optimization problem, we develop an impr… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 5 pages, 2 figures. Accepted by IEEE WCL

  47. arXiv:2308.03958  [pdf, other

    cs.CL

    Simple synthetic data reduces sycophancy in large language models

    Authors: Jerry Wei, Da Huang, Yifeng Lu, Denny Zhou, Quoc V. Le

    Abstract: Sycophancy is an undesirable behavior where models tailor their responses to follow a human user's view even when that view is not objectively correct (e.g., adapting liberal views once a user reveals that they are liberal). In this paper, we study the prevalence of sycophancy in language models and propose a simple synthetic-data intervention to reduce this behavior. First, on a set of three sy… ▽ More

    Submitted 14 February, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  48. arXiv:2308.03290  [pdf, other

    cs.CV cs.LG

    FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search

    Authors: Jordan Dotzel, Gang Wu, Andrew Li, Muhammad Umar, Yun Ni, Mohamed S. Abdelfattah, Zhiru Zhang, Liqun Cheng, Martin G. Dixon, Norman P. Jouppi, Quoc V. Le, Sheng Li

    Abstract: Quantization has become a mainstream compression technique for reducing model size, computational requirements, and energy consumption for modern deep neural networks (DNNs). With improved numerical support in recent hardware, including multiple variants of integer and floating point, mixed-precision quantization has become necessary to achieve high-quality results with low model cost. Prior mixed… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted to AutoML 2024

  49. arXiv:2308.03139  [pdf, other

    cs.CV math.OC

    Unfolded proximal neural networks for robust image Gaussian denoising

    Authors: Hoang Trieu Vy Le, Audrey Repetti, Nelly Pustelnik

    Abstract: A common approach to solve inverse imaging problems relies on finding a maximum a posteriori (MAP) estimate of the original unknown image, by solving a minimization problem. In thiscontext, iterative proximal algorithms are widely used, enabling to handle non-smooth functions and linear operators. Recently, these algorithms have been paired with deep learning strategies, to further improve the est… ▽ More

    Submitted 21 August, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

  50. arXiv:2307.12729  [pdf, ps, other

    cs.CV

    Persistent-Transient Duality: A Multi-mechanism Approach for Modeling Human-Object Interaction

    Authors: Hung Tran, Vuong Le, Svetha Venkatesh, Truyen Tran

    Abstract: Humans are highly adaptable, swiftly switching between different modes to progressively handle different tasks, situations and contexts. In Human-object interaction (HOI) activities, these modes can be attributed to two mechanisms: (1) the large-scale consistent plan for the whole activity and (2) the small-scale children interactive actions that start and end along the timeline. While neuroscienc… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV 2023