Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Mars, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18078  [pdf, other

    cs.CL cs.AI

    PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization

    Authors: Christopher Clarke, Yuzhao Heng, Lingjia Tang, Jason Mars

    Abstract: The recent emergence of Large Language Models (LLMs) has heralded a new era of human-AI interaction. These sophisticated models, exemplified by Chat-GPT and its successors, have exhibited remarkable capabilities in language understanding. However, as these LLMs have undergone exponential growth, a crucial dimension that remains understudied is the personalization of these models. Large foundation… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  2. arXiv:2407.12847  [pdf, other

    cs.CL cs.AI cs.HC

    Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments

    Authors: Roland Daynauth, Jason Mars

    Abstract: The SLAM paper demonstrated that on-device Small Language Models (SLMs) are a viable and cost-effective alternative to API-based Large Language Models (LLMs), such as OpenAI's GPT-4, offering comparable performance and stability. However, SLAM also identified discrepancies between human preferences and traditional auto-evaluators. This follow-up paper explores methods to align LLM evaluator prefer… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2405.08965  [pdf, other

    cs.PL cs.AI

    LLMs are Meaning-Typed Code Constructs

    Authors: Jason Mars, Yiping Kang, Jayanaka Dantanarayana, Chandra Irugalbandara, Kugesan Sivasothynathan, Lingjia Tang

    Abstract: Programming with Generative AI (GenAI) models is a type of Neurosymbolic programming and has seen tremendous adoption across many domains. However, leveraging GenAI models in code today can be complex, counter-intuitive and often require specialized frameworks, leading to increased complexity. This is because it is currently unclear as to the right abstractions through which we should marry GenAI… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. arXiv:2405.03832  [pdf, other

    cs.CL cs.AI

    Guylingo: The Republic of Guyana Creole Corpora

    Authors: Christopher Clarke, Roland Daynauth, Charlene Wilkinson, Hubert Devonish, Jason Mars

    Abstract: While major languages often enjoy substantial attention and resources, the linguistic diversity across the globe encompasses a multitude of smaller, indigenous, and regional languages that lack the same level of computational support. One such region is the Caribbean. While commonly labeled as "English speaking", the ex-British Caribbean region consists of a myriad of Creole languages thriving alo… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024 Main Conference Special Theme Track: Languages of Latin America and The Caribbean

  5. arXiv:2401.07123  [pdf, other

    cs.HC cs.CL

    One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI

    Authors: Christopher Clarke, Karthik Krishnamurthy, Walter Talamonti, Yiping Kang, Lingjia Tang, Jason Mars

    Abstract: Conversational agents have been gaining increasing popularity in recent years. Influenced by the widespread adoption of task-oriented agents such as Apple Siri and Amazon Alexa, these agents are being deployed into various applications to enhance user experience. Although these agents promote "ask me anything" functionality, they are typically built to focus on a single or finite set of expertise.… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  6. arXiv:2312.14972  [pdf, other

    cs.SE cs.AI cs.LG

    Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production

    Authors: Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth, Tharuka Kasthuri Arachchige, Jayanaka Dantanarayana, Krisztian Flautner, Lingjia Tang, Yiping Kang, Jason Mars

    Abstract: Many companies use large language models (LLMs) offered as a service, like OpenAI's GPT-4, to create AI-enabled product experiences. Along with the benefits of ease-of-use and shortened time-to-solution, this reliance on proprietary services has downsides in model control, performance reliability, uptime predictability, and cost. At the same time, a flurry of open-source small language models (SLM… ▽ More

    Submitted 16 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Updated title, Revised content

    Journal ref: ISPASS-2024: 2024 IEEE International Symposium on Performance Analysis of Systems and Software

  7. arXiv:2307.12935  [pdf, other

    cs.CL cs.AI

    Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

    Authors: Christopher Clarke, Matthew Hall, Gaurav Mittal, Ye Yu, Sandra Sajeev, Jason Mars, Mei Chen

    Abstract: Classic approaches to content moderation typically apply a rule-based heuristic approach to flag content. While rules are easily customizable and intuitive for humans to interpret, they are inherently fragile and lack the flexibility or robustness needed to moderate the vast amount of undesirable content found online today. Recent advances in deep learning have demonstrated the promise of using hi… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: ACL 2023 Main Conference

  8. arXiv:2305.16521  [pdf, other

    cs.CL cs.LG

    Label Agnostic Pre-training for Zero-shot Text Classification

    Authors: Christopher Clarke, Yuzhao Heng, Yiping Kang, Krisztian Flautner, Lingjia Tang, Jason Mars

    Abstract: Conventional approaches to text classification typically assume the existence of a fixed set of predefined labels to which a given text can be classified. However, in real-world applications, there exists an infinite label space for describing a given text. In addition, depending on the aspect (sentiment, topic, etc.) and domain of the text (finance, legal, etc.), the interpretation of the label c… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  9. arXiv:2305.09864  [pdf, other

    cs.CL cs.DC cs.PL cs.SE

    The Jaseci Programming Paradigm and Runtime Stack: Building Scale-out Production Applications Easy and Fast

    Authors: Jason Mars, Yiping Kang, Roland Daynauth, Baichuan Li, Ashish Mahendra, Krisztian Flautner, Lingjia Tang

    Abstract: Today's production scale-out applications include many sub-application components, such as storage backends, logging infrastructure and AI models. These components have drastically different characteristics, are required to work in collaboration, and interface with each other as microservices. This leads to increasingly high complexity in developing, optimizing, configuring, and deploying scale-ou… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  10. arXiv:2206.08434  [pdf, other

    cs.DC cs.AI cs.AR cs.PL

    The Case for a Wholistic Serverless Programming Paradigm and Full Stack Automation for AI and Beyond -- The Philosophy of Jaseci and Jac

    Authors: Jason Mars

    Abstract: In this work, the case is made for a wholistic top-down re-envisioning of the system stack from the programming language level down through the system architecture to bridge this complexity gap. The key goal of our design is to address the critical need for the programmer to articulate solutions with higher level abstractions at the problem level while having the runtime system stack subsume and h… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  11. arXiv:2203.07665  [pdf, other

    cs.CL cs.AI cs.IR

    One Agent To Rule Them All: Towards Multi-agent Conversational AI

    Authors: Christopher Clarke, Joseph Joshua Peper, Karthik Krishnamurthy, Walter Talamonti, Kevin Leach, Walter Lasecki, Yiping Kang, Lingjia Tang, Jason Mars

    Abstract: The increasing volume of commercially available conversational agents (CAs) on the market has resulted in users being burdened with learning and adopting multiple agents to accomplish their tasks. Though prior work has explored supporting a multitude of domains within the design of a single agent, the interaction experience suffers due to the large action space of desired capabilities. To address… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  12. arXiv:2203.06668  [pdf, other

    cs.CL cs.AI

    Towards Personalized Intelligence at Scale

    Authors: Yiping Kang, Ashish Mahendra, Christopher Clarke, Lingjia Tang, Jason Mars

    Abstract: Personalized Intelligence (PI) is the problem of providing customized AI experiences tailored to each individual user. In many applications, PI is preferred or even required. Existing personalization approaches involve fine-tuning pre-trained models to create new customized models. However, these approaches require a significant amount of computation to train, scaling with model size and the numbe… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  13. arXiv:2006.13378  [pdf, other

    cs.DC cs.GR

    A Benchmarking Framework for Interactive 3D Applications in the Cloud

    Authors: Tianyi Liu, Sen He, Sunzhou Huang, Danny Tsang, Lingjia Tang, Jason Mars, Wei Wang

    Abstract: With the growing popularity of cloud gaming and cloud virtual reality (VR), interactive 3D applications have become a major type of workloads for the cloud. However, despite their growing importance, there is limited public research on how to design cloud systems to efficiently support these applications, due to the lack of an open and reliable research infrastructure, including benchmarks and per… ▽ More

    Submitted 2 August, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

  14. arXiv:1909.02027  [pdf, other

    cs.CL cs.AI cs.LG

    An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

    Authors: Stefan Larson, Anish Mahendran, Joseph J. Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K. Kummerfeld, Kevin Leach, Michael A. Laurenzano, Lingjia Tang, Jason Mars

    Abstract: Task-oriented dialog systems need to know when a query falls outside their range of supported intents, but current text classification corpora only define label sets that cover every example. We introduce a new dataset that includes queries that are out-of-scope---i.e., queries that do not fall into any of the system's supported intents. This poses a new challenge because models cannot assume that… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted to EMNLP-IJCNLP 2019

  15. arXiv:1904.03122  [pdf, other

    cs.CL

    Outlier Detection for Improved Data Quality and Diversity in Dialog Systems

    Authors: Stefan Larson, Anish Mahendran, Andrew Lee, Jonathan K. Kummerfeld, Parker Hill, Michael A. Laurenzano, Johann Hauswald, Lingjia Tang, Jason Mars

    Abstract: In a corpus of data, outliers are either errors: mistakes in the data that are counterproductive, or are unique: informative samples that improve model robustness. Identifying outliers can lead to better datasets by (1) removing noise in datasets and (2) guiding collection of additional data to fill gaps. However, the problem of detecting both outlier types has received relatively little attention… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted as long paper to NAACL 2019

  16. arXiv:1808.02513  [pdf, other

    cs.LG stat.ML

    Rethinking Numerical Representations for Deep Neural Networks

    Authors: Parker Hill, Babak Zamirai, Shengshuo Lu, Yu-Wei Chao, Michael Laurenzano, Mehrzad Samadi, Marios Papaefthymiou, Scott Mahlke, Thomas Wenisch, Jia Deng, Lingjia Tang, Jason Mars

    Abstract: With ever-increasing computational demand for deep learning, it is critical to investigate the implications of the numeric representation and precision of DNN model weights and activations on computational efficiency. In this work, we explore unconventional narrow-precision floating-point representations as it relates to inference accuracy and efficiency to steer the improved design of future DNN… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

  17. arXiv:1604.03450  [pdf, ps, other

    physics.data-an cs.IT

    A Noise-Robust Method with Smoothed \ell_1/\ell_2 Regularization for Sparse Moving-Source Mapping

    Authors: Mai Quyen Pham, Benoit Oudompheng, Jérôme I. Mars, Barbara Nicolas

    Abstract: The method described here performs blind deconvolution of the beamforming output in the frequency domain. To provide accurate blind deconvolution, sparsity priors are introduced with a smooth \ell_1/\ell_2 regularization term. As the mean of the noise in the power spectrum domain is dependent on its variance in the time domain, the proposed method includes a variance estimation step, which allows… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.

  18. arXiv:1303.0742  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Multivariate Temporal Dictionary Learning for EEG

    Authors: Quentin Barthélemy, Cédric Gouy-Pailler, Yoann Isaac, Antoine Souloumiac, Anthony Larue, Jérôme I. Mars

    Abstract: This article addresses the issue of representing electroencephalographic (EEG) signals in an efficient way. While classical approaches use a fixed Gabor dictionary to analyze EEG signals, this article proposes a data-driven method to obtain an adapted dictionary. To reach an efficient dictionary learning, appropriate spatial and temporal modeling is required. Inter-channels links are taken into ac… ▽ More

    Submitted 4 March, 2013; originally announced March 2013.

    Journal ref: Published in Journal of Neuroscience Methods, vol. 215, pp. 19-28, 2013