Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: AlShikh, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14906  [pdf, other

    cs.CL cs.IR

    Writing in the Margins: Better Inference Pattern for Long Context Retrieval

    Authors: Melisa Russak, Umar Jamil, Christopher Bryant, Kiran Kamble, Axel Magnuson, Mateusz Russak, Waseem AlShikh

    Abstract: In this paper, we introduce Writing in the Margins (WiM), a new inference pattern for Large Language Models designed to optimize the handling of long input sequences in retrieval-oriented tasks. This approach leverages the chunked prefill of the key-value cache to perform segment-wise inference, which enables efficient processing of extensive contexts along with the generation and classification o… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  2. arXiv:2405.02048  [pdf, ps, other

    cs.IR cs.AI

    Comparative Analysis of Retrieval Systems in the Real World

    Authors: Dmytro Mozolevskyi, Waseem AlShikh

    Abstract: This research paper presents a comprehensive analysis of integrating advanced language models with search and retrieval systems in the fields of information retrieval and natural language processing. The objective is to evaluate and compare various state-of-the-art methods based on their performance in terms of accuracy and efficiency. The analysis explores different combinations of technologies,… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2402.17553  [pdf, other

    cs.AI cs.CL cs.CV cs.HC

    OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

    Authors: Raghav Kapoor, Yash Parag Butala, Melisa Russak, Jing Yu Koh, Kiran Kamble, Waseem Alshikh, Ruslan Salakhutdinov

    Abstract: For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step. Autonomous virtual agents represent an exciting step in automating many of these menial tasks. Virtual agents would empower users with limited technical proficiency to harness the full possibilities of computer systems. They coul… ▽ More

    Submitted 21 July, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2307.03692  [pdf, other

    cs.CL cs.AI

    Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning

    Authors: Waseem AlShikh, Manhal Daaboul, Kirk Goddard, Brock Imel, Kiran Kamble, Parikshith Kulkarni, Melisa Russak

    Abstract: In this paper, we introduce the Instruction Following Score (IFS), a metric that detects language models' ability to follow instructions. The metric has a dual purpose. First, IFS can be used to distinguish between base and instruct models. We benchmark publicly available base and instruct models, and show that the ratio of well formatted responses to partial and full sentences can be an effective… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.