Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Sadrieh, A

.
  1. arXiv:2409.04114  [pdf, other

    cs.CL cs.AI

    Multi-Programming Language Ensemble for Code Generation in Large Language Model

    Authors: Tengfei Xue, Xuefeng Li, Tahir Azim, Roman Smirnov, Jianhui Yu, Arash Sadrieh, Babak Pahlavan

    Abstract: Large language models (LLMs) have significantly improved code generation, particularly in one-pass code generation. However, most existing approaches focus solely on generating code in a single programming language, overlooking the potential of leveraging the multi-language capabilities of LLMs. LLMs have varying patterns of errors across different languages, suggesting that a more robust approach… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: Code available at https://github.com/NinjaTech-AI/MPLE

  2. arXiv:2407.12057  [pdf, other

    cs.CL cs.AI

    NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2

    Authors: Tengfei Xue, Xuefeng Li, Roman Smirnov, Tahir Azim, Arash Sadrieh, Babak Pahlavan

    Abstract: Retrieval-augmented generation (RAG) techniques are widely used today to retrieve and present information in a conversational format. This paper presents a set of enhancements to traditional RAG techniques, focusing on large language models (LLMs) fine-tuned and hosted on AWS Trainium and Inferentia2 AI chips via SageMaker. These chips are characterized by their elasticity, affordability, and effi… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    ACM Class: I.2.7