Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Krema, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05365  [pdf, other

    cs.CL cs.AI cs.CE

    FiST-Financial Style Transfer with Hallucination and Creativity Control Framework

    Authors: Sohini Roychowdhury, Marko Krema, Brian Moore, Xingjian Lai, Dike Effedua, Bharat Jethwani

    Abstract: Financial report generation using general purpose large language models pose two major challenges, including the lack of compound sentences and hallucinations. Advanced prompt engineering and retrieval augmented generation (RAG) techniques are incapable of curing the writing style discrepancies. In this work we propose a novel two-stage fine-tuning process wherein public domain financial reports a… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 8 pages, 13 figures, 5 tables, conference

  2. arXiv:2405.03963  [pdf, other

    cs.AI cs.LG

    ERATTA: Extreme RAG for Table To Answers with Large Language Models

    Authors: Sohini Roychowdhury, Marko Krema, Anvar Mahammad, Brian Moore, Arijit Mukherjee, Punit Prakashchandra

    Abstract: Large language models (LLMs) with retrieval augmented-generation (RAG) have been the optimal choice for scalable generative AI solutions in the recent past. However, the choice of use-cases that incorporate RAG with LLMs have been either generic or extremely domain specific, thereby questioning the scalability and generalizability of RAG-LLM approaches. In this work, we propose a unique LLM-based… ▽ More

    Submitted 14 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 tables, Asilomar SSC Conference, 2024

  3. arXiv:2311.07592  [pdf, other

    cs.CL cs.AI cs.IR

    Hallucination-minimized Data-to-answer Framework for Financial Decision-makers

    Authors: Sohini Roychowdhury, Andres Alvarez, Brian Moore, Marko Krema, Maria Paz Gelpi, Federico Martin Rodriguez, Angel Rodriguez, Jose Ramon Cabrejas, Pablo Martinez Serrano, Punit Agrawal, Arijit Mukherjee

    Abstract: Large Language Models (LLMs) have been applied to build several automation and personalized question-answering prototypes so far. However, scaling such prototypes to robust products with minimized hallucinations or fake responses still remains an open challenge, especially in niche data-table heavy domains such as financial decision making. In this work, we present a novel Langchain-based framewor… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, 4 tables