Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Ziletti, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03010  [pdf, other

    cs.CL cs.IR

    Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs

    Authors: Daniel Steinigen, Roman Teucher, Timm Heine Ruland, Max Rudat, Nicolas Flores-Herr, Peter Fischer, Nikola Milosevic, Christopher Schymura, Angelo Ziletti

    Abstract: Recent advancements in Large Language Models (LLMs) have showcased their proficiency in answering natural language queries. However, their effectiveness is hindered by limited domain-specific knowledge, raising concerns about the reliability of their responses. We introduce a hybrid system that augments LLMs with domain-specific knowledge graphs (KGs), thereby aiming to enhance factual correctness… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 10 pages, 7 figures

  2. arXiv:2403.09226  [pdf, other

    cs.CL

    Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records

    Authors: Angelo Ziletti, Leonardo D'Ambrosi

    Abstract: Electronic health records (EHR) and claims data are rich sources of real-world data that reflect patient health status and healthcare utilization. Querying these databases to answer epidemiological questions is challenging due to the intricacy of medical terminology and the need for complex SQL queries. Here, we introduce an end-to-end methodology that combines text-to-SQL generation with retrieva… ▽ More

    Submitted 16 May, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 6 pages, 1 figure

    Journal ref: NAACL 2024 Clinical NLP Workshop

  3. Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

    Authors: Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell

    Abstract: Medical coding (MC) is an essential pre-requisite for reliable data retrieval and reporting. Given a free-text reported term (RT) such as "pain of right thigh to the knee", the task is to identify the matching lowest-level term (LLT) - in this case "unilateral leg pain" - from a very large and continuously growing repository of standardized medical terms. However, automating this task is challengi… ▽ More

    Submitted 1 May, 2022; originally announced June 2022.

    Comments: NAACL-HLT 2022 Industry Track

    Journal ref: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track

  4. arXiv:2109.01518  [pdf, other

    cs.LG cs.CL

    Biomedical Data-to-Text Generation via Fine-Tuning Transformers

    Authors: Ruslan Yermakov, Nicholas Drago, Angelo Ziletti

    Abstract: Data-to-text (D2T) generation in the biomedical domain is a promising - yet mostly unexplored - field of research. Here, we apply neural models for D2T generation to a real-world dataset consisting of package leaflets of European medicines. We show that fine-tuned transformers are able to generate realistic, multisentence text from data in the biomedical domain, yet have important limitations. We… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: Accepted at ACL-INGL2021 (International Conference on Natural Language Generation organised by the Association for Computational Linguistics)

  5. Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

    Authors: Angelo Ziletti, Christoph Berns, Oliver Treichel, Thomas Weber, Jennifer Liang, Stephanie Kammerath, Marion Schwaerzler, Jagatheswari Virayah, David Ruau, Xin Ma, Andreas Mattern

    Abstract: Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely,… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Journal ref: Front. Comput. Sci 88 (3) (2021)