Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Mishra, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09869  [pdf, other

    cs.CL cs.CV cs.SE

    Docling Technical Report

    Authors: Christoph Auer, Maksym Lysak, Ahmed Nassar, Michele Dolfi, Nikolaos Livathinos, Panos Vagenas, Cesar Berrospi Ramis, Matteo Omenetti, Fabian Lindlbauer, Kasper Dinkla, Lokesh Mishra, Yusik Kim, Shubham Gupta, Rafael Teixeira de Lima, Valery Weber, Lucas Morin, Ingmar Meijer, Viktor Kuropiatnyk, Peter W. J. Staar

    Abstract: This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addit… ▽ More

    Submitted 30 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2408.04661  [pdf, other

    cs.CL cond-mat.mtrl-sci

    MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities

    Authors: Ali Riza Durmaz, Akhil Thomas, Lokesh Mishra, Rachana Niranjan Murthy, Thomas Straub

    Abstract: While large language models learn sound statistical representations of the language and information therein, ontologies are symbolic knowledge representations that can complement the former ideally. Research at this critical intersection relies on datasets that intertwine ontologies and text corpora to enable training and comprehensive benchmarking of neurosymbolic models. We present the MaterioMi… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  3. arXiv:2406.19102  [pdf, other

    cs.CL cs.AI cs.IR

    Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs

    Authors: Lokesh Mishra, Sohayl Dhibi, Yusik Kim, Cesar Berrospi Ramis, Shubham Gupta, Michele Dolfi, Peter Staar

    Abstract: Environment, Social, and Governance (ESG) KPIs assess an organization's performance on issues such as climate change, greenhouse gas emissions, water consumption, waste management, human rights, diversity, and policies. ESG reports convey this valuable quantitative information through tables. Unfortunately, extracting this information is difficult due to high variability in the table structure as… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at the NLP4Climate workshop in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

  4. ESG Accountability Made Easy: DocQA at Your Service

    Authors: Lokesh Mishra, Cesar Berrospi, Kasper Dinkla, Diego Antognini, Francesco Fusco, Benedikt Bothur, Maksym Lysak, Nikolaos Livathinos, Ahmed Nassar, Panagiotis Vagenas, Lucas Morin, Christoph Auer, Michele Dolfi, Peter Staar

    Abstract: We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Demonstration Track of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI 24)

    Journal ref: AAAI 2024, 38, 23814-23816

  5. arXiv:2212.02794  [pdf

    cs.CV

    Hybrid Model using Feature Extraction and Non-linear SVM for Brain Tumor Classification

    Authors: Lalita Mishra, Shekhar Verma, Shirshu Varma

    Abstract: It is essential to classify brain tumors from magnetic resonance imaging (MRI) accurately for better and timely treatment of the patients. In this paper, we propose a hybrid model, using VGG along with Nonlinear-SVM (Soft and Hard) to classify the brain tumors: glioma and pituitary and tumorous and non-tumorous. The VGG-SVM model is trained for two different datasets of two classes; thus, we perfo… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.