Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Talwarr, A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10380  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models

    Authors: Pranshu Pandya, Agney S Talwarr, Vatsal Gupta, Tushar Kataria, Vivek Gupta, Dan Roth

    Abstract: Cognitive textual and visual reasoning tasks, such as puzzles, series, and analogies, demand the ability to quickly reason, decipher, and evaluate patterns both textually and spatially. While LLMs and VLMs, through extensive training on large amounts of human-curated data, have attained a high level of pseudo-human intelligence in some common sense reasoning tasks, they still struggle with more co… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures, 5 tables