Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Feinglass, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15961  [pdf, other

    cs.CV

    Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images

    Authors: Yiran Luo, Joshua Feinglass, Tejas Gokhale, Kuan-Cheng Lee, Chitta Baral, Yezhou Yang

    Abstract: Domain Generalization (DG) is a challenging task in machine learning that requires a coherent ability to comprehend shifts across various domains through extraction of domain-invariant features. DG performance is typically evaluated by performing image classification in domains of various image styles. However, current methodology lacks quantitative understanding about shifts in stylistic domain,… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted at the 3rd CVPR Workshop on Vision Datasets Understanding

  2. arXiv:2404.08761  [pdf, ps, other

    cs.CV cs.LG

    `Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning

    Authors: Joshua Feinglass, Jayaraman J. Thiagarajan, Rushil Anirudh, T. S. Jayram, Yezhou Yang

    Abstract: Current approaches in Generalized Zero-Shot Learning (GZSL) are built upon base models which consider only a single class attribute vector representation over the entire image. This is an oversimplification of the process of novel category recognition, where different regions of the image may have properties from different seen classes and thus have different predominant attributes. With this in m… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted to the CVPR 2024 LIMIT Workshop

  3. arXiv:2309.00215  [pdf, other

    cs.CV cs.CL

    Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding

    Authors: Joshua Feinglass, Yezhou Yang

    Abstract: Object proposal generation serves as a standard pre-processing step in Vision-Language (VL) tasks (image captioning, visual question answering, etc.). The performance of object proposals generated for VL tasks is currently evaluated across all available annotations, a protocol that we show is misaligned - higher scores do not necessarily correspond to improved performance on downstream VL tasks. O… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted to WACV 2024 (Round 1)

  4. arXiv:2106.01444  [pdf, ps, other

    cs.CL cs.CV

    SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis

    Authors: Joshua Feinglass, Yezhou Yang

    Abstract: The open-ended nature of visual captioning makes it a challenging area for evaluation. The majority of proposed models rely on specialized training to improve human-correlation, resulting in limited adoption, generalizability, and explainabilty. We introduce "typicality", a new formulation of evaluation rooted in information theory, which is uniquely suited for problems lacking a definite ground t… ▽ More

    Submitted 7 January, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to and presented at ACL 2021 Main Conference (Oral)