Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Aravinthan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.07532  [pdf, other

    cs.CV cs.AI cs.CL

    Interfacing Foundation Models' Embeddings

    Authors: Xueyan Zou, Linjie Li, Jianfeng Wang, Jianwei Yang, Mingyu Ding, Junyi Wei, Zhengyuan Yang, Feng Li, Hao Zhang, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang

    Abstract: Foundation models possess strong capabilities in reasoning and memorizing across modalities. To further unleash the power of foundation models, we present FIND, a generalized interface for aligning foundation models' embeddings with unified image and dataset-level understanding spanning modality and granularity. As shown in the teaser figure, a lightweight transformer interface without tuning any… ▽ More

    Submitted 15 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: CODE: https://github.com/UX-Decoder/FIND