Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Priyadarshan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/