HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

Zhou, Yi; Zhang, Hui; Yu, Jiaqian; Yang, Yifan; Jung, Sangil; Park, Seung-In; Yoo, ByungIn

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.08639 (cs)

[Submitted on 13 Mar 2024 (v1), last revised 26 Mar 2024 (this version, v2)]

Title:HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

Authors:Yi Zhou, Hui Zhang, Jiaqian Yu, Yifan Yang, Sangil Jung, Seung-In Park, ByungIn Yoo

View PDF HTML (experimental)

Abstract:Vectorized High-Definition (HD) map construction requires predictions of the category and point coordinates of map elements (e.g. road boundary, lane divider, pedestrian crossing, etc.). State-of-the-art methods are mainly based on point-level representation learning for regressing accurate point coordinates. However, this pipeline has limitations in obtaining element-level information and handling element-level failures, e.g. erroneous element shape or entanglement between elements. To tackle the above issues, we propose a simple yet effective HybrId framework named HIMap to sufficiently learn and interact both point-level and element-level information. Concretely, we introduce a hybrid representation called HIQuery to represent all map elements, and propose a point-element interactor to interactively extract and encode the hybrid information of elements, e.g. point position and element shape, into the HIQuery. Additionally, we present a point-element consistency constraint to enhance the consistency between the point-level and element-level information. Finally, the output point-element integrated HIQuery can be directly converted into map elements' class, point coordinates, and mask. We conduct extensive experiments and consistently outperform previous methods on both nuScenes and Argoverse2 datasets. Notably, our method achieves $77.8$ mAP on the nuScenes dataset, remarkably superior to previous SOTAs by $8.3$ mAP at least.

Comments:	Accepted to CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.08639 [cs.CV]
	(or arXiv:2403.08639v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.08639

Submission history

From: Yi Zhou [view email]
[v1] Wed, 13 Mar 2024 15:51:23 UTC (16,543 KB)
[v2] Tue, 26 Mar 2024 15:40:20 UTC (16,510 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators