A Neural Network Architecture for Accurate 4D Vehicle Pose Estimation from Monocular Images with Uncertainty Assessment

Tomasz Nowak; Piotr Skrzypczyński

doi:10.1007/978-981-99-8132-8_30

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / A Neural Network Architecture for Accurate 4D Vehicle Pose Estimation from Monocular Images with Uncertainty Assessment

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

A Neural Network Architecture for Accurate 4D Vehicle Pose Estimation from Monocular Images with Uncertainty Assessment

Autorzy

Tomasz Nowak (WARiE) ^{[ 1 ][ 2.2 ][ SzD ]}
Piotr Skrzypczyński (WARiE) ^{[ 2 ][ 2.2 ][ P ]}

^{[ 1 ]} Wydział Automatyki, Robotyki i Elektrotechniki, Politechnika Poznańska | ^{[ 2 ]} Instytut Robotyki i Inteligencji Maszynowej, Wydział Automatyki, Robotyki i Elektrotechniki, Politechnika Poznańska | ^{[ SzD ]} doktorant ze Szkoły Doktorskiej | ^{[ P ]} pracownik

Dyscyplina naukowa (Ustawa 2.0)

[2.2] Automatyka, elektronika, elektrotechnika i technologie kosmiczne

Rok publikacji

2024

Typ rozdziału

rozdział w monografii naukowej / referat

Język publikacji

angielski

Słowa kluczowe

EN

Vehicle pose estimation
3D scene understanding
Deep learning

Streszczenie

EN This paper proposes a new neural network architecture for estimating the four degrees of freedom poses of vehicles from monocular images in an uncontrolled environment. The neural network learns how to reconstruct 3D characteristic points of vehicles from image crops and coordinates of 2D keypoints estimated from these images. The 3D and 2D points are used to compute the vehicle pose solving the Perspective-n-Point problem, while the uncertainty is propagated by applying the Unscented Transform. Our network is trained and tested on the ApolloCar3D dataset, and we introduce a novel method to automatically obtain approximate labels for 3D points in this dataset. Our system outperforms state-of-the-art pose estimation methods on the ApolloCar3D dataset, and unlike competitors, it implements a full pipeline of uncertainty propagation.

Strony (od-do)

396 - 412

DOI

10.1007/978-981-99-8132-8_30

URL

https://link.springer.com/chapter/10.1007/978-981-99-8132-8_30

Książka

Neural Information Processing : 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part VIII

Zaprezentowany na

30th International Conference on Neural Information Processing, ICONIP 2023, 20-23.11.2023, Changsha, China

Punktacja Ministerstwa / rozdział

20