Monocular Ground Normal Prediction for the Road Ahead

1Machine Perception Research Laboratory, HUN-REN SZTAKI, Budapest, Hungary 2Vehicle Industry Research Center, Széchenyi István University, Győr, Hungary
Pitch GIF

Predicted pitch angle (calculated from normal) visualized over time for sequence 010 on GradeSet.

Abstract

Robust fusion of monocular and inertial data has the potential to offer a low-cost alternative for ground surface normal prediction ahead, compared to more expensive sensors, such as LiDAR. Yet robust camera-based prediction remains challenging, particularly for steep grades and texture-poor, ho- mogeneous road surfaces.

To address these issues, we propose an enhanced monocular camera-IMU fusion pipeline incorporating a lightweight transformer-based feature matcher for improved correspondence accuracy, and robust temporal filtering, using a SLERP filter, to enhance consistency and reduce drift. To enable rigorous benchmarking and reproducibility, we also standardize the evaluation protocol and release a novel dataset containing synchronized camera, LiDAR, and IMU-derived pose data, specifically captured across diverse incline and decline scenarios.

Extensive continuous validation demonstrates that our method significantly improves both accuracy and temporal stability over existing approaches, setting a new state of the art for robust, continuous ground normal estimation ahead. The proposed pipeline and dataset are publicly available at: https://norbertmarko.github.io/imu_cam_normal_prediction/.

Normal GIF

Predicted pitch angle visualized over time for sequence 029 on PandaSet.

Normal GIF

Predicted Normal vector angle visualized over time for sequence 029 on PandaSet.

BibTeX

@article{marko2025roboust,
  author    = {Norbert Markó and Zoltán Rózsa and Áron Ballagi and Tamás Szirányi},
  title     = {Monocular Ground Normal Prediction for the Road Ahead},
  journal   = {},
  year      = {2025},
}
-->