Predicted pitch angle (calculated from normal) visualized over time for sequence 010 on GradeSet.
Robust fusion of monocular and inertial data has the potential to offer a low-cost alternative for ground surface normal prediction ahead, compared to more expensive sensors, such as LiDAR. Yet robust camera-based prediction remains challenging, particularly for steep grades and texture-poor, ho- mogeneous road surfaces.
To address these issues, we propose an enhanced monocular camera-IMU fusion pipeline incorporating a lightweight transformer-based feature matcher for improved correspondence accuracy, and robust temporal filtering, using a SLERP filter, to enhance consistency and reduce drift. To enable rigorous benchmarking and reproducibility, we also standardize the evaluation protocol and release a novel dataset containing synchronized camera, LiDAR, and IMU-derived pose data, specifically captured across diverse incline and decline scenarios.
Extensive continuous validation demonstrates that our method significantly improves both accuracy and temporal stability over existing approaches, setting a new state of the art for robust, continuous ground normal estimation ahead. The proposed pipeline and dataset are publicly available at: https://norbertmarko.github.io/imu_cam_normal_prediction/.
Predicted pitch angle visualized over time for sequence 029 on PandaSet.
Predicted Normal vector angle visualized over time for sequence 029 on PandaSet.
@article{marko2025roboust,
author = {Norbert Markó and Zoltán Rózsa and Áron Ballagi and Tamás Szirányi},
title = {Monocular Ground Normal Prediction for the Road Ahead},
journal = {},
year = {2025},
}