AAU Student Projects is unavailable between June 15th 1.30pm and 17th 1.30pm due to planned system maintenance. The projects cannot be downloaded during this period.
AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


Real-Time Depth Estimation in Limited Hardware Environments: A Performance and Resource Trade-off Analysis of SOTA Methods

Author

Term

4. semester

Education

Publication year

2026

Submitted on

Pages

95

Abstract

This thesis explores how to estimate object distances with a single camera (monocular) on resource-constrained hardware. We use a Raspberry Pi 5 with a Hailo AI HAT+ accelerator as the embedded platform and compare it with higher-capacity desktop models (Depth Anything and Metric3D). Within a shared test framework, we compare and also combine several approaches: geometric homography (mapping image points to the ground plane using calibration), object detection with YOLO (a common model), relative-depth models that order what is nearer or farther, and metric-depth models that estimate actual distances. We built an application to capture frames from the Raspberry Pi, annotate scenes, run both desktop and Hailo pipelines, evaluate static scenes, and replay saved scenes for repeatable real-time testing. The final evaluation uses 19 annotated scenes and a repeatable real-time replay. The results show that no single method fully solves the task. YOLO with homography is efficient and interpretable when the object class is detected, but it remains class-dependent. Metric3D provides useful metric-depth reference values on the desktop, but is too heavy for the current real-time loop. Relative-depth models are most valuable as selection signals rather than as direct metric sensors. The most practical direction is a hybrid pipeline: use learned depth to select a close object or region and suppress background, then apply calibrated homography to convert the selected contact point into a metric ground-plane distance.

Denne afhandling undersøger, hvordan man kan estimere afstande til objekter med ét kamera (monokulær afstandsestimering) på ressourcebegrænset hardware. Vi bruger en Raspberry Pi 5 med et Hailo AI HAT+ som indlejret accelerator og sammenligner med større desktop-modeller (Depth Anything og Metric3D). Inden for en fælles testramme sammenlignes og kombineres flere tilgange: geometrisk homografi (at kortlægge billedpunkter til jordplan via kalibrering), objektdetektion med YOLO (en udbredt model), relative-dybdemodeller, der rangerer nær/fjern, samt metriske dybdemodeller, der estimerer faktiske afstande. Vi udviklede en applikation, der indsamler billeder fra Raspberry Pi, annoterer scener, kører både desktop- og Hailo-pipelines, evaluerer statiske scener og afspiller gemte scener til gentagelige test i realtid. Den endelige evaluering omfatter 19 annoterede scener og en gentagelig realtids-afspilning. Resultaterne viser, at ingen enkelt metode løser opgaven fuldt ud. YOLO med homografi er effektiv og let at fortolke, når den relevante objektklasse detekteres, men er klasseafhængig. Metric3D giver nyttige referenceværdier for metrisk dybde på desktop, men er for tung til den nuværende realtidsløkke. Relative-dybdemodeller er mest nyttige som udvælgelsessignaler frem for som direkte metriske sensorer. Den mest praktiske retning er derfor en hybrid pipeline: brug lært dybde til at vælge et nært objekt eller område og frasortere baggrund, og brug derefter kalibreret homografi til at omdanne det valgte kontaktpunkt til en metrisk jordplansafstand.

[This apstract has been rewritten with the help of AI based on the project's original abstract]