We propose PRO, an efficient and generalizable tile-based depth refinement framework. Our PRO consists of two key components: (i) Grouped Patch Consistency Training that enhances test-time efficiency while mitigating the depth discontinuity problem, and (ii) Bias Free Masking that prevents the depth refinement models from overfitting to dataset-specific biases, enabling better generalization to real-world datasets even after training on synthetic data.
The gallery below presents several images from the internet and refinement results of PRO (Ours) based on DepthAnythingv2. Use the slider and gestures to reveal details on both sides.
@misc{kwon2025onelook,
title={One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images},
author={Byeongjun Kwon and Munchurl Kim},
year={2025},
eprint={2503.22351},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2503.22351},
}