Real2Sim2Real: RetinalDepth-64K for Depth Estimation in Posterior Segment Ophthalmic Surgery

Bingwen Dong1,* Gan Liu1,* Xiaoxi Lu1,* Guangcheng Chen1 Jialu Zhang1
Yan Hu2,1 Xiaoqing Zhang3,1,† Jiang Liu1,4,5,6,†
1 SUSTech 2 Institute of AI for Industries CAS 3 Shenzhen Institutes of Advanced Technology CAS
4 University of Nottingham Ningbo 5 Wenzhou Medical University 6 Changchun University
* Equal contribution Corresponding authors
CVPR 2026

Abstract

Accurate depth estimation is crucial for 3D reconstruction and precise navigation in posterior segment ophthalmic surgery. However, acquiring annotated data remains challenging due to the impracticality of depth sensors under surgical microscopes. To overcome this limitation, we introduce RetinalDepth, a novel synthetic dataset comprising 44,800 temporal stereo image pairs across 896 diverse scenes for posterior segment surgery, developed via a Real2Sim2Real pipeline. In the Real-to-Sim phase, we model anatomically accurate eyes and instruments in Blender, integrating ultra-wide-field retinal textures, refractive aqueous humor, and dynamic trajectories, enhanced by post-processing for realism. RetinalDepth provides synchronized left and right RGB frames with pixel-perfect depth maps, surface normals, instrument segmentation masks, and camera parameters, enabling robust training of monocular, stereo, and video depth estimation models. In the Sim-to-Real phase, we introduce temporal depth variance, a novel metric for quantifying the stability of frame-to-frame depth estimation. Fine-tuning on RetinalDepth significantly boosts model performance on both synthetic and real surgical videos, enhancing generalization, surgical precision, and novice training. As the first synthetic benchmark for posterior segment surgery, RetinalDepth bridges the sim-to-real gap in ophthalmic computer vision.

Citation

@inproceedings{retinaldepth64k_2026,
  title={Real2Sim2Real: RetinalDepth-64K for Depth Estimation in Posterior Segment Ophthalmic Surgery},
  author={...},
  booktitle={CVPR},
  year={2026}
}