Casser, V., Pirk, S., Mahjourian, R., Angelova, A.: Unsupervised monocular depth and ego-motion learning with structure and semantics. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 381–388 (2019). https://doi.org/10.1109/CVPRW.2019.00051
Zhou, Z., Fan, X., Shi, P., Xin, Y.: R-msfm: recurrent multi-scale feature modulation for monocular depth estimating. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 12757–12766 (2021). https://doi.org/10.1109/ICCV48922.2021.01254
Klingner, M., Termóhlen, J.-A., Mikolajczyk, J., Fingscheidt, T.: Self-supervised monocular depth estimation: solving the dynamic object problem by semantic guidance. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision–ECCV 2020, pp. 582–600. Springer, Cham (2020) Chapter Google Scholar
Lyu, X., Liu, L., Wang, M., Kong, X., Liu, L., Liu, Y., Chen, X., Yuan, Y.: Hr-depth: high resolution self-supervised monocular depth estimation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 2294–2301 (2021). https://doi.org/10.1609/aaai.v35i3.16329
Zhou, T., Brown, M., Snavely, N., Lowe, D.G.: Unsupervised learning of depth and ego-motion from video. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6612–6619 (2017). https://doi.org/10.1109/CVPR.2017.700
Guizilini, V.C., Hou, R., Li, J., Ambrus, R., Gaidon., A.: Semantically-guided representation learning for self-supervised monocular depth. arXiv abs/2002.12319 (2020)
Klingner, M., Termóhlen, J.-A., Mikolajczyk, J., Fingscheidt, T.: Self-supervised monocular depth estimation: solving the dynamic object problem by semantic guidance. In: Computer Vision—ECCV 2020, pp. 582–600. Springer, Berlin, Heidelberg (2020). https://doi.org/10.1007/978-3-030-58565-5_35
Ranjan, A., Jampani, V., Balles, L., Kim, K., Sun, D., Wulff, J., Black, M.J.: Competitive collaboration: joint unsupervised learning of depth, camera motion, optical flow and motion segmentation. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12232–12241 (2019). https://doi.org/10.1109/CVPR.2019.01252
Bian, J., Zhan, H., Wang, N., Li, Z., Zhang, L., Shen, C., Cheng, M.-M.: Unsupervised scale-consistent depth learning from video. Int. J. Comput. Vis. 129, 2548–2564 (2021) ArticleMATH Google Scholar
Zhang, N., Nex, F., Vosselman, G., Kerle, N.: Lite-mono: a lightweight cnn and transformer architecture for self-supervised monocular depth estimation. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 18537–18546 (2022)
Krizhevsky, A., Sutskever, I.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2012) ArticleMATH Google Scholar
Eigen, D., Puhrsch, C., Fergus, R.: Depth map prediction from a single image using a multi-scale deep network. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 27. Curran Associates Inc., USA (2014) MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., Navab, N.: Deeper depth prediction with fully convolutional residual networks. In: 2016 Fourth International Conference on 3D Vision (3DV), pp. 239–248 (2016). https://doi.org/10.1109/3DV.2016.32
Kendall, A., Grimes, M.K., Cipolla, R.: Posenet: a convolutional network for real-time 6-dof camera relocalization. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 2938–2946 (2015)
Zhang, R., Luo, Z., Dhanjal, S.S., Schmotzer, C., Hasija, S.: Posenet + + : A CNN Framework for Online Pose Regression and Robot Re-localization (2018)
Huang, Z., Xu, Y., Shi, J., Zhou, X., Bao, H., Zhang, G.: Prior guided dropout for robust visual localization in dynamic environments. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2791–2800 (2019). https://doi.org/10.1109/ICCV.2019.00288
Tian, M., Nie, Q., Shen, H.: 3d scene geometry-aware constraint for camera localization with deep learning. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 4211–4217 (2020)
Xian, K., Zhang, J., Wang, O., Mai, L., Lin, Z., Cao, Z.: Structure-guided ranking loss for single image depth prediction. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 608–617 (2020). https://doi.org/10.1109/CVPR42600.2020.00069
Godard, C., Aodha, O.M., Firman, M., Brostow, G.: Digging into self-supervised monocular depth estimation. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3827–3837 (2019). https://doi.org/10.1109/ICCV.2019.00393