3D face parsing based on 2D CPFNet: conformal parameterized face parsing network (original) (raw)

References

  1. Khan, K., Attique, M., Khan, R.U., Syed, I., Chung, T.-S.: A multi-task framework for facial attributes classification through end-to-end face parsing and deep convolutional neural networks. Sensors 20(2), 328 (2020). https://doi.org/10.3390/s20020328
    Article MATH Google Scholar
  2. Jin, X., Li, Z., Ning, N., Lu, H., Li, X., Zhang, X., Zhu, X., Fang, X.: Face illumination transfer and swapping via dense landmark and semantic parsing. IEEE Sens. J. 22(18), 17391–17398 (2022). https://doi.org/10.1109/JSEN.2020.3025918
    Article Google Scholar
  3. Shen, Z., Lai, W.-S., Xu, T., Kautz, J., Yang, M.-H.: Deep semantic face deblurring. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8260–8269. IEEE, Salt Lake City, UT, USA (2018). https://doi.org/10.1109/CVPR.2018.00862
  4. Li, T., Qian, R., Dong, C., Liu, S., Yan, Q., Zhu, W., Lin, L.: BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network. In: Proceedings of the 26th ACM International conference on multimedia. MM ’18, pp. 645–653. Association for Computing Machinery, New York, NY, USA (2018). https://doi.org/10.1145/3240508.3240618
  5. Yu, Y., Mora, K.A.F., Odobez, J.-M.: Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction. In: 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), pp. 711–718 (2017). https://doi.org/10.1109/FG.2017.90
  6. Paysan, P., Knothe, R., Amberg, B., Romdhani, S., Vetter, T.: A 3D Face Model for Pose and Illumination Invariant Face Recognition. In: 2009 Sixth IEEE international conference on advanced video and signal based surveillance, pp. 296–301 (2009). https://doi.org/10.1109/AVSS.2009.58
  7. Wang, Z.: Robust three-dimensional face reconstruction by one-shot structured light line pattern. Opt. Lasers Eng. 124, 105798 (2020). https://doi.org/10.1016/j.optlaseng.2019.105798
    Article Google Scholar
  8. Kim, J., Park, S., Kim, S., Lee, S.: Registration method between ToF and color cameras for face recognition. In: 2011 6th IEEE conference on industrial electronics and applications, pp. 1977–1980 (2011). https://doi.org/10.1109/ICIEA.2011.5975916
  9. Tchapmi, L., Choy, C., Armeni, I., Gwak, J., Savarese, S.: SEGCloud: Semantic Segmentation of 3D Point Clouds. In: 2017 International conference on 3D vision (3DV), pp. 537–547 (2017). https://doi.org/10.1109/3DV.2017.00067
  10. Meng, H.-Y., Gao, L., Lai, Y.-K., Manocha, D.: VV-Net: Voxel VAE Net With Group Convolutions for Point Cloud Segmentation. In: 2019 IEEE/CVF International conference on computer vision (ICCV), pp. 8499–8507 (2019). https://doi.org/10.1109/ICCV.2019.00859
  11. Charles, R.Q., Su, H., Kaichun, M., Guibas, L.J.: PointNet: Deep learning on point sets for 3D classification and segmentation. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), pp. 77–85 (2017). https://doi.org/10.1109/CVPR.2017.16
  12. Qi, C.R., Yi, L., Su, H., Guibas, L.J.: PointNet++: Deep hierarchical feature learning on point sets in a metric space (2017). https://doi.org/10.48550/arXiv.1706.02413
  13. Thomas, H., Qi, C.R., Deschaud, J.-E., Marcotegui, B., Goulette, F., Guibas, L.: KPConv: Flexible and Deformable Convolution for Point Clouds. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6410–6419 (2019). https://doi.org/10.1109/ICCV.2019.00651
  14. Te, G., Hu, W., Liu, Y., Shi, H., Mei, T.: AGRNet: Adaptive graph representation learning and reasoning for face parsing. IEEE Trans. Image Process. 30, 8236–8250 (2021). https://doi.org/10.1109/TIP.2021.3113780
    Article MATH Google Scholar
  15. Wang, L., Huang, Y., Hou, Y., Zhang, S., Shan, J.: Graph attention convolution for point cloud semantic segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 10288–10297 (2019). https://doi.org/10.1109/CVPR.2019.01054
  16. Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3D shape recognition. In: 2015 IEEE international conference on computer vision (ICCV), pp. 945–953 (2015). https://doi.org/10.1109/ICCV.2015.114
  17. Lawin, F.J., Danelljan, M., Tosteberg, P., Bhat, G., Khan, F.S., Felsberg, M.: Deep projective 3D semantic segmentation. In: Felsberg, M., Heyden, A., Krüger, N. (eds.) Computer analysis of images and patterns. Lecture notes in computer science, pp. 95–107. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-64689-3_8
  18. Qi, X., Liao, R., Jia, J., Fidler, S., Urtasun, R.: 3D Graph Neural Networks for RGBD Semantic Segmentation. In: 2017 IEEE International conference on computer vision (ICCV), pp. 5209–5218 (2017). https://doi.org/10.1109/ICCV.2017.556
  19. Hu, X., Yang, K., Fei, L., Wang, K.: ACNET: attention based network to exploit complementary features for RGBD semantic segmentation. In: 2019 IEEE International conference on image processing (ICIP), pp. 1440–1444 (2019). https://doi.org/10.1109/ICIP.2019.8803025
  20. Gu, X., Wang, Y., Chan, T.F., Thompson, P.M., Yau, S.-T.: Genus zero surface conformal mapping and its application to brain surface mapping. IEEE Trans. Med. Imaging 23(8), 949–958 (2004). https://doi.org/10.1109/TMI.2004.831226
    Article MATH Google Scholar
  21. Warrell, J., Prince, S.J.D.: Labelfaces: Parsing facial features by multiclass labeling with an epitome prior. In: 2009 16th IEEE International Conference on Image Processing (ICIP), pp. 2481–2484 (2009). https://doi.org/10.1109/ICIP.2009.5413918
  22. Smith, B.M., Zhang, L., Brandt, J., Lin, Z., Yang, J.: Exemplar-based face parsing. In: 2013 IEEE conference on computer vision and pattern recognition, pp. 3484–3491 (2013). https://doi.org/10.1109/CVPR.2013.447
  23. Luo, P., Wang, X., Tang, X.: Hierarchical face parsing via deep learning. In: 2012 IEEE conference on computer vision and pattern recognition, pp. 2480–2487 (2012). https://doi.org/10.1109/CVPR.2012.6247963
  24. Liu, S., Yang, J., Huang, C., Yang, M.-H.: Multi-objective convolutional learning for face labeling. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp. 3451–3459 (2015). https://doi.org/10.1109/CVPR.2015.7298967
  25. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp. 3431–3440 (2015). https://doi.org/10.1109/CVPR.2015.7298965
  26. Jackson, A.S., Valstar, M., Tzimiropoulos, G.: A CNN Cascade for Landmark Guided Semantic Part Segmentation. In: Hua, G., Jégou, H. (eds.) Computer Vision– ECCV 2016 Workshops. Lecture Notes in Computer Science, pp. 143–155. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_14
  27. Guo, T., Kim, Y., Zhang, H., Qian, D., Yoo, B., Xu, J., Zou, D., Han, J.-J., Choi, C.: Residual encoder decoder network and adaptive prior for face parsing. In: Proceedings of the thirty-second AAAI conference on artificial intelligence and thirtieth innovative applications of artificial intelligence conference and Eighth AAAI symposium on educational advances in artificial intelligence. AAAI’18/IAAI’18/EAAI’18, pp. 6861–6869. AAAI Press, New Orleans, Louisiana, USA (2018)
  28. Wei, Z., Sun, Y., Wang, J., Lai, H., Liu, S.: Learning adaptive receptive fields for deep image parsing network. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp. 3947–3955 (2017). https://doi.org/10.1109/CVPR.2017.420
  29. Luo, L., Xue, D., Feng, X.: EHANet: an effective hierarchical aggregation network for face parsing. Appl. Sci. 10(9), 3135 (2020). https://doi.org/10.3390/app10093135
    Article MATH Google Scholar
  30. Tatarchenko, M., Park, J., Koltun, V., Zhou, Q.-Y.: Tangent convolutions for dense prediction in 3D. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 3887–3896 (2018). https://doi.org/10.1109/CVPR.2018.00409
  31. Li, Y., Bu, R., Sun, M., Wu, W., Di, X., Chen, B.: PointCNN: convolution on \(X\)-transformed points. In: Proceedings of the 32nd international conference on neural information processing systems. NIPS’18, pp. 828–838. Curran Associates Inc., Red Hook, NY, USA (2018)
  32. Wang, Y., Sun, Y., Liu, Z., Sarma, S.E., Bronstein, M.M., Solomon, J.M.: Dynamic graph CNN for learning on point clouds. ACM Trans. Graphics 38(5), 146–114612 (2019). https://doi.org/10.1145/3326362
    Article Google Scholar
  33. Hanocka, R., Hertz, A., Fish, N., Giryes, R., Fleishman, S., Cohen-Or, D.: MeshCNN: a network with an edge. ACM Trans. Graphics 38(4), 90–19012 (2019). https://doi.org/10.1145/3306346.3322959
    Article Google Scholar
  34. Sinha, A., Bai, J., Ramani, K.: Deep learning 3D shape surfaces using geometry images. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) Computer Vision– ECCV 2016. Lecture notes in computer science, pp. 223–240. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46466-4_14
  35. Tulsiani, S., Su, H., Guibas, L.J., Efros, A.A., Malik, J.: Learning shape abstractions by assembling volumetric primitives. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp. 1466–1474 (2017). https://doi.org/10.1109/CVPR.2017.160
  36. Choi, P.T., Lui, L.M.: Fast disk conformal parameterization of simply-connected open surfaces. J. Sci. Comput. 65(3), 1065–1090 (2015). https://doi.org/10.1007/s10915-015-9998-2
    Article MathSciNet MATH Google Scholar
  37. Gong, Y., Wang, L., Guo, R., Lazebnik, S.: Multi-scale orderless pooling of deep convolutional activation features. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) Computer Vision– ECCV 2014. Lecture notes in computer science. Springer, Cham, pp. 392–407 (2014). https://doi.org/10.1007/978-3-319-10584-0_26
  38. Baocai, Y., Yanfeng, S., Chengzhang, W., Yun, aG.: BJUT-3D large scale 3D face database and information processing. J. Comput. Res. Devel. 46(6), 1009–1018 (2009)
    MATH Google Scholar
  39. CASIA: CASIA-3D FaceV1 (2004). http://biometrics.idealtest.org/
  40. Heseltine, T., Pears, N., Austin, J.: Three-dimensional face recognition using combinations of surface feature map subspace components. Image Vis. Comput. 26(3), 382–396 (2008). https://doi.org/10.1016/j.imavis.2006.12.008
    Article MATH Google Scholar
  41. Hu, Q., Yang, B., Xie, L., Rosa, S., Guo, Y., Wang, Z., Trigoni, N., Markham, A.: RandLA-Net: efficient semantic segmentation of large-scale point clouds. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 11105–11114. IEEE, Seattle, WA, USA (2020). https://doi.org/10.1109/CVPR42600.2020.01112
  42. Lai, X., Liu, J., Jiang, L., Wang, L., Zhao, H., Liu, S., Qi, X., Jia, J.: Stratified Transformer for 3D Point Cloud Segmentation. In: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 8490–8499 (2022). https://doi.org/10.1109/CVPR52688.2022.00831
  43. Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017). https://doi.org/10.1109/TPAMI.2016.2644615
    Article MATH Google Scholar
  44. Ronneberger, O., Fischer, P., Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) Medical image computing and computer-assisted intervention– MICCAI 2015. Lecture notes in computer science, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
  45. Chen, L.-C., Papandreou, G., Schroff, F., Adam, H.: Rethinking Atrous Convolution for Semantic Image Segmentation (2017). https://doi.org/10.48550/arXiv.1706.05587
  46. Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision– ECCV 2018. Lecture notes in computer science, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
  47. Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., Lu, H.: Dual attention network for scene segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 3141–3149 (2019). https://doi.org/10.1109/CVPR.2019.00326
  48. Paszke, A., Chaurasia, A., Kim, S., Culurciello, E.: ENet: A deep neural network architecture for real-time semantic segmentation (2016). https://doi.org/10.48550/arXiv.1606.02147
  49. Romera, E., Álvarez, J.M., Bergasa, L.M., Arroyo, R.: ERFNet: efficient residual factorized convnet for real-time semantic segmentation. IEEE Trans. Intell. Transp. Syst. 19(1), 263–272 (2018). https://doi.org/10.1109/TITS.2017.2750080
    Article MATH Google Scholar
  50. Lyu, H., Fu, H., Hu, X., Liu, L.: Esnet: edge-based segmentation network for real-time semantic segmentation in traffic scenes. In: 2019 IEEE International conference on image processing (ICIP), pp. 1855–1859 (2019). https://doi.org/10.1109/ICIP.2019.8803132
  51. Gen Li, Joongkyu Kim: DABNet: depth-wise asymmetric bottleneck for real-time semantic segmentation. In: 2019 British machine vision conference (BMCV) (2019). https://doi.org/10.48550/arXiv.1907.11357
  52. Li, H., Xiong, P., Fan, H., Sun, J.: DFANet: deep feature aggregation for real-time semantic segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 9514–9523 (2019). https://doi.org/10.1109/CVPR.2019.00975
  53. Rudra, P., Poudel, K., Stephan L.: Fast-SCNN: Fast Semantic Segmentation Network. In: 2019 British machine vision conference (BMCV) (2019). https://doi.org/10.48550/arXiv.1902.04502
  54. Wu, T., Tang, S., Zhang, R., Cao, J., Zhang, Y.: CGNet: a light-weight context guided network for semantic segmentation. IEEE Trans. Image Process. 30, 1169–1179 (2021). https://doi.org/10.1109/TIP.2020.3042065
    Article MATH Google Scholar

Download references