Pillar-Based Adaptive Sparse Transformer with Cost-Optimized Positive Sample Selection for 4D Radar Object Detection (original) (raw)

References

  1. Sun, S., Petropulu, A.P., Poor, H.V.: MIMO radar for advanced driver-assistance systems and autonomous driving: Advantages and challenges. IEEE Signal Process. Mag. 37(4), 98–117 (2020)
    Article Google Scholar
  2. Sun, P., Li, S., Zhu, B., Zuo, Z., Xia, X.: Vision-based fixed-time uncooperative aerial target tracking for UAV. IEEE/CAA J. Autom. Sinica 10(5), 1322–1324 (2023)
    Article Google Scholar
  3. Liu, J., Bai, L., Xia, Y., Huang, T., Zhu, B., Han, Q.-L.: GNNPMB: A simple but effective online 3D multi-object tracker without bells and whistles. IEEE Trans. Intell. Vehicles 8(2), 1176–1189 (2023)
    Article Google Scholar
  4. Svenningsson, P., Fioranelli, F., Yarovoy, A.: RadarPointGNN: Graph-based object recognition for unstructured radar point-cloud data. In: Proceedings of the IEEE Radar Conference (RadarConf), pp 1–6 (2021)
  5. Han, Z., et al.: 4d millimeter-wave radar in autonomous driving: A survey. arXiv:2306.04242 (2023)
  6. Liu, J., Xiong, W., Bai, L., Xia, Y., Huang, T., Ouyang, W., Zhu, B.: Deep instance segmentation with automotive radar detection points. IEEE Trans. Intell. Vehicles 8(1), 84–94 (2023)
    Article Google Scholar
  7. Brisken, S., Ruf, F., Höhne, F.: Recent evolution of automotive imaging radar and its information content. IET Radar Sonar Navig. 12, 1078–1081 (2018)
    Article Google Scholar
  8. Caillot, A., Ouerghi, S., Vasseur, P., Boutteau, R., Dupuis, Y.: Survey on cooperative perception in an automotive context. IEEE Trans. Intell. Transp. Syst. 23(9), 14204–14223 (2022)
    Article Google Scholar
  9. Palffy, A., Pool, E., Baratam, S., Kooij, J.F., Gavrila, D.M.: Multi-class road user detection with 3+1D radar in the View-of-Delft dataset. IEEE Robot. Autom. Lett. 7(2), 4961–4968 (2022)
    Article Google Scholar
  10. Sun, S., Petropulu, A.P., Poor, H.V.: MIMO radar for advanced driver-assistance systems and autonomous driving: Advantages and challenges. IEEE Signal Process. Mag. 37(4), 98–117 (2020)
    Article Google Scholar
  11. Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3d object detection and tracking. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN USA, 20–25 June, pp. 11784–11793 (2021)
  12. Li, G., Xiong, Y., Zhang, H., et al.: Novel 4D 79 GHz radar concept for object detection and active safety applications. In: Proceedings of the IEEE German Microwave Conference, pp. 87–90 (2019)
  13. Dreher, M., Ercelik, E., Banziger, T., Knoll, A.: Radar-based 2D car detection using deep neural networks. In: Proceedings of the IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), pp. 1–8 (2020)
  14. Svenningsson, P., Fioranelli, F., Yarovoy, A.: RadarPointGNN: Graph-based object recognition for unstructured radar point-cloud data. In: Proceedings of the IEEE Radar Conference (RadarConf21), pp. 1–6 (2021)
  15. Bai, J., Li, S., Huang, L., Chen, H.: Robust detection and tracking method for moving object based on radar and camera data fusion. IEEE Sens. J. 21, 10761–10774 (2021)
    Article Google Scholar
  16. Karangwa, J., Liu, J., Zeng, Z.: Vehicle detection for autonomous driving: A review of algorithms and datasets. IEEE Trans. Intell. Transp. Syst. 24(11), 11586–11594 (2023)
    Article Google Scholar
  17. Zhou, Y., Liu, L., Zhao, H., López-Benítez, M., Yu, L., Yue, Y.: Towards deep radar perception for autonomous driving: Datasets, methods, and challenges. Sensors 22(11), 4208 (2022)
    Article Google Scholar
  18. Fan, L., et al.: 4D mmWave radar for autonomous driving perception: a comprehensive survey. IEEE Trans. Intell. Vehicles 9(4), 4606–4620 (2024)
    Article Google Scholar
  19. Kim, Y., Shin, J., Kim, S., Lee, I.-J., Choi, J.W., Kum, D.: CRN: Camera Radar Net for Accurate, Robust, Efficient 3D Perception. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 17615–17626 (2023)
  20. Nabati, R., Qi, H.: CenterFusion: Center-based radar and camera fusion for 3D object detection. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa HI USA, January, pp. 1526–1535 (2021)
  21. Kim, Y., Kim, S., Choi, J.W., Kum, D.: CRAFT: Camera-Radar 3D object detection with spatio-contextual fusion transformer. Proc. AAAI Conf. Artif. Intell. 37(1), 1160–1168 (2023)
    Google Scholar
  22. Meyer, M., Kuschk, G.: Automotive radar dataset for deep learning based 3d object detection. In: Proceedings of the 16th European Radar Conference (EuRAD), pp. 129–132 (2019)
  23. Palffy, A., Pool, E., Baratam, S., Kooij, J.F.P., Gavrila, D.M.: Multi-class road user detection with 3+1D radar in the View-of-Delft dataset. IEEE Robot. Autom. Lett. 7(2), 4961–4968 (2022)
    Article Google Scholar
  24. Zheng, L., Ma, Z., Zhu, X., Tan, B., Li, S., Long, K., Sun, W., Chen, S., Zhang, L., Wan, M., Huang, L., Bai, J.: TJ4DRadSet: A 4D radar dataset for autonomous driving. In: Proceedings of the IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau China, October, pp. 493–498 (2022)
  25. Paek, D.-H., Kong, S.-H., Wijaya, K.T.: K-radar: 4d radar object detection for autonomous driving in various weather conditions. Adv. Neural. Inf. Process. Syst. 35, 3819–3829 (2022)
    Google Scholar
  26. Charles, R.Q., Su, H., Kaichun, M., Guibas, L.J.: PointNet: Deep learning on point sets for 3D classification and segmentation. In: Proceedings of CVPR, pp. 77–85 (2017)
  27. Qi, C.R., et al.: Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv. Neural Inf. Process. Syst. 30 (2017)
  28. Ulrich, M., Braun, S., Kohler, D., Niederlöhner, D., Faion, F., Glaser, C., Blume, H.: Improved orientation estimation and detection with hybrid object detection networks for automotive radar. In: Proceedings of the IEEE International Conference on Intelligent Transportation Systems (ITSC), pp. 111–117 (2022)
  29. Danzer, A., Griebel, T., Bach, M., Dietmayer, K.: 2D car detection in radar data with PointNets. In: Proceedings of the IEEE International Conference on Intelligent Transportation Systems (ITSC), pp. 61–66 (2019)
  30. Rukhovich, D., Vorontsova, A., Konushin, A.: Imvoxelnet: Image to voxels projection for monocular and multi-view general-purpose 3D object detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2397–2406 (2022)
  31. He, C., Li, R., Li, S., Zhang, L.: Voxel set transformer: A set-to-set approach to 3D object detection from point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8417–8427 (2022)
  32. Mao, J., Mao, J.G., et al.: Voxel transformer for 3d object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3164–3173 (2021)
  33. Sun, P., Tan, M., Wang, W. et al.: Swformer: Sparse window transformer for 3d object detection in point clouds. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 426–442 (2022)
  34. Fan, L., et al.: Embracing single stride 3D object detector with sparse transformer. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8448–8458 (2022)
  35. Liu, Z., et al.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 9992–10002 (2021)
  36. Tan, B., et al.: Tracking of multiple static and dynamic targets for 4D automotive millimeter-wave radar point cloud in urban environments. Remote Sens. 15(11), 2923 (2023)
    Article Google Scholar
  37. Palmer, P., Krueger, M., Altendorfer, R., Bertram, T.: Ego-motion estimation and dynamic motion separation from 3D point clouds for accumulating data and improving 3D object detection. In: Automotive meets Electronics (AmE) 2023 – GMM Symposium, pp. 86–91 (2023)
  38. Kong, S.-H., Paek, D.-H., Cho, S.: RTNH+: Enhanced 4D radar object detection network using combined CFAR-based two-level preprocessing and vertical encoding. arXiv:2310.17659 (2023)
  39. Yan, Q., Wang, Y.: Mvfan: Multi-view feature assisted network for 4d radar object detection. In: Proceedings of the International Conference on Neural Information Processing (NeurIPS), pp. 493–511 (2023)
  40. Team, O., et al.: Openpcdet: An open-source toolbox for 3d object detection from point clouds. OD Team 2020. https://github.com/open-mmlab/OpenPCDet. Accessed 22 October 2023
  41. Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., Beijbom, O.: Pointpillars: Fast encoders for object detection from point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12697–12705 (2019)
  42. Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3D object detection and tracking. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11784–11793 (2021)
  43. Li, J., Luo, C., Yang, X.: PillarNeXt: Rethinking network designs for 3D object detection in LiDAR point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 17567–17576 (2023)
  44. Yan, Y., Mao, Y., Li, B.: Second: Sparsely embedded convolutional detection. Sensors 18(10), 3337 (2018)
    Article Google Scholar
  45. Cao, J., Fang, Y., Xu, J., Ling, Q.: Feature fusion and interaction network for 3d object detection based on 4d millimeter wave radars. In: Proceedings of the 43rd Chinese Control Conference (CCC), pp. 8876–8881 (2024)
  46. Xiong, W., et al.: LXL: LiDAR excluded lean 3D object detection with 4d imaging radar and camera fusion. IEEE Trans. Intell. Vehicles 9(1), 79–92 (2023)
    Article Google Scholar
  47. Liu, J., et al.: Smurf: Spatial multi-representation fusion for 3d object detection with 4d imaging radar. IEEE Trans. Intell. Vehicles 9(1), 799–812 (2023)
    Article Google Scholar
  48. Chen, X., Zhang, T., Wang, Y., Wang, Y., Zhao, H.: FUTR3D: A unified sensor fusion framework for 3d detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 172–181 (2023)
  49. Zheng, L., et al.: Rcfusion: Fusing 4-d radar and camera with bird’s-eye view features for 3-d object detection. IEEE Trans. Instrum. Meas. 72, 1–14 (2023)
    Article Google Scholar
  50. Shi, W., et al.: Smiformer: Learning spatial feature representation for 3d object detection from 4d imaging radar via multi-view interactive transformers. Sensors 23(23), 9429 (2023)
    Article Google Scholar
  51. Yan, Q., Wang, Y.: Mvfan: Multi-view feature assisted network for 4d radar object detection. In: Proceedings of the International Conference on Neural Information Processing (NeurIPS), pp. 493–511 (2023)

Download references