Vincent Casser – Machine Learning Researcher, Software Engineer (original) (raw)
I’m a Staff Research Scientist at Waymo, the autonomous driving company formerly known as the Google Self-Driving Car Project. At Waymo, I’m developing new technologies for autonomous vehicles in areas such as reconstructive sensor simulation (NeRF/3DGS), generative sensor simulation, sensor fusion, multi-task learning and foundation models. I have deployed numerous safety-critical models to Waymo’s fully autonomous vehicle fleet, which has served millions of trips to customers across various markets.A subset of my research is published at CVPR, ICCV, CoRL, IROS and ICRA, and I hold numerous international patents in the autonomous driving domain. I have been organizing the AV industry’s primary academic workshop at CVPR in 2022, 2023, 2024 and 2025.
I enjoy interdisciplinary work, and have broad experience in machine learning, deep learning and computer vision. Before joining Waymo, I pursued research in domains such as computational perception, aerial robotics and biomedical imaging. Some of my previous projects were related to the study of human memory (at MIT), machine learning applications in healthcare (with Massachusetts General Hospital), astronomy (with the Harvard-Smithsonian Center) and electron microscopy (with the Harvard Lichtman Lab).
News
Publications
2025-03-04 03:37:21
89
SceneCrafter: Controllable Multi-View Driving Scene Editing
Zehao Zhu, Yuliang Zou, Chiyu “Max” Jiang, Bo Sun, Vincent Casser, Xiukun Huang, Jiahao Wang, Zhenpei Yang, Ruiqie Gao, Leonidas Guibas, Mingxing Tan, Dragomir Anguelov
Conference on Computer Vision and Pattern Recognition (CVPR’25).
2025-03-04 03:42:44
90
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
Wayne Hung, Vincent Casser, Henrik Kretzschmar, Jyh-Jing Hwang, Dragomir Anguelov:
IEEE International Conference on Robotics and Automation (ICRA’24). Full link
2025-03-04 03:45:47
91
Block-NeRF: Scalable Neural Rendering
Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jon T. Barron, Henrik Kretzschmar
Conference on Computer Vision and Pattern Recognition (CVPR’22). Oral presentation. Full link
2025-03-04 03:47:28
92
Instance Segmentation with Cross-Modal Consistency
Alex Zhu, Vincent Casser, Reza Mahjourian, Henrik Kretzschmar and Soeren Pirk
International Conference on Intelligent Robots and Systems (IROS’22) Full link
2025-03-04 03:49:11
93
4D-Net for Learned Multi-Modal Alignment
AJ Piergiovanni, Vincent Casser, Michael Ryoo and Anelia Angelova
International Conference on Computer Vision (ICCV’21) Full link
2025-03-04 03:52:05
94
Taskology: Utilizing Task Relations at Scale
Yao Lu, Soeren Pirk, Jan Dlabal, Anthony Brohan, Ankita Pasad, Zhao Chen, Vincent Casser, Anelia Angelova and Ariel Gordon
Conference on Computer Vision and Pattern Recognition (CVPR’21). Oral presentation. Full link
2025-03-04 03:53:20
95
Unsupervised Monocular Depth Learning in Dynamic Scenes
Hanhan Li, Ariel Gordon, Hang Zhao, Vincent Casser, Anelia Angelova
Conference on Robot Learning (CoRL’20) Full link
2025-03-04 03:54:38
96
Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability
Camilo Fosco, Anelise Newman, Vincent Casser, Allen Lee, Barry McNamara and Aude Oliva
European Conference on Computer Vision (ECCV’20) Full link
2025-03-04 03:56:41
97
Predicting Visual Importance Across Graphic Design Types
Camilo Fosco, Vincent Casser, Amish K. Bedi, Peter O’Donovan, Aaron Hertzmann and Zoya Bylinskii
ACM User Interface Software and Technology Symposium (UIST’20) Full link
2025-03-04 03:58:39
98
Fast Mitochondria Segmentation for Connectomics
Vincent Casser, Kai Kang, Hanspeter Pfister and Daniel Haehn
Medical Imaging with Deep Learning (MIDL’20) Full link
2025-03-04 03:59:26
99
Depth Prediction Without the Sensors
Vincent Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
Thirty-Third AAAI Conference on Artificial Intelligence (AAAI’19) Full link
2025-03-04 04:02:54
101
OIL: Observational Imitation Learning
Guohao Li and Matthias Mueller, Vincent Casser, Neil Smith, Dominik Michels, Bernard Ghanem
Robotics: Science and Systems (RSS’19) Full link
2025-03-04 04:01:53
100
Sim4CV: A Photo-Realistic Simulator for Computer Vision
Matthias Mueller, Vincent Casser, Jean Lahoud, Neil Smith, Bernard Ghanem
International Journal of Computer Vision (IJCV) Full link