gaopeng (original) (raw)
Peng Gao
Young Scientist
Shanghai AI Lab
Email: gaopeng [at] pjlab (dot) org (dot) cn
I am a Young Scientist at Shanghai AI Lab. I got my Ph.D. degree from Multimedia Lab, the Chinese University of Hong Kong in 2021. During my Ph.D. period, I was supervised by Xiaogang Wang and Hongsheng Li. I was luckily to be involved in internship program at MERL Boston, Microsoft Seattle, AI2 Seattle and Sensetime Beijing/Shenzhen during my Ph.D. time. My research interestes lie in multi-modality Learning, efficient visual backbone design, self-supervised representation learning.
If you are interested in research intern, research engineer, full-time researcher at Shanghai AI lab or Ph.D. program of MMLAB at CUHK. Please send me an email.
Publications
2022
PointCLIP: Point Cloud Understanding by CLIP
,Ziyu Guo*,Wei Zhang,Kunchang Li,Xupeng Miao,Bin Cui,Yu Qiao,Peng Gao**,Hongsheng Li,
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning
Kunchang Li,Yali Wang,Peng Gao,Guanglu Song,Yu Liu,Hongsheng Li,Yu Qiao,
2021
A Simple Long-Tailed Recognition Baseline via Vision-Language Model
Teli Ma*, Shijie Geng, Mengmeng Wang, Jing Shao,Jiasen Lu,Hongsheng Li,Peng Gao**,Yu Qiao,
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang*,Rongyao Fang*,Wei Zhang*,Peng Gao**,Kunchang Li,Jifeng Dai,Yu Qiao,Hongsheng Li,
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Peng Gao*,Shijie Geng*,Renrui Zhang*,Teli Ma,Rongyao Fang,Yongfeng Zhang,Hongsheng Li,Yu Qiao,
Container : Context Aggregation Network
Peng Gao*,Jiasen Lu,Hongsheng Li,Roozbeh Mottaghi,Aniruddha Kembhavi,
Fast Convergence of DETR with Spatially Modulated Co-attention
Peng Gao,Minghang Zeng,Xiaogang Wang, Jifeng Dai, Hongsheng Li,
Scalable Transformers for Neural Machine Translation
Peng Gao,Shijie Geng,Yu Qiao,Xiaogang Wang, Jifeng Dai, Hongsheng Li,
Dual Stream Network for Vision Recognition
Mingyuan Mao*,Peng Gao*,Renrui Zhang*,Honghui Zheng*, Teli Ma, Yan Peng, Errui Ding,Shumin Han
End-to-End Object Detection with Adaptive Clustering Transformer
Minghang Zeng,Peng Gao,Renrui Zhang,Kunchang Li,Xiaogang Wang,Hongsheng Li, Dong Hao,
Dense Contrastive Visual-Linguistic Pretraining
Lei Shi,Kai Shuang,Shijie Geng, Peng Gao, Zuohui Fu,Gerard de Melo,Yunpeng Chen Sen Su
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
Shijie Geng,Peng Gao,Moitreya Chatterjee, Chiori Hori, Jonathan Le Roux, Yongfeng Zhang,Hongsheng Li,Anoop Cherian
2020
Learning Where to Focus for Efficient Video Object Detection
Zhengkai Jiang,Yu Liu,Ceyuan Yang,Jihao Liu,Peng Gao,Qian Zhang,Shiming Xiang Chunhong Pan
2019
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao,Haoxuan You,Zhanpeng Zhang,Xiaogang Wang,Hongsheng Li
Dynamic Fusion with Intra and Inter-Modality Attention Flow for Visual Question Answering
Peng Gao,Zhengkai Jiang,Haoxuan You,Pan Lu,Steven CH Hoi,Xiaogang Wang,Hongsheng Li
Oral Presentation
Video Object Detection with Locally-Weightd Deformable Neighboors
Zhengkai Jiang,Peng Gao,Chaoxu Guo,Qian Zhang,Shiming Xiang,Chunhong Pan
2018
Question-guided Hybrid Convolution for Visual Question Answering
Peng Gao,Hongsheng Li,Shuang Li,Pan Lu,Yikang Li,Steven C.H. Hoi,Xiaogang Wang