Popular repositories Loading
- MGM MGM Public
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Python 3.3k 282
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Python 2.7k 289
- Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Python 2.2k 158
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Python 1.6k 77
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Python 812 42
- VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Python 799 69