DV Lab (original) (raw)

  1. MGM MGM Public
    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
    Python 3.3k 282
  2. Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
    Python 2.7k 289
  3. Project Page for "LISA: Reasoning Segmentation via Large Language Model"
    Python 2.2k 158
  4. Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
    Python 1.6k 77
  5. LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
    Python 812 42
  6. VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
    Python 799 69