MIT HAN Lab (original) (raw)

Pinned Loading

  1. [ICLR 2024] Efficient Streaming Language Models with Attention Sinks
    Python 6.9k 382
  2. [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
    Python 3k 248
  3. Efficient vision foundation models for high-resolution generation and perception.
    Python 2.8k 218
  4. [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
    Python 2.6k 466
  5. [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
    Python 2.1k 421
  6. [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
    Python 1.9k 341