MIT HAN Lab (original) (raw)

Pinned Loading

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Python 6.9k 382
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python 3k 248
Efficient vision foundation models for high-resolution generation and perception.
Python 2.8k 218
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Python 2.6k 466
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Python 2.1k 421
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Python 1.9k 341