Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM Python 3.4k 545
A high-throughput and memory-efficient inference and serving engine for LLMs Python 83.1k 18.1k
A game where players compete to draw differing prompts on a shared canvas, as scored by a computer vision model Python 8
Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity mitigation Python 27 4
pySLAM-D is a real-time SLAM algorithm for UAV aerial stitching. Includes additional features and refactored code inspired by BU's implementation https://github.com/armandok/pySLAM-D Python 6
Push Cursor on Target messages to TAK clients with attachments and other information Python 58 13