feifeibear - Overview (original) (raw)
Skip to content
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign up
Pinned Loading
- Making large AI models cheaper, faster and more accessible
Python 40.8k 4.5k
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Python 1.9k 204
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
C++ 1.5k 200
- PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
Python 761 57
- Fast inference from large lauguage models via speculative decoding
Python 722 68
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Python 488 42