feifeibear - Overview (original) (raw)

Skip to content

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

Pinned Loading

  1. Making large AI models cheaper, faster and more accessible
    Python 40.8k 4.5k
  2. xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
    Python 1.9k 204
  3. a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
    C++ 1.5k 200
  4. PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
    Python 761 57
  5. Fast inference from large lauguage models via speculative decoding
    Python 722 68
  6. USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
    Python 488 42