JamesLim-sy - Overview (original) (raw)
limingshu JamesLim-sy
Xiaohongshu | PaddlePaddle | SJTU - MLSys
- Xiaohongshu
- Shanghai, China
Block or report JamesLim-sy
Pinned Loading
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ 23.5k 5.9k - Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ - Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python - SGLang is a fast serving framework for large language models and vision language models.
Python 21.2k 3.7k