[v1] Hybrid Memory Allocator by heheda12345 · Pull Request #17996 · vllm-project/vllm (original) (raw)
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
…ator_a
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
WoosukKwon added the ready
ONLY add when PR is ready to merge/full CI is needed
label
Signed-off-by: Chen Zhang zhangch99@outlook.com
Signed-off-by: Chen Zhang zhangch99@outlook.com
joerunde pushed a commit to torch-spyre/sendnn-inference that referenced this pull request
vLLM v0.9.1 contains a bug that causes vllm-spyre to hang on boot-up.
The bug is not respecting num_gpu_blocks_overrides. It was introduced
in vllm-project/vllm#17996 and fixed in
vllm-project/vllm#19503.
Signed-off-by: Travis Johnson tsjohnso@us.ibm.com
leo-li-opus pushed a commit to leo-li-opus/vllm that referenced this pull request
Signed-off-by: Chen Zhang zhangch99@outlook.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Chen Zhang zhangch99@outlook.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})