[Bugfix][P/D] Fix Prefix Cache Bug by NickLucche · Pull Request #18411 · vllm-project/vllm (original) (raw)
Co-authored-by: rshaw@neuralmagic.com robertgshaw2@gmail.com
Signed-off-by: nicklucche nlucches@redhat.com
Signed-off-by: nicklucche nlucches@redhat.com
NickLucche changed the title
[Bugfix][P/D] Fix Preemption + Prefix Cache Bug (#92) [Bugfix][P/D] Fix Prefix Cache Bug
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: nicklucche nlucches@redhat.com Co-authored-by: Robert Shaw 114415538+robertgshaw2-redhat@users.noreply.github.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})