[BugFix] Fix incremental detokenization perf issue by njhill · Pull Request #16963 · vllm-project/vllm (original) (raw)
max was meant to be min - could cause O(n^2) blowup in pathological cases
Signed-off-by: Nick Hill nhill@redhat.com
added the bug
Something isn't working
label
WoosukKwon added the ready
ONLY add when PR is ready to merge/full CI is needed
label
njhill deleted the fix-inc-detok branch
frieda-huang pushed a commit to frieda-huang/vllm that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com Signed-off-by: Frieda (Jingying) Huang jingyingfhuang@gmail.com
jikunshang pushed a commit to jikunshang/vllm that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com
adobrzyn pushed a commit to HabanaAI/vllm-fork that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com Signed-off-by: Agata Dobrzyniewicz adobrzyniewicz@habana.ai
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com Signed-off-by: Mu Huai tianbowen.tbw@antgroup.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Nick Hill nhill@redhat.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})