[Frontend] Optimize beam search loop by sorting and then splicing by zhanggzh · Pull Request #19347 · vllm-project/vllm (original) (raw)
[](/apps/gemini-code-assist)
[](/apps/gemini-code-assist)
Signed-off-by: zhangguozhu zhangguozhu@360.cn
zhangguozhu added 2 commits
Signed-off-by: zhangguozhu zhangguozhu@360.cn
Signed-off-by: zhangguozhu zhangguozhu@360.cn
martinbomio added a commit to martinbomio/vllm that referenced this pull request
Signed-off-by: mgoin mgoin64@gmail.com
mgoin changed the title
[Frontend]Opt beam search [Frontend] Optimize beam search
mgoin changed the title
[Frontend] Optimize beam search [Frontend] Optimize beam search loop by sorting and then splicing
Signed-off-by: mgoin mgoin64@gmail.com
Signed-off-by: mgoin mgoin64@gmail.com
mgoin added performance
Performance-related issues
ONLY add when PR is ready to merge/full CI is needed
and removed unstale
Recieved activity after being labelled stale
labels
RunkaiTao pushed a commit to RunkaiTao/vllm that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com Signed-off-by: Runkai Tao rt572@physics.rutgers.edu
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: zhangguozhu zhangguozhu@360.cn Signed-off-by: mgoin mgoin64@gmail.com Co-authored-by: zhangguozhu zhangguozhu@360.cn Co-authored-by: mgoin mgoin64@gmail.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})