[Misc] reuse num_tokens_across_dp of get_dp_padding to avoid unnecessary dp all reduce in set_forward_context by izhuhaoran · Pull Request #18935 · vllm-project/vllm (original) (raw)

@izhuhaoran

…all reduce in set_forward_context

@izhuhaoran

@izhuhaoran

varun-sundar-rabindranath

varun-sundar-rabindranath

varun-sundar-rabindranath

varun-sundar-rabindranath

@izhuhaoran

varun-sundar-rabindranath

tlrmchlsmth

@izhuhaoran

@tlrmchlsmth

Signed-off-by: Tyler Michael Smith tysmith@redhat.com

tlrmchlsmth

@tlrmchlsmth tlrmchlsmth added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Jun 1, 2025

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@izhuhaoran @tlrmchlsmth

…ary dp all reduce in set_forward_context (vllm-project#18935)

Signed-off-by: Tyler Michael Smith tysmith@redhat.com Co-authored-by: zhuhaoran zhuhaoran.zhr@alibaba-inc.com Co-authored-by: Tyler Michael Smith tysmith@redhat.com

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})