[Frontend] Add rerank support to run_batch endpoint by pooyadavoodi · Pull Request #16278 · vllm-project/vllm (original) (raw)
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
amitm02 pushed a commit to amitm02/vllm that referenced this pull request
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io Signed-off-by: amit amit.man@gmail.com
amitm02 pushed a commit to amitm02/vllm that referenced this pull request
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io Signed-off-by: amit amit.man@gmail.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Pooya Davoodi pooya.davoodi@parasail.io
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})