[VLM] Add PP support and fix GPTQ inference for Ovis models by Isotr0py · Pull Request #18958 · vllm-project/vllm (original) (raw)

added 10 commits

May 17, 2025 15:51

@Isotr0py

Signed-off-by: isotr0py 2037008807@qq.com

@Isotr0py

@Isotr0py

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

DarkLight1337

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

DarkLight1337

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

DarkLight1337

jeejeelee

amitm02 pushed a commit to amitm02/vllm that referenced this pull request

Jun 1, 2025

@Isotr0py @amitm02

…ject#18958)

Signed-off-by: isotr0py 2037008807@qq.com Signed-off-by: Isotr0py 2037008807@qq.com Signed-off-by: amit amit.man@gmail.com

amitm02 pushed a commit to amitm02/vllm that referenced this pull request

Jun 1, 2025

@Isotr0py @amitm02

…ject#18958)

Signed-off-by: isotr0py 2037008807@qq.com Signed-off-by: Isotr0py 2037008807@qq.com Signed-off-by: amit amit.man@gmail.com

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@Isotr0py

…ject#18958)

Signed-off-by: isotr0py 2037008807@qq.com Signed-off-by: Isotr0py 2037008807@qq.com

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})