[Hardware][TPU][V1] Multi-LoRA Optimisations for the V1 TPU backend by Akshat-Tripathi · Pull Request #15655 · vllm-project/vllm (original) (raw)
added 30 commits
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
…because xla doesn't allow partial updates
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
This reverts commit b78b088.
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
Signed-off-by: Akshat Tripathi akshat@krai.ai
auto-merge was automatically disabled
Head branch was pushed to by a user without write access
Signed-off-by: Akshat Tripathi akshat@krai.ai
amitm02 pushed a commit to amitm02/vllm that referenced this pull request
Signed-off-by: Akshat Tripathi akshat@krai.ai Signed-off-by: Chengji Yao chengjiyao@google.com Signed-off-by: xihajun junfan@krai.ai Signed-off-by: Jorge de Freitas jorge.de-freitas22@imperial.ac.uk Signed-off-by: Jorge de Freitas jorge@krai.ai Co-authored-by: Chengji Yao chengjiyao@google.com Co-authored-by: xihajun junfan@krai.ai Co-authored-by: Jorge de Freitas jorge.de-freitas22@imperial.ac.uk Co-authored-by: Jorge de Freitas jorge@krai.ai Signed-off-by: amit amit.man@gmail.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Akshat Tripathi akshat@krai.ai Signed-off-by: Chengji Yao chengjiyao@google.com Signed-off-by: xihajun junfan@krai.ai Signed-off-by: Jorge de Freitas jorge.de-freitas22@imperial.ac.uk Signed-off-by: Jorge de Freitas jorge@krai.ai Co-authored-by: Chengji Yao chengjiyao@google.com Co-authored-by: xihajun junfan@krai.ai Co-authored-by: Jorge de Freitas jorge.de-freitas22@imperial.ac.uk Co-authored-by: Jorge de Freitas jorge@krai.ai
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})