[Bugfix] model_max_length should consider max_model_len in tokenizer_config by noooop · Pull Request #19201 · vllm-project/vllm (original) (raw)
[](/apps/gemini-code-assist)
[](/apps/gemini-code-assist)
noooop changed the title
[Bugfix] fix XLMRobertaForSequenceClassification cross_encoding truncate_prompt_tokens [Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding using truncate_prompt_tokens
noooop changed the title
[Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding using truncate_prompt_tokens [Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding max_model_len
noooop changed the title
[Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding max_model_len [Bugfix] model_max_length should consider max_model_len in tokenizer_config
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})