[Bugfix] model_max_length should consider max_model_len in tokenizer_config by noooop · Pull Request #19201 · vllm-project/vllm (original) (raw)

[gemini-code-assist[bot]](/apps/gemini-code-assist)

[gemini-code-assist[bot]](/apps/gemini-code-assist)

@noooop

@noooop noooop changed the title[Bugfix] fix XLMRobertaForSequenceClassification cross_encoding truncate_prompt_tokens [Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding using truncate_prompt_tokens

Jun 5, 2025

@noooop

@noooop noooop changed the title[Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding using truncate_prompt_tokens [Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding max_model_len

Jun 5, 2025

@noooop

@noooop

@noooop noooop changed the title[Bugfix] fix XLMRobertaForSequenceClassification for cross_encoding max_model_len [Bugfix] model_max_length should consider max_model_len in tokenizer_config

Jun 6, 2025

DarkLight1337

@noooop

DarkLight1337

@noooop

@noooop

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@noooop

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})