Parameters | EvalScope (original) (raw)

Run evalscope eval --help to get a complete list of parameter descriptions.

Model Parameters#

`# For example, pass arguments in the form of key=value
--model-args revision=master,precision=torch.float16,device_map=auto
--generation-config do_sample=true,temperature=0.5

Or pass more complex parameters using a JSON string

--model-args '{"revision": "master", "precision": "torch.float16", "device_map": "auto"}'
--generation-config '{"do_sample":true,"temperature":0.5,"chat_template_kwargs":{"enable_thinking": false}}'
`

The following parameters are only valid when eval-type=service:

Dataset Parameters#

# For example --datasets gsm8k arc --dataset-args '{"gsm8k": {"few_shot_num": 4, "few_shot_random": false}, "arc": {"dataset_id": "/path/to/arc"}}, "ifeval": {"filters": {"remove_until": "</think>"}}'

Evaluation Parameters#

Judge Parameters#

The LLM-as-a-Judge evaluation parameters use a judge model to determine correctness, including the following parameters:

Other Parameters#