Adding load_in_8bit to inference_hf.py by airaria · Pull Request #756 · ymcui/Chinese-LLaMA-Alpaca (original) (raw)

Adding --load_in_8bit option to inference_hf.py

Usage
Just add the --load_in_8bit to the launching command. For example:

python scripts/inference/inference_hf.py \
    --base_model path_to_merged_chinese_alpaca_plus \
    --with_prompt --interactive \
    --load_in_8bit