model_calib — Model Optimizer 0.27.1 (original) (raw)

TensorRT Model Optimizer

Calibration utilities.

Functions

max_calibrate Calibrate the model using max.
awq Apply AWQ to the model.
smoothquant Smooth-Quant variant with per-channel weight scaling.
svdquant Lite version of SVDQuant.

awq(model, algorithm='awq_lite', forward_loop=None, **kwargs)

Apply AWQ to the model.

Parameters:

max_calibrate(model, forward_loop=None, distributed_sync=True)

Calibrate the model using max.

Parameters:

smoothquant(model, forward_loop=None, alpha=1.0)

Smooth-Quant variant with per-channel weight scaling.

The parameters are as described inSmoothQuantCalibConfig.

Parameters:

svdquant(model, forward_loop=None, lowrank=32, **kwargs)

Lite version of SVDQuant.

The parameters are as described inSVDQuantConfig.

Parameters: