qtensor — Model Optimizer 0.27.1 (original) (raw)

Modules

modelopt.torch.quantization.qtensor.base_qtensor Base Class for Real Quantized Tensor.
modelopt.torch.quantization.qtensor.fp8_tensor Implements FP8 quantization for efficient tensor storage and computation.
modelopt.torch.quantization.qtensor.int4_tensor Implements INT4 quantization for efficient tensor storage and computation.
modelopt.torch.quantization.qtensor.nf4_tensor Implements NF4 quantization for efficient tensor storage and computation.

Tensor Class for Real Quantization.