qtensor — Model Optimizer 0.27.1 (original) (raw)
Modules
modelopt.torch.quantization.qtensor.base_qtensor | Base Class for Real Quantized Tensor. |
---|---|
modelopt.torch.quantization.qtensor.fp8_tensor | Implements FP8 quantization for efficient tensor storage and computation. |
modelopt.torch.quantization.qtensor.int4_tensor | Implements INT4 quantization for efficient tensor storage and computation. |
modelopt.torch.quantization.qtensor.nf4_tensor | Implements NF4 quantization for efficient tensor storage and computation. |
Tensor Class for Real Quantization.