Float8DynamicActivationFloat8WeightConfig — torchao main documentation (original) (raw)

class torchao.quantization.Float8DynamicActivationFloat8WeightConfig(activation_dtype: dtype = torch.float8_e4m3fn, weight_dtype: dtype = torch.float8_e4m3fn, granularity: Optional[Union[PerTensor, PerRow, List[Union[PerTensor, PerRow]]]] = None, mm_config: Optional[Float8MMConfig] = None, set_inductor_config: bool = True)[source]

Configuration for applying float8 dynamic symmetric quantization to both activations and weights of linear layers.

Parameters: