nvfp4_qat_config
¶
Classes¶
fastvideo.layers.quantization.nvfp4_qat_config.NVFP4QATQuantizeMethod
¶
Bases: QuantizeMethodBase
Source code in fastvideo/layers/quantization/nvfp4_qat_config.py
Functions¶
fastvideo.layers.quantization.nvfp4_qat_config.NVFP4QATQuantizeMethod.apply
¶
Apply NVFP4 QAT quantized computation.
Source code in fastvideo/layers/quantization/nvfp4_qat_config.py
fastvideo.layers.quantization.nvfp4_qat_config.NVFP4QATQuantizeMethod.create_weights
¶
create_weights(layer: Module, input_size_per_partition: int, output_partition_sizes: list[int], input_size: int, output_size: int, params_dtype: dtype, **extra_weight_attrs)
Create weights for a linear layer. Note the corrected signature to match LinearMethodBase.