Skip to content

Use fixed scale for Float8 softmax quantization instead of observer (… #390

Use fixed scale for Float8 softmax quantization instead of observer (…

Use fixed scale for Float8 softmax quantization instead of observer (… #390

Job log options

This job was skipped