Pulse · pytorch/ao

June 9, 2025 – June 10, 2025

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

DUMMY PR: add support for hpu in float8 base and compile test for torch ao
#2326 commented on Jun 10, 2025 • 14 new comments
float8 moe training conversion API prototype
#2275 commented on Jun 10, 2025 • 4 new comments
ROCm mx-fp8 Gemm
#2066 commented on Jun 10, 2025 • 2 new comments
moe quant with dedicated kernels [wip]
#2325 commented on Jun 9, 2025 • 2 new comments
Add support for bmm and `to` for fbgemm Tensor
#2337 commented on Jun 9, 2025 • 2 new comments
skip quant/dequant decomposed
#2299 commented on Jun 10, 2025 • 1 new comment
Add round_scales_to_power_of_2 option for float quantization
#2323 commented on Jun 9, 2025 • 1 new comment
[Question] Combining QAT and Sparsity Training
#2310 commented on Jun 9, 2025 • 0 new comments
[Windows][build]two Build failure on Windows on latest main branch
#2297 commented on Jun 9, 2025 • 0 new comments
int4_weight_only get plain weight are padded
#2249 commented on Jun 10, 2025 • 0 new comments
Eval hf models using lm_eval
#2179 commented on Jun 9, 2025 • 0 new comments
[WIP] Enable Int4WeightOnlyGPTQQuantizer on Intel GPU.
#2200 commented on Jun 9, 2025 • 0 new comments
Fix failing tests on h100
#2231 commented on Jun 9, 2025 • 0 new comments
[BE] Make ScalingGranularity module level so it can be rendered in API ref on docsite
#2314 commented on Jun 9, 2025 • 0 new comments