Pulse · pytorch/ao

June 7, 2025 – June 10, 2025

17 Active pull requests

4 Active issues
- 11 Merged pull requests
- 6 Open pull requests
- 0 Closed issues
- 4 New issues

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

DUMMY PR: add support for hpu in float8 base and compile test for torch ao
#2326 commented on Jun 10, 2025 • 14 new comments
moe quant with dedicated kernels [wip]
#2325 commented on Jun 9, 2025 • 3 new comments
ROCm mx-fp8 Gemm
#2066 commented on Jun 10, 2025 • 2 new comments
skip quant/dequant decomposed
#2299 commented on Jun 10, 2025 • 1 new comment
Add round_scales_to_power_of_2 option for float quantization
#2323 commented on Jun 9, 2025 • 1 new comment
BF16 stochastic rounding does not work distributed (FSDP)
#2296 commented on Jun 8, 2025 • 0 new comments
[Question] Combining QAT and Sparsity Training
#2310 commented on Jun 9, 2025 • 0 new comments
[Windows][build]two Build failure on Windows on latest main branch
#2297 commented on Jun 9, 2025 • 0 new comments
int4_weight_only get plain weight are padded
#2249 commented on Jun 11, 2025 • 0 new comments
[CPU] Enable DA8W4 on CPU
#2128 commented on Jun 8, 2025 • 0 new comments
Eval hf models using lm_eval
#2179 commented on Jun 9, 2025 • 0 new comments
[WIP] Enable Int4WeightOnlyGPTQQuantizer on Intel GPU.
#2200 commented on Jun 10, 2025 • 0 new comments
Fix failing tests on h100
#2231 commented on Jun 9, 2025 • 0 new comments
Build mxfp4 kernel for sm120a
#2285 commented on Jun 8, 2025 • 0 new comments
[BE] Make ScalingGranularity module level so it can be rendered in API ref on docsite
#2314 commented on Jun 9, 2025 • 0 new comments