Implemented a new test case for LUT quantization #2342

szyszyzys · 2025-06-09T22:53:57Z

No description provided.

pytorch-bot · 2025-06-09T22:54:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2342

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 276d953 with merge base d72a6d1 ():

NEW FAILURE - The following job has failed:

Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh)
test/integration/test_integration.py::TestSubclass::test_int4_weight_only_quant_subclass_grouped_5_cuda

This comment was automatically generated by Dr. CI and updates every 15 minutes.

metascroy · 2025-06-09T23:00:11Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+
+
+std::pair<std::vector<int8_t>, std::vector<int8_t>>
+generate_simple_u_to_s_lut_and_indices(


What does "simple_u_to_s" mean?

The term "simple_u_to_s" refers to converting an unsigned index to a signed index. The example provided is a simplified scenario where we only need to map values to each slot. I will work on making the function name more descriptive and readable.

Yeah, let's come up with a better name, e.g., maybe "generate_random_int8_lut_and_indices" or something like that

metascroy · 2025-06-09T23:09:07Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+  * input data type (T_in) to its corresponding floating-point representation for
+  * each quantization group.
+  *
+  * @tparam T_in The data type of the quantized values (e.g., int8_t). This


Where is the T_in defined in the code?

I updated the implementation but forgot to update the function description. I will provide a new description.

metascroy · 2025-06-09T23:09:29Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+  * @param has_zeros A flag indicating if the zero-points should be used.
+  * @return A flattened std::vector<float> containing all the group LUTs concatenated.
+  */
+  std::vector<int8_t> generate_requant_lut_from_params(


Why are we returning int8_t type, and not float32_t for LUT?

In the context of test_channelwise_8bit_activation_groupwise_lowbit_weight_lut, the code requires an int8_t type to function correctly. I initially designed the implementation to serve as a second quantization step, hence the int8_t output. However, I agree that it should be more flexible to accommodate different data types for various scenarios. I attempted to address this in the code but overlooked this function. I will make the necessary adjustments.

metascroy · 2025-06-09T23:10:43Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+  const size_t lut_size_per_group = static_cast<size_t>(q_max) - q_min + 1;
+  const int lut_index_offset = q_min;
+
+  const int num_groups = zeros.size();


What is this when has_zeros=false?

It could lead to errors. I should not rely on the size of zeros when has_zeros is false.

metascroy · 2025-06-09T23:11:10Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+    for (int q_val = q_min; q_val <= q_max; ++q_val) {
+      size_t lut_idx = group_idx * lut_size_per_group + (q_val - lut_index_offset);
+      // The LUT stores the result of (quantized_value - zero_point)
+      luts[lut_idx] = static_cast<int8_t>(q_val - zero_point);


Why is the lut output dtype int8_t?

metascroy · 2025-06-09T23:15:20Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+  }
+
+  // Helper to perform quantization
+  static std::vector<T_in> quantize_input(


What is this for?

Can't we just generate the random LUT for testing?

Yes, we can generate a random LUT for testing purposes. My initial design aimed to extend this function to accommodate more scenarios, such as secondary quantization. This would allow it to work in conjunction with other components. My goal was to ensure that it could be integrated and tested as part of a larger system.

I'm not sure I follow how you're planning on using this. For LUT quantization, we won't be quantizing in this way.

We also shouldn't re-invent the quantization logic because we have this elsewhere in other tests, e.g., here: https://github.com/pytorch/ao/blob/main/torchao/experimental/kernels/cpu/aarch64/tests/test_utils.h#L110.

If you want to refactor that code to use non-aarch64 code, that'd be great, but I'm not sure it's related to the LUT project.

Got it. Thanks!

metascroy · 2025-06-09T23:57:49Z

torchao/experimental/kernels/cpu/aarch64/tests/test_utils_lut.h

+  }
+
+  // Helper to compute the reference output
+  static std::vector<float> compute_expected_output(


Is this doing the matrix product of activations * dequantized_weights?

It looks like it's just dequantizing the weights?

I may have overdesigned this part. My initial intention was to accommodate a more complex setup, such as integrating with other quantization methods or using different LUT designs. That's why I used the GroundTruthStrategy to set the dequantization strategies. It seems unnecessary in this context.

Implemented a new test case for LUT quantization

b0c32d6

szyszyzys added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Jun 9, 2025

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 9, 2025

metascroy reviewed Jun 9, 2025

View reviewed changes

Update LUT test, adding new simple unit test logic.

276d953

szyszyzys force-pushed the lut_update_final branch from abdb958 to 276d953 Compare June 10, 2025 00:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implemented a new test case for LUT quantization #2342

Implemented a new test case for LUT quantization #2342

Uh oh!

szyszyzys commented Jun 9, 2025

pytorch-bot bot commented Jun 9, 2025 •

edited

Loading

metascroy Jun 9, 2025

szyszyzys Jun 9, 2025

metascroy Jun 9, 2025

metascroy Jun 9, 2025

szyszyzys Jun 9, 2025

metascroy Jun 9, 2025

szyszyzys Jun 9, 2025

metascroy Jun 9, 2025

szyszyzys Jun 9, 2025

metascroy Jun 9, 2025

metascroy Jun 9, 2025

szyszyzys Jun 9, 2025

metascroy Jun 9, 2025

szyszyzys Jun 10, 2025

metascroy Jun 9, 2025

szyszyzys Jun 10, 2025



		std::pair<std::vector<int8_t>, std::vector<int8_t>>
		generate_simple_u_to_s_lut_and_indices(

Implemented a new test case for LUT quantization #2342

Are you sure you want to change the base?

Implemented a new test case for LUT quantization #2342

Uh oh!

Conversation

szyszyzys commented Jun 9, 2025

pytorch-bot bot commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2342

❌ 1 New Failure

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pytorch-bot bot commented Jun 9, 2025 •

edited

Loading