[Inductor][Quant] Change the QConv output scale name #124246

leslie-fang-intel · 2024-04-17T02:13:02Z

Stack from ghstack (oldest at bottom):

Summary
Change the name of QConv output scale from inv_output_scale to output_scale after we move the optimization of quant/dequant from decomposition to lowering phase.

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-04-17T02:13:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/124246

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit b291167 with merge base fdff992 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor / cuda12.1-py3.10-gcc9-sm86 / test (dynamic_inductor_timm, 2, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
sebotnet33ts_256

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-cuda11.8-py3.10-gcc9 / test (distributed, 3, 3, linux.8xlarge.nvidia.gpu) (gh) (trunk failure)
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: e51d53dbf004d4132c6bc97cb33c5a2a0c99c617 Pull Request resolved: #124246

peterbell10

backward_compat failure is real. Assuming these are internal operators only used by the quantization fx_passes though, you can just add an exception.

leslie-fang-intel · 2024-04-28T02:34:58Z

backward_compat failure is real. Assuming these are internal operators only used by the quantization fx_passes though, you can just add an exception.

Hi @peterbell10, thanks for the comment. I think they are internal operators only used by the Inductor quantization fx_passes. Modify check_forward_backward_compatibility.py to pass the backward_compat testing. Please kindly help to take a look again.

**Summary** Change the name of QConv output scale from `inv_output_scale` to `output_scale` after we move the optimization of quant/dequant from decomposition to lowering phase. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: c1224c830802161e59f32f136cf2749a7b6ff2cd Pull Request resolved: #124246

**Summary** Change the name of QConv output scale from `inv_output_scale` to `output_scale` after we move the optimization of quant/dequant from decomposition to lowering phase. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 42151e4eeb13585a8ec29c8d7d2062d2a54b9d4f Pull Request resolved: #124246

leslie-fang-intel · 2024-04-28T06:26:31Z

Hi @jerryzh168, could you kindly help to take a look of this PR and the previous one per the discussion in #123444?

**Summary** Change the name of QConv output scale from `inv_output_scale` to `output_scale` after we move the optimization of quant/dequant from decomposition to lowering phase. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

leslie-fang-intel · 2024-05-09T00:55:00Z

@pytorchbot merge

pytorchmergebot · 2024-05-09T00:56:48Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

**Summary** Fix 2 regression issues caused by previous refactor: - Fix the issue in dequant promotion pass with dynamic quant when the dequant node is with `tensor` overload. - Fix numerical issue in dynamic quant, since input will convert to scales' dtype (which is `double`) to do quant operatoration with previous implementation. **TestPlan** ``` clear && python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k test_dynamic_qlinear_input_dim_exceeds_2 clear && python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k test_qlinear_dequant_promotion_dynamic_cpu ``` Pull Request resolved: #125207 Approved by: https://github.com/peterbell10, https://github.com/jgong5 ghstack dependencies: #124041, #124246

This reverts commit 9ba9f7f. Reverted #124246 on behalf of https://github.com/huydhn due to Sorry for reverting your change but I think there is a land race with the change https://hud.pytorch.org/pytorch/pytorch/commit/33e6791645b5950b0f39301f55b8a4a79c0ca847 ([comment](#124041 (comment)))

pytorchmergebot · 2024-05-09T01:34:26Z

@leslie-fang-intel your PR has been successfully reverted.

**Summary** Change the name of QConv output scale from `inv_output_scale` to `output_scale` after we move the optimization of quant/dequant from decomposition to lowering phase. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

leslie-fang-intel · 2024-05-09T08:41:33Z

@pytorchbot merge

pytorchmergebot · 2024-05-09T08:43:27Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

**Summary** Fix 2 regression issues caused by previous refactor: - Fix the issue in dequant promotion pass with dynamic quant when the dequant node is with `tensor` overload. - Fix numerical issue in dynamic quant, since input will convert to scales' dtype (which is `double`) to do quant operatoration with previous implementation. **TestPlan** ``` clear && python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k test_dynamic_qlinear_input_dim_exceeds_2 clear && python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k test_qlinear_dequant_promotion_dynamic_cpu ``` Pull Request resolved: #125207 Approved by: https://github.com/peterbell10, https://github.com/jgong5 ghstack dependencies: #124041, #124246

[Inductor][Quant] Change the QConv output scale name

41cff0f

[ghstack-poisoned]

leslie-fang-intel requested review from jerryzh168, salilsdesai, kimishpatel, digantdesai and jianyuh as code owners April 17, 2024 02:13

leslie-fang-intel mentioned this pull request Apr 17, 2024

[Inductor] [Quant] Enable lowering of quant per tensor and refactor quant pattern #124041

Closed

pytorch-bot bot added ciflow/inductor module: cpu CPU specific problem (e.g., perf, algorithm) module: inductor release notes: quantization release notes category labels Apr 17, 2024

leslie-fang-intel added a commit that referenced this pull request Apr 17, 2024

[Inductor][Quant] Change the QConv output scale name

bf3d68b

ghstack-source-id: e51d53dbf004d4132c6bc97cb33c5a2a0c99c617 Pull Request resolved: #124246

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 17, 2024

pytorchbot added the open source label Apr 17, 2024

leslie-fang-intel requested review from jgong5 and Xia-Weiwen April 18, 2024 01:18

peterbell10 reviewed Apr 24, 2024

View reviewed changes

jgong5 approved these changes Apr 25, 2024

View reviewed changes

leslie-fang-intel added a commit that referenced this pull request Apr 28, 2024

[Inductor][Quant] Change the QConv output scale name

6e0da69

ghstack-source-id: c1224c830802161e59f32f136cf2749a7b6ff2cd Pull Request resolved: #124246

leslie-fang-intel added a commit that referenced this pull request Apr 28, 2024

[Inductor][Quant] Change the QConv output scale name

89a5517

ghstack-source-id: 42151e4eeb13585a8ec29c8d7d2062d2a54b9d4f Pull Request resolved: #124246

leslie-fang-intel requested a review from peterbell10 April 28, 2024 02:55

leslie-fang-intel mentioned this pull request Apr 30, 2024

[Inductor][Quant] Fix PT2E Dynamic Quant regression #125207

Closed

peterbell10 approved these changes May 6, 2024

View reviewed changes

pytorchmergebot added the merging label May 9, 2024

pytorchmergebot added the Merged label May 9, 2024

pytorchmergebot closed this in 9ba9f7f May 9, 2024

pytorchmergebot removed the merging label May 9, 2024

pytorchmergebot added the Reverted label May 9, 2024

pytorchmergebot reopened this May 9, 2024

pytorchmergebot added the merging label May 9, 2024

pytorchmergebot closed this in c337395 May 9, 2024

pytorchmergebot removed the merging label May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor][Quant] Change the QConv output scale name #124246

[Inductor][Quant] Change the QConv output scale name #124246

leslie-fang-intel commented Apr 17, 2024 •

edited

pytorch-bot bot commented Apr 17, 2024 •

edited

peterbell10 left a comment

leslie-fang-intel commented Apr 28, 2024 •

edited

leslie-fang-intel commented Apr 28, 2024

leslie-fang-intel commented May 9, 2024

pytorchmergebot commented May 9, 2024

pytorchmergebot commented May 9, 2024

leslie-fang-intel commented May 9, 2024

pytorchmergebot commented May 9, 2024

[Inductor][Quant] Change the QConv output scale name #124246

[Inductor][Quant] Change the QConv output scale name #124246

Conversation

leslie-fang-intel commented Apr 17, 2024 • edited

pytorch-bot bot commented Apr 17, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/124246

✅ You can merge normally! (2 Unrelated Failures)

peterbell10 left a comment

Choose a reason for hiding this comment

leslie-fang-intel commented Apr 28, 2024 • edited

leslie-fang-intel commented Apr 28, 2024

leslie-fang-intel commented May 9, 2024

pytorchmergebot commented May 9, 2024

Merge started

pytorchmergebot commented May 9, 2024

leslie-fang-intel commented May 9, 2024

pytorchmergebot commented May 9, 2024

Merge started

leslie-fang-intel commented Apr 17, 2024 •

edited

pytorch-bot bot commented Apr 17, 2024 •

edited

leslie-fang-intel commented Apr 28, 2024 •

edited