-
Notifications
You must be signed in to change notification settings - Fork 569
[ET-VK] Removed shared memory usage and simplied conv2d dw op shader to improve performance. #11178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ET-VK] Removed shared memory usage and simplied conv2d dw op shader to improve performance. #11178
Conversation
…to improve performance. This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11178
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit f4fa2c3 with merge base f8a3fd8 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
…to improve performance. This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) ghstack-source-id: 286577756 Pull Request resolved: #11178
This pull request was exported from Phabricator. Differential Revision: D75499165 |
… op shader to improve performance." This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
…to improve performance. Pull Request resolved: #11178 This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. ghstack-source-id: 286585745 @exported-using-ghexport Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/)
This pull request was exported from Phabricator. Differential Revision: D75499165 |
… op shader to improve performance." This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
…to improve performance. Pull Request resolved: #11178 This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. ghstack-source-id: 286586831 @exported-using-ghexport Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/)
This pull request was exported from Phabricator. Differential Revision: D75499165 |
… op shader to improve performance." This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D75499165 |
… op shader to improve performance." This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D75499165 |
e103333
into
gh/trivedivivek/102/base
…to improve performance. (#11270) This PR was created by the merge bot to help merge the original PR into the main branch. ghstack PR number: #11178 by @trivedivivek ^ Please use this as the source of truth for the PR details, comments, and reviews ghstack PR base: https://github.com/pytorch/executorch/tree/gh/trivedivivek/102/base ghstack PR head: https://github.com/pytorch/executorch/tree/gh/trivedivivek/102/head Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/trivedivivek/101/orig Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/trivedivivek/102/orig @diff-train-skip-merge --------- Co-authored-by: Vivek Trivedi <[email protected]>
Stack from ghstack (oldest at bottom):
This diff removes shared memory usage in
conv2d_dw_output_tile.glsl
shader to improve performance.Makes sum a one dimensional array, and moves bias application before storing texel.
Differential Revision: D75499165