-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AMD] Reland instruction scheduling hint changes #4940
Merged
Merged
Commits on Oct 29, 2024
-
Revert "[AMD] revert optimizations (triton-lang#4919)"
This reverts commit 93de426.
Configuration menu - View commit details
-
Copy full SHA for 4b50c48 - Browse repository at this point
Copy the full SHA 4b50c48View commit details -
Configuration menu - View commit details
-
Copy full SHA for d53c499 - Browse repository at this point
Copy the full SHA d53c499View commit details -
[AMD] fixed a bug resulted in reverting PR#4919
Replaced temlate-based impl. of `rewindUnaryOps` in `SchedInstructions.cpp` using regular for-loops. The new impl. is more robust and can handle other unary ops automatically.
Configuration menu - View commit details
-
Copy full SHA for ce02968 - Browse repository at this point
Copy the full SHA ce02968View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2aecafa - Browse repository at this point
Copy the full SHA 2aecafaView commit details -
[AMD] Fixed bug in
setNumGeneratedGlobalLoads
* add a test for the presence of OpIdx attribute
Configuration menu - View commit details
-
Copy full SHA for 088fbd9 - Browse repository at this point
Copy the full SHA 088fbd9View commit details -
[AMD] added additional check into
createCKV3Schedule
The extra check tests whether the data are loaded from HBM using `buffer_load` instructions. The CKV3 scheduling is skipped if the check fails.
Configuration menu - View commit details
-
Copy full SHA for c72bfb9 - Browse repository at this point
Copy the full SHA c72bfb9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 68e7fac - Browse repository at this point
Copy the full SHA 68e7facView commit details -
Configuration menu - View commit details
-
Copy full SHA for dd7d2c6 - Browse repository at this point
Copy the full SHA dd7d2c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for a2f8874 - Browse repository at this point
Copy the full SHA a2f8874View commit details -
Configuration menu - View commit details
-
Copy full SHA for ae8c3c8 - Browse repository at this point
Copy the full SHA ae8c3c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2504666 - Browse repository at this point
Copy the full SHA 2504666View commit details
Commits on Oct 31, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 63a552f - Browse repository at this point
Copy the full SHA 63a552fView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.