[OPTIMIZER] Thread local reduction optimization (#2542)

Co-authored-by: Phil Tillet <phil@openai.com>
This commit is contained in:
Zahi Moudallal
2023-10-31 16:13:36 -07:00
committed by GitHub
parent 258399c114
commit 3650213218
12 changed files with 986 additions and 31 deletions

View File

@@ -148,6 +148,8 @@ def optimize_ttgir(mod, num_stages, num_warps, num_ctas, target,
if capability // 10 >= 9:
pm.add_tritongpu_fence_insertion_pass()
pm.add_tritongpu_ws_fixup_missing_attrs_pass()
pm.add_tritongpu_optimize_thread_locality_pass()
pm.add_canonicalizer_pass()
pm.run(mod)
return mod