github/ROCm - ROCm - AtHeartEngineering

mirror of https://github.com/ROCm/ROCm.git synced 2026-04-05 03:01:17 -04:00

Author	SHA1	Message	Date
Keren Zhou	74dbb2fc0a	[DOCS] Add missing ops and corresponding comments (#1699 )	2023-05-21 12:18:48 -07:00
peterbell10	deb2c71fb4	[FRONTEND] Add `tl.expand_dims` (#1614 ) This exposes `semantic.expand_dims` in the public API and builds upon it with support for expanding multiple dimensions at once. e.g. ```python tl.expand_dims(tl.arange(0, N), (0, -1)) # shape = [1, N, 1] ``` Compared to indexing with `None`, this API is useful because the dimensions can be constexpr values rather than hard-coded into the source. As a basic example ```python @triton.jit def max_keepdim(value, dim): res = tl.max(value, dim) return tl.expand_dims(res, dim) ```	2023-05-04 09:46:24 -07:00
peterbell10	0d76c4ca95	[FRONTEND] Rename `tl.reduction` -> `tl.reduce` and improve testing (#1521 ) `tl.reduction` is currently tested indirectly through the existing reduction operators, but it's good to have a direct test for the function itself. --------- Co-authored-by: Philippe Tillet <phil@openai.com>	2023-04-14 14:35:31 -07:00
Phil Tillet	0e3290963e	[DOCS] re-enabled flash attention tutorial	2023-04-13 15:49:32 -07:00
Keren Zhou	fdf1c1f2a1	[DOCS] Fix documentation workflow (#1520 ) Co-authored-by: Phil Tillet <phil@openai.com>	2023-04-13 13:49:36 -07:00
Keren Zhou	272f23457a	[DOCS] Restore the documentation workflow (#1503 ) Not sure if it works at this moment, but at least we can restore the workflow first.	2023-04-11 13:36:15 -07:00
who who who	fd0516fb90	[DOCS] Fixed typo (#1489 )	2023-04-09 16:06:34 -07:00
Xuehai Pan	5b36cb48ad	[CI][TEST] update `pre-commit` hooks and use `pre-commit` for style tests in CI (#1409 ) Ref issue: - #1408 Changes: - Add `.editorconfig` - Add `pre-commit-hooks`: ```yaml - repo: https://github.com/pre-commit/pre-commit-hooks rev: v4.4.0 hooks: - id: check-symlinks - id: destroyed-symlinks - id: trailing-whitespace - id: end-of-file-fixer - id: check-yaml - id: check-toml - id: check-ast - id: check-added-large-files - id: check-merge-conflict - id: check-executables-have-shebangs - id: check-shebang-scripts-are-executable - id: detect-private-key - id: debug-statements ``` - Add `flake8` to `pre-commit` config and add `.flake8` file - Use `pre-commit` for style tests in CI - Run `pre-commit` and fix existing violations: - fix trailing spaces - fix end-of-files - fix mod file mode with `chmod -x` - run `autopep8` on existing code - fix `flake8` violations	2023-03-25 14:52:16 -07:00
peterbell10	6063fccd0b	[FRONTEND][BACKEND] Lower `tl.abs` to `math::Abs{I,F}Op` (#1401 ) This generates identical PTX for floating point, but for integer types the resulting PTX is much better. For example `tl.abs` for int16 currently generates ```mlir cvt.s32.s16 %r1, %rs2; neg.s16 %rs4, %rs2; setp.lt.s32 %p4, %r1, 0; selp.b16 %rs3, %rs4, %rs2, %p4; ``` After, it becomes a single `abs.s16` instruction. This also improves LLVM's ability to optimize floats. e.g. `abs(t) * abs(t)` is optimized to `t * t` now which didn't happen before. --------- Co-authored-by: Keren Zhou <kerenzhou@openai.com>	2023-03-24 21:58:24 -07:00
Berke Kocaoğlu	ba91f39dbf	[DOC] Fix syntax errors, typos, formatting; increase consistency (#1357 ) This PR; - Fixes syntax errors like `.type values: dict[str, Callable[[list[Any]], Any]]` to `:type values: dict[str, Callable[[list[Any]], Any]]`, - Fixes typos, - Fixes formatting like `k ++` to ` k++`, - Increases consistency (e.g. by transforming the minority `cd dir/` to the majority `cd dir`).	2023-03-16 15:32:02 -07:00
Yen-Chen Lin	1ea08be168	[TUTORIALS] Add description for 05-layer-norm.py (#1178 ) - Add text description and equations for the tutorial. - Improve the code readability by changing variable names to align them with the equation. The actual code logic is not changed. This is a follow-up of #510. Let me know if a preview HTML is helpful for the review, I can add a link to that too.	2023-02-13 08:47:35 +00:00
Philippe Tillet	20100a7254	Merge `triton-mlir` branch - Complete rewrite of the backend from scratch (#1004 ) This PR merges the `triton-mlir` branch, in which we have been quietly rewriting the Triton backend from scratch to increase maintainability, stability and ultimately performance. Changes to the runtime are minimal, and this new version aims to remain backward-compatible with the previous commit. The legacy backend is now officially deprecated, but can still be accessed via the `legacy-backend` tag. Co-authored-by: Keren Zhou <kerenzhou@openai.com> Co-authored-by: Yan Chunwei <yanchunwei@outlook.com> Co-authored-by: goostavz <109190422+goostavz@users.noreply.github.com> Co-authored-by: Shintaro Iwasaki <siwasaki@fb.com> Co-authored-by: Yan Da <dyanab@connect.ust.hk> Co-authored-by: Jun Yang <yangjunpro@gmail.com> Co-authored-by: Ian Bearman <ianb@microsoft.com> Co-authored-by: Jason Ansel <jansel@jansel.net> Co-authored-by: Qingyi Liu <qingyil@nvidia.com> Co-authored-by: ben-zhang-609 <110140741+ben-zhang-609@users.noreply.github.com> Co-authored-by: Chenggang Zhao <lyricz@yeah.net> Co-authored-by: ben-zhang-609 <benzh609@gmail.com> Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-12-21 01:30:50 -08:00
Twizzes	ddae106c0e	[DOCS] Update installation.rst to fix windows build error (#747 )	2022-10-13 13:27:15 -07:00
Philippe Tillet	2baf333d44	[DOCS] Fixed typos (#670 )	2022-09-18 17:13:12 -07:00
Shintaro Iwasaki	c668d6596e	[DOCS] Fix spelling (#664 ) This PR applies minor spelling fix in comments and string literals to `master`. It shouldn't hurt anything.	2022-09-16 12:26:40 -07:00
Keren Zhou	d345ddf837	[DOCS] Separate atomic cas from other atomic operations since operands are very different (#559 )	2022-06-22 17:51:17 -07:00
Philippe Tillet	4941bc7001	[DOCS] Some more fixes (#455 )	2022-02-08 16:53:56 -08:00
Philippe Tillet	077d6c8ff0	[DOCS] re-activated tutorials	2022-02-08 11:42:39 -08:00
Philippe Tillet	822ddcd14b	[DOCS] Added versioning (#453 )	2022-02-08 11:28:18 -08:00
Madeleine Thompson	9801aa7b56	[DOCS] fix tutorials for v2.0 (#422 ) - Fix meta-parameter usage on tutorials. - Install tutorial dependencies on CI. - Switch from `requirements-test.txt` to `extras_require` for test dependencies, and also use it for tutorial dependencies. - Make some performance tests deterministic.	2022-01-07 12:34:38 -08:00
Philippe Tillet	b352b16567	[DOCS] Installation documentation now doesn't suggest to run regression tests	2021-09-29 18:32:33 -07:00
Min Xu	cecca90bea	[DOCS] update installation doc and add gitignore (#279 ) Co-authored-by: Min Xu <min.xu.public@gmail.com>	2021-09-12 21:11:45 -07:00
Szymon Sidor	8bedcce9be	[LANG] Added seeded random number generation - philox (#261 )	2021-09-02 22:02:40 -07:00
Philippe Tillet	f26a48a3b4	[DOCS] Various improvements (#224 ) - Added docstr for autotune, Config, heuristics - Added docstr for atomics - Hiding internal _builder argument used for built-in language primitives - Re-factor docstr to use common templates between similar functions.	2021-08-18 11:15:53 -07:00
Philippe Tillet	c45c2e9684	[DOCS] Added docs for cos/sin/sqrt (#204 )	2021-08-14 10:34:07 -07:00
Nicholas Joseph	6cd1ec3955	[DOCS] Fix formatting mistakes (#192 )	2021-08-06 12:58:43 -07:00
milesial	b7cdf670c3	[DOCS] Fix related work (#172 )	2021-08-01 11:06:37 -07:00
Reid Draper	2322d6df2a	[CI] Update `ptillet` to `openai` (#152 )	2021-07-29 11:39:50 -07:00
Philippe Tillet	41ecd96300	[DOCS] minor grammar improvements	2021-07-28 14:18:31 -07:00
Avi Radinsky	d3851d8989	[DOCS] Typo fix (#151 )	2021-07-28 12:07:12 -07:00
Philippe Tillet	4b9df06568	[CI] Bumped dev version to 1.0.1 and fixed permissions in documentation.yml (#149 )	2021-07-28 04:35:14 -07:00
Philippe Tillet	acd5e44611	[GENERAL] Some minor improvements here and there to build systems and docs (#148 )	2021-07-28 01:51:17 -07:00
Philippe Tillet	bfc0a7587d	[PYTHON] Renamed triton.core -> triton.language (#92 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	29e33e50b7	[DOCS] Updates and improvements (#87 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	39f4730305	Deprecation of Triton-C and Replacement by decorated Python functions (#86 ) This PR implements a major overhaul of the frontend for Triton, and replaces Triton-C by a pure Python API in which kernels are defined as @triton.jit decorated functions. The documentation and tutorials have also been updated to accommodate these changes. See documentations for more information on the new API	2021-07-27 12:38:49 -07:00
Philippe Tillet	1fdb465b71	[DOCS] Various improvements and typo fixes	2021-07-27 12:38:49 -07:00
Philippe Tillet	b352bc79e3	[CI] Changed triton-nightly to --pre triton (#78 ) The solution proposed in #77 can create namespace conflicts when triton and triton-nightly have both been pip installed. Therefore, this PR is moving nightly releases to pre-releases in the main triton index.	2021-07-27 12:38:49 -07:00
Philippe Tillet	2f80a98776	[BUILD] Added automatic nightly build releases to pip in CI; removed build-time dependence on LLVM and PyTorch (#77 ) Recently there has been more and more report about installation issues: - Installing Triton before upgrading pytorch can create some issues because Triton uses some torch headers - llvm-10-dev not available on some platform; llvm-11-dev not available on e.g. Ubuntu. absence of nightly builds This PR should fix all these issues. Some CMake tricks are used to download and install llvm at build time. Triton Python bindings were modified to remove dependence on pytorch ops. Midnight CI job added to generate binary wheels for all Triton version and update them on pypi's new triton-nightly project. This PR will also make it very easy to use LLVM forks in the future for whatever needs we have.	2021-07-27 12:38:49 -07:00
Philippe Tillet	3ad0a4d7be	[DOCS] Uncommented sphinx gallery	2021-07-27 12:38:49 -07:00
Philippe Tillet	a74919fa46	[DOCS] Improved index	2021-07-27 12:38:49 -07:00
Philippe Tillet	997e54e3bf	[DOCS] Added non-tutorial documentation pages	2021-07-27 12:38:49 -07:00
Philippe Tillet	f4fb209dad	[DOCS] Removed pip installation instruction as version on Pip is not up-to-date	2021-07-27 12:38:49 -07:00
Philippe Tillet	92242ace2c	[DOCS] Re-structured documentation hierarchy	2021-07-27 12:38:49 -07:00
Philippe Tillet	ca04da3575	[DOCS] Switched tutorials to Python and use Sphinx Gallery	2021-07-27 12:38:49 -07:00
Philippe Tillet	5172792543	[DOCS] Added .ipynb tutorials in docs	2021-07-27 12:38:49 -07:00
Philippe Tillet	0c13b8ff0e	[DOCS] Updated and improved docs (#73 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	269ebc12e5	[PYTHON][TESTS][DOC] Various improvement of the API and code quality: * Simplified `triton.kernel` API to achieve lower latency: > .data_ptr() must now be passed as kernel argument. No more implicit conversion from torch.tensor > compilation options are now constant attributes, i.e., opt.d('VAR') becomes opt.VAR > torch.device must now be passed explicitly to triton.kernel (no longer inferred from torch.tensor arguments) * C++ tests moved to `python/tests/` * C++ tutorial created in `tutorials/` * Python tutorial created in python/tutorials/ * Version changed to 1.0alpha * No longer copying C++ headers into the Python package * added python/triton/ops/ package for pre-written Triton ops	2021-07-27 12:38:48 -07:00
Philippe Tillet	32d615f8f8	[DOCS] Now specifying pip command in installation.rst	2021-07-27 12:38:48 -07:00
jack-willturner	180ed26b61	[DOCS] Transposition fix	2021-07-27 12:38:48 -07:00
jack-willturner	32819dea51	[DOCS] Matmul and vecadd working examples	2021-07-27 12:38:48 -07:00

1 2

57 Commits