ROCm/docs/python-api at 6063fccd0bced43c7acb970bd7994192d2963e40 - ROCm

mirror of https://github.com/ROCm/ROCm.git synced 2026-02-21 03:00:39 -05:00

Files

peterbell10 6063fccd0b [FRONTEND][BACKEND] Lower tl.abs to math::Abs{I,F}Op (#1401 )

This generates identical PTX for floating point, but for integer types
the resulting PTX is much better. For example `tl.abs` for int16
currently generates

```mlir
  cvt.s32.s16 %r1, %rs2;
  neg.s16     %rs4, %rs2;
  setp.lt.s32 %p4, %r1, 0;
  selp.b16    %rs3, %rs4, %rs2, %p4;
```

After, it becomes a single `abs.s16` instruction.

This also improves LLVM's ability to optimize floats. e.g. `abs(t) *
abs(t)` is optimized to `t * t` now which didn't happen before.

---------

Co-authored-by: Keren Zhou <kerenzhou@openai.com>

2023-03-24 21:58:24 -07:00

triton.language.rst

[FRONTEND][BACKEND] Lower tl.abs to math::Abs{I,F}Op (#1401 )

2023-03-24 21:58:24 -07:00

triton.rst

[DOC] Fix syntax errors, typos, formatting; increase consistency (#1357 )

2023-03-16 15:32:02 -07:00

triton.testing.rst

[DOC] Fix syntax errors, typos, formatting; increase consistency (#1357 )

2023-03-16 15:32:02 -07:00