Commit Graph

2460 Commits

Author SHA1 Message Date
yugang-amd
7d57bc520f Fix broken link for AMDGPU installer (#4989) 2025-07-02 10:05:48 -04:00
Pratik Basyal
f8d4957944 KMD UMD support footnote update ROCm 633 (#4972) 2025-06-26 15:33:59 -04:00
Peter Park
306d511aac [docs/6.3.3] Link to specific ROCm/vLLM readme in inference/vllm-benchmark.rst (#4922)
* point to specific rocm/vllm readme

* fix previous version url
2025-06-13 13:49:18 -04:00
Peter Park
177f5c4461 Merge pull request #4711 from peterjunpark/docs/6.3.3-vllm-benchmark-ver
[docs/6.3.3] Use full docker image tag instead of `instinct_main` alias in vllm-benchmark.rst
2025-05-06 12:01:45 -04:00
Pratik Basyal
b92b122236 6.1.5 column added (#4646) 2025-04-17 11:50:05 -04:00
Istvan Kiss
a479555bd3 Fix broken torchserve link 2025-04-08 17:22:48 +02:00
Peter Park
8430bd089b Merge pull request #4561 from peterjunpark/docs/6.3.3
Add documentation fixes to docs/6.3.3
2025-04-03 13:16:25 -04:00
Dominic Widdows
86d8bac331 Update workload.rst with small export fix (#4425)
Tiny fix that removes the "export" directive.
` export HIP_FORCE_DEV_KERNARG=1  hipblaslt-bench ...`
leads to
bash: export: `hipblaslt-bench': not a valid identifier

whereas just starting with HIP_FORCE_DEV_KERNARG=1 passes this env var to the hipblaslt-bench process, which I think is the intention here.

(cherry picked from commit 715cce53de)
2025-04-03 13:11:13 -04:00
Peter Park
214bc23817 fix link to CLR license (#4560)
(cherry picked from commit fdf24a9c40)
2025-04-03 13:11:13 -04:00
Alex Xu
6634ef8f62 update rocm-docs-core 2025-04-02 15:46:41 -04:00
Peter Park
928a972ca2 Merge pull request #4552 from peterjunpark/docs/6.3.3
Fix links in documentation
2025-04-01 15:59:47 -04:00
Peter Park
f1bbe0cef5 Fix more links in documentation (#4551)
* fix vllm engine args link

* remove RDNA subtree in under system optimization in toc

* fix RDNA 2 architecture PDF link

* fix CLR LICENSE.txt link

* fix rocPyDecode license link

(cherry picked from commit ea66bf386a)
2025-04-01 15:57:45 -04:00
Peter Park
51050661af Fix links in documentation
(cherry picked from commit ac2c5e72d4)
2025-04-01 15:40:28 -04:00
Peter Park
00fa68cf1a Add MaxText training Docker doc (#4543)
Add MaxText training Docker doc

(cherry picked from commit 424e6148bd)
2025-03-28 11:40:48 -04:00
Peter Park
51be44e941 Add MaxText training Docker doc
Add MaxText training Docker doc

(cherry picked from commit 424e6148bd)
2025-03-28 11:26:23 -04:00
Pratik Basyal
bf0fcd370c ROCprofiler-SDK naming and upcoming changes corrected (#4541)
* ROCprofiler-SDK naming corrected

* Swati's feedback added

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>

---------

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>
2025-03-28 10:09:12 -04:00
Peter Park
c364eb3b53 Merge pull request #4536 from peterjunpark/docs/6.3.3
Improve "tuning guides" landing page (#4504)
2025-03-27 15:35:03 -04:00
Peter Park
5ce76d3b68 Improve "tuning guides" landing page (#4504)
* Improve "tuning guides" landing page

* Update docs/how-to/gpu-performance/mi300x.rst

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>

* Update docs/how-to/gpu-performance/mi300x.rst

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>

* change tuning to optimization

---------

Co-authored-by: Pratik Basyal <pratik.basyal@amd.com>
(cherry picked from commit 58d42ec50b)
2025-03-27 15:01:56 -04:00
Pratik Basyal
76796e56e5 AMD GPU Docs System optimization migration changes in ROCm 6.3.3 Docs (#4534)
* AMD GPU Docs System optimization migration changes in ROCm Docs (#296)

* System optimization migration changes in ROCm

* Linting issue fixed

* Linking corrected

* Minor change

* Link updated to Instinct.docs.amd.com

* ROCm docs grid updated by removing IOMMU.rst, pcie-atomics, and oversubscription pages

* Files removed and reference fixed

* Reference text updated

* GPU atomics removed
2025-03-26 14:55:13 -04:00
Peter Park
571578a8bd Merge pull request #4521 from peterjunpark/docs/6.3.3
[6.3.3] Update Megatron-LM doc for 25.4
2025-03-21 17:12:01 -04:00
Peter Park
2415f36078 fix pre version 2025-03-21 16:52:38 -04:00
Peter Park
7df0804c06 Update Megatron-LM doc for 25.4 (#4520)
* update megatron-lm doc

* update 'previous versions'

* add missing space

* update docker pull tag

* Update options and docker pull tag

* Add performance measurements link to megatron-lm doc

* fix previous versions table

* words

* Simplify system validation section

* minor fixes

* fix perv versions tbl

(cherry picked from commit 8f359da39e)
2025-03-21 16:50:38 -04:00
Alex Xu
820db2c544 add 6.1.5 to version list
(cherry picked from commit 388f18cf36)
2025-03-14 11:00:23 -04:00
Peter Park
e7484f25e7 Merge pull request #4495 from peterjunpark/docs/6.3.3
PyTorch training Docker update 25.4 (#4482)
2025-03-13 14:02:25 -04:00
Peter Park
061ae36a84 PyTorch training Docker update 25.4 (#4482)
* remove orphan tag

* add hugging face PEFT

* update "previous versions"

* data == ultrachat 200k

* fix "llama 2"

* add ultrachat to wordlist

* fix previous versions table

* add performance measurements

* add mi325x

* fix prev version

* change 'validation' to 'testing

* fix dir name

* fix backtick

(cherry picked from commit 2fca094531)
2025-03-13 13:40:53 -04:00
Peter Park
7fad99073b Merge pull request #4493 from peterjunpark/docs/6.3.3
[docs/6.3.3] Update vLLM performance Docker docs (#4491)
2025-03-13 10:15:39 -04:00
Peter Park
c6ceff11f0 Update vLLM performance Docker docs (#4491)
* add links to performance results

words

* change "performance validation" to "performance testing"

* update vLLM docker 3/11

* add previous versions

add previous versions

* fix llama 3.1 8b model repo name

* words

(cherry picked from commit 9b2ce2b634)
2025-03-13 10:05:11 -04:00
Peter Park
acc7a23265 Merge pull request #4486 from peterjunpark/docs/6.3.3
[docs/6.3.3] Fix "VGPR" typo in workload tuning guide (#4484)
2025-03-12 16:19:31 -04:00
Peter Park
4a14260695 Fix "VGPR" typo in workload tuning guide (#4484)
* Fix "VGPR" typo in workload tuning guide

* fix wording
2025-03-12 16:17:17 -04:00
Istvan Kiss
f747943d36 Replace "-" on precision support page 2025-03-10 13:41:12 +01:00
Pratik Basyal
3d59247e7a Content for modprobe added to MI300X system optimization (#4434) (#4461)
Added content for modprobe
2025-03-07 15:05:42 -05:00
Peter Park
d5b1fd4389 Merge pull request #4448 from peterjunpark/docs/6.3.3
Add docs fixes to 6.3.3
2025-03-05 09:18:50 -05:00
Adel Johar
a05d9e2fa0 Docs: use custom directive to reference library versions
(cherry picked from commit cd85ccd539)
2025-03-05 09:02:15 -05:00
Peter Park
7ddb10a0fc Fix applies to linux tag for training benchmark docker pages (#4446)
(cherry picked from commit fa0e212906)
2025-03-05 09:02:08 -05:00
Peter Park
63f9bc30bd Merge pull request #4432 from peterjunpark/docs/6.3.3
Update LLM inference performance validation on AMD Instinct MI300X gu…
2025-02-28 13:02:56 -05:00
Peter Park
b174ab767e Update LLM inference performance validation on AMD Instinct MI300X guide to filter by desired model (#4424)
* WIP

(cherry picked from commit a06a5b5b959a9425e7384fb58b88c3716f380e48)

rm unneeded files

(cherry picked from commit f1d0c00056a83299bdea74a43cd17454999cf2d8)

* add sphinxcontrib.datatemplates

(cherry picked from commit d056b93a325d87b81f54f70c6eb4ae78f4fb0bc1)

* add template

(cherry picked from commit 0691d59f0a1efbda7908762b7a906e30a65c0ee1)

fix template

(cherry picked from commit 01e4bea5522aa5deeaade58c105ff850f449df8b)

WIPO

(cherry picked from commit 4d8daf7445e7be92cd9ee1d39dff564bd8de41f4)

WIP

(cherry picked from commit 9eefd1f5833bc4dc8de9d777ff65a5fe5f826dbd)

update models yaml schema

(cherry picked from commit a5f0fc1e6cc51104dc2d42029bfcf3eea276d270)

add model groups functionality

(cherry picked from commit 13f49f96dd3e5a160d37c52e48a4fbcccdcf4f9e)

add selector headings and fix template

(cherry picked from commit 35f7f2314bcf74b4fd0a8ca10aaabf0de7063bb0)

update template

(cherry picked from commit 9e2dcfe0c7f6e7c2c685866ea83375fbacbc5032)

fix

(cherry picked from commit be51e32791550ddc21785effccb889228394b242)

use classes instead of data tags

(cherry picked from commit cd52d68c504f7e7435d156ae70cf4bde1dfe703e)

update template

(cherry picked from commit 9ed89fee6874b39ee3535fbde54a0a59f346ea2b)

clean up extra wip files

(cherry picked from commit a9f965a104baa966c184054638e935b011526278)

update wordlist

(cherry picked from commit f783656814e896aedd21acd1c8c87b4700c14469)

remove unused template

(cherry picked from commit cac894bd9c2b1262c9c006e5fddbcb742dc6d882)

improve script

(cherry picked from commit ca20ffd4922916616e0924d625652a815f27c35f)

fix template

(cherry picked from commit 752c61fda856fd5b244734636c036c8877e823b9)

fix standalone benchmark output path in template

(cherry picked from commit d8c04203b5ec0f6c2e2307f7890304a3dc5687be)

fix toc

(cherry picked from commit 8df42faf53488ef29f5a263d25032f3d35cd58ed)

update script to prevent flash of unstyled content

import a11y

(cherry picked from commit 46c852717f223a1d8744fab035807cebab4c5404)

add tabindex to wordlist

(cherry picked from commit 11492593f9692f5453045e7ec52c8f8ae9624ae9)

text

update script

* remove unused config option

* reorganize assets

* fix linting warning

* move js from data/ to extension/
2025-02-28 12:40:43 -05:00
Istvan Kiss
f75ef9e2c1 Fix white paper links 2025-02-28 15:03:59 +01:00
Adel Johar
e5bf76ead1 Merge pull request #4422 from ROCm/docs_6.3.3_update_fix_arch
Merge pull request #4393 from ROCm/docs_fix_arch
2025-02-28 14:09:20 +01:00
Adel Johar
5393e90a8e Merge pull request #4393 from ROCm/docs_fix_arch
Docs: Fix gpu-arch-spec.rst
2025-02-27 16:35:33 +01:00
Peter Park
fbc2815223 Merge pull request #4417 from peterjunpark/docs/6.3.3
[docs/6.3.3] Update PT and TF docker inventories in compatibility docs (#4415)
2025-02-26 09:28:30 -05:00
Peter Park
2b96a37b08 Fix tensorflow-rocm repo.radeon.com url 2025-02-25 12:58:02 -05:00
Peter Park
1e5ad14d86 Update PT and TF docker inventories in compatibility docs (#4415)
* update PyTorch docker inventories in compatibility doc

* update TF docker inventories in compatibility doc

* update text to rocm 6.3.3

(cherry picked from commit 934767322b)
2025-02-25 12:38:25 -05:00
Peter Park
f9d6bd4db8 Merge pull request #4410 from peterjunpark/docs/6.3.3
[docs/6.3.3] fix tab sync and nested tab Megatron-LM doc (#4409)
2025-02-21 17:23:06 -05:00
Peter Park
23e78c8d55 fix tab sync and nested tab Megatron-LM doc (#4409)
(cherry picked from commit 1ea1c5c6e0)
2025-02-21 17:20:15 -05:00
Peter Park
0edd31bde6 Merge pull request #4408 from peterjunpark/docs/6.3.3
Update docs on Megatron-LM and PyTorch training Dockers (#4407)
2025-02-21 13:29:10 -05:00
Peter Park
4af488e27d Update docs on Megatron-LM and PyTorch training Dockers (#4407)
* Update Megatron-LM and PyTorch Training Docker docs

Also restructure TOC

* Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

update "start training" text

Apply suggestions from code review

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

update conf.py

fix spacing

fix branding issue

add disable numa

reorg

remove extra text

(cherry picked from commit 389fa7071b)
2025-02-21 13:10:42 -05:00
Parag Bhandari
7ae7046301 Merge branch 'roc-6.3.x' into docs/6.3.3 2025-02-19 17:25:14 -05:00
Parag Bhandari
358092386e Merge branch 'develop' into roc-6.3.x rocm-6.3.3 2025-02-19 17:25:03 -05:00
Parag Bhandari
e071738908 Merge branch 'roc-6.3.x' into docs/6.3.3 2025-02-19 17:22:38 -05:00
pbhandar-amd
cd79403931 Update vllm-benchmark.rst 2025-02-19 17:21:29 -05:00