Commit Graph

2619 Commits

Author SHA1 Message Date
yugang-amd
fef80c324d Bump up requirement version (#4804)
* bump up requirement version

* update requirements.txt

* Use Python 3.10
2025-05-27 10:58:19 -04:00
dependabot[bot]
5ffc336620 Build(deps): Bump tornado from 6.4.2 to 6.5.1 in /docs/sphinx (#4796)
Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.2 to 6.5.1.
- [Changelog](https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst)
- [Commits](https://github.com/tornadoweb/tornado/compare/v6.4.2...v6.5.1)

---
updated-dependencies:
- dependency-name: tornado
  dependency-version: 6.5.1
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-26 17:29:40 -06:00
dependabot[bot]
a6c5fc4a2c Build(deps): Bump rocm-docs-core from 1.18.2 to 1.19.0 in /docs/sphinx (#4806)
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.18.2 to 1.19.0.
- [Release notes](https://github.com/ROCm/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.18.2...v1.19.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-version: 1.19.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-26 17:23:42 -06:00
yugang-amd
32939eed40 Merge pull request #4802 from yugang-amd/link-fix
fix broken links
2025-05-26 14:42:47 -04:00
yugang-amd
454331ba59 fix links 2025-05-26 13:50:25 -04:00
yugang-amd
637174f644 Merge pull request #4791 from yugang-amd/wavefront-size
Update wavefront size
2025-05-26 10:10:40 -04:00
yugang-amd
934ce63840 fix typo 2025-05-26 10:02:14 -04:00
yugang-amd
230b01565f update wavefront size 2025-05-22 17:49:08 -04:00
Peter Park
505041d90a Document specs for Radeon RX 9070 + small fix in megatron-lm doc (#4780)
* Document specs for Radeon RX 9070

* fix wrong version in megatron-lm.rst
2025-05-22 16:28:17 -04:00
Daniel Su
3ba79f9431 Ex CI: use clr amd-mainline temporarily (#4781) 2025-05-22 16:04:44 -04:00
yugang-amd
714b5395f8 Merge pull request #4773 from yugang-amd/rocshmem-xref
update rocSHMEM xrefs
2025-05-22 15:13:50 -04:00
Peter Park
9ed65a81c4 Add Megatron-LM benchmark doc 5/2 (#4778)
* reorg files

* add tabs

* update template

* update template

* update wordlist and toc

* add previous version to doc

* add selector paragraph

* update wordlist.txt
2025-05-22 14:28:18 -04:00
Peter Park
6d9f430c70 fix 9070 XT gfx target in gpu-arch-specs table (#4775) 2025-05-22 12:11:18 -04:00
yugang-amd
7697298f5d update rocSHMEM xrefs 2025-05-22 10:41:25 -04:00
Daniel Su
854bd268bf Ex CI: enable downstream job triggers for PRIMs and RANDs (#4761)
* Remove sparseCheckout param

* Add support for downloading same-pipeline-builds for monorepo chain builds

* Make local-artifact step names more informative

* Use componentName param for artifact filenames

* Enable chain downstream triggers for PRIMs & RANDs

* Set preTargetFilter for tests' local-artifact-download call

* Set checkout: none for test jobs

* Exclude failing rocThrust scan.hip test

* Matrixize downstream jobs
2025-05-22 10:24:06 -04:00
Peter Park
f1f2b3cac2 remove HIP upcoming changes reference link (#4771) 2025-05-21 12:09:35 -07:00
alexxu-amd
e265ee53ba Merge pull request #4766 from ROCm/alexxu12/tool-update-641
Update tools & README for 6.4.1
2025-05-21 15:02:27 -04:00
Peter Park
3f56efcb3b Update documented known issues in 6.4.1 rel (#4765)
* add ROCm SMI uninstallation note

* words

* clarify amd-smi note

* add links to gh issues in known issues section

* words
2025-05-21 15:01:35 -04:00
Peter Park
98fde2bff1 Add RDNA4 OS support note in RELEASE.md and compat matrix (#4764)
* fix vllm link in release.md

* add RDNA4 note in compat matrix

* update hipcc github url to specific path in llvm-project repo

* remove non-existant HIP upcoming changes reference

* remove non-existant resolved issues internal link

* fix hip upcoming changes url

* duplicate amd smi known issue
2025-05-21 14:23:48 -04:00
Peter Park
0e8b745266 Fix toc (#4762) 2025-05-21 12:26:30 -04:00
alexxu-amd
02a8a6e5df Merge pull request #4760 from ROCm/sync-develop-from-internal
Sync develop from internal for 6.4.1 GA
2025-05-21 12:21:42 -04:00
alexxu-amd
f118318f98 Merge pull request #414 from ROCm/sync-develop-from-external
Sync develop from external
2025-05-21 12:00:22 -04:00
alexxu-amd
47e4ec8b3a Merge branch 'develop' into sync-develop-from-external 2025-05-21 11:17:10 -04:00
Alex Xu
58a62bc00e Merge remote-tracking branch 'external/develop' into sync-develop-from-external 2025-05-21 11:16:31 -04:00
Peter Park
56d258592d Finalize 6.4.1 release notes (#408)
* update URLs for production

* update historical changelog

* remove deep learning compat section from doc highlights

* update changelog.md

* Update CHANGELOG.md

Co-authored-by: yugang-amd <yugang.wang@amd.com>

* Update CHANGELOG.md

Co-authored-by: yugang-amd <yugang.wang@amd.com>

---------

Co-authored-by: yugang-amd <yugang.wang@amd.com>
2025-05-21 11:15:44 -04:00
Peter Park
8dc7016405 Add Radeon AI PRO R9700, Radeon RX 9070 XT, RX 9060 XT to gpu-arch-specs (#411)
* add Radeon AI PRO R7900, Radeon RX 9070 XT, Radeon RX 9060 XT to gpu-arch-specs.rst

* update compat matrices

* fix spacing in historical compat csv file
2025-05-21 11:04:46 -04:00
alexxu-amd
8686bca1b4 Merge pull request #412 from ROCm/alexxu-amd-patch-1
Add 6.4.1 to version list
2025-05-21 11:00:11 -04:00
alexxu-amd
82d15a09f5 Merge branch 'develop' into alexxu-amd-patch-1 2025-05-21 10:58:27 -04:00
Peter Park
42e0c0cfba [6.4.1] Add RDNA4 GPUs to docs (#410)
* add Radeon AI PRO R9700 SD2.1 known issue

* Add hardware support note for R9700, 9070 XT, 9060 XT

* words

* Add links to new 9000-series GPUs
2025-05-21 10:27:43 -04:00
alexxu-amd
ddcad120a2 Update versions.md 2025-05-21 09:52:05 -04:00
Peter Park
b8892f2c33 add amd-smi ras --cper known issue (#409) 2025-05-20 16:36:33 -04:00
Peter Park
ab384a1b6e [6.4.1] Add RCCL-UnitTests known issue 2025-05-20 07:56:50 -04:00
Peter Park
27db6ef0b3 add rccl known issue to stack known issues (#404)
* add rccl known issue to stack known issue

* remove bullet
2025-05-16 12:32:21 -04:00
Istvan Kiss
707d6c022f Merge pull request #388 from ROCm/rn_precision_sup
Precision support page update release note revert
2025-05-16 11:33:07 +02:00
Istvan Kiss
3bafe307bf Page will be not published 2025-05-16 11:29:26 +02:00
Peter Park
ca5d0d0000 [6.4.1] update llvm-project version and add RCCL known issue (#401)
* update llvm-project version

* add RCCL known issue
2025-05-15 16:20:59 -04:00
randyh62
e35efbae09 Update RELEASE.md (#402)
Update Added entry, add Changed and Optimized entries
2025-05-15 13:14:18 -07:00
Daniel Su
0d7846fbab Ex CI: enable rocPRIM sparse checkout (#4743) 2025-05-15 14:39:28 -04:00
Peter Park
92a9c88fe3 [6.4.1 release notes] Change links to internal for external review (#400)
* change installer links to internal

* change smi changelog links to internal
2025-05-15 11:48:13 -04:00
Peter Park
2a3c2fe5aa Update 6.4.1 release notes (#399)
* remove extra file

* Update wording in RELEASE.md

* Update RELEASE.md

Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>

* update amdsmi changelog

* install -> installed

t

---------

Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>
2025-05-14 15:41:12 -04:00
Daniel Su
156917e15d Ex CI: set absolute cmakeSourceDir paths (#4741) 2025-05-14 11:03:57 -04:00
Daniel Su
d7a9280008 Ex CI: set cmakeSourceDir for all components that set cmakeBuildDir (#4738) 2025-05-13 17:15:54 -04:00
Daniel Su
c1825ba41c Ex CI: skip docker creation on gfx942 (#4735) 2025-05-13 17:05:02 -04:00
Peter Park
0a77e7b3a5 docs: Add system health check doc under ROCm for AI (#4736)
* add initial draft

* add to toc and install page

* update wording

* improve documentation structure

* resturcture and expand content

* add to training section

* add to conf.py article_pages

* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* update wordlist.txt

* Update docs/how-to/rocm-for-ai/includes/system-health-benchmarks.rst

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

* inference --> AI workloads

* udpate toc

* update article_pages in conf.py

* Update system validation notes in training docs

* fix links in prerequisite-system-validation

* wording

* add note

* consistency

* remove extra files

* fix links

* add links to training index page

---------

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>
2025-05-13 15:54:48 -04:00
Daniel Su
a940f3f090 Ex CI: add sparse option to checkout template (#4701)
* Ex CI: add sparse option to checkout template

* replace Pipeline.Workspace with Agent.BuildDirectory for consistency
2025-05-13 14:46:48 -04:00
Daniel Su
95415d5e70 Ex CI: remove firstRenderDeviceAccess demand from all components (#4734) 2025-05-13 13:08:27 -04:00
Istvan Kiss
d1772b9ca3 Fix unsupported section structure on JAX (#4733) 2025-05-13 17:39:25 +02:00
Istvan Kiss
f65e1412df Fix compatibility list (#4731) 2025-05-13 16:26:36 +02:00
Istvan Kiss
ea1072b11d JAX compatibility page upate (#4727) 2025-05-08 19:31:13 +02:00
Peter Park
90a651d2b6 Merge pull request #4725 from peterjunpark/docs/quark-model-quantization
Add quark in model-quantization.rst
2025-05-08 10:34:39 -04:00