Commit Graph

1589 Commits

Author SHA1 Message Date
Sam Wu
17f12a11e7 Merge pull request #3234 from WBobby/roc-6.1.2-manifest
Update manifest file for ROCm6.1.2
rocm-6.1.2
2024-06-04 14:50:14 -06:00
Wang, Yanyao
b2f0f0acdf Update manifest file for ROCm6.1.2 2024-06-04 15:39:16 -05:00
Sam Wu
a11c0512e1 Merge branch 'docs/6.1.2' into roc-6.1.x 2024-06-04 14:38:59 -06:00
Sam Wu
eec71da8dd Merge pull request #3232 from ROCm/develop
Merge develop into roc-6.1.x
2024-06-04 14:36:34 -06:00
Sam Wu
39891fe185 Sync develop branch 2024-06-04 14:32:36 -06:00
Peter Park
14ee171649 Add OS support note (#91) 2024-06-04 14:11:01 -04:00
Peter Park
e7bff21d3e Add final fixes to 6.1.2 release notes and changelog (#90)
* Regenerate changelog

* Add component changelogs and known issue

Fix RELEASE.md headings

Update pub datestamp for 6.1.2

Add AMDSMI and ROCm SMI to 6.1.2 template

Add rccl and rocBLAS

Update intro blurb and headings

Add ROCm SMI fix

Add missed heading to AMDSMI

Update datestamp and release version number

Update version and release number

Add known issue re: MI300X error detection

Words

Add issue link

Rm GitHub issue link

Move known issue down

Update ki wording

Remove "this issue has been investigated ... " from known issue

Fix changelog h1

* Reorg known issue, upcoming changes, remove rocDecode tested configurations

* Add fixes from review

* Add fixed issue link

* Fix heading

* Remove known issue
2024-06-04 12:23:07 -04:00
Peter Park
6abe5b50a2 Merge pull request #3229 from peterjunpark/docs/6.1.2
docs/6.1.2: Update the links for rocminfo and rocm-bandwidth-test (#3213)
2024-06-04 08:12:15 -07:00
amitkumar-amd
df864f8f79 Update the links for rocminfo and rocm-bandwidth-test (#3213)
* Update the links for rocminfo and rocm-bandwidth-test

* Update the links for rocminfo and rocm-bandwidth-test

* Update the links for rocminfo and rocm-bandwidth-test

* Update links to intersphinx links

---------

Co-authored-by: Peter Jun Park <peter.park@amd.com>
2024-06-04 11:00:52 -04:00
amitkumar-amd
7290ce9030 Update the links for rocminfo and rocm-bandwidth-test (#3213)
* Update the links for rocminfo and rocm-bandwidth-test

* Update the links for rocminfo and rocm-bandwidth-test

* Update the links for rocminfo and rocm-bandwidth-test

* Update links to intersphinx links

---------

Co-authored-by: Peter Jun Park <peter.park@amd.com>
2024-06-04 10:59:22 -04:00
Peter Park
d6d18d7cd4 Merge pull request #3226 from peterjunpark/docs/6.1.2
docs/6.1.2: Add "Fine Tuning LLMs" how to guide (#3124)
2024-06-04 07:02:36 -07:00
Peter Park
30f10e0145 Update fine-tuning guide: title, improve readibility in code blocks, fix typos (#3222)
* Fix typo

* Add torchtune link

* Add newlines before comments in code blocks for readability

* Update title
2024-06-03 22:15:36 -04:00
Peter Park
1e55e01af3 Add "Fine Tuning LLMs" how to guide (#3124)
* Add Fine Tuning LLMs how to guide

* Reorg and refactor Fine-tuning LLMs with ROCm

Update index and headings

Fix formatting and update toc

Split out content from index to overview.rst

Add metadata

Clean up overview

Add inference sections, fix rst errors, clean up single-gpu-fine-tuning

Combine fine-tuning and inference guides

Fix some links and formatting

Update toc and add formatting fixes

Add ck kernel fusion content

Update toc

Clean up model quantization and acceleration

Add CK images

Clean up profiling

Update triton kernel performance optimization

Update llm inference frameworks guide

Disable automatic number of figures and tables in Sphinx conf

Change tabs to spaces

Change heading to end with -ing

Add link fixes and heading updates

Add rocprof/Omniperf/Omnitrace section

Update profiling and debugging guide

Add formatting fixes

Satisfy spellcheck

Fix words

Delete unused file

Finish overview

Clean up first 4 sections

Multi-gpu fine-tuning guide: slight fixes

Update toc

Remove tabs

Formatting fixes

* Minor wording updates

* Add some clean-up

* Update profiling and debugging gudie

* Fix Omnitrace link

* Update ck kernel fusion with latest

* Update CK formatting

* Fix perfetto link syntax

* Fix typos and add blurbs

* Add fixes to Triton optimization doc

* Tabify saving adapters / models section

* Fix linting errors - spellcheck

Fix spelling and grammar

Satisfy linter

Update wording in profiling guide

Add fixes to satisfy linter

More fixes for linting in Triton guide

More linting fixes

Spellcheck in CK guide

* Improve triton guide

Fix linting errors and optics

* Add occupancy / vgpr table

Change some wording

* Re-add tunableop

* Add missing indent in _toc.yml

* Remove ckProfiler references

* Add links to resources

* Add refs in CK optimization guide

* Rename files and fix internal links

* Organize tuning guides

Reorg triton

* Add compute unit diagram

* Remove AutoAWQ

* Add higher res image for Perfetto trace example

* Update link text

* Update fig nums

* Update some formatting

* Update "Inductor"

* Change "Inductor" to TorchInductor

* Add link to official TorchInductor docs
2024-06-03 22:15:13 -04:00
Peter Park
9a347aa168 Update fine-tuning guide: title, improve readibility in code blocks, fix typos (#3222)
* Fix typo

* Add torchtune link

* Add newlines before comments in code blocks for readability

* Update title
2024-06-03 22:11:19 -04:00
Peter Park
fed33835a0 Add "Fine Tuning LLMs" how to guide (#3124)
* Add Fine Tuning LLMs how to guide

* Reorg and refactor Fine-tuning LLMs with ROCm

Update index and headings

Fix formatting and update toc

Split out content from index to overview.rst

Add metadata

Clean up overview

Add inference sections, fix rst errors, clean up single-gpu-fine-tuning

Combine fine-tuning and inference guides

Fix some links and formatting

Update toc and add formatting fixes

Add ck kernel fusion content

Update toc

Clean up model quantization and acceleration

Add CK images

Clean up profiling

Update triton kernel performance optimization

Update llm inference frameworks guide

Disable automatic number of figures and tables in Sphinx conf

Change tabs to spaces

Change heading to end with -ing

Add link fixes and heading updates

Add rocprof/Omniperf/Omnitrace section

Update profiling and debugging guide

Add formatting fixes

Satisfy spellcheck

Fix words

Delete unused file

Finish overview

Clean up first 4 sections

Multi-gpu fine-tuning guide: slight fixes

Update toc

Remove tabs

Formatting fixes

* Minor wording updates

* Add some clean-up

* Update profiling and debugging gudie

* Fix Omnitrace link

* Update ck kernel fusion with latest

* Update CK formatting

* Fix perfetto link syntax

* Fix typos and add blurbs

* Add fixes to Triton optimization doc

* Tabify saving adapters / models section

* Fix linting errors - spellcheck

Fix spelling and grammar

Satisfy linter

Update wording in profiling guide

Add fixes to satisfy linter

More fixes for linting in Triton guide

More linting fixes

Spellcheck in CK guide

* Improve triton guide

Fix linting errors and optics

* Add occupancy / vgpr table

Change some wording

* Re-add tunableop

* Add missing indent in _toc.yml

* Remove ckProfiler references

* Add links to resources

* Add refs in CK optimization guide

* Rename files and fix internal links

* Organize tuning guides

Reorg triton

* Add compute unit diagram

* Remove AutoAWQ

* Add higher res image for Perfetto trace example

* Update link text

* Update fig nums

* Update some formatting

* Update "Inductor"

* Change "Inductor" to TorchInductor

* Add link to official TorchInductor docs
2024-06-03 14:04:33 -04:00
danielsu-amd
f52bc2bc68 External CI: Add rocBLAS dependency to rocSPARSE (#3216) 2024-06-03 13:41:30 -04:00
danielsu-amd
205790159d External CI: use pipelined rocm-core for rocprofiler (#3215) 2024-06-03 10:52:56 -04:00
Peter Park
9679a84a8b Add components, known issues, and fixed issues to 6.1.2 RN / CL (#87)
* Regenerate changelog

* Add component changelogs and known issue

Fix RELEASE.md headings

Update pub datestamp for 6.1.2

Add AMDSMI and ROCm SMI to 6.1.2 template

Add rccl and rocBLAS

Update intro blurb and headings

Add ROCm SMI fix

Add missed heading to AMDSMI

Update datestamp and release version number

Update version and release number

Add known issue re: MI300X error detection

Words

Add issue link

Rm GitHub issue link

Move known issue down

Update ki wording

Remove "this issue has been investigated ... " from known issue

Fix changelog h1
2024-06-03 08:51:38 -04:00
Sam Wu
d34f7d7777 Merge pull request #3210 from ROCm/dependabot/pip/docs/sphinx/requests-2.32.2
Bump requests from 2.31.0 to 2.32.2 in /docs/sphinx
2024-05-31 17:10:09 -06:00
dependabot[bot]
16fca72626 Bump requests from 2.31.0 to 2.32.2 in /docs/sphinx
Bumps [requests](https://github.com/psf/requests) from 2.31.0 to 2.32.2.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.31.0...v2.32.2)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-31 23:02:26 +00:00
Sam Wu
1a6ce7f6e0 Merge pull request #3212 from ROCm/dependabot/pip/docs/sphinx/rocm-docs-core-1.2.0
Bump rocm-docs-core from 1.1.1 to 1.2.0 in /docs/sphinx
2024-05-31 17:01:03 -06:00
dependabot[bot]
35c17fcce5 Bump rocm-docs-core from 1.1.1 to 1.2.0 in /docs/sphinx
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 1.1.1 to 1.2.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v1.1.1...v1.2.0)

---
updated-dependencies:
- dependency-name: rocm-docs-core
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-05-31 22:07:13 +00:00
Sam Wu
bf19dd1dc8 Update RTD config 2024-05-31 15:18:53 -06:00
Sam Wu
5fec2e1ca4 Update documentation requirements 2024-05-31 13:49:14 -06:00
danielsu-amd
1975889da1 External CI: Remove redundant rocm_smi_lib pipeline ID (#3211) 2024-05-31 14:25:09 -04:00
Sam Wu
b9c4490f96 Merge branch 'roc-6.1.x' into docs/6.1.2 2024-05-31 11:59:44 -06:00
Sam Wu
7fcb0f19a9 Merge pull request #3208 from ROCm/develop
Merge develop into roc-6.1.x
2024-05-31 11:49:48 -06:00
Sam Wu
625c18371c Merge branch 'roc-6.1.x' into develop 2024-05-31 11:47:19 -06:00
danielsu-amd
9dd6e42122 External CI: Dockerless + latest source for rocprofiler and rocm_bandwidth_test (#3209) 2024-05-31 13:27:47 -04:00
Joseph Macaranas
9d27863954 MIOpen External CI: Add rocprofiler-register dependency for latest source (#3203) 2024-05-31 11:23:46 -04:00
Joseph Macaranas
04561cc60f External CI: updated cmake dependencies (#3206)
Template with bash commands to update cmake with snap.
Use template for two components that want updated cmake with latest source on their default branches.
2024-05-31 11:16:36 -04:00
Joseph Macaranas
14a3e80a1b hipFFT External CI: Missing vmImage syntax for base pool (#3204) 2024-05-31 11:16:17 -04:00
abhimeda
32334fd826 Removing docker from hipBLASLt (#3202) 2024-05-30 21:12:54 -04:00
Peter Park
3e833c7366 Remove unused images and add link to usage in Deep Learning install guide (#3196) (#3201) 2024-05-30 20:20:31 -04:00
Peter Park
61d18252ab Remove unused images and add link to usage in Deep Learning install guide (#3196) 2024-05-30 19:28:13 -04:00
Sam Wu
d25045334c Disable PDF on 6.1.1 (#3199) 2024-05-30 19:14:29 -04:00
Sam Wu
2d8eba0404 Disable pdf builds (#3197) 2024-05-30 19:13:54 -04:00
Joseph Macaranas
cfaa056ae0 Add rocPRIM dependency to rocSOLVER CI build (#3195) 2024-05-30 17:33:02 -04:00
Peter Park
230790d794 Merge pull request #3193 from peterjunpark/docs/6.1.1
docs/6.1.1: Add "How to use ROCm for AI" (#3117)
2024-05-30 13:55:43 -07:00
Peter Park
af84ba09c6 Add "How to use ROCm for AI" (#3117)
* Add Using ROCm for AI:wq

Add PyTorch Docker installation images

Split doc into subtopics

Add metadata

Clean up index

Clean up hugging face guide

Clean up installation guide

Fix rST formatting

Clean up install and train-a-model

Clean up MAD

Delete unused file

Add ref anchors and clean up MAD doc

Add formatting fixes

Update toc and section index

Format some code blocks

Remove install guide and update toc

Chop installation guide

Clean up deployment and hugging face sections

Change headings to end in -ing

Fix spelling in Training a model

Delete MAD and split out install content

Fix formatting

Change words to satisfy spellcheck linter

* Add review suggestions and add helpful links

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

Add helpful links and add review suggestions

Remove fine-tuning link and links to D5 and MAGMA

Update docs/how-to/rocm-for-ai/deploy-your-model.rst

Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>

Update DeepSpeed link

Add subheading to ML framework installation and closing blurb to hugging face models guide

* Reorder topics
2024-05-30 16:19:45 -04:00
Peter Park
6a5defb825 Add "How to use ROCm for AI" (#3117)
* Add Using ROCm for AI:wq

Add PyTorch Docker installation images

Split doc into subtopics

Add metadata

Clean up index

Clean up hugging face guide

Clean up installation guide

Fix rST formatting

Clean up install and train-a-model

Clean up MAD

Delete unused file

Add ref anchors and clean up MAD doc

Add formatting fixes

Update toc and section index

Format some code blocks

Remove install guide and update toc

Chop installation guide

Clean up deployment and hugging face sections

Change headings to end in -ing

Fix spelling in Training a model

Delete MAD and split out install content

Fix formatting

Change words to satisfy spellcheck linter

* Add review suggestions and add helpful links

Co-authored-by: Leo Paoletti <164940351+lpaoletti@users.noreply.github.com>

Add helpful links and add review suggestions

Remove fine-tuning link and links to D5 and MAGMA

Update docs/how-to/rocm-for-ai/deploy-your-model.rst

Co-authored-by: Young Hui - AMD <145490163+yhuiYH@users.noreply.github.com>

Update DeepSpeed link

Add subheading to ML framework installation and closing blurb to hugging face models guide

* Reorder topics
2024-05-30 16:17:44 -04:00
randyh62
6864f1546e hipcc doc link (#3190)
* hipcc doc link

* Update docs/what-is-rocm.rst

Co-authored-by: Istvan Kiss <neon60@gmail.com>

* Update docs/what-is-rocm.rst

Co-authored-by: Istvan Kiss <neon60@gmail.com>

---------

Co-authored-by: Istvan Kiss <neon60@gmail.com>
2024-05-30 12:52:58 -07:00
Joseph Macaranas
58f543c010 Some new external CI dependencies for latest source on default branches (#3188)
rdc: amdsmi
rocBLAS: AOCL 4.2
rocPRIM: incorrect compiler path
2024-05-30 10:56:52 -04:00
abhimeda
7504e6bc13 removing docker from external ci pipelines (#3177)
* removed docker and pointed ROCm deps to our existing builds

* removed vmImage tag for pool

* added pip to apt list and renamed from rocFFT to hipFFT

* fixed spelling mistakes in rocmDependencies

* added correct apt dep for pip

* removed leading slash in the cmake flags

* changed cxx_compiler to /rocm/bin/hipcc

* added llvm-project, ROCR-Runtime, clr, and rocminfo to rocm deps

* added rocFFT as a rocm dependency

* removed docker and added our builds for components

* removed rocFFT from rocm deps

* Fixed typo in rocFFT value

* added rocprofiler-register to rocFFT and fixed typo in the dependencies-rocm file

* changed cxx compiler to amdclang++

* fixed amdclang++ paths

* moving to faster machine

* added cmake module paths

* switched back to medium build

* added libopm-dev to apt deps

* added libomp-14-dev to apt deps

* added aomp as a rocm dep

* added aomp as a rocm dep

* added hipcc as the cxx_compiler

* reverted back to clang++ as the cxx_compiler

* removed unmentioned rocm deps from the readme

* removed docker

* added python3-pip as an apt dep

* fixed compiler paths

* added hipRAND as a rocm dep

* added print statements to see directory structure

* adding a print statement into /agent/_work/1/s/build/library

* added -Tensile_rocm_assembler as a build flag

* removed a broken script line

* added D to tensile rocm assembler

* added DROCM_PATH to build flags

* fixed typo

* changed build pool from medium to base

* changed build pool from base to low

* added env variables using josephs pr

* removed docker from hipBLASLt and added rocm dependencies that point to our builds

* added pip to the apt packages array

* changed cmake_cxx_compiler env var ro amdclang++

* changed cmake_cxx_compiler env var to amdclang++

* changed cmake_cxx_compiler env var to hipcc

* changed cmake_cxx_compiler env var to hipcc

* changed clang to amdclang

* changed all refs mentioning hipcc to amdclang

* changed cmake_cxx_compiler back to hipcc

* added a HIP_PATH env var based off Tensile/Source/FindHIP.cmake

* added hipcc to HIP_PATH

* added rocm-cmake to rocm deps

* added rocRAND as a rocm dep

* removed dcmake_module flag

* added libomp-dev as an apt dep

* added aomp as a rocm dep

* added clang as an apt dep

* reverted changes back to how they appear in develop since this branch will be submitted for review

* removed unecessary flags

* adding -DCMAKE_CXX_COMPILER=$(Agent.BuildDirectory)/rocm/llvm/bin/amdclang++         -DCMAKE_C_COMPILER=$(Agent.BuildDirectory)/rocm/llvm/bin/amdclang back to see if these are vital to a successful build

* removed newline character
2024-05-30 10:55:24 -04:00
Joseph Macaranas
7e1a1bc7c2 Change offload build to a parameter (#3187) 2024-05-29 21:50:02 -04:00
Joseph Macaranas
a2574adc73 Disable aomp offload build for initial external ci-build work (#3186)
* Disable aomp offload build for initial ci-build work

* Missing dependency for medium pool use of rocPRIM

* Latest rocBLAS source needs added ROCm dependencies
2024-05-29 21:45:34 -04:00
Joseph Macaranas
7207d815d1 ci-build scenario adjustments for aomp and rocm_smi_lib (#3185)
* Update rocm_smi_lib.yml

* Change checkout reference for aomp ci-build case
2024-05-29 19:51:06 -04:00
Peter Park
7db5854c78 Merge pull request #3180 from peterjunpark/docs/6.1.1
Add missed ROCm SMI changelog notes (#3168)
2024-05-29 14:48:37 -07:00
Sam Wu
5930282993 docs(conf.py): Update PDF version to 6.1.1 (#3184) 2024-05-29 15:11:19 -04:00
Sam Wu
012c55ab04 docs(conf.py): Update PDF version to 6.1.1 (#3183) 2024-05-29 15:10:41 -04:00