* adding preliminary compatibility matrix data for 6.2.1
* bump up some version numbers from 6.2.0 to 6.2.1
* adding kernel versions to compatibility matrix. I hate it
* add kernel version lookup table, in dropdown list
* add KFD and User space support. Also adjust some meta data keywords
* update 6.2.1 RC2 versions
* make spelling linter happy
* remove kernel versions from table, just reference LUT below
* Leave kenerel Lookup table expanded
* update kernel version table
* remove kernels from historical matrix, update footnotes
* move historical matrix into compatibility folder
* update historical matrix paths
* version bumps for RC3
* RC4 has no other version bumps. Reorder RPP alphabetically
* change How-To card hue to purple
- Add roctracer dependency to hipBLASLt build to address recent failures.
- Change build pool to ultra due to increased build times.
- Enable ccache to help with build times.
- Referred to public documentation, build instructions, source code in tests directory, and iterative runs to modify build flags.
- rdci test failures are known due to singleton nature of rocprofiler, but gtest attempting to spawn multiple instances. There is an internal ticket to track the issue.
Referred to public documentation, build instructions, and iterative debug runs to update build flags, publish new artifacts, and run tests. Test results are not parsed and graphed in Azure.
40% pass rate for this initial pass. Would like to push this through to at least change the build process and then defer fixing the remaining test failures.
- Test results are not parsed to be graphed in Azure reports.
- Added ccache to potentially improve build times, keyed against the date and hash based on amdclang++ binary.
* Add FBGEMM/FBGEMM_GPU to the Model acceleration libraries page
* Add words to wordlist and fix a typo
* Add new sections for Docker and testing
* Incorporate comments from the external review
* Some minor edits and clarifications
* Incorporate further review coments and fix test section
* Add comment to test section
* Change git clone command for FBGEMM repo
* Change Docker command
* Changes from internal review
* Fix linting issue
Replace cmake calls with bash script calls to compile the components comprising openmp-extras.
Added inline comments to describe the bash scripts from aomp repo being executed.
- Added steps for creating wheel file for torchvision.
- Tried to add torchaudio as well, but it was not reading in AMDGPU_TARGETS value in the nested cmake calls from the python setup.py execution.
- Upstream pytorch builder scripts were updated, so it broke the patching step in the job. Removed the need to patch by using a flag to skip the tests.
- Will work on adding smoke tests of pytorch and torchvision later, just getting this out to fix the nightly build.
* Add introduction and links to the new guide to the vLLM optimized Docker image on AMD Infinity Hub
* Update target link for the Docker vLLM guide
* Change target URL
* Change link target URL again