Files
b1tg 0fbc551622 train bert with fp8 (#13874)
* fp8 train

* clean

* lint

* test fix from #13439

* skip first/last layer

* rm __init__, restore unroll <=32 check

* tests

* clean test, remove unused

* multi-gpu test, clean quantize_to_fp8

* remove bert contiguous

* run script

* test: better check

* run script search

* add seed in bert data shuffle

* move script to mi350x folder

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2026-01-09 09:21:59 -05:00
..
2026-01-09 09:21:59 -05:00
2025-06-08 08:42:22 -07:00
2025-06-08 08:42:22 -07:00
2023-03-11 16:28:10 -08:00
2025-10-08 04:54:07 -04:00
2025-02-20 18:03:09 -05:00
2025-10-16 09:55:20 -04:00
2025-10-08 04:54:07 -04:00
2025-02-26 13:22:08 -05:00
2025-08-10 20:33:22 -04:00
2025-12-01 22:50:53 -08:00
2025-06-05 17:17:42 -07:00