training

2026-04-24 03:00:54 -04:00 · 2023-02-20 16:35:43 -08:00
parent 9d072c5778
commit 7be0d52d03
1 changed files with 1 additions and 1 deletions
--- a/docs/train.md
+++ b/docs/train.md
@@ -259,7 +259,7 @@ Also, if your dataset is large, you may want to end the training with a few thou

 Also, if you unlock some original layers, you may want a lower learning rate, like 2e-6.

-## Other Considerations: the sudden converge phenomenon and gradient accumulation
+## More Consideration: Sudden Converge Phenomenon and Gradient Accumulation

 ![img](../github_page/ex1.jpg)