chenyu
97b05f567e
revert the .detach() in layernorm ( #4904 )
...
* revert the .detach() in layernorm
it's only correct in LayerNorm where input is the data, and not correct in GroupNorm and InstanceNorm that reused layernorm.
Added backward tests for weights, bias and input for these norms.
* bigger atol for llvm
* relax backward more
2024-06-10 18:02:05 -04:00
..
2024-06-08 22:52:32 +03:00
2024-05-24 17:04:19 -04:00
2024-04-10 02:00:34 -04:00
2024-06-10 14:49:29 -04:00
2024-06-09 12:15:11 +02:00
2023-12-01 11:34:47 -08:00
2020-12-15 23:44:08 -08:00
2023-06-25 10:38:58 -07:00
2024-06-09 11:33:03 +03:00
2024-05-29 23:12:35 -04:00
2024-05-15 15:06:48 +03:00
2024-05-08 01:52:22 -04:00
2024-03-29 20:50:27 -07:00
2024-02-16 00:49:30 -05:00
2024-02-15 16:55:39 +01:00
2024-05-10 22:43:09 -07:00
2024-05-10 21:40:02 -07:00
2024-06-09 11:33:03 +03:00
2024-05-31 17:06:58 +03:00
2024-06-10 14:49:29 -04:00
2024-05-31 17:06:58 +03:00
2024-03-18 16:47:07 -04:00
2024-05-15 10:50:25 -07:00
2024-06-09 11:33:03 +03:00
2024-01-01 10:59:56 -08:00
2024-05-07 12:56:23 -04:00
2024-03-26 21:02:46 -07:00
2024-05-21 18:59:36 -04:00
2024-05-14 23:12:59 -07:00
2024-06-05 14:41:19 +03:00
2023-12-07 17:07:05 -08:00
2024-03-14 20:44:34 -07:00
2024-06-08 22:57:48 -04:00
2024-05-15 23:46:08 +03:00
2024-06-10 18:02:05 -04:00
2024-06-10 12:11:39 -04:00
2024-05-15 23:46:08 +03:00
2024-06-09 18:29:42 -04:00
2024-05-05 10:14:03 -07:00
2024-05-31 17:06:58 +03:00
2023-12-03 17:20:27 -05:00
2024-05-19 00:25:25 -04:00
2024-06-10 14:49:29 -04:00
2024-05-14 01:28:02 -04:00
2024-05-15 23:46:08 +03:00
2024-05-15 23:46:08 +03:00
2024-06-03 22:11:52 +03:00
2024-06-05 16:01:19 -04:00
2024-06-05 12:55:54 -04:00
2024-06-05 16:01:19 -04:00
2024-04-18 07:43:10 +04:00
2024-06-05 12:55:54 -04:00
2024-06-03 13:37:37 -04:00
2023-11-27 21:24:06 -08:00
2024-06-09 23:46:03 +02:00
2024-06-03 18:02:15 -04:00
2024-05-20 12:06:00 -04:00
2024-06-09 07:00:12 -04:00
2024-05-17 18:00:18 -07:00
2024-01-14 19:36:05 -08:00