Commit Graph

738 Commits

Author SHA1 Message Date
George Hotz
4a9882d495 hlops 2022-06-08 11:46:09 -07:00
George Hotz
e046a2fd9f readme fix typos 2022-06-08 11:43:05 -07:00
George Hotz
4b09ca90a1 readme: still WIP 2022-06-08 11:41:19 -07:00
George Hotz
91d97c8313 mlops for cpu 2022-06-08 11:20:10 -07:00
George Hotz
1764a36bd6 llops cpu passes 2022-06-08 10:57:45 -07:00
George Hotz
81d16d105e llcpu convs work 2022-06-08 10:51:09 -07:00
George Hotz
6bdcf5ef59 llops cpu 2022-06-08 10:43:02 -07:00
George Hotz
582ddef59c 14 llops too 2022-06-08 10:01:10 -07:00
George Hotz
60a48455ad still over line count, maybe test pass 2022-06-08 09:51:28 -07:00
George Hotz
70561f3d90 way over the line limit 2022-06-08 09:36:31 -07:00
George Hotz
4f7ee235c5 not a real test now 2022-06-08 09:00:59 -07:00
George Hotz
ae33060dae early float4 stuff for binary 2022-06-08 08:59:54 -07:00
George Hotz
82f29b5dbf better GPU block 2022-06-08 08:01:04 -07:00
George Hotz
42ae78241e only run test on GPU 2022-06-08 07:54:40 -07:00
George Hotz
cdf4b5f142 opencl perf test 2022-06-08 07:49:08 -07:00
George Hotz
892ac661e1 enet readability 2022-06-07 10:23:05 -07:00
George Hotz
9b27ba650b load new torch files 2022-06-07 10:06:48 -07:00
George Hotz
33c9c6c025 float4 for unary_op 2022-06-06 10:40:30 -07:00
George Hotz
69a401d596 use cl.enqueue_copy and add padding to GPUBuffers 2022-06-06 10:27:08 -07:00
George Hotz
d8ee8a39ac sgd threestep graph is so pretty 2022-06-06 09:45:37 -07:00
George Hotz
aa3fb10a92 hmm, not allowed 2022-06-06 09:40:15 -07:00
George Hotz
c143c92828 adam threestep 2022-06-06 09:38:28 -07:00
George Hotz
c26ed64cef slightly cleaner graph 2022-06-06 09:31:48 -07:00
George Hotz
d302049e53 don't use div 2022-06-06 09:25:31 -07:00
George Hotz
7229d65235 shortcut nops 2022-06-06 09:15:41 -07:00
George Hotz
b482e1db85 minor cleanups, one less buffer 2022-06-06 08:54:24 -07:00
George Hotz
1ec786960f fix ci with f 2022-06-06 08:34:05 -07:00
George Hotz
f177cd550f make binary ops respect needs_input_grad 2022-06-06 08:29:08 -07:00
George Hotz
a1dff4061b minor cleanups 2022-06-06 08:14:52 -07:00
George Hotz
613f0ca6e5 remove new_buffer 2022-06-06 07:57:39 -07:00
George Hotz
30f55eaaba rename llops to opencl 2022-06-06 07:49:36 -07:00
George Hotz
233c71a7ba support requires_grad 2022-06-06 07:47:31 -07:00
George Hotz
9f9cf076c0 comments 2022-06-05 21:51:31 -07:00
George Hotz
a7ea00dae7 remove useless 2022-06-05 21:37:12 -07:00
George Hotz
5526f36f44 all buffer alloc in top level 2022-06-05 21:34:34 -07:00
George Hotz
85e130a0c7 line count 2022-06-05 21:12:02 -07:00
George Hotz
d9e57954d8 factor out unary and binary op allocations 2022-06-05 20:48:36 -07:00
George Hotz
bb7e7b48f9 output edges aren't intermediates either 2022-06-05 17:27:32 -07:00
George Hotz
927a238dc8 better printing for intermediate tensors 2022-06-05 17:19:47 -07:00
George Hotz
3dac8fa728 this fix the gc 2022-06-05 17:16:40 -07:00
George Hotz
0ee21ba115 add ViT test and car 2022-06-05 17:12:43 -07:00
George Hotz
ebd7290f56 simpler batchnorm 2022-06-05 16:51:51 -07:00
George Hotz
30025f2bc7 bn: don't track_running_stats if you aren't training 2022-06-05 16:41:20 -07:00
George Hotz
1de75b67d5 fix bug in graph with use of id 2022-06-05 16:31:20 -07:00
George Hotz
c193088c97 fix bug in purple edges 2022-06-05 16:04:51 -07:00
George Hotz
81c9438ea1 keepdim avoids reshapes 2022-06-05 15:56:42 -07:00
George Hotz
60b8689ea2 code2 was unneeded for reduce op 2022-06-05 15:15:04 -07:00
George Hotz
46aaac5aaf clean up loops 2022-06-05 15:00:21 -07:00
George Hotz
44168c4c40 remove all ctxes 2022-06-05 14:55:39 -07:00
George Hotz
f01bad36c2 factor out convs 2022-06-05 14:48:42 -07:00