chenyu
c99fa58dd2
simplify gpt2 example ( #1973 )
...
* simplify gpt2 example
* kernel_jitted_count and jit tests
* Revert "kernel_jitted_count and jit tests"
This reverts commit 31a3c26dd0 .
* all_jitted test in test_real_world
2023-10-05 07:09:29 -07:00
George Hotz
48c8d130ae
simpler GPT2 ( #1941 )
...
* don't realize in gpt2
* simpler gpt2
2023-09-29 04:41:09 -07:00
Gijs Koning
b8ff20ffe4
Gpt2 ( #1896 )
...
* small helps
* got something working
* faster?
* faster yes
* cleanup
* cleanup
* cleanup
* Fix non jit
* Fix fp16 and some cleanup
* Fix fp16 and some cleanup
* cleanup
* similar to master
* cleanup
2023-09-22 20:14:47 +08:00
nimlgen
4c31dfafb3
add seed to gpt-2 ( #1869 )
2023-09-15 17:34:14 -04:00
chenyu
ebcda8a714
Move var_vals from ShapeTracker to LazyBuffer ( #1819 )
2023-09-08 09:25:10 -07:00
chenyu
a2745819f6
faster gpt2 jit path and gpt2 in test_real_world ( #1738 )
2023-09-02 08:39:12 -07:00
George Hotz
cd7ceed914
gpt2: print total instead of sync time
2023-08-30 10:59:42 -07:00
George Hotz
a6d842af7a
move device to ops ( #1646 )
...
* move device to ops
* mlops types
* 2 lines
2023-08-23 08:30:17 -07:00
George Hotz
643cbdfd50
make embedding and GPT-2 fast ( #1631 )
...
* make embedding fast
* jit more, variable shape support
* print mem bw
2023-08-22 15:14:38 -07:00
George Hotz
718ced296c
move state to nn/state ( #1619 )
2023-08-22 07:36:24 -07:00
George Hotz
4f459841bc
Symbolic JIT for GPT2 ( #1613 )
...
* not fast yet
* simpler
* symbolic jit
* fp16 GOPS and GB
2023-08-21 19:44:57 -07:00
George Hotz
e3c6c0c6db
add GPT2 example ( #1511 ) ( #1514 )
...
* add gpt2 to examples
* some cleanup
* fixes
* argparse + scaled_dot_product_attention
* add timing
* add to benchmark
Co-authored-by: YassineYousfi <yassine.y10@gmail.com >
2023-08-10 09:09:47 -07:00