George Hotz
|
a6d842af7a
|
move device to ops (#1646)
* move device to ops
* mlops types
* 2 lines
|
2023-08-23 08:30:17 -07:00 |
|
George Hotz
|
643cbdfd50
|
make embedding and GPT-2 fast (#1631)
* make embedding fast
* jit more, variable shape support
* print mem bw
|
2023-08-22 15:14:38 -07:00 |
|
George Hotz
|
718ced296c
|
move state to nn/state (#1619)
|
2023-08-22 07:36:24 -07:00 |
|
George Hotz
|
4f459841bc
|
Symbolic JIT for GPT2 (#1613)
* not fast yet
* simpler
* symbolic jit
* fp16 GOPS and GB
|
2023-08-21 19:44:57 -07:00 |
|
George Hotz
|
e3c6c0c6db
|
add GPT2 example (#1511) (#1514)
* add gpt2 to examples
* some cleanup
* fixes
* argparse + scaled_dot_product_attention
* add timing
* add to benchmark
Co-authored-by: YassineYousfi <yassine.y10@gmail.com>
|
2023-08-10 09:09:47 -07:00 |
|