George Hotz
c60c3b467a
clean up symlinking in benchmark ( #2219 )
...
* clean up symlinking
* make torch deterministic
2023-11-05 16:46:05 -08:00
George Hotz
12dd165d38
add WINO/HALF/HIP to AMD benchmark
2023-10-25 13:22:45 -04:00
George Hotz
0e3e2bac13
amd wino: upload results
2023-09-09 13:57:14 -07:00
George Hotz
6f95c5f284
winograd speed test for AMD ( #1826 )
2023-09-09 13:56:33 -07:00
George Hotz
0f2bd10d00
add winograd CIFAR to mac tests ( #1825 )
...
* add winograd CIFAR to mac tests
* symlink already done
2023-09-09 13:45:24 -07:00
George Hotz
fb1cc6bf4b
llama jit is default, print tok/sec ( #1774 )
...
* llama jit is default, print tok/sec
* jit not default in CI
2023-09-05 10:12:16 -07:00
George Hotz
89cd380bfc
add nvidia CI ( #1737 )
...
* add nvidia
* speed(nvidia)
2023-09-01 22:02:30 -07:00
George Hotz
fdd7f282cb
Reenable tensor cores for self-hosted Mac CI ( #1717 )
...
* debug 5 matmul
* allow tensor cores in CI
* tensor cores on arm64
* put debug back
2023-08-30 07:53:04 -07:00
wozeparrot
2f768e386d
stable diffusion benchmark artifact ( #1714 )
2023-08-29 21:08:40 -04:00
George Hotz
0ea22bf249
remove DEBUG=1 from stable diffusion AMD since jit cache is fixed
2023-08-29 12:46:12 -07:00
George Hotz
ab9b9ff3e2
pipefail benchmark ( #1709 ) ( #1710 )
...
* feat: specify shell
* feat: specify shell for mac
Co-authored-by: wozeparrot <wozeparrot@gmail.com >
2023-08-29 08:15:02 -07:00
George Hotz
aa7c98722b
sd timing ( #1706 )
2023-08-28 20:22:57 -07:00
George Hotz
ad7d26c393
fix __launch_bounds__ and benchmark TC MATMUL ( #1575 )
...
* fix
* benchmark matmul
2023-08-19 10:54:39 -07:00
George Hotz
e3c6c0c6db
add GPT2 example ( #1511 ) ( #1514 )
...
* add gpt2 to examples
* some cleanup
* fixes
* argparse + scaled_dot_product_attention
* add timing
* add to benchmark
Co-authored-by: YassineYousfi <yassine.y10@gmail.com >
2023-08-10 09:09:47 -07:00
wozeparrot
351684395c
dont run on fork ( #1510 )
2023-08-09 13:06:45 -04:00
wozeparrot
88e2e0c8a3
Revert "don't try to run benchmark on forks" ( #1508 )
2023-08-09 12:59:49 -04:00
wozeparrot
65b65b760b
don't try to run benchmark on forks ( #1507 )
2023-08-09 12:59:19 -04:00
George Hotz
5fdd248617
don't download cifar ( #1472 )
2023-08-06 21:38:59 -07:00
George Hotz
d78fb8f4ed
add stable diffusion and llama ( #1471 )
...
* add stable diffusion and llama
* pretty in CI
* was CI not true
* that
* CI=true, wtf
* pythonpath
* debug=1
* oops, wrong place
* uops test broken for wgpu
* wgpu tests flaky
2023-08-06 21:31:51 -07:00
George Hotz
486a9dbfd9
speed v torch ( #1464 )
...
* speed v torch
* always print
* change print
* torch speed tee
* all exposed
2023-08-06 09:32:33 -07:00
George Hotz
2ab282bfec
run on update_benchmark too ( #1460 )
...
* run on update_benchmark too
* amd inference test
* name it better
* add 10 CIFAR training steps
2023-08-06 08:58:37 -07:00
George Hotz
943b227cb1
only on push to master
2023-08-06 00:10:07 -07:00
George Hotz
2274e3e757
Fix benchmark ( #1454 )
...
* do benchmarking
* system
* artifact
* go
* name artifact
* only on push
2023-08-05 23:44:36 -07:00
George Hotz
bf21aec81f
do benchmarking ( #1451 )
...
* do benchmarking
* system
* artifact
* go
* name artifact
2023-08-05 23:35:01 -07:00