George Hotz
6f1810af2d
with unroll, the action space goes from 161 -> 127 ( #2060 )
...
* with unroll, the action space goes from 161 -> 127
* more reliable instrumentation
* beam search is so op
* beam bugfix
2023-10-12 20:52:23 -07:00
George Hotz
c5edb3c374
train value net, improve API, add BCE ( #2047 )
...
* api cleanups, BCE losses
* valuenet
* fixup examples
* learning okay
* add valuenet runner
* net improvements
* net improvements
* 40% win rate
2023-10-12 07:56:38 -07:00
George Hotz
0ba629c7b9
add world dataset ( #2045 )
2023-10-11 15:54:30 -07:00
George Hotz
0c3b6f13a8
Latest opt ( #2044 )
...
* split out actions
* rl algorithm
2023-10-11 15:46:14 -07:00
George Hotz
41bfeb2c1e
start work on auto opt ( #2034 )
...
* start work on auto opt
* lin failure
* not beating hcopt
* greedy
* timing is fast
* codegen.search
* greedy search in handcode_opt
* track running gflops
* clean up those files
* no failure
2023-10-11 12:54:53 -07:00
chenyu
1c980517c5
s/var_vals_from_ast/vars_from_ast ( #2038 )
2023-10-10 20:21:55 -07:00
George Hotz
f139060103
Rewrite hand coded opt with action space ( #2030 )
...
* tests passing
* hand coded opt with new abstractions
* simpler opts
* split out tensor cores
2023-10-10 07:38:38 -07:00
George Hotz
16ca8410f8
op logger + replay ( #2021 )
...
* logops
* fix dtype printing
* needs inf
* ops dataset
* minor improvements
* 12k kernels
* opt can compile
* graph flops
2023-10-08 15:10:18 -07:00