Szymon Ożóg
|
1e7b7b2c3c
|
Fix flop coutning for mulacc (#4640)
* Fix flop coutning for mulacc
* add test_simple_mulacc
* Update test_uops_stats.py
* Update test_uops_stats.py
* revert test_mulacc
* Test for MULACC vs MUL+ADD
|
2024-05-20 12:06:00 -04:00 |
|
George Hotz
|
2f970a4fc2
|
all realize 2 (#4527)
* all realize 2
* tests fixup
* fix more tests
* fix openpilot
* fix tests
* unneeded
|
2024-05-10 22:43:09 -07:00 |
|
George Hotz
|
827058f030
|
update tests get_runner (#4522)
|
2024-05-10 20:09:22 -07:00 |
|
George Hotz
|
7425a0c646
|
CommandQueue is the future (#3950)
* start of command queue
* cq work
* runs
* cleanup
* outs set
* read is gone
* future buffer work
* command queue is better
* command queue works
* loadops
* delete unneeded
* command queue works
* upd
* fix tests
* use CommandQueue in compile
* delay sync
|
2024-04-01 17:35:48 -07:00 |
|
George Hotz
|
68ca4d4276
|
split to schedule.py (#3949)
* split to schedule.py
* split
|
2024-03-26 21:02:46 -07:00 |
|
George Hotz
|
150ea2eb76
|
create engine folder and move code (#3948)
* retry
* older tf
* that
|
2024-03-26 20:38:03 -07:00 |
|
George Hotz
|
1b6e890ef2
|
uops flop counter (#3373)
* factor out winograd functions
* test counter
* uops flop counter
* more correct
* ish
* correct
* cleanup
* tests for uops flop counter
* tests still fail
* fix symbolic uops flop cnt
* fix symbolic uops flop cnt
* hmm, it's an alu
* uops alu resolve
* relax that
|
2024-02-20 09:36:30 +01:00 |
|