tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-19 03:48:47 -05:00

Author	SHA1	Message	Date
George Hotz	8af8808c61	cleanup tests, bump caches (#11746 )	2025-08-19 21:21:07 -07:00
George Hotz	1d307f568c	move device tests to test/device + test cleanups (#11735 ) * move device tests to test/device * test speedups * test device * linalg to unit * upd * so pytest just works * more divide and skip * speed * test devectorize * add pillow	2025-08-19 16:02:20 -07:00
George Hotz	2ea54d7337	improve syntax of UPats using f [pr] (#11717 ) Co-authored-by: chenyu <chenyu@fastmail.com>	2025-08-18 20:49:45 -04:00
George Hotz	4afa0b86bb	hotfix: ls -lh on wheel size	2025-08-18 11:52:59 -07:00
chenyu	c10e4c4e20	print wheel build size (#11714 )	2025-08-18 14:29:47 -04:00
chenyu	d0d39885c3	onnx in tinygrad (#11675 )	2025-08-14 19:57:21 -04:00
wozeparrot	71260a5ea4	feat: only bench openpilot 0.9.9 models (#11664 )	2025-08-14 19:27:18 -04:00
chenyu	48c4033ae1	fix pylint for onnx (#11673 ) * fix pylint for onnx * too long	2025-08-14 18:48:02 -04:00
geohotstan	1e904155e3	Add Onnx Huggingface to test/models/test_onnx.py (#11468 ) * BOOM * cache extra/huggingface/models/ * why max buffer size is not 0 * override MAX_BUFFER_SIZE * less models * remove more models and change cache dir to already cached dir * only metal * less is more? * remove check ops * why is this not setting the ENVVAR * ughhhhh just test in models * only cpu and gpu * only cpu actually * just override it idk * final * move extra dependencies up top * simplification * fix print * make README better * revert ops_disk fix for now * clean up test_onnx * remove testing fashion clip model cuz sloooowwwwww * actually let METAL run this * fix comment mistake * fix download path in run_models * does this work? * cleanup setup and teardown * contextvar like this? * prove model is cached * do I need to increment DOWNLOAD_CACHE_VERSION? * see if cached with incremented DOWNLOAD_CACHE_VERSION * use warnings to see if the model exists * revert DOWNLOAD_CACHE_VERSION stuff and clean up * add retry to download * nit	2025-08-14 11:16:41 -04:00
ttomsa	ae0c3cfff6	change clang -march flag to -mcpu on arm (#10970 ) Co-authored-by: wozeparrot <wozeparrot@gmail.com>	2025-08-11 13:38:48 -04:00
nimlgen	5403a4aeaf	null dev: support offset on buffers (#11606 ) * null dev: support offset on buffers * nolimit	2025-08-10 21:58:37 +03:00
chenyu	dd3d2eb36c	add training llama3 test in ci (#11599 )	2025-08-09 22:35:39 -04:00
chenyu	b232c60def	benchmark openpilot 0.9.9 (#11575 ) * benchmark openpilot 0.9.9 not sure what to do with the 0.9.7 ones with IMAGE=2 and validate * name	2025-08-08 01:26:14 -04:00
chenyu	702e38dc19	remove FUSE_ARANGE_UINT (#11567 ) also add IGNORE_OOB=1 to bert runs. lowered BS on tinybox to 90 since 96 oom during eval without reset	2025-08-07 16:49:06 -04:00
chenyu	594cbdc66f	skip AM ResNet50 benchmark (#11565 ) hanging with FUSE_ARANGE?	2025-08-07 14:07:01 -04:00
chenyu	7ee3770961	FUSE_ARANGE=1 (#11427 ) * FUSE_ARANGE=1 * fix test --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-08-07 13:32:34 -04:00
George Hotz	21570545d3	move view pushing to codegen, try 2 (#11534 ) * move view pushing to codegen, try 2 * fix up some linearizer tests * fix test search * fix test schedule * delete that test * fix test arange * fix a few tests * update tests * push views * ebs cleanup * fix local/reg * test and lint * fix more tests * test cleanups * skipped that one	2025-08-06 15:58:38 -07:00
George Hotz	4fe11725c6	pass through sink arg, update linearizer test (#11536 ) * pass through sink arg, update linearizer test * get_program help * bump line count * use new api	2025-08-06 09:48:48 -07:00
geohotstan	1163292759	move onnx_parser into onnx (#11530 )	2025-08-06 10:46:27 -04:00
nimlgen	1afb290027	ci: fix runner in nv (#11527 )	2025-08-06 10:38:04 +03:00
chenyu	c9225d22ce	only disable flaky test_jit_multidev_xfer (#11523 )	2025-08-05 22:17:25 -04:00
George Hotz	f58fd3143d	cleanup fix_kernel (#11520 ) * cleanup fix_kernel * early load buffer * early meta ops * move those to fix_kernel_ops * fix tests * remote metal was flaky * Revert "fix tests" This reverts commit `a27019383d`. * that hack broke things * fine for ptx	2025-08-05 18:38:43 -07:00
chenyu	3f742a5a7c	comma space lab models benchmark (#11461 )	2025-07-31 19:06:18 -04:00
wozeparrot	d3da20eca6	feat: bump mlperf workflow timeout to 6 hours (#11440 )	2025-07-30 14:12:12 -07:00
nimlgen	5fc5bb5237	ci: clear processes (#11434 ) * unified hcq_smi for managment * fix * fix * no reset for amd	2025-07-30 22:15:18 +03:00
nimlgen	4b4ba5454c	ci: move driver start higher (#11431 )	2025-07-30 10:48:38 +03:00
chenyu	204da24cfc	increase driverbenchmark timeout-minutes to 15 (#11428 )	2025-07-29 19:45:05 -04:00
nimlgen	c88e401d0e	ci: fix typos in h machine benchmarks (#11423 )	2025-07-29 22:11:47 +03:00
George Hotz	1f1f99c287	hotfix: add DEBUG=3 to driver CI	2025-07-29 11:03:47 -07:00
nimlgen	d38d285489	ci: add h machines (#11416 ) * ci: add h machines * more * fix names * names not collide * 20 * 10	2025-07-29 19:21:51 +03:00
Tom Clesius	2568bc0d99	ci: add caching for apt packages (#11162 ) * add caching for apt packages * remove 'inputs' from apt cache key, use outputs instead of env * remove unnecessary mkdir for partial --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-07-29 09:04:56 -07:00
uuuvn	052191eae4	Remote multihost (p2p with infiniband verbs) (#9746 ) Co-authored-by: wozeparrot <wozeparrot@gmail.com>	2025-07-27 14:44:32 -07:00
uuuvn	76a2ddbd78	Move remote tests out of onnx (#11310 ) Co-authored-by: wozeparrot <wozeparrot@gmail.com>	2025-07-23 13:25:55 -07:00
chenyu	86e7504111	mypy check extra/onnx.py (#11348 ) instead of running test with 3.10, add onnx to mypy which would have caught StrEnum regression. Several type annotation failed mypy now that does not affect running the code and were skipped for now	2025-07-23 12:42:59 -04:00
chenyu	960da9319d	Remove StrEnum in onnx for python 3.10 (#11345 ) some training tests failed looks like parsing error?	2025-07-23 11:52:25 -04:00
chenyu	7a9a5cfd28	isolate test/external/external_test_am.py (#11335 ) seems to be the one crashing, also remove -n=auto for that	2025-07-22 19:02:20 -04:00
George Hotz	09431d4ad1	make DEFINE_REG behave like the others (#11273 ) * simpler define reg * cast * PTRCAT define_acc * cleanups * fix uops stats * fix linearizer tests * llvm * define reg sets const * define reg sets const * no assign * collapse that * fix test_max_pool2d_bigger_stride_dilation * use index, fix webgpu * devec * fix tests * fix webgpu * fix llvm * threads for python * fix ops_python * only for reg * acc_half is real now in the emulator * fix llvm * fix webgpu init * fix wgpu test * fix some tests * fix ptx * fix ptx bool acc * cleanups * broken, meh. will fix with ENDRANGE * line count	2025-07-22 13:53:56 -07:00
George Hotz	affd83961c	small changes from define_reg (#11327 ) * small changes from define_reg * fix webgpu	2025-07-22 11:11:48 -07:00
qazal	0c4e19f270	hotfix: disable process replay in REMOTE=1 tests (#11320 ) * hotfix: disable process replay in REMOTE=1 tests * comment	2025-07-22 10:41:58 +03:00
geohotstan	445ff8de56	ONNX onnx_parser and buffer_parse clean up (#11000 ) * start * remove onnx.load from compile4 and move np to dropout * clean up and enable test * clean up * move WebGPU ONNX test into MacOS (WebGPU) * leave test in ONNX (CPU) * fix raw_data init None, and simplify onnx_runner test a little? * THESE TESTS ARE SO UGLY UGHH * need to really think about how to structure the test * wow LLMs are quite something * not always on disk now * also add external data loading test * cleaner tests * minimize diff and add const folding tests * add external data loading too * whoops add webgpu back.. but why was it not needed in the first place? * better comment * move webgpu test to macos(webgpu)? * llm english so much better than me wow * trigger CI to check flakiness --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2025-07-21 15:10:25 -04:00
chenyu	be2f4336e6	use onnx 1.18.0 in DSP test (#11279 )	2025-07-18 14:09:23 -04:00
chenyu	c5a5d74642	Revert "image_dot of 2 half inputs returns half (#11007 )" (#11274 ) This reverts commit `fa8e08f922`.	2025-07-17 17:34:18 -04:00
Utkarsh Gill	fa8e08f922	image_dot of 2 half inputs returns half (#11007 ) * cast after sum * comment out skipif * minor fix * only test IMAGE * IMAGE is supported now * simpler * simplerr * only cast if dtype is None * dont need to change base_imaeg_type * only cast when dtype is half * add explicit test * actually no, workflow seems better * actually, keep both * move test * fix indent --------- Co-authored-by: Utkarsh Gill <engelbart@Utkarshs-MacBook-Pro.local>	2025-07-17 13:47:22 -07:00
wozeparrot	5878b189b8	don't const fold shape changing bitcast (#11236 )	2025-07-14 16:42:16 -07:00
chenyu	85ddd72038	simpler grouptop in hcopt (#11219 ) * simpler grouptop in hcopt keep the only perf relevant conditions and the rest is handled by try except * update openpilot read image count	2025-07-13 16:06:09 -04:00
chenyu	2b48b961be	fix a few broken AMX tests (#11204 )	2025-07-12 21:42:38 -04:00
uuuvn	40da5f0c81	fix silent mypy failure in ci (#11201 ) Example: https://github.com/tinygrad/tinygrad/actions/runs/16215577171/job/45784110543?pr=11177#step:7:20 Caused by footguny exception in how `set -e` works: ```bash python -m mypy --strict-equality --lineprecision-report . && cat lineprecision.txt ``` Will fail (and have non-zero exit code if run in interactive mode) but because there is `&&` it won't count as script-terminating failure in a script with `set -e` and instead as a test (similar to how fail of a command in if condition won't count as a script-terminating failure despite having non-zero exit code)	2025-07-12 15:12:25 -04:00
chenyu	7ce9e45474	mypy onnx_parser (#11141 )	2025-07-08 19:50:28 -04:00
chenyu	ffcc557986	lint onnx and onnx_parser (#11134 )	2025-07-08 15:28:35 -04:00
George Hotz	397826f0b4	add a test for 1B llm (#11124 ) * add a test for 1B llm * fix mbs * add apps to release	2025-07-07 18:47:25 -07:00

1 2 3 4 5 ...

916 Commits