tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-09 15:08:02 -05:00

Author	SHA1	Message	Date
chenyu	be2f4336e6	use onnx 1.18.0 in DSP test (#11279 )	2025-07-18 14:09:23 -04:00
chenyu	c5a5d74642	Revert "image_dot of 2 half inputs returns half (#11007 )" (#11274 ) This reverts commit `fa8e08f922`.	2025-07-17 17:34:18 -04:00
Utkarsh Gill	fa8e08f922	image_dot of 2 half inputs returns half (#11007 ) * cast after sum * comment out skipif * minor fix * only test IMAGE * IMAGE is supported now * simpler * simplerr * only cast if dtype is None * dont need to change base_imaeg_type * only cast when dtype is half * add explicit test * actually no, workflow seems better * actually, keep both * move test * fix indent --------- Co-authored-by: Utkarsh Gill <engelbart@Utkarshs-MacBook-Pro.local>	2025-07-17 13:47:22 -07:00
wozeparrot	5878b189b8	don't const fold shape changing bitcast (#11236 )	2025-07-14 16:42:16 -07:00
chenyu	85ddd72038	simpler grouptop in hcopt (#11219 ) * simpler grouptop in hcopt keep the only perf relevant conditions and the rest is handled by try except * update openpilot read image count	2025-07-13 16:06:09 -04:00
uuuvn	40da5f0c81	fix silent mypy failure in ci (#11201 ) Example: https://github.com/tinygrad/tinygrad/actions/runs/16215577171/job/45784110543?pr=11177#step:7:20 Caused by footguny exception in how `set -e` works: ```bash python -m mypy --strict-equality --lineprecision-report . && cat lineprecision.txt ``` Will fail (and have non-zero exit code if run in interactive mode) but because there is `&&` it won't count as script-terminating failure in a script with `set -e` and instead as a test (similar to how fail of a command in if condition won't count as a script-terminating failure despite having non-zero exit code)	2025-07-12 15:12:25 -04:00
chenyu	7ce9e45474	mypy onnx_parser (#11141 )	2025-07-08 19:50:28 -04:00
chenyu	ffcc557986	lint onnx and onnx_parser (#11134 )	2025-07-08 15:28:35 -04:00
George Hotz	397826f0b4	add a test for 1B llm (#11124 ) * add a test for 1B llm * fix mbs * add apps to release	2025-07-07 18:47:25 -07:00
George Hotz	f7d4638e05	start LLM app, tons of clean up required. target is 200 line ollama (#11068 ) * start LLM app, tons of clean up required. target is 200 line ollama * kind of works * simpler * add k/v cache * with SYM=1, it loops * no rope cache * simpler * more cleanups * cleanups * works * argparse and comments * from gguf * generate is a function * no copy from cpu * fix max context pass in * test * improve test * ai2_arc * fix 8B, use less ram * 136 lines	2025-07-07 17:09:46 -07:00
chenyu	425d5f55c4	generate kernel dataset and upload artifact (#11063 )	2025-07-02 17:21:25 -04:00
Ahmed Harmouche	e992ed10dc	WebGPU on Windows (#10890 ) * WebGPU on Windows * Fix dawn-python install * New test * pydeps * Minor fix * Only install dawn-python on windows webgpu --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-07-02 08:38:45 -07:00
George Hotz	0597735f28	remove TC=3 not porting this (#11045 )	2025-06-30 15:12:49 -07:00
George Hotz	5911b71404	early support for bidirectional pattern matcher (#11027 ) * early support for bidirectional pattern matcher * expose it and add a test * no bottom up arg there * disable flaky test	2025-06-29 16:54:07 -07:00
chenyu	126fcf4129	clean up AMD_LLVM in tests (#11021 )	2025-06-28 22:45:47 -04:00
qazal	44257f25e4	bump line count to 14600 (#11010 )	2025-06-27 22:48:14 +03:00
George Hotz	b4eb876d5a	kernel.py no longer permutes reduce axis [pr] (#10968 ) * kernel.py no longer permutes reduce axis [pr] * delete tests that handcode uops * regen of sops is broken... * put import back * just remove that * disable those tests	2025-06-26 17:44:58 -07:00
qazal	712980e167	fix extract_dataset + add tests to CI (#10995 ) * fix extract_dataset + tests * add CI * sops.gz itself is same as master * yml + gzip -c + ge * don't commit that * bump limit to 1000 * axis=7 * test_tiny	2025-06-27 01:51:36 +03:00
chenyu	efad567ebd	ruff check whole `examples/mlperf/` (#10979 )	2025-06-25 12:57:48 -04:00
George Hotz	0f89660ce4	Revert "change clang -march flag to -mcpu on arm (#10841 )" (#10942 ) This reverts commit `897e42fd1b`.	2025-06-23 16:48:28 -07:00
ttomsa	897e42fd1b	change clang -march flag to -mcpu on arm (#10841 ) * change clang -march flag to -mcpu with fp16 disassembly test * fix * add capstone to macos dependencies * just check no cast in test * rm import * woops * lets check * move check * llvm init before cpu chcek * try this * bump autogen llvm version * also update libclang? * revert * add comment * skip llvm test and add comment * linter	2025-06-23 16:28:48 -07:00
George Hotz	ae4d2d71b4	bump line count to 14500	2025-06-23 15:32:27 -07:00
geohotstan	4ab7d792cc	ONNX improve dtype fallback (#10800 ) * fix * add early verbose demo test * is this how to write tests :s * is definition drift even a thing? gemini says it is * clean up * better * even better * try add to CI * doesn't work quite yet * much more work to be done * whoops * partition the test heh * skipif * some nits for better names * add webgpu test for onnxrunner * fix reference links * flush for now	2025-06-21 19:29:45 -04:00
George Hotz	e2907360b7	multi is one PM [pr] (#10838 ) * multi is one PM [pr] * disable flaky tests	2025-06-16 14:52:47 -07:00
uuuvn	18d936f981	Remote multihost (#10598 )	2025-06-16 13:18:56 -07:00
George Hotz	27cf836958	split ocelot out for autogen, fix CI (#10819 ) * split ocelot out for autogen, fix CI * mac ocelot	2025-06-15 11:37:23 -07:00
chenyu	7d5c769c6b	fix compile4 (#10797 )	2025-06-12 22:28:56 -04:00
wozeparrot	53edd49a33	feat: bump to llvm20 (#10784 )	2025-06-11 16:04:18 -07:00
George Hotz	9d0383634d	bump cache and include full python version [pr] (#10768 ) * bump cache and include full python version [pr] * stupid windows * really stupid windows	2025-06-10 15:07:30 -07:00
chenyu	612cdf5146	move fuzz_shape_ops to run with other fuzzer (#10767 ) * move fuzz_shape_ops to run with other fuzzer * don't skip CPU	2025-06-10 17:43:04 -04:00
chenyu	5e7ad70aae	don't run linearize().uop tests in get_action_space test (#10766 ) * don't run linearize().uop tests in get_action_space test this part takes 2 minutes in CI and has nothing to do with action space. also not sure if the "for some reason" comment is still relevant * -n=auto test/models	2025-06-10 17:23:53 -04:00
George Hotz	0fbf3f5554	Revert "Revert "Update autogen ci runner to ubuntu 24.04 (#10736 )" (#10757 )" (#10758 ) This reverts commit `a6dba9b9d9`.	2025-06-10 09:32:27 -07:00
George Hotz	a6dba9b9d9	Revert "Update autogen ci runner to ubuntu 24.04 (#10736 )" (#10757 ) This reverts commit `1d15374c7a`.	2025-06-10 09:31:51 -07:00
uuuvn	1d15374c7a	Update autogen ci runner to ubuntu 24.04 (#10736 ) For `kfd.AMDKFD_IOC_EXPORT_DMABUF`	2025-06-10 08:33:02 -07:00
George Hotz	acf72872b3	move view left to the outer graph prereqs + testing (#10725 ) * move view left to the outer graph * global view right * dont need that one * remove comment * test kernelize * simple * split onnx, test sdxl null * fix testing * ugh, wrong one * Update test.yml	2025-06-09 20:43:25 -07:00
George Hotz	ef58ab340a	hotfix: remove n=auto from REMOTE=1 test	2025-06-09 09:19:36 -07:00
George Hotz	81b9c04574	move high level stuff to unit tests [pr] (#10708 ) * move high level stuff to unit tests [pr] * process replay on unit tests * fix pr, less compute * set omp num threads * set 200MB buffer size limit * delete junk * fix tests * faster * move test_indexing to unit * faster	2025-06-08 14:05:56 -07:00
George Hotz	4e2c3560b4	smaller tests are faster tests [pr] (#10704 ) * remove del spam from CI * more * preconstruct default buffer spec * ignore those errors * check exception * more exception check * skip stuff * smaller tests mean faster tests * a few more	2025-06-08 10:54:19 -07:00
George Hotz	7ff175c022	cache a venv to avoid pip usage (#10689 ) * try built in pip caching * try venv * export venv * set VIRTUAL_ENV * revert that * venv key * fix * ci cache hit? * fix windows	2025-06-07 20:13:41 -07:00
George Hotz	53ed64e133	ci speed work 1 (#10676 ) * skip a few slow tests * use a venv for python packages * create venv * no user, it's in venv * ignore venv * venv * new cache key * try that * this * version the python cache	2025-06-07 16:33:11 -07:00
qazal	7114b6ab31	viz browser tests (#10626 ) * viz browser tests * expect failure if js/ isn't included * back green	2025-06-04 14:58:24 +03:00
George Hotz	ee12e801a3	optional fused optimizers (#10549 ) * enumerate cases of Tensors in the JIT * optional fused optimizers * add fused optimizer test * move that there * ugh	2025-05-28 13:50:30 -07:00
Sieds Lykles	ae02a1e232	[bounty] Z3 symbolic fuzzer [pr] (#10514 ) * First version, caught a bug? * Nicely print failure to reproduce * Remove that * Put the assert back * Change fuzzing to use testing_unit so it has z3 * Test key to match * Add rule * Add test * Add test for edge case 0 * Merge patterns * update comment * consistent whitespace * whitespace * add condition * add test * update comment * use Variable * fuzzer using z3_renderer * Cleaned up printing and debugging * working new fuzzer * change some comments and printing * more formatting * fuzz failures in seperate file * fix fstring * more tests * naming * remove added line * remove comment * print number of skipped expressions * use self.assertEqual --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2025-05-28 16:28:37 -04:00
uuuvn	c29c46853f	Very basic mock sqtt (#10512 ) This mockgpu sqtt emulation will just ignore basically everything and end up with a 0x1000 size trace full of zeroes, but just testing for things like register rename is better than nothing i guess	2025-05-26 14:38:28 -07:00
b1tg	a1f64af92d	ci: setup llvm for amdremote (#10507 ) Co-authored-by: b1tg <b1tg@users.noreply.github.com>	2025-05-25 21:52:27 -04:00
George Hotz	bf2a0907be	gate the mockdsp behind MOCKDSP=1 [pr] (#10486 )	2025-05-23 11:44:02 -07:00
George Hotz	f1fe1f93c1	hotfix: 14000 lines	2025-05-19 09:40:53 -07:00
uuuvn	0f825e12f2	Remote fixedvars (#10371 ) * amd mockgpu graph support For testing remote graph stuff (prompted by #10371) in ci * Remote fixedvars Somehow none of existing tests failed when fixedvars were added, looking what to add as an regression test for this --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2025-05-18 09:57:13 -07:00
uuuvn	27c12be471	amd mockgpu graph support (#10385 ) For testing remote graph stuff (prompted by #10371) in ci	2025-05-18 09:43:16 -07:00
qazal	0294bfe507	simpler can_pad (#10364 ) * simpler can_pad [pr] * 3 kernels * tests * less kernels	2025-05-18 10:00:07 +03:00

... 3 4 5 6 7 ...

831 Commits