tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-01-13 17:08:11 -05:00

Author	SHA1	Message	Date
Chen-Yu Yang	04b6260994	raise if Tensor._buffer is called during jit	2026-01-12 16:38:14 -05:00
C T	a8c821f45e	add Tensor.log10 with test\test_ops.py::TestOps::test_log10 (#14113 )	2026-01-12 13:45:47 -05:00
chenyu	6b0a9f5ee6	don't strip sink in to_uops_list [pr] (#14111 )	2026-01-12 11:19:03 -05:00
chenyu	cad7feec02	more onnx ops (#14104 ) HannWindow, HammingWindow, BlackmanWindow, Hardmax, LpNormalization	2026-01-12 09:11:13 -05:00
chenyu	9973a81356	add channels_last to QLinearGlobalAveragePool (#14094 ) and other minor cleanups	2026-01-10 18:38:19 -05:00
chenyu	35c9701df0	update outdated tests and comments (#14090 )	2026-01-10 01:00:48 -05:00
chenyu	92246ea731	update tests, `WEBGPU=1 pytest .` passes (#14089 ) * update tests, `WEBGPU=1 pytest .` passes * minor update	2026-01-10 00:03:02 -05:00
chenyu	c34c6d9468	fix wgsl packed_store can drop valid (#14088 ) * fix wgsl packed_store can drop valid * fix	2026-01-09 15:22:06 -05:00
chenyu	eacccc5ace	more disk assign tests (#14087 ) covers more edge cases	2026-01-09 14:14:52 -05:00
chenyu	ed295e74dc	don't skip gguf test if ggml is not installed (#14086 ) * don't skip gguf test if ggml is not installed should just let it fail * fix	2026-01-09 12:05:58 -05:00
chenyu	cff33c8d78	add some disk assign tests (#14085 )	2026-01-09 11:50:59 -05:00
chenyu	74fa3c7d09	decomp pow for LVP (#14084 ) test failed due to undefined behavior, so use decomp instead	2026-01-09 10:50:28 -05:00
b1tg	0fbc551622	train bert with fp8 (#13874 ) * fp8 train * clean * lint * test fix from #13439 * skip first/last layer * rm __init__, restore unroll <=32 check * tests * clean test, remove unused * multi-gpu test, clean quantize_to_fp8 * remove bert contiguous * run script * test: better check * run script search * add seed in bert data shuffle * move script to mi350x folder --------- Co-authored-by: chenyu <chenyu@fastmail.com>	2026-01-09 09:21:59 -05:00
chenyu	efcb32f6a9	unique const when requires_grad is set to True (#14075 ) * unique const when requires_grad is set to True * fix pyrender	2026-01-08 16:30:45 -05:00
chenyu	b34c637767	support bfloat16 for CL (#14073 )	2026-01-08 14:14:29 -05:00
Garret Castro	16b652302e	skip bf16 test if not supported by device (#14070 )	2026-01-08 13:37:24 -05:00
wozeparrot	027b935269	tk: fix grouped load store (#14035 )	2026-01-07 22:38:02 -08:00
chenyu	3caa1e2c98	fix cast HALF with PYTHON backend (#14058 )	2026-01-07 16:52:05 -05:00
chenyu	5f1ede7f7e	clean up test_dtype (#14055 ) use less lambda	2026-01-07 15:45:42 -05:00
chenyu	2833c5a54b	few more jit tests with multi tensor inputs (#14047 )	2026-01-06 22:05:22 -05:00
chenyu	72a3f78d19	jit includes tensor inputs in containers (#14043 ) * jit includes tensor inputs in containers * cleanup	2026-01-06 19:42:06 -05:00
chenyu	c714881832	don't allow jit input to be const (#14045 ) * don't allow jit input to be unbuffered like const * just const to fix multi * fix rnnt	2026-01-06 18:15:22 -05:00
chenyu	a8896f28e1	test_unrealized_const_input_frozen (#14044 ) unrealized const is not replaced in jit	2026-01-06 14:17:43 -05:00
nimlgen	325f4006ff	amd: copies w/o sdma (#14036 ) * amd: copies w/o sdma * as_args * fixes * f	2026-01-06 21:15:58 +03:00
chenyu	7fb18f7e47	raise when jit fxn returns non-Tensor output (#14042 )	2026-01-06 12:59:20 -05:00
chenyu	4491ec0c9e	JitError (#14041 ) * JitError * test_symbolic_jit	2026-01-06 12:19:50 -05:00
chenyu	6ddddc68af	test jit tolist failure (#14040 ) also moved tests to test_jit_footguns	2026-01-06 11:16:57 -05:00
chenyu	b699b9f763	test case for jit a function with item call (#14039 ) * test case for jit a function with item call output is silently wrong now * no dtype	2026-01-06 10:40:43 -05:00
qazal	3170365a5b	visualize SQTT with the same cfg infrastructure (#13870 ) * start * rough sketch * post render dag * art * intro g key * work * custom color scale * colors * more blue * better * smaller * use for loop in test	2026-01-06 14:53:20 +09:00
chenyu	83063cc3e4	onnx TensorScatter (#14024 )	2026-01-05 09:05:22 -05:00
chenyu	9497ec00f2	fix onnx attention permute (#14025 ) * fix onnx attention permute * skip test_attention_4d_fp16_cpu too	2026-01-05 08:58:50 -05:00
chenyu	7a81a3cb98	more passed onnx tests (#14022 )	2026-01-05 07:46:27 -05:00
Christopher Milan	b2a0b9c551	autogen: dump patch in CI (#14010 ) * autogen: don't fast-fail, produce patch artifact on differences All verification steps now use continue-on-error to run completely. Each job generates a patch artifact containing all differences found. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * add gen from header test * fix tests * fail if diff * add forward decl autogen test * remove confusing/wrong comments * macos unittests set LIBCLANG_PATH --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 22:38:12 -05:00
chenyu	aae08b20e0	enable passed onnx tests (#14017 )	2026-01-04 22:12:50 -05:00
chenyu	f6a78a29e0	support einsum trace (#14012 ) * support einsum trace * test_einsum_scalar_cpu	2026-01-04 19:27:27 -05:00
wozeparrot	f550f9204c	fa: failing test for bwd jit (#14009 ) * tk: failing test for bwd jit * feat: mark expectedFailure * clean: spaces	2026-01-04 16:57:43 -05:00
George Hotz	7abf4591ba	use bitsize on dtype (#14011 ) * use bitsize on dtype [pr] * bitsize * bitsize in js export, but might be wrong * reverts * revert that	2026-01-04 12:16:21 -08:00
chenyu	cfb8bf5814	faster image load (#13977 ) sometimes image load does not need to init with NAN	2026-01-04 13:09:59 -05:00
qazal	bdb421f13e	process_replay: passthrough sink arg for Ops.PROGRAM input (#14000 )	2026-01-04 13:09:39 +09:00
chenyu	8003db2a28	test case of NOOP store load folding (#13997 )	2026-01-03 14:39:26 -05:00
qazal	2cc64d71b0	simplify mi350x gemm / viz asm tests (#13984 ) * mi350x gemm cleanup * asm tests work * simpler asm tests	2026-01-03 11:11:07 +09:00
Christopher Milan	9dc524536f	IMAGE=1 creates "dynamic" images (#13769 ) * remove image from BufferSpec * cl tiny_gemm (64) works * mypy * padding * openpilot CL * reshape properly * remove extra qcom checks * pad output * mypy * update compile test * move undo * TestImageCopy valid images * TestImageRealization valid images * TestImageDType valid images * cleanups * test_renderer_failures * ruff * mypy * simplify ops_qcom * bump step time * Revert "bump step time" This reverts commit `75a037c7d0`. * "dynamic textures" are optional * a start * IMAGE=1 works, no FLOAT16 * fast but wrong * mypy * some fixes * better * works * refactor * oops	2026-01-02 16:22:39 -05:00
chenyu	2e2b5fed12	fix misspellings (#13976 )	2026-01-02 10:37:38 -05:00
b1tg	a78fcc55a4	amd tc 1616128 (#13439 ) * amd tc 1616128 * fix test * remove hardcoded check in test	2026-01-02 09:01:05 -05:00
wozeparrot	ecbac8a338	tk: fa cleanups + causal test (#13963 )	2026-01-01 18:05:00 -08:00
chenyu	af0392efea	only set DiskDevice.size if it opens successfully (#13962 )	2026-01-01 19:33:26 -05:00
chenyu	e036d6df89	properly fix DiskDevice reuse (#13961 )	2026-01-01 18:08:23 -05:00
chenyu	cb7c76a3bd	update test_fuzz_failure to not contruct full UOp (#13960 )	2026-01-01 15:09:58 -05:00
chenyu	51398edf9c	fix indirect import (#13958 ) also deleted old external tests	2026-01-01 14:22:45 -05:00
chenyu	8e416df438	simpler InvalidType [pr] (#13957 ) simpler singleton pattern	2026-01-01 13:55:51 -05:00

1 2 3 4 5 ...

4855 Commits