Commit Graph

  • 9df100d9e7 Deployed 2b01ca5 with MkDocs version: 1.6.1 gh-pages github-actions[bot] 2026-04-07 05:49:19 +00:00
  • 2b01ca59dd USB driver for custom ASM firmware (#15597) master George Hotz 2026-04-07 13:45:41 +08:00
  • 810d7c00cd llama: unify scripts (#15628) wozeparrot 2026-04-07 11:28:08 +08:00
  • 19e96497ee interface in DEV (#15620) Christopher Milan 2026-04-06 16:59:28 -07:00
  • 9ec3a409e9 tests update_benchmark Christopher Milan 2026-04-06 16:14:07 -07:00
  • 461ad9a0c4 Merge branch 'master' into dev-iface Christopher Milan 2026-04-06 16:08:15 -07:00
  • 8ba58304f7 viz: reenable tests (#15626) qazal 2026-04-07 01:52:44 +03:00
  • 2f7d085450 shared _normalize_indices for getitem (#15625) chenyu 2026-04-06 17:45:36 -04:00
  • 66ec188d50 more activations to mixin (#15624) chenyu 2026-04-06 15:41:41 -04:00
  • 1483f7e71c support shift by Tensor (#15623) chenyu 2026-04-06 15:14:57 -04:00
  • 8a64917a91 ugh Christopher Milan 2026-04-06 11:27:39 -07:00
  • 6e30a5f5ea update shifts in torch backend (#15622) chenyu 2026-04-06 14:08:33 -04:00
  • 9c0882ade9 fix Christopher Milan 2026-04-06 11:03:00 -07:00
  • 98f4c7a65c oops Christopher Milan 2026-04-06 10:37:31 -07:00
  • aa627238d5 benchmarks Christopher Milan 2026-04-06 10:36:24 -07:00
  • 8a353f679e test runner Christopher Milan 2026-04-06 10:33:20 -07:00
  • 77b1167c67 tests Christopher Milan 2026-04-06 10:32:34 -07:00
  • fc9c4ac1ca docs Christopher Milan 2026-04-06 10:25:26 -07:00
  • b6d319b7b0 interface in DEV Christopher Milan 2026-04-06 09:59:19 -07:00
  • a444be172d lower fuzz_symbolic_symbolic_div timeout (#15619) chenyu 2026-04-06 12:58:29 -04:00
  • 01b49c8647 support int operand for shifts (#15618) chenyu 2026-04-06 12:32:12 -04:00
  • e2700475cf mlx: cleaner (#15617) nimlgen 2026-04-06 17:49:47 +03:00
  • 86c4431d74 add gpu_family detection to Metal, target MSL 4.0 on macOS 26+ (#15079) Valtteri Valo 2026-04-06 01:51:38 +03:00
  • ff0c941548 remove redundant iteration and toposort in _deepwalk (#15532) 13Perrius 2026-04-05 15:38:45 -07:00
  • e39cfe685a validate lr, momentum, weight_decay in optimizers (#15576) Andrew Cappelli 2026-04-05 18:37:34 -04:00
  • 6a334ceb27 hotfix: fix bert (#15613) nimlgen 2026-04-05 23:41:21 +03:00
  • e3986a6b74 mlx: init runtime (#15612) nimlgen 2026-04-05 22:52:29 +03:00
  • e0988dbae5 hcq: support non for signal_t and compute_t (#15611) nimlgen 2026-04-05 18:56:47 +03:00
  • 5e134aa087 hcq: add write/poll_bit commands (#15610) nimlgen 2026-04-05 18:09:44 +03:00
  • 604cdbf2f7 am: large allocs aligned to 2mb to use 2mb pages (#15609) nimlgen 2026-04-05 18:01:31 +03:00
  • b2d5b29f45 assembly/amd: validate dsl keyword args (#15608) qazal 2026-04-05 17:00:24 +03:00
  • 056fcd7758 viz: web work from rdna4 gemm (#15607) qazal 2026-04-05 13:14:16 +03:00
  • 7e54992bf6 fp8 llama (#15588) wozeparrot 2026-04-05 09:24:57 +08:00
  • 4d36366717 assembly/amd: match rdna4 hw gidx init in emulator (#15604) qazal 2026-04-04 20:28:18 +03:00
  • 2ba5a6ddc8 remove detach in selu (#15602) chenyu 2026-04-04 11:04:29 -04:00
  • f7aed180e4 viz/cli: add Other row in profiler (#15600) qazal 2026-04-04 16:40:53 +03:00
  • 74ecf6d3e6 opaque structs are also c.Struct (#15596) Christopher Milan 2026-04-03 16:40:43 -07:00
  • 645d45d968 DEV has arch (#15577) Christopher Milan 2026-04-03 16:17:19 -07:00
  • 902edc3781 hcq: hcqbuf in copy (#15595) nimlgen 2026-04-03 22:47:36 +03:00
  • 2c4271209e hcq: peer groups for remote (#15594) nimlgen 2026-04-03 19:03:07 +03:00
  • 8fdef2d3e4 mean/std/var to mixin (#15593) chenyu 2026-04-03 10:42:41 -04:00
  • 9920b42b5e hotfix: renderer.target.arch in disasm (#15592) qazal 2026-04-03 16:23:51 +03:00
  • 237084b276 remote: support several hosts (#15585) nimlgen 2026-04-03 11:22:15 +03:00
  • 0ed8d9271d Renderers accept Target or nothing (#15590) Christopher Milan 2026-04-02 22:09:41 -07:00
  • 3a26920141 feat: framework ci (#15589) wozeparrot 2026-04-03 13:03:51 +08:00
  • 830a147a52 Revert "good stuff in USB" fancy_usb George Hotz 2026-04-03 12:19:57 +08:00
  • 736fea8412 select_first_inited cleanup and better errors (#15587) Christopher Milan 2026-04-02 16:27:58 -07:00
  • 8c50da800d [pr] cleanup unused ctx's in codegen (#15586) Christopher Milan 2026-04-02 16:06:58 -07:00
  • 694dc5a717 install script in benchmark (#15584) nimlgen 2026-04-02 18:15:58 +03:00
  • 046c3f1240 mlx: add loopback with send/recv (#15583) nimlgen 2026-04-02 18:15:46 +03:00
  • c64226e97c fix CreationMixin doc (#15582) chenyu 2026-04-02 09:46:28 -04:00
  • d8c2836099 good stuff in USB George Hotz 2026-04-02 18:34:03 +08:00
  • fefb0ebc2a gemm/asm: fp8 cleanups (#15580) qazal 2026-04-02 13:02:38 +03:00
  • 61bc91aa8c Tensor cumalu cleanups (#15579) chenyu 2026-04-02 05:23:22 -04:00
  • 4c654024bc good stuff in USB George Hotz 2026-04-02 11:23:30 +08:00
  • 1aa04eab08 simple CreationMixin (#15567) chenyu 2026-04-01 23:00:56 -04:00
  • 5b2a3251c4 mlperf system json for mi350 (#15575) wozeparrot 2026-04-02 06:30:33 +08:00
  • 6c67bd4c14 better error message when invalid renderer is specified (#15573) Christopher Milan 2026-04-01 14:12:55 -07:00
  • 0d6fbc2355 remove flaky and redundant image test (#15574) Christopher Milan 2026-04-01 13:33:13 -07:00
  • 20f7f0be8e nir renderers use arch (#15556) Christopher Milan 2026-04-01 13:32:51 -07:00
  • 148ad09559 am: do not use dbell for ih (#15571) nimlgen 2026-04-01 21:34:21 +03:00
  • 93a85c7348 am: raise when using more sdma engines (#15569) nimlgen 2026-04-01 21:33:42 +03:00
  • da12c2ea16 better install msg (#15570) nimlgen 2026-04-01 20:09:37 +03:00
  • 20497f2840 fold BIND to CONST when min==max (#15568) b1tg 2026-04-01 23:19:04 +08:00
  • 9275f283e5 viz: update flag and display names (#15566) qazal 2026-04-01 15:48:37 +03:00
  • f5c0794df2 fix Tensor.const_like (#15565) chenyu 2026-04-01 08:35:19 -04:00
  • 09f60d80fd llama: fix FP8=1 FAKEDATA=1 (#15564) qazal 2026-04-01 14:53:03 +03:00
  • 6d1e992e89 copyout sharded w/o ioring (#15562) nimlgen 2026-04-01 14:47:29 +03:00
  • 150c456977 add OSError to suppress_finalizing (#15558) nimlgen 2026-04-01 12:33:59 +03:00
  • fc5b94b902 fix UOp.where(const, const) (#15560) chenyu 2026-04-01 05:28:49 -04:00
  • 5aeb2273db add amd_copy_matmul.py to CI (#15555) chenyu 2026-03-31 22:39:18 -04:00
  • 034f617971 NVCCRenderer is separate from CUDARenderer (#15554) Christopher Milan 2026-03-31 18:26:13 -07:00
  • 8b5b9a0e90 llama: run_and_time (#15533) wozeparrot 2026-04-01 06:46:16 +08:00
  • acf239e4d2 specify renderer in DEV, <dev>_<ren>=1 is deprecated (#15551) Christopher Milan 2026-03-31 15:35:14 -07:00
  • 5181c8e23a llm: fix nan in kvcache (#15552) nimlgen 2026-04-01 00:38:45 +03:00
  • 3af25ccdb4 docs: minor tinygpu changes (#15550) nimlgen 2026-03-31 21:29:15 +03:00
  • 477d194630 hipcomgr and tinygpu scripts (#15549) nimlgen 2026-03-31 20:07:52 +03:00
  • 83085f103c tinygpu docs (#15545) nimlgen 2026-03-31 19:49:38 +03:00
  • ca89215a59 nv: use nvcc over nak by default (#15547) nimlgen 2026-03-31 18:54:56 +03:00
  • a15345a53e viz/cli: improve --help message (#15546) qazal 2026-03-31 16:31:33 +03:00
  • 10d570b3d5 signed tinygpu (#15541) nimlgen 2026-03-31 14:55:09 +03:00
  • 4ac2552642 improve ReduceMixin.all (#15544) chenyu 2026-03-31 07:54:27 -04:00
  • 89ec22131a tests to show double negation in min is not cancelled (#15543) chenyu 2026-03-31 06:59:13 -04:00
  • 8feb8edc68 gemm/asm: add fp8 support to cdna asm_gemm (#15542) qazal 2026-03-31 13:32:54 +03:00
  • 2939ae8b22 more mixin (#15540) chenyu 2026-03-31 05:46:55 -04:00
  • e69f5f9f69 more movement methods to mixin (#15536) chenyu 2026-03-31 05:16:47 -04:00
  • ceb63c8c2f new bundle id (#15307) nimlgen 2026-03-31 12:16:03 +03:00
  • 467c0af8aa viz: skip flaky sever tests (#15538) qazal 2026-03-31 11:20:30 +03:00
  • f88e255cea gemm/asm: split and parameterize dtype in llama gemm tests (#15408) qazal 2026-03-31 11:12:44 +03:00
  • a63392a565 llm: pairwise ranking topk for MoE expert selection (#15499) b1tg 2026-03-31 12:46:39 +08:00
  • 79cccf3003 write sz output to file (#15534) wozeparrot 2026-03-31 11:16:17 +08:00
  • 6fb038d109 replace CompilerSet with list (#15530) Christopher Milan 2026-03-30 20:07:52 -07:00
  • bc866a93f0 viz: rename exec to sqtt (#15527) qazal 2026-03-31 02:06:51 +03:00
  • adbfd82d1d DEV is ContextVar, setting Device.DEFAULT is deprecated (#15508) Christopher Milan 2026-03-30 14:10:49 -07:00
  • 9583489068 add mlx driver to extra (#15526) nimlgen 2026-03-30 20:28:49 +03:00
  • ad6347f6d8 sqtt: allow mapping sopk to IMMEDIATE packets (#15525) qazal 2026-03-30 17:12:17 +03:00
  • 301b2cea57 move matmul to mixin (#15524) chenyu 2026-03-30 07:39:09 -04:00
  • f0eaac4235 reduce mixin (#15523) chenyu 2026-03-30 05:23:58 -04:00
  • f485d0b664 UOp.sum -> usum, prod -> uprod [pr] (#15522) chenyu 2026-03-29 04:51:55 -04:00
  • 36a925e2a2 viz: color wmma, one color map for cli and web (#15519) qazal 2026-03-28 21:53:01 +02:00