nimlgen
372d9e5753
hcq: helper for visible devices ( #12950 )
...
* hcq: helper for visible devices
* fix
* f
2025-10-28 02:27:56 +08:00
nimlgen
b63bd02969
update runtime docs ( #12191 )
2025-09-15 17:46:20 +03:00
qazal
a388d2cb1a
remove PROFILE=1 option, it's just VIZ=1 [pr] ( #12176 )
...
* remove PROFILE=1 option, it's just VIZ=1 [pr]
* sqtt
* sqtt 2
* return last
* rename
2025-09-15 12:51:50 +03:00
chenyu
0e266f376c
ops_gpu -> ops_cl ( #12103 )
2025-09-10 15:15:48 -04:00
nimlgen
1c6c42715f
unify cpu and llvm ( #11982 )
...
* try unify cpu and llvm
* fixes
* fix
* ops
* no llvm
* fix
* rm
* lvmm is ot
* oops
* override
* no llvm
* ignore
* skip llvm
* ooops
2025-09-09 13:54:44 +03:00
George Hotz
5ef7c5923f
docs: remove unused METAL_XCODE env var ( #10421 )
2025-06-06 18:39:54 -04:00
Alexey Zaytsev
78a6af3da7
Use $CUDA_PATH/include for CUDA headers ( #9858 )
2025-04-13 16:20:19 +01:00
Ignacio Sica
876a8be97a
Debug env var breakdown ( #9663 )
...
* add debug level breakdown
* hotfix
* Update env_vars.md
2025-04-02 14:34:07 +08:00
chenyu
2e7c2780a9
CLANG -> CPU ( #9189 )
2025-02-20 18:03:09 -05:00
Ahmed Harmouche
0f94b98646
Force WebGPU backend type [pr] ( #9164 )
...
* Force webgpu backend type
* Mypy fix
* Rename to WEBGPU_BACKEND
* Add it to env_vars docs
* Remove link
2025-02-19 17:19:39 +08:00
ignaciosica
d2234e308a
tf32 tc for nv and ptx ( #8635 )
...
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com >
2025-01-17 17:43:57 -08:00
nimlgen
5cb9443ebb
PROFILE is enabled when VIZ is enabled ( #8516 )
2025-01-06 19:47:16 +03:00
chenyu
51afc3cc88
update env_vars doc on VIZ link ( #7689 )
...
existing one throws 404 because mkdocs does not allow traverse above doc root (i think?). so for now just stick the github link to it
2024-11-13 17:28:14 -05:00
Kinvert
6a0ed46b1c
adding viz to env_vars docs ( #7630 )
2024-11-11 21:28:27 +08:00
George Hotz
3169cb386d
remove graph [pr] ( #7085 )
2024-10-16 11:40:07 +08:00
nimlgen
604fb60143
docs: fix link to jit in env_vars ( #5474 )
2024-07-14 16:08:16 +03:00
Simone Margaritelli
03c3b14cc2
docs: addded JIT description to dos/env_vars.md ( #5445 )
...
* docs: addded JIT description to dos/env_vars.md
* docs: rephrased JIT=2 in env_vars.md
2024-07-13 07:07:11 -07:00
nimlgen
6604d2b2c3
amd/nv respect visible devs ( #5409 )
...
* nv/amd respect visible devices
* linter
* sort amd gpus
* env docs
2024-07-12 20:02:12 +03:00
nimlgen
69f116a7e1
nv/amd profiler ( #4718 )
...
* nv/amd profiler
* fix
* fix
* profile copies
* profile logger
* fixes
* more fixes
* less lines and fixes
* fixes
* some linter
* back sync, no related change
* fix gpu2cpu time def
* simpler
* linter
* linter
* docs
* add add_event api
2024-06-23 17:10:12 +03:00
nimlgen
654a8b9ef7
retire hsa ( #4885 )
...
* retire hsa
* EMULATE_AMD
2024-06-09 11:33:03 +03:00
chenyu
f8f97562e0
remove File Specific Variables from env_vars.md ( #4684 )
2024-05-22 17:00:14 -04:00
George Hotz
9425973bc7
docs cleanup and move ( #4593 )
...
* cleanup and move
* docs-legacy is gone
* don't update setup.py
2024-05-14 20:44:59 -07:00
George Hotz
8f749ae0eb
New docs are in mkdocs ( #4178 )
...
* start mkdocs
* simple docs for tensor
* more docs
* move those back
* more docs
* copy markdown extensions
* docs legacy
* docs building workflow
* fix showcase links
* only that?
* install tinygrad
* add docs to setup.py
* Delete examples/llm.c/data
2024-04-16 10:59:51 +04:00
qazal
27af37f2ad
misc: remove unused env vars ( #3963 )
...
* remove unused env vars
* delete CPU
2024-03-27 16:08:15 -04:00
chenyu
83f39a8ceb
env var to change default float ( #3902 )
...
* env var to change default float to fp16 or bf16
looking for standard names for these. we have FLOAT16 that does something to IMAGE and HALF to convert weights.
working on default bf16 too.
```
RuntimeError: compile failed: <null>(6): error: identifier "__bf16" is undefined
__bf16 cast0 = (nv_bfloat16)(val0);
```
remove that in cifar
* DEFAULT_FLOAT
* default of default
* unit test
* don't check default
* tests work on linux
2024-03-24 20:33:57 -04:00
nimlgen
3fb13ff892
HIP -> HSA in docs/env_vars ( #3824 )
2024-03-19 22:53:33 +03:00
George Hotz
e4528543fa
remove LLVMOPT
2024-01-15 16:01:09 -08:00
chenyu
e39cd3e7f2
update env_vars.md ( #3127 )
...
mostly removed deprecated ones. not clear how to maintain this especially for extra/examples
2024-01-15 01:06:56 -05:00
Yingbo Ma
d43485ae9e
Fix graph_uops ( #2457 )
...
* Load networkx when we need to graph uops
* Document GRAPHUOPS
* import nx in `graph_uops`
2023-11-27 18:42:48 -08:00
George Hotz
4da2ddea6e
Interpreted cleanups ( #2312 )
...
* move the compiler out of ops
* don't return realized
* var_vals filter, fix custom
* typing
2023-11-15 09:02:23 -08:00
chenyu
5d5921d2c8
small doc env update ( #2112 )
2023-10-18 14:49:25 -07:00
George Hotz
c36d306606
KOPT is over, BEAM is upstream ( #2071 )
...
* create cache for q learning
* make linter happy
* global beam
* where it belongs
* bugfix
* ditch the kopt, use the beam
* faster lin and DEBUG=2 okay
* remove kopt, move search to features
2023-10-16 09:46:03 -07:00
Roelof van Dijk
972d9ea215
fix: PRUNEGRAPH is unused ( #1985 )
2023-10-05 14:28:43 -07:00
chenyu
b8fde6bb0f
Test KOPT in CI ( #1744 )
...
* test kopt in ci
* getenv takes dtype from default
2023-09-03 14:37:20 -07:00
wozeparrot
f61d0657d1
document new envvars ( #1676 )
...
* feat: document some new envvars
* feat: actually put values
* feat: no more cifar torch
* feat: no fakedata
2023-08-26 20:17:02 -04:00
George Hotz
a6d842af7a
move device to ops ( #1646 )
...
* move device to ops
* mlops types
* 2 lines
2023-08-23 08:30:17 -07:00
Alex Telon
b66361843a
Timing and Context can now be used as decorators ( #1385 )
...
* Context and Timing can now be used as decorators
* Using Timing decorator in quickstart.md
The time formating is better and is a useful tool to learn.
Old: Time: 3.5260659999912605
New: Time: 3526.14 ms
* Updated env_vars documentation for Context
* Added test for Context decorator
* Put new import on same line as others
2023-08-01 17:16:10 -07:00
terafo
aa60feda48
Fix naming conflict with huggingface datasets ( #1161 )
...
* Rename in files
* Move files
* Moved to extra/datasets as suggested
* Changes to files
* Fixed stupid mistake
---------
Co-authored-by: terafo <terafo@protonmail.com >
2023-07-07 10:43:44 -07:00
George Hotz
18892242b0
global -> group ( #1007 )
...
* global -> group
* allow None for local_size in custom function
* lil local
* comment on shape
* fix cuda
* smart local cast
* better local heuristic
* fix ptx, and work_dim cleanup
* fix metal
* fix ops test
* fix openpilot jit
* no more optlocal
* might fix metal tests
* try metal now
* see generated metal code
* test free removal. REVERT THIS
* mergable
2023-06-21 11:50:43 -07:00
Pasan Perera
b6102ba4ac
added CUDA and PTX to env_vars.md ( #1009 )
2023-06-19 08:47:44 -07:00
Casey Primozic
651d6ea457
Minor improvements + cleanup to ops_gpu.py ( #1006 )
...
* Minor improvements + cleanup to `ops_gpu.py`
* Add some previously undocumented environment variables from `ops_gpu.py` to `env_vars.md`
* Update debug print for OpenCL to print the devices that will be used post-filtering with `CL_EXCLUDE`
* Remove a couple unused or superfluous variables and assignments
* Use `fromimport` shorthand to shave off a couple precious LOC
* Couple small whitespace changes to clean things up
* Revert change to ordering of OpenCL devices
* Small refactor for OpenCL context creation
2023-06-18 21:26:40 -07:00
Nicklas Boman
5c7248c72d
imagenet download and prepare ( #928 )
...
Changing if not exist to the exist_ok=True parameter and adding a variable check if you want to download training data also
adding variable to env_vars.md
2023-06-08 12:55:33 -07:00
wozeparrot
e9c1ae3825
Add a quick start guide ( #900 )
...
* feat: initial quick start guide
* fix: fix link
* feat: add note about jit
* feat: add note about load/store ops
* feat: add link to discord
* feat: add note about saving and loading models
* fix: correct code for saving and loading
* feat: overhaul docs
* fix: fix link
* feat: wording
* feat: add link to discord
* feat: contributing guidelines
* feat: make contributing section more doc focused
* feat: add link to env_vars from readme
* fix: wording
* feat: move community to bottom
* feat: showcase
* feat: linebreak
* feat: redesigned header
* feat: tweaks
* feat: tweaks
* feat: badge for lines of code
* feat: move installation instructions to repo readme
* feat: readme overhaul number 2
* feat: move visualization to quick start guide
* feat: readme 2 electric boogaloo
* fix: grammar
* fix: formatting
* feat: no ugly line
* feat: add line back
* feat: new load method
* feat: split adding accelerator docs out
* feat: showcase whisper
* feat: smaller tweaks
* feat: bring back oneliner
2023-06-04 08:51:20 -07:00