mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-01-09 06:58:11 -05:00
* dtypes alu test * those types don't exist in torch * floats * more tests * disable those * a couple unary tests * skip float16 tests in CI for GPU * fix LLVM bool add True+True=1+1=2 which truncates to False in native LLVM * remove hardcoded float for LLVM ALU fns * less sensitive atol for fp32, 1e-10 is flaky and sometimes failed even if you revert the merge commit for non-fp32 math, nothing has changed in our kernels for fp32. * return on overflows * fix CUDA exp2 * compute results of op regardless of bounds in a python backend * skip fp16 in GPU and CUDACPU * fuzz a smaller range in the float_midcast_int32 test I sampled this and we overflow ~70% of the time. because numpy behaves differently on different devices for overflows and Metal seems to do the same, I'm opting to eliminate the non-determinism here * remove CUDA exp2 overload it's already there now --------- Co-authored-by: George Hotz <geohot@gmail.com>
51 lines
721 B
Plaintext
51 lines
721 B
Plaintext
__pycache__
|
|
.venv/
|
|
.vscode
|
|
.DS_Store
|
|
notebooks
|
|
.*.swp
|
|
.*.swo
|
|
*.pyc
|
|
*.so
|
|
*.txt
|
|
build
|
|
/dist
|
|
*.egg-info
|
|
/env
|
|
a.out
|
|
boxes.jpg
|
|
pandecode.dump
|
|
vertex.bin
|
|
recognize*
|
|
.idea
|
|
disassemblers/applegpu
|
|
disassemblers/cuda_ioctl_sniffer
|
|
*.prof
|
|
extra/datasets/cifar-10-python.tar.gz
|
|
extra/datasets/librispeech/
|
|
extra/datasets/imagenet/
|
|
extra/datasets/kits19/
|
|
extra/datasets/squad/
|
|
extra/datasets/img_align_celeba*
|
|
extra/datasets/open-images-v6-mlperf
|
|
extra/datasets/kits/
|
|
extra/datasets/COCO/
|
|
extra/datasets/audio*
|
|
extra/weights
|
|
venv
|
|
examples/**/net.*[js,json]
|
|
examples/**/*.safetensors
|
|
node_modules
|
|
package.json
|
|
package-lock.json
|
|
temp
|
|
*.csv
|
|
.coverage
|
|
coverage.xml
|
|
htmlcov
|
|
outputs_yolov8
|
|
wandb
|
|
model.safetensors
|
|
quickstart.py
|
|
.hypothesis
|