disk tensor load contains big offset and is not meant to be run by gpu. repro steps ``` time ./extra/optimization/generate_dataset.sh gzip /tmp/sops mv /tmp/sops.gz extra/datasets/ ```
added back the log ast function and removed hacks that work around the old dataset
* logops * fix dtype printing * needs inf * ops dataset * minor improvements * 12k kernels * opt can compile * graph flops