tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-02-09 06:05:11 -05:00

Author	SHA1	Message	Date
JaSpa99	d3d58a37e5	Bert: use Tensor.scaled_dot_product_attention (#1528 ) * use scaled attn from Tensor * add a test for bert * linter * no more tokenizer * without loading weights * remove prints * tribute to linter lords * smaller input and less runs * small bert	2023-08-12 08:46:04 -07:00