mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-01-10 15:38:29 -05:00
would merge if it's also ~1 minute. btw why is gpt2 beam not slower in the first beam run?
would merge if it's also ~1 minute. btw why is gpt2 beam not slower in the first beam run?