Files
tinygrad/test
George Hotz f7d4638e05 start LLM app, tons of clean up required. target is 200 line ollama (#11068)
* start LLM app, tons of clean up required. target is 200 line ollama

* kind of works

* simpler

* add k/v cache

* with SYM=1, it loops

* no rope cache

* simpler

* more cleanups

* cleanups

* works

* argparse and comments

* from gguf

* generate is a function

* no copy from cpu

* fix max context pass in

* test

* improve test

* ai2_arc

* fix 8B, use less ram

* 136 lines
2025-07-07 17:09:46 -07:00
..
2025-05-26 14:38:28 -07:00
2025-06-17 19:39:34 +03:00
2020-12-15 23:44:08 -08:00
2024-11-11 20:18:04 +08:00
2025-06-27 13:48:48 +03:00
2025-06-20 15:22:28 -07:00
2025-06-08 08:42:22 -07:00
2025-02-20 18:03:09 -05:00
2025-02-18 15:26:58 +08:00
2025-06-16 16:46:12 -07:00
2025-06-08 08:42:22 -07:00
2025-06-20 15:22:28 -07:00
2025-05-07 11:41:41 -07:00
2025-02-20 18:03:09 -05:00
2025-07-07 16:21:26 -04:00
2025-06-20 15:22:28 -07:00
2025-06-08 08:42:22 -07:00
2025-06-16 13:18:56 -07:00
2025-07-02 15:10:24 -07:00
2025-06-29 09:06:10 -07:00
2025-06-08 08:42:22 -07:00
2025-06-20 15:22:28 -07:00