Files
tinygrad/test/external
George Hotz f7d4638e05 start LLM app, tons of clean up required. target is 200 line ollama (#11068)
* start LLM app, tons of clean up required. target is 200 line ollama

* kind of works

* simpler

* add k/v cache

* with SYM=1, it loops

* no rope cache

* simpler

* more cleanups

* cleanups

* works

* argparse and comments

* from gguf

* generate is a function

* no copy from cpu

* fix max context pass in

* test

* improve test

* ai2_arc

* fix 8B, use less ram

* 136 lines
2025-07-07 17:09:46 -07:00
..
2025-03-21 15:52:54 -04:00
2025-06-05 17:17:42 -07:00
2024-03-29 19:35:50 -07:00
2025-06-08 08:42:22 -07:00
2024-11-21 10:33:08 +08:00
2025-06-20 15:22:28 -07:00
2023-09-28 18:02:31 -07:00
2025-06-20 15:22:28 -07:00