tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-13 00:15:35 +08:00

Author	SHA1	Message	Date
George Hotz	7e6d617935	addrspace cleanups (#16565 ) * addrspace cleanups * bumps * eh, relax a little	2026-06-10 15:57:18 -07:00
George Hotz	a9b6cfece0	refactor llm into files (#15780 ) * refactor llm into files * chat.html * tokenizer cleanup * cleanup * tests	2026-04-17 12:33:11 +08:00
George Hotz	ec00cefa5b	llm is the only app (#15779 ) * tinygrad/llm is the only app * upd pyproject * claude refs * scoping * min diff	2026-04-17 10:44:48 +08:00
b1tg	4e88d875ba	llm: glm 4.7 flash (#15738 ) * glm 4.7 * test * temperature, server enable_thinking * --no-think * remove think stuff	2026-04-16 22:42:04 +08:00
George Hotz	b5a9465b13	llm: add support for moonlight (deepseek MLA) (#15466 ) * add gguf Q5_0 * it works * rebase * simpler test * class * less diff * dicts * normal names * simplify * this * simpler * work * work	2026-04-11 10:32:48 +08:00
b1tg	9ab1415937	llm: fix streaming UTF-8 decode (#15653 )	2026-04-10 17:01:02 +08:00
George Hotz	fe2690399b	llm: support assistant prefill + refactor to TransformerConfig (#15457 ) * llm: support assistant prefill * refactor to ModelConfig * TransformerConfig * more	2026-03-25 10:50:48 +08:00
George Hotz	a33ac869aa	llm server: temperature + test client (#15444 ) * improvements to the llm server * eval script * eval llm * better eval gets 58.71 * cleanups * add temperature, but multinomial is absurdly slow * claude is so smart * lint * remove slop * no more stop	2026-03-24 21:07:15 +08:00
George Hotz	8a82b26522	llm: print the prefill cache size (#15146 ) * print the llm prefill cache size * mock that too	2026-03-05 12:13:28 +08:00
George Hotz	d59e6e7a37	move more tests to test/null, split some existing ones (#14512 ) * move more tests to test/null, split some existing ones * null work * null work * move more * fixes * move PIL * PIL in CLIP * don't move that	2026-02-03 20:20:20 +08:00

10 Commits