tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-13 08:28:55 +08:00

Author	SHA1	Message	Date
nimlgen	77965a22e5	local optimize as rewrite (#15953 ) * local optimize as rewrite * better * x * slighly rename * fix * ugh * remove * x * remove * not weak	2026-04-28 22:51:04 +03:00
nimlgen	4164666c72	programinfo (#15942 ) * programinfo * fix * m * x * x * changes * x * fix * rm	2026-04-27 23:12:03 +03:00
nimlgen	e0ff6cc15c	remove old schedule (#15930 ) * remove old schedule * tests * r * x	2026-04-25 16:46:36 +03:00
nimlgen	a5e9ea7a60	remove schedule batch 4 (#15927 ) * remove schedule batch 4 * fini	2026-04-25 12:36:55 +03:00
nimlgen	f2751955cb	remove linear_to_schedule from tests (#15912 ) * remove linear_to_schedule from tests * x	2026-04-24 20:02:10 +03:00
nimlgen	e5891acab2	jit: precompile (#15848 ) * x * jit: precompile as sep step * x * s * x * x * x * ? * ? * x * x * viz * f * x * u * x * x	2026-04-23 00:23:32 +03:00
nimlgen	ae9b84d32f	rm beam uop (#15844 )	2026-04-21 13:10:26 +03:00
nimlgen	01ac1c8c15	remove all run_schedule from tests (#15846 )	2026-04-21 12:02:10 +03:00
Christopher Milan	6adf4c3cd9	MOCKGPU interfaces (#15796 )	2026-04-17 21:56:29 -04:00
George Hotz	ec00cefa5b	llm is the only app (#15779 ) * tinygrad/llm is the only app * upd pyproject * claude refs * scoping * min diff	2026-04-17 10:44:48 +08:00
qazal	12c653a743	remove opts arg in get_program, everything uses opts_to_apply [pr] (#15767 ) * check Ops.BEAM in process replay * remove opts from the get_program api * lint * simplify * cleanup	2026-04-16 22:42:43 +03:00
qazal	96092d110c	fix process_replay Ops.BEAM [pr] (#15752 )	2026-04-16 07:35:28 +09:00
George Hotz	1ae6528bb6	move schedule into schedule (#15736 ) * move schedule into schedule * callify to root * sched docs	2026-04-15 11:03:25 +08:00
chenyu	3394d18066	size*itemsize -> nbytes (#15729 ) and some UOp.size removal to prep for size to mixin change	2026-04-14 16:27:54 -04:00
George Hotz	f930579b7a	llm: change the default port to 8000 so you can remember it (match vLLM)	2026-04-08 11:25:38 +08:00
chenyu	a444be172d	lower fuzz_symbolic_symbolic_div timeout (#15619 ) mitigate timeout crash due to high total time	2026-04-06 12:58:29 -04:00
nimlgen	604cdbf2f7	am: large allocs aligned to 2mb to use 2mb pages (#15609 )	2026-04-05 18:01:31 +03:00
Christopher Milan	645d45d968	DEV has arch (#15577 ) Co-authored-by: Comma Device <device@comma.ai>	2026-04-03 19:17:19 -04:00
nimlgen	902edc3781	hcq: hcqbuf in copy (#15595 )	2026-04-03 22:47:36 +03:00
Christopher Milan	acf239e4d2	specify renderer in DEV, <dev>_<ren>=1 is deprecated (#15551 )	2026-03-31 18:35:14 -04:00
Christopher Milan	adbfd82d1d	DEV is ContextVar, setting Device.DEFAULT is deprecated (#15508 )	2026-03-30 17:10:49 -04:00
chenyu	f485d0b664	UOp.sum -> usum, prod -> uprod [pr] (#15522 ) rename to prep reduce mixin	2026-03-29 04:51:55 -04:00
Christopher Milan	bc180a963c	deprecate <dev>=1 in favor of DEV=<dev> (#15467 ) * start work on target * add test * update actions to use DEV * update docs * update readmes * tests need that too * update example * update tests (comments) * fix that test * ruff * mypy * oops * remove getenvs * don't add Target yet * and the test * lint * and docs * more stuff * assert * few more fixes * test assert	2026-03-26 03:48:03 -04:00
George Hotz	fe2690399b	llm: support assistant prefill + refactor to TransformerConfig (#15457 ) * llm: support assistant prefill * refactor to ModelConfig * TransformerConfig * more	2026-03-25 10:50:48 +08:00
George Hotz	a33ac869aa	llm server: temperature + test client (#15444 ) * improvements to the llm server * eval script * eval llm * better eval gets 58.71 * cleanups * add temperature, but multinomial is absurdly slow * claude is so smart * lint * remove slop * no more stop	2026-03-24 21:07:15 +08:00
nimlgen	9656d97d97	jit: captures linears, not execitems (#15399 ) * jit: captures linears, not execitems * x * um * etsts * mockcuda	2026-03-21 16:32:12 +08:00
chenyu	da1700e16b	dtypes.index -> dtypes.weakint (#15377 )	2026-03-20 01:08:46 -04:00
nimlgen	d720d50e12	memory: traverse all valid ranges only (#15338 ) * memory: traverse all valid ranges only * x	2026-03-18 14:03:39 +08:00
Christopher Milan	864d3917d5	add openpilot onnx parser test (#15334 )	2026-03-18 00:12:02 -04:00
nimlgen	4b42bb54aa	am: reset sdma to start from 0 (#15109 )	2026-03-03 18:14:46 +03:00
nimlgen	ccbbca05ef	beam: add dev_timeout for am (#15063 ) * beam: add dev_timeout for am * all covered * fk * x * fuzz * reset * f	2026-03-01 16:57:29 +03:00
nimlgen	9b3450c9da	test gpu crash on cdna (#15062 )	2026-02-28 13:17:59 +03:00
nimlgen	faa66e0a61	mi350 hive_reset am repro (#15014 )	2026-02-25 21:30:18 +03:00
George Hotz	2611907afb	start ripping out old scheduler -- no maps (#14909 ) * start ripping out old scheduler -- no maps * no more metadata	2026-02-20 21:05:04 +08:00
George Hotz	fc5677c28b	resnet dataloader + more test cleanups (#14899 ) * resnet dataloader * tests	2026-02-20 10:05:47 +08:00
George Hotz	f081f154ae	parameterize the CDNA asm gemm (#14813 ) * parameterize the CDNA asm gemm * fix llama test * fix * add more gemmt ests * confirm all match * test these asm gemms	2026-02-17 11:35:18 +08:00
George Hotz	bc3487d607	VIZ display cleanups (#14811 ) * exclude reshape/expand broadcasts from viz * limit src lines	2026-02-17 10:03:08 +08:00
qazal	9da7f5e733	disable process replay for AMD emulator renderer [pr] (#14766 ) * disable process replay for AMD emulator renderer [pr] * line * skip	2026-02-15 18:52:37 +09:00
nimlgen	3bee6638e3	external_test_hive_reset (#14729 ) * external_test_hive_reset * add fault	2026-02-13 19:08:36 +03:00
George Hotz	4680247e35	renderer/amd: move in tree (#14702 ) * renderer/amd: move in tree * fix paths in tests * 24000 lines * no delete for amd files	2026-02-12 18:09:16 +08:00
George Hotz	befc1e800c	assembly/amd: disasm is test only (#14694 ) * assembly/amd: disasm is test only * viz uses str	2026-02-12 12:33:46 +08:00
George Hotz	c331798201	move tests to test/backend (#14691 ) * move tests to test/backend * fix imports * fix CI * revert that one * Fix formatting in README for test command	2026-02-12 11:09:44 +08:00
George Hotz	4565958792	some lil speedups (#14679 )	2026-02-11 10:01:58 +08:00
George Hotz	2d4ad9e739	add a waitlist for graph rewrite (#14678 ) * add a waitlist for graph rewrite * cleaner * one context on spec check	2026-02-11 09:30:13 +08:00
chenyu	884592f6c8	pin z3-solver version (#14605 ) found exact input that crashes z3 4.15.4	2026-02-06 22:49:31 -05:00
George Hotz	7a2a3b5c71	Remove Ops.KERNEL, it's all Ops.CALL now (#14603 )	2026-02-07 10:21:54 +08:00
chenyu	b9fe8b7591	fix opt in process replay [pr] (#14599 )	2026-02-06 16:49:56 -05:00
chenyu	197ebcbbbc	log seed with flush=True in fuzz_symbolic (#14597 ) * log seed with flush=True in fuzz_symbolic i think z3 can crash. added reading seed from argv to see if we repro later * fuzz_symbolic_symbolic_div	2026-02-06 15:03:57 -05:00
chenyu	d57d24c7d4	Buffer.as_buffer -> Buffer.as_memoryview [pr] (#14535 ) it casts to memoryview. also inline the as_typed_buffer checks to Tensor._data	2026-02-04 11:31:11 -05:00
nimlgen	2f55005ad9	qcom: sync cpu cache when from_blob (#14518 ) * um * fx * d * x * x * x * x * f * ren	2026-02-03 21:51:03 +03:00

1 2 3 4 5 ...

1001 Commits