tinygrad

mirror of https://github.com/tinygrad/tinygrad.git synced 2026-06-15 17:40:13 +08:00

Author	SHA1	Message	Date
George Hotz	8612385ccb	add all codegen stages to spec_tensor	2026-05-12 10:23:03 -07:00
chenyu	f3e3c3851f	explicit args to Tensor.rand (#16161 ) added requires_grad, other kwargs were silently dropped	2026-05-12 12:53:39 -04:00
nimlgen	e5729935c6	time_call (#16152 ) * time_call * x * fix caches	2026-05-12 16:58:28 +03:00
qazal	fe39cf148a	add Ops.SOURCE test (#16155 ) * simple failing test * raises * change	2026-05-12 22:49:32 +09:00
chenyu	09fd80fba6	fix randperm and _multi_like drop requires_grad (#16150 )	2026-05-11 23:23:34 -04:00
George Hotz	8294d105a7	Update the spec in spec.py to match the current state (#16132 ) * start work on specv2 * more spec * more spec * fix amd emulator * more spec * more * fix test_uop_graph * move those * spec=2 * skip those questionable tests * ptx fix * more spec=2 * store * allow custom function in tensor * spec 2 * fix beam search for tensor cores * delete the old specs * fix import	2026-05-11 20:07:47 -07:00
chenyu	0b02fb6797	Revert "[pr] match torch rmsnorm (#16122 )" (#16144 ) This reverts commit `692257dd70`.	2026-05-11 17:53:42 -04:00
Joshua James Venter	692257dd70	[pr] match torch rmsnorm (#16122 ) * [pr] match rmsnorm torch Signed-off-by: Joshua James Venter <venter.joshua@gmail.com> * 1e-5 * ops.md --------- Signed-off-by: Joshua James Venter <venter.joshua@gmail.com> Co-authored-by: chenyu <chenyu@fastmail.com>	2026-05-11 14:36:41 -04:00
Pawan	4dd6ad3514	gradient: add TRUNC backward (#15925 ) * gradient: add TRUNC backward * test: move round quantization gradient to test_ops	2026-05-08 16:27:55 -07:00
chenyu	235044c9d8	Ops.IDIV -> Ops.CDIV, Ops.MOD -> Ops.CMOD (#16093 ) * Ops.IDIV -> Ops.CDIV, Ops.MOD -> Ops.CMOD * ruff	2026-05-07 23:18:15 -04:00
chenyu	072db9924c	div to mixin (#16078 ) also deleted idiv method	2026-05-07 12:52:37 -04:00
bigyoshi	4024d8438f	runtime/graph: avoid core_id runtimevar merge conflicts (#16026 ) Co-authored-by: bigyoshi51 <269989564+bigyoshi51@users.noreply.github.com>	2026-05-03 19:16:02 +03:00
chenyu	782d1ff80f	Tensor.fmod (#16014 ) c-style mod matches torch	2026-05-01 16:02:18 -04:00
qazal	8b147a9ed5	minimal repro for llama copies 2 (#16011 )	2026-05-01 22:23:47 +09:00
qazal	a29dd7b19b	Revert "cleanup: untrack wait Metal buffers (#15954 )" (#16010 ) * Revert "cleanup: untrack wait Metal buffers (#15954)" This reverts commit `5eb1fd5d3c`. * regression test fixes	2026-05-01 21:18:19 +09:00
qazal	65879fe1b7	metal synchronize regression test (#16008 ) * add test for metal wait=True * add self.assertRaises	2026-05-01 20:10:57 +09:00
George Hotz	4506688285	split render to render.py (#16002 ) * split render to render.py * move more print	2026-04-30 19:41:14 -07:00
chenyu	52c92e15ae	no replacement multinomial (#15995 ) * no replacement multinomial Efraimidis–Spirakis * num_samples == 1 can use fast path	2026-04-30 17:35:26 -04:00
chenyu	e0b09f288f	input validation for rand functions (#15990 )	2026-04-30 14:00:44 -04:00
nimlgen	11e1a2b89f	cleaner and faster run_linear (#15987 ) * cleaner and faster run_linear * x * assert for now * x * x * sym_infer * remove sink	2026-04-30 20:15:22 +03:00
qazal	58b34e71bd	failing test for llama useless copies (#15989 )	2026-05-01 00:55:29 +09:00
nimlgen	dfd2d07005	remove CompiledRunner (#15970 ) * rm usage of CompiledRunner * more tests * last * linter * sink * remove * linter	2026-04-29 22:45:48 +03:00
George Hotz	5f441ecffc	unify reduce + reduce_axis (#15973 ) * unify reduce + reduce_axis * fix all tests * lil cleanups	2026-04-29 10:29:56 -07:00
nimlgen	7787f76dcc	get_runner -> get_runtime (#15967 ) * get_runner -> get_runtime * do not use get_runner * fix * remove get_tunner * remove * fix * x	2026-04-29 18:29:49 +03:00
nimlgen	77965a22e5	local optimize as rewrite (#15953 ) * local optimize as rewrite * better * x * slighly rename * fix * ugh * remove * x * remove * not weak	2026-04-28 22:51:04 +03:00
nimlgen	4164666c72	programinfo (#15942 ) * programinfo * fix * m * x * x * changes * x * fix * rm	2026-04-27 23:12:03 +03:00
nimlgen	96165ff0d1	validate_with_cpu as rewrite (#15938 ) * validate_with_cpu as rewrite * compil * x * linter * moved * fix	2026-04-26 19:58:53 +03:00
nimlgen	117e9e22dd	estimates from graph (#15937 ) * estimates from graph * test * x	2026-04-26 18:22:53 +03:00
nimlgen	e0ff6cc15c	remove old schedule (#15930 ) * remove old schedule * tests * r * x	2026-04-25 16:46:36 +03:00
nimlgen	a5e9ea7a60	remove schedule batch 4 (#15927 ) * remove schedule batch 4 * fini	2026-04-25 12:36:55 +03:00
nimlgen	d2ab6ea7a6	remove schedule batch 3 (#15924 ) * remove shcedule batch 3 * batch 6 * batch 7	2026-04-25 11:53:16 +03:00
nimlgen	3c8a2db870	remove schedule() from tests batch 2 (#15923 ) * remove schedule() from tests batch 2 * batch 4	2026-04-25 10:44:41 +03:00
nimlgen	f2751955cb	remove linear_to_schedule from tests (#15912 ) * remove linear_to_schedule from tests * x	2026-04-24 20:02:10 +03:00
chenyu	7a1adfd2aa	update Tensor.allclose to return Tensor (#15904 ) matches jax	2026-04-24 08:27:17 -04:00
nimlgen	c0f77c2e1c	hcq graph to linear (#15888 ) * hcq * f * f * linter	2026-04-24 12:42:49 +03:00
nimlgen	5cf4ad2fb6	fix resolve param (#15889 )	2026-04-23 17:41:44 +03:00
George Hotz	0c3260d5d9	rename VECTORIZE to STACK (#15880 )	2026-04-23 10:43:42 +08:00
chenyu	f911a63a6b	don't allow negative num_classes in one_hot (#15859 ) no auto infer num_classes, matches jax	2026-04-21 19:39:29 -04:00
Christopher Milan	99a0debd62	Device.count() (#15842 )	2026-04-21 16:46:38 -04:00
chenyu	9192c93b7e	Tensor.invalid -> Tesnor.invalids (#15849 ) matches ones and zeros, and to not share name with UOp.invalid	2026-04-21 11:19:51 -04:00
nimlgen	ae9b84d32f	rm beam uop (#15844 )	2026-04-21 13:10:26 +03:00
nimlgen	01ac1c8c15	remove all run_schedule from tests (#15846 )	2026-04-21 12:02:10 +03:00
nimlgen	c0d7135b5f	do not use jit_cache in test (#15823 ) * do not use jit_cache in test * fix	2026-04-20 11:45:17 +03:00
oxrinz	f551a4bded	add threefry const folding (#15787 ) * prim threefry * test fix * clean test * cleanup * cleanup 2 * cleanup 3 * fix conflict markers in test_const_folding.py * update test * fix lint * use const instead of value for test	2026-04-20 09:30:03 +08:00
Christopher Milan	6adf4c3cd9	MOCKGPU interfaces (#15796 )	2026-04-17 21:56:29 -04:00
chenyu	0191cc73dc	update arange range check (#15794 ) it was not checking negative steps correctly	2026-04-17 16:07:50 -04:00
nimlgen	23ca680a3a	run_linear (#15784 ) * run_linear try 2 * x * f * tests * ctx, cleaner * r * x	2026-04-17 22:44:16 +03:00
Christopher Milan	9f4b7bed25	add pickled jit regression test (#15774 )	2026-04-16 16:59:09 -04:00
qazal	12c653a743	remove opts arg in get_program, everything uses opts_to_apply [pr] (#15767 ) * check Ops.BEAM in process replay * remove opts from the get_program api * lint * simplify * cleanup	2026-04-16 22:42:43 +03:00
George Hotz	d1cce7a476	put the ranges on store instead of after (#15759 ) * put the ranges on store instead of after * better assert * fix stuff * comment out slow rules i don't understand * simpler rule * closer * return false for store * fix loop * only a few schedule failures remain * remove stores to self * all tests pass locally * remove junk * regression test and fix * better test, bump broken torch count * bugfix with regression test * new fusion is better	2026-04-16 19:06:40 +08:00

1 2 3 4

168 Commits