George Hotz
e9569b8799
fix dsp
2026-06-03 12:09:46 -07:00
chenyu
8a4203638a
make full with buffer=False deviceless ( #16483 )
...
affects arange and eye
2026-06-03 12:35:59 -04:00
George Hotz
ffadd7a315
remove intel and amx support ( #16482 )
2026-06-02 18:53:05 -07:00
Christopher Milan
80eeb4dd21
mockgpu: use autogen.libc ( #16479 )
2026-06-02 19:59:36 -04:00
chenyu
a43b55d480
deviceless const folding schedule test ( #16477 )
2026-06-02 18:46:30 -04:00
chenyu
6b7d2b91df
update test_uop_graph ( #16470 )
...
use UOp methods instead of constructing UOp directly, some of it violated spec
2026-06-02 08:53:54 -04:00
George Hotz
20242fdf1d
update test + spec from shrink_in_render ( #16467 )
...
* update test + spec from shrink_in_render
* cast
2026-06-01 19:24:43 -07:00
Christopher Milan
2d0f132a3b
ci: cleanup more duplicate tests ( #16462 )
2026-06-01 18:56:29 -04:00
George Hotz
124d2f8227
anon addrspace from new renderer ( #16461 )
...
* anon addrspace from new renderer
* use max_numel in python renderer
* add sizes to ptrs in tests
* more
* correct fix
2026-06-01 14:42:02 -07:00
chenyu
7e7b481ba7
less CONST(DEVICE) ( #16452 )
...
* less CONST(DEVICE)
no DEVICE for single device in const_like, multi has other issues
* maybe
* that?
2026-06-01 15:55:12 -04:00
George Hotz
556defa0f7
minor updates from vec removal ( #16456 )
2026-05-31 09:48:51 -07:00
Javier De Jesus
989f713c1b
support negative pads in circular pad mode ( #16448 )
2026-05-31 09:28:45 -07:00
nimlgen
2c2cb339e0
fix word wrap ( #16450 )
2026-05-30 23:21:24 +03:00
qazal
29b47a0057
llama: update local amax implementation after ParamArgs change ( #16446 )
...
* local amax failing test
* update _local_abs_max_fxn
2026-05-30 16:55:43 +09:00
Christopher Milan
434cfa96a3
ci: no fetch in backend tests ( #16438 )
...
should make for less actions cache thrashing
2026-05-29 17:11:16 -04:00
chenyu
b7280705a7
limit CONST(UNIQUE) to invalids only ( #16432 )
2026-05-29 16:02:06 -04:00
qazal
81bc00c006
do not require clearing method_cache in viz tests ( #16428 )
...
* update
* update test_dedup
2026-05-29 18:12:34 +09:00
qazal
54cfb794b8
viz: addrspace little colored box ( #16427 )
...
* return addrspace
* layout
* render
* addrspace encodes color
* update colors
* in input_ast all are params are green
* update stroke
2026-05-29 17:25:07 +09:00
Christopher Milan
69aa2054f6
rename clangjit to clang ( #16423 )
2026-05-28 22:41:58 -04:00
George Hotz
1e7f1dcf49
add ParamArgs [pr] ( #16421 )
...
* add ParamArgs
* fix export
* cleanups
* fixes
* simpler
2026-05-28 19:17:17 -07:00
Christopher Milan
c8af163d2b
disable process replay by default ( #16419 )
...
enable process replay with [pr] and assert with [PR]
process replay no longer captures on master
2026-05-28 17:36:28 -04:00
George Hotz
edca5df25a
flip offset and shape in pad and shrink ( #16414 )
...
* flip offset and shape in pad and shrink
* dumb test
2026-05-28 11:58:19 -07:00
chenyu
d72d8ee065
.const() should not ignore dtype ( #16412 )
...
fixed a bug in postrange, also cleaner
2026-05-28 10:49:15 -04:00
Christopher Milan
171401e8df
skip modulo by zero in test_dtype_alu ( #16404 )
2026-05-27 17:09:05 -04:00
chenyu
c33b767407
bring back test and torch backend change for unique const ( #16403 )
2026-05-27 15:16:08 -04:00
chenyu
6da785562b
test_custom_kernel_precompile_multidevice ( #16401 )
...
add a test to show what invalids need
2026-05-27 11:19:16 -04:00
chenyu
3e80f375ee
skip test_setitem_fancy_on_unrealized_view ( #16400 )
...
crashes in linux llvm ci
2026-05-27 09:50:26 -04:00
chenyu
945ed4f689
revert const unique changes ( #16395 )
2026-05-27 00:06:41 -04:00
chenyu
fa14cde05c
test update for arange and eye ( #16394 )
...
these will need explicit clone to make a buffer
2026-05-26 22:48:34 -04:00
George Hotz
156a4438d9
rename BUFFER_VIEW to SLICE ( #16391 )
...
* rename BUFFER_VIEW to SLICE
* fix comments
2026-05-26 18:15:00 -07:00
Christopher Milan
3adf7f5d95
disable flaky cl test ( #16388 )
2026-05-26 19:56:57 -04:00
Christopher Milan
d23659d38b
cleanup some old test skips ( #16384 )
2026-05-26 19:07:22 -04:00
George Hotz
fd963038a0
remove allow_any_len from store ( #16385 )
...
* remove allow_any_len from store
* a few more
* no bv there
* more fixes
* fixes
* oh that
2026-05-26 15:26:53 -07:00
chenyu
0b88827482
remove CONST(UNIQUE) ( #16383 )
2026-05-26 14:45:22 -04:00
chenyu
d861c50dce
remove unique_const ( #16382 )
2026-05-26 13:53:31 -04:00
George Hotz
bac82d4949
fix emu bug in gfx950 ( #16381 )
...
* fix emu bug in gfx950
* fix renderer
2026-05-26 10:32:03 -07:00
chenyu
9b00defc8c
Revert "remove unique_const ( #16372 )" ( #16380 )
...
This reverts commit 09019d6761 .
2026-05-26 12:30:07 -04:00
chenyu
09019d6761
remove unique_const ( #16372 )
...
* remove unique_const
* fix SDWA thing
* that?
2026-05-26 12:18:03 -04:00
George Hotz
7f1b02854e
bufferview offset is units of input dtype ( #16378 )
2026-05-26 08:49:31 -07:00
qazal
846a809af7
viz: add +- toggle for hidden UOps ( #16368 )
...
* first
* remove
* move src toggles to client side
* line
* update viz server tests
* remove those
* logic
* cleanup
* call matches
* fix const arg
* add labels
* keep changes
* the stack on movement ops hiding change
* structure
* rename to expandedNodes
* work
* test intention
2026-05-26 22:31:54 +09:00
wozeparrot
76fc39ccc0
gather to single device ( #16354 )
2026-05-25 17:27:08 -07:00
Christopher Milan
8ddd1328df
remove getenv(CI) ( #16365 )
...
gone everywhere except test_interop, because torch MPS does not work in actions
2026-05-25 20:23:33 -04:00
George Hotz
689ab6a49f
move buffer view offset to src ( #16364 )
...
* this work?
* failed
2026-05-25 17:07:55 -07:00
Christopher Milan
d8f86be613
webgpu: shader-f16 support in arch ( #16370 )
2026-05-25 19:20:59 -04:00
qazal
b73d2d17b9
viz/cli: add --interval ( #16363 )
...
* interval support
* add test_interval
* llama uses interval
2026-05-26 03:35:06 +09:00
chenyu
5d5e02871f
remove Tensor.from_uop ( #16344 )
...
and no device for const in Tensor init
2026-05-24 18:53:09 -04:00
chenyu
926d125a63
update test_stack ( #16345 )
...
also skip COMPILE_ONLY, it was comparing 0==0
2026-05-23 10:42:35 -04:00
chenyu
149a87dac2
deviceless const cleanups ( #16341 )
2026-05-22 20:11:01 -04:00
Christopher Milan
451f38155c
start cleanup of the slowest tests ( #16339 )
2026-05-22 18:39:36 -04:00
qazal
bbfe4f80ec
quantize_fp8 kernels in uops ( #16288 )
...
* add tests
* simple UOp kernel is n^2
* fast kernel matching c++, opts_to_apply=()
* remove cpp
* simple o(n) kernel, two passes
* fuse the loops
* works on DEV=CPU
* multi regression test
* fix multi, this can possibly be its own bugfix
* test cleanups
* minimal diff
* match C in UOps
* Revert "match C in UOps"
This reverts commit 0bef740c30 .
* edit test
* match speed with C try 2
* needs_second_gpu
* cleanup
2026-05-22 20:54:06 +09:00