qazal
54f00e1013
sqtt: correct rdna4 structs ( #15948 )
2026-04-28 07:35:50 +09:00
qazal
c58fd85a99
sqtt: add needs_rocprof decorator ( #15947 )
...
* sqtt: add needs_rocprof decorator
* version string
2026-04-28 06:22:50 +09:00
qazal
8c174bdad4
viz/sqtt: correct exec pipes ( #15885 )
...
* wmma
* p2
* test
* left
* work
* pickle
* handwritten failing tests
* start work
* test the pipes
* empirical evidence
* update rdna4 enum types
* VALU pipe 1
* TRANSCENDENTAL pipe
* transcendental function units
* reorder
* wmma pipe
* cleanup and notes
* smaller
* work
* diff cleanup
* pickle
* use se:1
* int
2026-04-28 05:05:49 +09:00
qazal
9a23de7d27
viz/cli: unify profile and rewrites, -s ALL default ( #15931 )
...
* work
* workg
* better
* cleanup
* better defaults
* --ls
* better
* work
* update llama
* update
2026-04-25 22:31:24 +09:00
qazal
5eb641395a
viz/cli: select kernel events in -s DEV ( #15909 )
...
* simple test
* pass
2026-04-24 21:03:34 +09:00
qazal
afc3904e58
viz/cli: unit tests in CI ( #15788 )
...
* simple failing test
* test stdout
* cleanup sqttmap
2026-04-17 22:34:44 +09:00
qazal
71c83cc3f6
viz: put OTHER_ on the wave row ( #15650 )
...
* viz: put OTHER_ on the wave row
* update tests
* cleanup cli
2026-04-08 23:13:44 +09:00
qazal
3ac16b3bea
viz: add wmma row, update exec duration logic ( #15646 )
...
* viz: split wmma to its own row, fix duration logic
* regs
* decrease number of loops, add pickle
* assert overlaps
2026-04-08 20:24:23 +09:00
qazal
a15345a53e
viz/cli: improve --help message ( #15546 )
...
* viz/cli: improve --help message
* not the default
* more work
* -s
* respect colored
2026-03-31 22:31:33 +09:00
qazal
ccaa6bfc19
viz/cli cleanups ( #15511 )
...
* one less function
* work
* layout
* better handling of rewrites
* mypy passes
2026-03-28 08:50:38 +09:00
qazal
dcc2a5d23b
viz/cli: simplify to --source and --item flags ( #15510 )
...
* viz/cli: simplify to --source and --item flags
* update viz cli test
2026-03-28 04:46:39 +09:00
qazal
586c49642f
viz/cli: test in CI ( #15501 )
...
* viz cli work
* baseline test
* make cli test work without subprocess
* more checks
* check itrace
* s/return/return None
* change
* minimal
* colored
2026-03-27 06:47:15 +09:00
qazal
3f9f0fa846
viz: yield sqtt alt events ( #15500 )
...
* yield other
* less
* work
* less
2026-03-27 04:43:41 +09:00
qazal
1b3d00d6ac
viz/cli: remove --offset and --limit flags ( #15439 )
...
* work
* also no more no-color
* reorder
* update llama
* sqtt readme
* itertools
* rm that
* signals back
2026-03-25 09:52:27 +09:00
qazal
109472c37e
sqtt: new s_barrier pickles, handle rdna4 barriers in emulator ( #15437 )
2026-03-24 03:25:28 +09:00
qazal
c7b18e6108
viz: sqtt printer in viz/cli.py ( #15411 )
...
* work
* sqtt timeline in CLI
* format all printers nicely
* s/Showed/Printed
* ansistrip
* sys.exit
* keep colors in list
* work from amd_copy_matmul
* has_more always gets returned
* linter
* don't print colors
* more colors
* wow this is so deep
* work
* minor details
* selected
* improve progress bar
* remove it
* 22, global_load_vaddr is so long
2026-03-23 00:17:05 +09:00
qazal
709fc52d7b
viz: fix auto zoom range in sqtt, include endpgm packet ( #15349 )
...
* viz: fix automatic zoom range in sqtt packets
* it's x+width
* include s_endpgm
* endpgm also doesn't have exec
2026-03-18 22:52:32 +09:00
qazal
33bd33e783
sqtt: add CDNA ops enum, show in viz ( #15140 )
2026-03-17 09:38:42 +09:00
qazal
4445f50356
viz: variable duration rdna barriers ( #15277 )
...
* viz: variable length rdna barriers
* work
* tiny changes
* simple wave simd test
* small wave sync test
* good multi barrier bug find
* simple fix
* wave_sync asserts
* rdna4 work
* more rdna4
* find more bugs in my model
* it's so much simpler
* wave_sync tests duration
* r4
* should just call this rdna4
2026-03-16 06:06:19 +09:00
qazal
7b6211fdd7
sqtt: remove discover_ops script ( #15279 )
2026-03-15 22:17:06 +09:00
qazal
d3eef70162
viz: render shader clock frequency graph ( #15197 )
2026-03-12 01:32:49 +09:00
qazal
a408d90f4f
viz: always detect sqtt packet overlaps, add timeline tests ( #15211 )
...
* test
* work
* it's called CALL, better assert
* qol
* row_ends
2026-03-11 05:32:38 +09:00
qazal
33a1970045
sqtt: simplify inst mapping, validate JUMP processing in CI ( #15139 )
...
* jump cleanup
* assert there's a JUMP
* new example for JUMP
* regenerate examples
* rdna4 work
* new packets
* work
* less for branch handling
* less verbose
* fix err message
2026-03-05 09:53:12 +09:00
qazal
c7f908b788
sqtt: fix rdna4 structs ( #15111 )
...
* work
* DEBUG=2
2026-03-03 23:32:14 +09:00
qazal
b8a55d5f68
sqtt: new packet types, add discovery script ( #14960 )
2026-02-28 04:27:27 +09:00
qazal
16ae96fa58
finish rdna4 sqtt ( #14903 )
...
* unskip
* it's a wave pair in rdna4
* work
* that
* hidden archive
* generic s_delay, mystery InstOpRDNA4.UNK_60
* branch failing test
* UNK_60 is OTHER_VMEM_STORE
* rdna4 has both s_delay_alu and s_wait_alu
* real branch failing test
* rdna4 doesn't have JUMP_NO, it's NEXT with a flag for no jump
* make inst_delay skips recursive
* all rdna4 tests pass
* simm16 unwraps
* that has a name
2026-02-20 16:06:13 +09:00
qazal
52b51a0324
test fixes from rdna4 sqtt ( #14902 )
2026-02-20 14:42:33 +09:00
qazal
32f569b573
viz/sqtt: decoder fixes pre rdna4/cdna4 work ( #14900 )
...
* viz/sqtt: decoder fixes pre rdna4/cdna4 work
* fix
* branch_inst + more tests
* smaller
2026-02-20 12:10:15 +09:00
qazal
ac62d28ddc
viz: amdgpu arch cleanup ( #14790 )
...
* viz: amdgpu arch cleanup
* don't do that
* simpler sqttmap
* work
* self.arch
2026-02-16 16:48:12 +09:00
George Hotz
4680247e35
renderer/amd: move in tree ( #14702 )
...
* renderer/amd: move in tree
* fix paths in tests
* 24000 lines
* no delete for amd files
2026-02-12 18:09:16 +08:00