qazal
|
bfb2d1f89a
|
Revert "fp8 gemm speedup (#16236)" (#16245)
This reverts commit d95bf394e1.
|
2026-05-19 02:01:44 +09:00 |
|
qazal
|
d95bf394e1
|
fp8 gemm speedup (#16236)
* add asm_gemm option
* milestone
* work
* edit
* only the fast kernel
* diff
|
2026-05-17 04:58:28 +09:00 |
|
wozeparrot
|
528d35e306
|
llama speed 4 (#15993)
|
2026-04-30 17:14:41 -07:00 |
|
wozeparrot
|
9e60e4a7e7
|
llama: native fp8 (#15733)
|
2026-04-16 22:16:05 -07:00 |
|
wozeparrot
|
7e54992bf6
|
fp8 llama (#15588)
Co-authored-by: qazal <qazal.software@gmail.com>
|
2026-04-04 18:24:57 -07:00 |
|
qazal
|
8feb8edc68
|
gemm/asm: add fp8 support to cdna asm_gemm (#15542)
* work
* hmm, mixins
* rhs_transposed
* also fix the dtype
* check for hipcc
* Exception
* select dev
* default
|
2026-03-31 19:32:54 +09:00 |
|