mirror of
https://github.com/tinygrad/tinygrad.git
synced 2026-06-13 08:28:55 +08:00
* update llama attention casting updated scaled_dot_product_attention middle cast and removed hard-coded half in llama attention. * fix that