Files
tinygrad/examples
Arseny Kapoulkine cb6e7b57a6 examples: Fix parameter bandwidth accounting for quantized LLama (#3930)
Instead of assuming every parameter is 2 bytes, just add up tensor sizes
in bytes
2024-03-25 18:41:05 -04:00
..
2024-03-24 11:43:12 -04:00
2023-03-11 16:28:10 -08:00
2024-03-14 20:44:34 -07:00
2023-10-30 18:42:26 -07:00
2023-08-22 07:36:24 -07:00
2023-09-28 18:02:31 -07:00
2024-01-01 14:58:48 -08:00
2024-03-14 13:34:14 -07:00
2023-11-28 17:36:55 -08:00
2023-12-08 12:59:38 -08:00
2024-03-14 17:33:45 -04:00