77a8919349
* UV+DTR model * DTR model.. again. * fix naviGPS * fix radar... * fix.. * test * fix.. * carrot serv * fix.. * fix.. fleet * fix.. radar * fix atc * Steam Powered model.. * fix.. radarLatFactor range.. 200->500 * fix.. dbc.. * side * SP v2 * brake light * fix brakelight * fix.. * add datetime... * fix.. * fix.. * fix.. * fix.. * blind spot * fix tz * fix.. * ff * radarLatFactor * fix.. bsd * Revert "fix.. bsd" This reverts commit 1d0d1434470e1b92c65eaffaeb8dd7cd779f85ee. * fix.. bsd side.. * test * fix.. e2e conditions * Revert "test" This reverts commit 0ce791dbd66c17260366ed1a4df2626c602dbb7d. * TR16 * fix cut-in detect threshold 3.4 -> 2.6 * fix.. jerk_l limit 5->10 * fix.. * fix.. gm * fix.. OPTIMA_H mass * fix.. radar.. * fix radar.. * fix.. * Radar... * fix.. * fix.. * fix.. * fix.. radartrack 3 * fix.. * fix.. * fix.. * merge.. * fix.. canfd * fix.. * fix.. * fix.. * fix.. radard * new cut_in * Revert "new cut_in" This reverts commit b9b6e9b33318fe1ce7d626468139b17848efcdcd. * fix.. * new cut_in detect... * fix.. disp.. * fix.. * fix.. * fix.. center radar.. * fix.. radar y_sane.. * fix.. * fix.. * hkg jerk 10 -> 5 * fix.. * fix.. * fix.. radar dbc.. * fix.. * fix.. jLead filter.. * test new radar interface.. * fix.. * fix.. * test time... * Revert "test time..." This reverts commit 63e9187736985c4dc4b4f3736674ba7cda6adc3f. * fix radar.. * fix.. * FireHose model.. * tinygrad * Update interface.py * fix.. * fix.. nff toyota corolla_tss2 * fix.. * fix.. * fix.. radar * fix.. * fix.. radar, y_gate * fix.. radar.. * fix.. for clone.. * scc radar enable at low speed.. * fix.. settings.. * fix. * fix.. * fix.. radarTimeStep. * TR16 model again.. * RELEASE.md * fix cut-in detection... * fix.. registeration timeout 15sec.. * fix.. * fix.. radar processing. * fix.. * fix.. * fix.. * fix.. * fix.. * fix..
39 lines
1.2 KiB
Python
39 lines
1.2 KiB
Python
import time
|
|
from tinygrad import Tensor, TinyJit, Device, Context
|
|
from tinygrad.helpers import Profiling, Timing, GlobalCounters
|
|
|
|
# python3 test/speed/external_test_speed_v_torch.py TestSpeed.test_add_a
|
|
|
|
@TinyJit
|
|
def plus(a:Tensor, b:Tensor): return a+b
|
|
|
|
if __name__ == "__main__":
|
|
a = Tensor([1]).realize()
|
|
b = Tensor([1]).realize()
|
|
for i in range(5):
|
|
with Timing(prefix=f"{i}:"):
|
|
c = plus(a,b)
|
|
Device[c.device].synchronize()
|
|
assert c.item() == 2
|
|
for i in range(5):
|
|
st = time.perf_counter()
|
|
c = plus(a,b)
|
|
et = time.perf_counter() - st
|
|
Device[c.device].synchronize()
|
|
print(f"nosync {i}: {et*1e6:.2f} us")
|
|
for i in range(5):
|
|
st = time.perf_counter()
|
|
c = plus(a,b)
|
|
Device[c.device].synchronize()
|
|
et = time.perf_counter() - st
|
|
print(f"precise {i}: {et*1e6:.2f} us")
|
|
assert GlobalCounters.time_sum_s == 0
|
|
with Context(DEBUG=2):
|
|
st = time.perf_counter()
|
|
c = plus(a,b)
|
|
Device[c.device].synchronize()
|
|
et = time.perf_counter() - st
|
|
print(f"kernel {GlobalCounters.time_sum_s*1e3:.2f} ms / full {et*1e3:.2f} ms -- {et/(GlobalCounters.time_sum_s+1e-12):.2f} x")
|
|
with Profiling():
|
|
c = plus(a,b)
|