Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Test] fix inline TDParams kwargs for nontensordata #1095

Merged
merged 1 commit into from
Nov 20, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 20, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 20, 2024
ghstack-source-id: da8b7f40d05715170a3e9f0b47763efe356afe5e
Pull Request resolved: #1095
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024
@vmoens vmoens merged commit ac6f24c into gh/vmoens/35/base Nov 20, 2024
11 of 21 checks passed
vmoens added a commit that referenced this pull request Nov 20, 2024
ghstack-source-id: da8b7f40d05715170a3e9f0b47763efe356afe5e
Pull Request resolved: #1095
@vmoens vmoens deleted the gh/vmoens/35/head branch November 20, 2024 09:46
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 43.8620μs 17.9340μs 55.7601 KOps/s 55.2984 KOps/s $\color{#35bf28}+0.83\%$
test_plain_set_stack_nested 63.7190μs 18.1485μs 55.1011 KOps/s 54.9763 KOps/s $\color{#35bf28}+0.23\%$
test_plain_set_nested_inplace 53.5810μs 20.1140μs 49.7165 KOps/s 50.6457 KOps/s $\color{#d91a1a}-1.83\%$
test_plain_set_stack_nested_inplace 72.0950μs 20.0894μs 49.7776 KOps/s 50.3473 KOps/s $\color{#d91a1a}-1.13\%$
test_items 49.4560μs 4.1177μs 242.8545 KOps/s 237.7996 KOps/s $\color{#35bf28}+2.13\%$
test_items_nested 0.5039ms 0.3368ms 2.9692 KOps/s 2.9365 KOps/s $\color{#35bf28}+1.11\%$
test_items_nested_locked 0.4851ms 0.3381ms 2.9576 KOps/s 2.9314 KOps/s $\color{#35bf28}+0.89\%$
test_items_nested_leaf 0.1309ms 72.3516μs 13.8214 KOps/s 14.0073 KOps/s $\color{#d91a1a}-1.33\%$
test_items_stack_nested 0.5138ms 0.3389ms 2.9507 KOps/s 2.9117 KOps/s $\color{#35bf28}+1.34\%$
test_items_stack_nested_leaf 0.1429ms 73.9790μs 13.5173 KOps/s 13.5366 KOps/s $\color{#d91a1a}-0.14\%$
test_items_stack_nested_locked 0.5030ms 0.3414ms 2.9294 KOps/s 2.7899 KOps/s $\color{#35bf28}+5.00\%$
test_keys 34.3040μs 3.5067μs 285.1702 KOps/s 284.4103 KOps/s $\color{#35bf28}+0.27\%$
test_keys_nested 0.2085ms 0.1379ms 7.2529 KOps/s 7.2375 KOps/s $\color{#35bf28}+0.21\%$
test_keys_nested_locked 1.8755ms 0.1434ms 6.9722 KOps/s 7.0479 KOps/s $\color{#d91a1a}-1.07\%$
test_keys_nested_leaf 0.2265ms 0.1184ms 8.4441 KOps/s 8.6752 KOps/s $\color{#d91a1a}-2.66\%$
test_keys_stack_nested 0.2320ms 0.1390ms 7.1919 KOps/s 7.3400 KOps/s $\color{#d91a1a}-2.02\%$
test_keys_stack_nested_leaf 0.2281ms 0.1194ms 8.3748 KOps/s 8.4624 KOps/s $\color{#d91a1a}-1.04\%$
test_keys_stack_nested_locked 0.2468ms 0.1431ms 6.9903 KOps/s 7.1312 KOps/s $\color{#d91a1a}-1.98\%$
test_values 9.4742μs 1.0478μs 954.3499 KOps/s 952.1755 KOps/s $\color{#35bf28}+0.23\%$
test_values_nested 0.1112ms 55.6048μs 17.9841 KOps/s 18.1102 KOps/s $\color{#d91a1a}-0.70\%$
test_values_nested_locked 0.1084ms 55.3474μs 18.0677 KOps/s 18.3053 KOps/s $\color{#d91a1a}-1.30\%$
test_values_nested_leaf 0.1281ms 60.8281μs 16.4398 KOps/s 16.7748 KOps/s $\color{#d91a1a}-2.00\%$
test_values_stack_nested 0.1208ms 56.7602μs 17.6180 KOps/s 16.8679 KOps/s $\color{#35bf28}+4.45\%$
test_values_stack_nested_leaf 0.1247ms 61.4458μs 16.2745 KOps/s 16.7327 KOps/s $\color{#d91a1a}-2.74\%$
test_values_stack_nested_locked 0.1248ms 56.6523μs 17.6515 KOps/s 18.0221 KOps/s $\color{#d91a1a}-2.06\%$
test_membership 21.5500μs 0.8946μs 1.1178 MOps/s 1.3526 MOps/s $\textbf{\color{#d91a1a}-17.35\%}$
test_membership_nested 57.5380μs 2.7860μs 358.9361 KOps/s 362.4034 KOps/s $\color{#d91a1a}-0.96\%$
test_membership_nested_leaf 46.8370μs 2.7744μs 360.4407 KOps/s 367.5318 KOps/s $\color{#d91a1a}-1.93\%$
test_membership_stacked_nested 50.9940μs 2.7644μs 361.7399 KOps/s 366.3861 KOps/s $\color{#d91a1a}-1.27\%$
test_membership_stacked_nested_leaf 40.8170μs 2.7645μs 361.7342 KOps/s 365.1289 KOps/s $\color{#d91a1a}-0.93\%$
test_membership_nested_last 58.7000μs 4.0498μs 246.9285 KOps/s 249.0123 KOps/s $\color{#d91a1a}-0.84\%$
test_membership_nested_leaf_last 50.8750μs 4.0747μs 245.4139 KOps/s 251.3527 KOps/s $\color{#d91a1a}-2.36\%$
test_membership_stacked_nested_last 27.7720μs 4.0863μs 244.7217 KOps/s 253.0952 KOps/s $\color{#d91a1a}-3.31\%$
test_membership_stacked_nested_leaf_last 59.1610μs 4.0508μs 246.8649 KOps/s 251.3192 KOps/s $\color{#d91a1a}-1.77\%$
test_nested_getleaf 46.2370μs 10.6146μs 94.2097 KOps/s 95.1646 KOps/s $\color{#d91a1a}-1.00\%$
test_nested_get 53.8310μs 10.1834μs 98.1994 KOps/s 100.0835 KOps/s $\color{#d91a1a}-1.88\%$
test_stacked_getleaf 58.0890μs 10.4620μs 95.5840 KOps/s 95.4275 KOps/s $\color{#35bf28}+0.16\%$
test_stacked_get 37.0090μs 10.1301μs 98.7155 KOps/s 100.3956 KOps/s $\color{#d91a1a}-1.67\%$
test_nested_getitemleaf 55.7740μs 11.1347μs 89.8093 KOps/s 91.4605 KOps/s $\color{#d91a1a}-1.81\%$
test_nested_getitem 54.0510μs 10.4961μs 95.2739 KOps/s 97.0390 KOps/s $\color{#d91a1a}-1.82\%$
test_stacked_getitemleaf 61.8550μs 11.2202μs 89.1246 KOps/s 91.4283 KOps/s $\color{#d91a1a}-2.52\%$
test_stacked_getitem 61.9360μs 10.3903μs 96.2437 KOps/s 98.4734 KOps/s $\color{#d91a1a}-2.26\%$
test_lock_nested 2.7775ms 0.4559ms 2.1936 KOps/s 1.8339 KOps/s $\textbf{\color{#35bf28}+19.61\%}$
test_lock_stack_nested 0.7728ms 0.4200ms 2.3808 KOps/s 2.4341 KOps/s $\color{#d91a1a}-2.19\%$
test_unlock_nested 0.8515ms 0.3658ms 2.7335 KOps/s 2.7351 KOps/s $\color{#d91a1a}-0.06\%$
test_unlock_stack_nested 0.5084ms 0.3381ms 2.9575 KOps/s 3.0404 KOps/s $\color{#d91a1a}-2.73\%$
test_flatten_speed 0.1841ms 94.3219μs 10.6020 KOps/s 11.1309 KOps/s $\color{#d91a1a}-4.75\%$
test_unflatten_speed 0.6641ms 0.4792ms 2.0866 KOps/s 2.1240 KOps/s $\color{#d91a1a}-1.76\%$
test_common_ops 1.6416ms 0.7789ms 1.2839 KOps/s 1.2945 KOps/s $\color{#d91a1a}-0.81\%$
test_creation 0.1523ms 2.0922μs 477.9758 KOps/s 497.5719 KOps/s $\color{#d91a1a}-3.94\%$
test_creation_empty 39.0130μs 11.1554μs 89.6430 KOps/s 86.9170 KOps/s $\color{#35bf28}+3.14\%$
test_creation_nested_1 54.0310μs 13.9624μs 71.6211 KOps/s 69.9810 KOps/s $\color{#35bf28}+2.34\%$
test_creation_nested_2 54.3720μs 18.0418μs 55.4268 KOps/s 53.6659 KOps/s $\color{#35bf28}+3.28\%$
test_clone 0.1197ms 13.7276μs 72.8459 KOps/s 77.9354 KOps/s $\textbf{\color{#d91a1a}-6.53\%}$
test_getitem[int] 0.8629ms 12.6013μs 79.3568 KOps/s 79.5633 KOps/s $\color{#d91a1a}-0.26\%$
test_getitem[slice_int] 0.1494ms 23.9134μs 41.8175 KOps/s 42.3578 KOps/s $\color{#d91a1a}-1.28\%$
test_getitem[range] 0.3861ms 50.2638μs 19.8950 KOps/s 20.4316 KOps/s $\color{#d91a1a}-2.63\%$
test_getitem[tuple] 0.1546ms 19.7837μs 50.5468 KOps/s 50.6822 KOps/s $\color{#d91a1a}-0.27\%$
test_getitem[list] 0.3218ms 44.8784μs 22.2824 KOps/s 22.4687 KOps/s $\color{#d91a1a}-0.83\%$
test_setitem_dim[int] 59.6610μs 26.2656μs 38.0726 KOps/s 39.3874 KOps/s $\color{#d91a1a}-3.34\%$
test_setitem_dim[slice_int] 95.1380μs 52.8002μs 18.9393 KOps/s 19.9898 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_setitem_dim[range] 0.1877ms 77.2554μs 12.9441 KOps/s 13.1556 KOps/s $\color{#d91a1a}-1.61\%$
test_setitem_dim[tuple] 76.0420μs 41.3729μs 24.1704 KOps/s 24.9648 KOps/s $\color{#d91a1a}-3.18\%$
test_setitem 0.1090ms 20.8538μs 47.9528 KOps/s 48.4318 KOps/s $\color{#d91a1a}-0.99\%$
test_set 0.1950ms 20.2174μs 49.4624 KOps/s 49.8733 KOps/s $\color{#d91a1a}-0.82\%$
test_set_shared 4.3688ms 0.1706ms 5.8625 KOps/s 5.8632 KOps/s $\color{#d91a1a}-0.01\%$
test_update 0.1531ms 23.5093μs 42.5363 KOps/s 42.8801 KOps/s $\color{#d91a1a}-0.80\%$
test_update_nested 0.1470ms 33.4768μs 29.8714 KOps/s 30.9563 KOps/s $\color{#d91a1a}-3.50\%$
test_update__nested 0.1619ms 33.0247μs 30.2804 KOps/s 31.0708 KOps/s $\color{#d91a1a}-2.54\%$
test_set_nested 0.1562ms 22.6118μs 44.2248 KOps/s 45.6606 KOps/s $\color{#d91a1a}-3.14\%$
test_set_nested_new 0.2541ms 27.4687μs 36.4051 KOps/s 37.2415 KOps/s $\color{#d91a1a}-2.25\%$
test_select 0.1879ms 42.1140μs 23.7451 KOps/s 23.3913 KOps/s $\color{#35bf28}+1.51\%$
test_select_nested 0.1350ms 60.5989μs 16.5019 KOps/s 16.6599 KOps/s $\color{#d91a1a}-0.95\%$
test_exclude_nested 0.1398ms 74.8315μs 13.3634 KOps/s 13.5508 KOps/s $\color{#d91a1a}-1.38\%$
test_empty[True] 0.5743ms 0.3504ms 2.8542 KOps/s 2.9003 KOps/s $\color{#d91a1a}-1.59\%$
test_empty[False] 12.3458μs 1.2498μs 800.1555 KOps/s 803.0305 KOps/s $\color{#d91a1a}-0.36\%$
test_unbind_speed 0.4690ms 0.2656ms 3.7651 KOps/s 3.8350 KOps/s $\color{#d91a1a}-1.82\%$
test_unbind_speed_stack0 0.5317ms 0.2636ms 3.7933 KOps/s 3.9597 KOps/s $\color{#d91a1a}-4.20\%$
test_unbind_speed_stack1 0.1099s 0.7887ms 1.2679 KOps/s 1.4297 KOps/s $\textbf{\color{#d91a1a}-11.32\%}$
test_split 0.1202s 1.7349ms 576.3941 Ops/s 568.4599 Ops/s $\color{#35bf28}+1.40\%$
test_chunk 0.1118s 1.7361ms 575.9931 Ops/s 573.4761 Ops/s $\color{#35bf28}+0.44\%$
test_consolidate_njt[False-None] 11.2029ms 8.2072ms 121.8441 Ops/s 124.6948 Ops/s $\color{#d91a1a}-2.29\%$
test_creation[device0] 3.5816ms 93.9037μs 10.6492 KOps/s 10.7889 KOps/s $\color{#d91a1a}-1.30\%$
test_creation_from_tensor 0.2863ms 95.1506μs 10.5097 KOps/s 10.4875 KOps/s $\color{#35bf28}+0.21\%$
test_add_one[memmap_tensor0] 0.1611ms 4.9635μs 201.4717 KOps/s 207.2031 KOps/s $\color{#d91a1a}-2.77\%$
test_contiguous[memmap_tensor0] 40.1850μs 0.5235μs 1.9102 MOps/s 1.8354 MOps/s $\color{#35bf28}+4.07\%$
test_stack[memmap_tensor0] 28.1020μs 3.3857μs 295.3557 KOps/s 297.1902 KOps/s $\color{#d91a1a}-0.62\%$
test_memmaptd_index 0.9588ms 0.2416ms 4.1388 KOps/s 4.3122 KOps/s $\color{#d91a1a}-4.02\%$
test_memmaptd_index_astensor 0.7183ms 0.3241ms 3.0856 KOps/s 3.2498 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_memmaptd_index_op 0.9952ms 0.6010ms 1.6639 KOps/s 1.7078 KOps/s $\color{#d91a1a}-2.57\%$
test_serialize_model 0.1252s 0.1177s 8.4997 Ops/s 7.4606 Ops/s $\textbf{\color{#35bf28}+13.93\%}$
test_serialize_model_pickle 0.4450s 0.3965s 2.5222 Ops/s 2.5214 Ops/s $\color{#35bf28}+0.03\%$
test_serialize_weights 0.2592s 0.1334s 7.4974 Ops/s 8.6703 Ops/s $\textbf{\color{#d91a1a}-13.53\%}$
test_serialize_weights_returnearly 0.1787s 0.1605s 6.2287 Ops/s 6.2847 Ops/s $\color{#d91a1a}-0.89\%$
test_serialize_weights_pickle 0.4598s 0.3939s 2.5389 Ops/s 2.4511 Ops/s $\color{#35bf28}+3.58\%$
test_serialize_weights_filesystem 0.1566s 0.1463s 6.8363 Ops/s 6.9446 Ops/s $\color{#d91a1a}-1.56\%$
test_serialize_model_filesystem 0.1704s 0.1511s 6.6202 Ops/s 6.5421 Ops/s $\color{#35bf28}+1.19\%$
test_reshape_pytree 0.1032ms 27.8804μs 35.8674 KOps/s 37.6214 KOps/s $\color{#d91a1a}-4.66\%$
test_reshape_td 0.1172ms 32.6201μs 30.6559 KOps/s 30.7217 KOps/s $\color{#d91a1a}-0.21\%$
test_view_pytree 72.9670μs 27.4434μs 36.4386 KOps/s 37.3674 KOps/s $\color{#d91a1a}-2.49\%$
test_view_td 97.3930μs 38.5720μs 25.9256 KOps/s 27.1835 KOps/s $\color{#d91a1a}-4.63\%$
test_unbind_pytree 79.8490μs 30.5438μs 32.7399 KOps/s 34.3216 KOps/s $\color{#d91a1a}-4.61\%$
test_unbind_td 0.3331ms 39.4451μs 25.3517 KOps/s 26.4647 KOps/s $\color{#d91a1a}-4.21\%$
test_split_pytree 75.6420μs 30.1751μs 33.1399 KOps/s 34.1195 KOps/s $\color{#d91a1a}-2.87\%$
test_split_td 0.5476ms 43.7646μs 22.8495 KOps/s 22.7812 KOps/s $\color{#35bf28}+0.30\%$
test_add_pytree 93.2050μs 37.1238μs 26.9369 KOps/s 28.4705 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_add_td 0.1397ms 60.5533μs 16.5144 KOps/s 18.0152 KOps/s $\textbf{\color{#d91a1a}-8.33\%}$
test_compile_add_one_nested[tensordict-compile] 0.1283ms 63.2462μs 15.8112 KOps/s 16.1884 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_add_one_nested[tensordict-eager] 0.4102ms 0.1630ms 6.1349 KOps/s 6.1442 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_one_nested[pytree-compile] 0.1353ms 47.0607μs 21.2491 KOps/s 22.0737 KOps/s $\color{#d91a1a}-3.74\%$
test_compile_add_one_nested[pytree-eager] 0.2714ms 0.1207ms 8.2848 KOps/s 8.4742 KOps/s $\color{#d91a1a}-2.23\%$
test_compile_copy_nested[tensordict-compile] 81.2820μs 26.5069μs 37.7260 KOps/s 39.2819 KOps/s $\color{#d91a1a}-3.96\%$
test_compile_copy_nested[tensordict-eager] 0.1458ms 53.7538μs 18.6033 KOps/s 18.6513 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_copy_nested[pytree-compile] 0.2182ms 82.2032μs 12.1650 KOps/s 12.6211 KOps/s $\color{#d91a1a}-3.61\%$
test_compile_copy_nested[pytree-eager] 0.1300ms 69.3000μs 14.4300 KOps/s 14.5654 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_add_one_flat[tensordict-compile] 0.2263ms 0.1073ms 9.3229 KOps/s 9.4908 KOps/s $\color{#d91a1a}-1.77\%$
test_compile_add_one_flat[tensordict-eager] 0.3778ms 0.2063ms 4.8471 KOps/s 5.1044 KOps/s $\textbf{\color{#d91a1a}-5.04\%}$
test_compile_add_one_flat[tensorclass-compile] 0.1129ms 46.0416μs 21.7195 KOps/s 20.9039 KOps/s $\color{#35bf28}+3.90\%$
test_compile_add_one_flat[tensorclass-eager] 0.4976ms 63.0450μs 15.8617 KOps/s 16.4363 KOps/s $\color{#d91a1a}-3.50\%$
test_compile_add_one_flat[pytree-compile] 0.1878ms 0.1040ms 9.6114 KOps/s 9.7547 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_add_one_flat[pytree-eager] 0.3678ms 0.2015ms 4.9625 KOps/s 4.9519 KOps/s $\color{#35bf28}+0.21\%$
test_compile_add_self_flat[tensordict-eager] 0.4625ms 0.2180ms 4.5877 KOps/s 4.8195 KOps/s $\color{#d91a1a}-4.81\%$
test_compile_add_self_flat[tensordict-compile] 0.1860ms 0.1083ms 9.2350 KOps/s 9.4625 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_add_self_flat[tensorclass-eager] 0.1886ms 55.1376μs 18.1365 KOps/s 18.6782 KOps/s $\color{#d91a1a}-2.90\%$
test_compile_add_self_flat[tensorclass-compile] 0.1179ms 48.2648μs 20.7190 KOps/s 22.0454 KOps/s $\textbf{\color{#d91a1a}-6.02\%}$
test_compile_add_self_flat[pytree-eager] 0.6568ms 0.1595ms 6.2700 KOps/s 6.2871 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_add_self_flat[pytree-compile] 0.1827ms 0.1054ms 9.4905 KOps/s 9.7655 KOps/s $\color{#d91a1a}-2.82\%$
test_compile_copy_flat[tensordict-compile] 58.8700μs 21.3237μs 46.8962 KOps/s 47.3159 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_copy_flat[tensordict-eager] 0.1599ms 59.5415μs 16.7950 KOps/s 17.5390 KOps/s $\color{#d91a1a}-4.24\%$
test_compile_copy_flat[pytree-compile] 0.1579ms 82.9121μs 12.0610 KOps/s 12.2524 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_copy_flat[pytree-eager] 0.1311ms 70.2728μs 14.2303 KOps/s 14.5802 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_assign_and_add[tensordict-compile] 0.3087ms 0.2116ms 4.7263 KOps/s 4.8213 KOps/s $\color{#d91a1a}-1.97\%$
test_compile_assign_and_add[tensordict-eager] 1.5868ms 1.3034ms 767.2083 Ops/s 794.4541 Ops/s $\color{#d91a1a}-3.43\%$
test_compile_assign_and_add[pytree-compile] 0.3027ms 0.2039ms 4.9033 KOps/s 4.9213 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_assign_and_add[pytree-eager] 0.9473ms 0.7736ms 1.2926 KOps/s 1.2959 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_assign_and_add_stack[compile] 0.5696ms 0.4621ms 2.1640 KOps/s 2.1991 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_assign_and_add_stack[eager] 2.9487ms 2.6541ms 376.7686 Ops/s 379.3485 Ops/s $\color{#d91a1a}-0.68\%$
test_compile_indexing[tensor-tensordict-compile] 0.1094ms 37.9441μs 26.3546 KOps/s 28.1154 KOps/s $\textbf{\color{#d91a1a}-6.26\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5684ms 33.3265μs 30.0061 KOps/s 30.8751 KOps/s $\color{#d91a1a}-2.81\%$
test_compile_indexing[tensor-tensorclass-compile] 82.1530μs 29.7435μs 33.6208 KOps/s 34.8490 KOps/s $\color{#d91a1a}-3.52\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1155ms 24.3844μs 41.0098 KOps/s 43.8273 KOps/s $\textbf{\color{#d91a1a}-6.43\%}$
test_compile_indexing[tensor-pytree-compile] 0.1033ms 30.7950μs 32.4728 KOps/s 32.3024 KOps/s $\color{#35bf28}+0.53\%$
test_compile_indexing[tensor-pytree-eager] 64.9610μs 24.1856μs 41.3470 KOps/s 43.8769 KOps/s $\textbf{\color{#d91a1a}-5.77\%}$
test_compile_indexing[slice-tensordict-compile] 0.1189ms 53.0657μs 18.8445 KOps/s 19.2627 KOps/s $\color{#d91a1a}-2.17\%$
test_compile_indexing[slice-tensordict-eager] 0.5410ms 19.1616μs 52.1877 KOps/s 51.0555 KOps/s $\color{#35bf28}+2.22\%$
test_compile_indexing[slice-tensorclass-compile] 98.5340μs 45.2870μs 22.0814 KOps/s 22.4599 KOps/s $\color{#d91a1a}-1.69\%$
test_compile_indexing[slice-tensorclass-eager] 71.6650μs 18.9818μs 52.6821 KOps/s 53.4058 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_indexing[slice-pytree-compile] 99.7670μs 46.1445μs 21.6711 KOps/s 22.3275 KOps/s $\color{#d91a1a}-2.94\%$
test_compile_indexing[slice-pytree-eager] 71.3630μs 19.1612μs 52.1889 KOps/s 53.1344 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_indexing[int-tensordict-compile] 0.1520ms 54.3585μs 18.3964 KOps/s 18.8213 KOps/s $\color{#d91a1a}-2.26\%$
test_compile_indexing[int-tensordict-eager] 0.8831ms 19.4089μs 51.5228 KOps/s 51.5137 KOps/s $\color{#35bf28}+0.02\%$
test_compile_indexing[int-tensorclass-compile] 0.1076ms 45.8195μs 21.8248 KOps/s 21.7014 KOps/s $\color{#35bf28}+0.57\%$
test_compile_indexing[int-tensorclass-eager] 0.3051ms 19.1767μs 52.1465 KOps/s 53.7397 KOps/s $\color{#d91a1a}-2.96\%$
test_compile_indexing[int-pytree-compile] 0.1081ms 46.0496μs 21.7157 KOps/s 22.1139 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_indexing[int-pytree-eager] 62.7880μs 19.0787μs 52.4146 KOps/s 53.6118 KOps/s $\color{#d91a1a}-2.23\%$
test_mod_add[eager] 75.5520μs 26.6887μs 37.4690 KOps/s 37.4828 KOps/s $\color{#d91a1a}-0.04\%$
test_mod_add[compile] 0.1488ms 45.9723μs 21.7522 KOps/s 22.1880 KOps/s $\color{#d91a1a}-1.96\%$
test_mod_add[compile-overhead] 94.8270μs 45.7028μs 21.8805 KOps/s 21.8757 KOps/s $\color{#35bf28}+0.02\%$
test_mod_wrap[eager] 0.4625ms 0.2146ms 4.6608 KOps/s 4.6552 KOps/s $\color{#35bf28}+0.12\%$
test_mod_wrap[compile] 1.8766ms 0.2060ms 4.8535 KOps/s 4.8925 KOps/s $\color{#d91a1a}-0.80\%$
test_mod_wrap[compile-overhead] 1.8639ms 0.2038ms 4.9076 KOps/s 4.9248 KOps/s $\color{#d91a1a}-0.35\%$
test_mod_wrap_and_backward[eager] 18.2272ms 12.0994ms 82.6485 Ops/s 79.1816 Ops/s $\color{#35bf28}+4.38\%$
test_mod_wrap_and_backward[compile] 17.3069ms 12.6939ms 78.7779 Ops/s 76.0960 Ops/s $\color{#35bf28}+3.52\%$
test_mod_wrap_and_backward[compile-overhead] 22.3787ms 12.5948ms 79.3981 Ops/s 73.1973 Ops/s $\textbf{\color{#35bf28}+8.47\%}$
test_seq_add[eager] 0.1807ms 90.7294μs 11.0218 KOps/s 10.8686 KOps/s $\color{#35bf28}+1.41\%$
test_seq_add[compile] 0.1447ms 61.6854μs 16.2113 KOps/s 16.1335 KOps/s $\color{#35bf28}+0.48\%$
test_seq_add[compile-overhead] 0.1284ms 59.6996μs 16.7505 KOps/s 16.4261 KOps/s $\color{#35bf28}+1.98\%$
test_seq_wrap[eager] 0.5343ms 0.3894ms 2.5681 KOps/s 2.4990 KOps/s $\color{#35bf28}+2.76\%$
test_seq_wrap[compile] 0.3790ms 0.2284ms 4.3785 KOps/s 4.3195 KOps/s $\color{#35bf28}+1.37\%$
test_seq_wrap[compile-overhead] 0.4615ms 0.2290ms 4.3677 KOps/s 4.3548 KOps/s $\color{#35bf28}+0.30\%$
test_func_call_runtime[False-eager] 0.9551ms 0.5670ms 1.7637 KOps/s 1.8288 KOps/s $\color{#d91a1a}-3.56\%$
test_func_call_runtime[False-compile] 0.8065ms 0.4270ms 2.3421 KOps/s 2.3144 KOps/s $\color{#35bf28}+1.20\%$
test_func_call_runtime[False-compile-overhead] 0.5453ms 0.4291ms 2.3304 KOps/s 2.3199 KOps/s $\color{#35bf28}+0.45\%$
test_func_call_runtime[True-eager] 1.4707ms 0.7702ms 1.2983 KOps/s 1.3266 KOps/s $\color{#d91a1a}-2.13\%$
test_func_call_runtime[True-compile] 0.6894ms 0.4696ms 2.1296 KOps/s 2.1183 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_runtime[True-compile-overhead] 0.9957ms 0.4694ms 2.1302 KOps/s 2.1204 KOps/s $\color{#35bf28}+0.46\%$
test_func_call_cm_runtime[False-eager] 0.8823ms 0.5493ms 1.8206 KOps/s 1.8617 KOps/s $\color{#d91a1a}-2.21\%$
test_func_call_cm_runtime[False-compile] 0.6557ms 0.4266ms 2.3444 KOps/s 2.3365 KOps/s $\color{#35bf28}+0.34\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5684ms 0.4258ms 2.3484 KOps/s 2.3167 KOps/s $\color{#35bf28}+1.37\%$
test_func_call_cm_runtime[True-eager] 1.1177ms 0.9001ms 1.1110 KOps/s 1.1362 KOps/s $\color{#d91a1a}-2.22\%$
test_func_call_cm_runtime[True-compile] 0.6466ms 0.4927ms 2.0297 KOps/s 2.0068 KOps/s $\color{#35bf28}+1.14\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6395ms 0.4949ms 2.0204 KOps/s 2.0252 KOps/s $\color{#d91a1a}-0.23\%$
test_vmap_func_call_cm_runtime[eager] 2.3872ms 1.8950ms 527.7053 Ops/s 525.7537 Ops/s $\color{#35bf28}+0.37\%$
test_vmap_func_call_cm_runtime[compile] 0.8736ms 0.5213ms 1.9182 KOps/s 1.9279 KOps/s $\color{#d91a1a}-0.51\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7128ms 0.5182ms 1.9298 KOps/s 1.9197 KOps/s $\color{#35bf28}+0.53\%$
test_distributed 0.3696ms 0.1288ms 7.7658 KOps/s 7.7139 KOps/s $\color{#35bf28}+0.67\%$
test_tdmodule 84.0170μs 18.5201μs 53.9954 KOps/s 50.9784 KOps/s $\textbf{\color{#35bf28}+5.92\%}$
test_tdmodule_dispatch 58.5090μs 37.2233μs 26.8649 KOps/s 26.6808 KOps/s $\color{#35bf28}+0.69\%$
test_tdseq 46.5580μs 21.1325μs 47.3204 KOps/s 45.3442 KOps/s $\color{#35bf28}+4.36\%$
test_tdseq_dispatch 70.9230μs 42.5881μs 23.4808 KOps/s 23.2448 KOps/s $\color{#35bf28}+1.02\%$
test_instantiation_functorch 2.5090ms 1.5557ms 642.8087 Ops/s 660.1514 Ops/s $\color{#d91a1a}-2.63\%$
test_exec_functorch 0.3669ms 0.1799ms 5.5600 KOps/s 5.6494 KOps/s $\color{#d91a1a}-1.58\%$
test_exec_functional_call 0.3409ms 0.1758ms 5.6867 KOps/s 5.8365 KOps/s $\color{#d91a1a}-2.57\%$
test_exec_td_decorator 0.5763ms 0.2339ms 4.2762 KOps/s 4.4893 KOps/s $\color{#d91a1a}-4.75\%$
test_vmap_mlp_speed_decorator[True-True] 1.1868ms 0.6457ms 1.5486 KOps/s 1.5533 KOps/s $\color{#d91a1a}-0.30\%$
test_vmap_mlp_speed_decorator[True-False] 0.9064ms 0.6441ms 1.5525 KOps/s 1.5797 KOps/s $\color{#d91a1a}-1.72\%$
test_vmap_mlp_speed_decorator[False-True] 0.7548ms 0.5239ms 1.9088 KOps/s 1.9163 KOps/s $\color{#d91a1a}-0.39\%$
test_vmap_mlp_speed_decorator[False-False] 0.8183ms 0.5227ms 1.9130 KOps/s 1.9293 KOps/s $\color{#d91a1a}-0.85\%$
test_to_module_speed[True] 1.5795ms 1.3008ms 768.7465 Ops/s 781.8737 Ops/s $\color{#d91a1a}-1.68\%$
test_to_module_speed[False] 1.6952ms 1.2665ms 789.5959 Ops/s 805.6185 Ops/s $\color{#d91a1a}-1.99\%$
test_tc_init 76.1820μs 44.6762μs 22.3833 KOps/s 21.9063 KOps/s $\color{#35bf28}+2.18\%$
test_tc_init_nested 0.1636ms 86.0076μs 11.6269 KOps/s 11.0431 KOps/s $\textbf{\color{#35bf28}+5.29\%}$
test_tc_first_layer_tensor 49.7630μs 1.5039μs 664.9528 KOps/s 655.3203 KOps/s $\color{#35bf28}+1.47\%$
test_tc_first_layer_nontensor 42.0490μs 4.6881μs 213.3054 KOps/s 210.7062 KOps/s $\color{#35bf28}+1.23\%$
test_tc_second_layer_tensor 48.3910μs 2.7622μs 362.0341 KOps/s 355.4126 KOps/s $\color{#35bf28}+1.86\%$
test_tc_second_layer_nontensor 48.8820μs 5.9709μs 167.4794 KOps/s 165.5854 KOps/s $\color{#35bf28}+1.14\%$
test_unbind 0.2405s 13.6172ms 73.4363 Ops/s 81.2529 Ops/s $\textbf{\color{#d91a1a}-9.62\%}$
test_full_like 9.1469ms 8.0832ms 123.7127 Ops/s 120.6627 Ops/s $\color{#35bf28}+2.53\%$
test_zeros_like 4.7666ms 3.2053ms 311.9855 Ops/s 128.2216 Ops/s $\textbf{\color{#35bf28}+143.32\%}$
test_ones_like 4.5179ms 3.5896ms 278.5789 Ops/s 124.4009 Ops/s $\textbf{\color{#35bf28}+123.94\%}$
test_clone 7.3750ms 5.8459ms 171.0593 Ops/s 102.2872 Ops/s $\textbf{\color{#35bf28}+67.23\%}$
test_squeeze 61.6950μs 11.8120μs 84.6595 KOps/s 81.2878 KOps/s $\color{#35bf28}+4.15\%$
test_unsqueeze 0.3777ms 89.3514μs 11.1918 KOps/s 11.4862 KOps/s $\color{#d91a1a}-2.56\%$
test_split 0.3142ms 0.1875ms 5.3322 KOps/s 5.3882 KOps/s $\color{#d91a1a}-1.04\%$
test_permute 0.3364ms 0.2170ms 4.6092 KOps/s 4.5729 KOps/s $\color{#35bf28}+0.79\%$
test_stack 33.3708ms 27.0305ms 36.9952 Ops/s 38.3619 Ops/s $\color{#d91a1a}-3.56\%$
test_cat 29.6924ms 26.6155ms 37.5721 Ops/s 38.3826 Ops/s $\color{#d91a1a}-2.11\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}19$. Worsened: $\large\color{#d91a1a}17$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 33.7700μs 10.1066μs 98.9451 KOps/s 101.0562 KOps/s $\color{#d91a1a}-2.09\%$
test_plain_set_stack_nested 41.1710μs 10.1560μs 98.4638 KOps/s 100.6353 KOps/s $\color{#d91a1a}-2.16\%$
test_plain_set_nested_inplace 53.2510μs 10.8981μs 91.7593 KOps/s 93.0856 KOps/s $\color{#d91a1a}-1.42\%$
test_plain_set_stack_nested_inplace 48.4510μs 10.9557μs 91.2766 KOps/s 92.3535 KOps/s $\color{#d91a1a}-1.17\%$
test_items 44.5010μs 2.8592μs 349.7440 KOps/s 347.4538 KOps/s $\color{#35bf28}+0.66\%$
test_items_nested 0.3787ms 0.3206ms 3.1191 KOps/s 3.1513 KOps/s $\color{#d91a1a}-1.02\%$
test_items_nested_locked 0.4177ms 0.3228ms 3.0980 KOps/s 3.1230 KOps/s $\color{#d91a1a}-0.80\%$
test_items_nested_leaf 89.9810μs 58.1363μs 17.2010 KOps/s 17.3623 KOps/s $\color{#d91a1a}-0.93\%$
test_items_stack_nested 0.3686ms 0.3236ms 3.0901 KOps/s 3.1432 KOps/s $\color{#d91a1a}-1.69\%$
test_items_stack_nested_leaf 94.3920μs 58.8688μs 16.9869 KOps/s 16.8466 KOps/s $\color{#35bf28}+0.83\%$
test_items_stack_nested_locked 0.4624ms 0.3251ms 3.0763 KOps/s 3.1125 KOps/s $\color{#d91a1a}-1.16\%$
test_keys 42.7410μs 3.7452μs 267.0094 KOps/s 289.5231 KOps/s $\textbf{\color{#d91a1a}-7.78\%}$
test_keys_nested 0.1163ms 70.0365μs 14.2783 KOps/s 14.4842 KOps/s $\color{#d91a1a}-1.42\%$
test_keys_nested_locked 0.7232ms 74.9271μs 13.3463 KOps/s 13.3140 KOps/s $\color{#35bf28}+0.24\%$
test_keys_nested_leaf 0.1046ms 60.7757μs 16.4540 KOps/s 16.4606 KOps/s $\color{#d91a1a}-0.04\%$
test_keys_stack_nested 0.1377ms 70.2078μs 14.2434 KOps/s 14.2143 KOps/s $\color{#35bf28}+0.20\%$
test_keys_stack_nested_leaf 0.1081ms 61.5763μs 16.2400 KOps/s 16.2337 KOps/s $\color{#35bf28}+0.04\%$
test_keys_stack_nested_locked 0.1138ms 75.1316μs 13.3100 KOps/s 13.2908 KOps/s $\color{#35bf28}+0.14\%$
test_values 5.5035μs 0.8453μs 1.1831 MOps/s 1.1904 MOps/s $\color{#d91a1a}-0.62\%$
test_values_nested 63.7620μs 31.2351μs 32.0153 KOps/s 32.4039 KOps/s $\color{#d91a1a}-1.20\%$
test_values_nested_locked 67.1710μs 32.6379μs 30.6392 KOps/s 30.9784 KOps/s $\color{#d91a1a}-1.10\%$
test_values_nested_leaf 83.5810μs 33.6773μs 29.6936 KOps/s 30.0512 KOps/s $\color{#d91a1a}-1.19\%$
test_values_stack_nested 67.0410μs 31.8077μs 31.4390 KOps/s 31.7513 KOps/s $\color{#d91a1a}-0.98\%$
test_values_stack_nested_leaf 63.0810μs 35.3078μs 28.3224 KOps/s 29.4322 KOps/s $\color{#d91a1a}-3.77\%$
test_values_stack_nested_locked 80.4710μs 33.1708μs 30.1470 KOps/s 30.3064 KOps/s $\color{#d91a1a}-0.53\%$
test_membership 1.8780μs 0.5073μs 1.9712 MOps/s 1.9897 MOps/s $\color{#d91a1a}-0.93\%$
test_membership_nested 11.5703μs 1.7779μs 562.4669 KOps/s 540.1189 KOps/s $\color{#35bf28}+4.14\%$
test_membership_nested_leaf 11.7500μs 1.8053μs 553.9223 KOps/s 550.2419 KOps/s $\color{#35bf28}+0.67\%$
test_membership_stacked_nested 30.6200μs 1.9014μs 525.9375 KOps/s 516.4041 KOps/s $\color{#35bf28}+1.85\%$
test_membership_stacked_nested_leaf 30.0010μs 1.9067μs 524.4798 KOps/s 520.8034 KOps/s $\color{#35bf28}+0.71\%$
test_membership_nested_last 41.2400μs 2.7150μs 368.3245 KOps/s 365.5284 KOps/s $\color{#35bf28}+0.76\%$
test_membership_nested_leaf_last 27.8510μs 2.7227μs 367.2857 KOps/s 365.7732 KOps/s $\color{#35bf28}+0.41\%$
test_membership_stacked_nested_last 39.4810μs 2.7240μs 367.1038 KOps/s 266.6801 KOps/s $\textbf{\color{#35bf28}+37.66\%}$
test_membership_stacked_nested_leaf_last 41.4210μs 2.7625μs 361.9968 KOps/s 271.7149 KOps/s $\textbf{\color{#35bf28}+33.23\%}$
test_nested_getleaf 38.4410μs 5.9518μs 168.0175 KOps/s 167.5508 KOps/s $\color{#35bf28}+0.28\%$
test_nested_get 45.8610μs 5.6856μs 175.8818 KOps/s 175.6776 KOps/s $\color{#35bf28}+0.12\%$
test_stacked_getleaf 34.9400μs 5.9824μs 167.1567 KOps/s 168.6540 KOps/s $\color{#d91a1a}-0.89\%$
test_stacked_get 49.8710μs 5.6635μs 176.5702 KOps/s 176.7370 KOps/s $\color{#d91a1a}-0.09\%$
test_nested_getitemleaf 32.9110μs 6.0170μs 166.1963 KOps/s 164.4237 KOps/s $\color{#35bf28}+1.08\%$
test_nested_getitem 46.5810μs 5.7446μs 174.0769 KOps/s 174.9893 KOps/s $\color{#d91a1a}-0.52\%$
test_stacked_getitemleaf 25.5310μs 6.0491μs 165.3145 KOps/s 166.3183 KOps/s $\color{#d91a1a}-0.60\%$
test_stacked_getitem 38.8410μs 5.7674μs 173.3892 KOps/s 175.6400 KOps/s $\color{#d91a1a}-1.28\%$
test_lock_nested 9.1854ms 0.3688ms 2.7115 KOps/s 2.7840 KOps/s $\color{#d91a1a}-2.61\%$
test_lock_stack_nested 0.3737ms 0.3287ms 3.0426 KOps/s 3.0875 KOps/s $\color{#d91a1a}-1.46\%$
test_unlock_nested 0.6788ms 0.2994ms 3.3399 KOps/s 3.3753 KOps/s $\color{#d91a1a}-1.05\%$
test_unlock_stack_nested 0.3119ms 0.2695ms 3.7107 KOps/s 3.8077 KOps/s $\color{#d91a1a}-2.55\%$
test_flatten_speed 0.1112ms 73.9062μs 13.5307 KOps/s 13.8540 KOps/s $\color{#d91a1a}-2.33\%$
test_unflatten_speed 0.3456ms 0.2911ms 3.4351 KOps/s 3.5065 KOps/s $\color{#d91a1a}-2.04\%$
test_common_ops 1.7800ms 0.5606ms 1.7837 KOps/s 1.8316 KOps/s $\color{#d91a1a}-2.62\%$
test_creation 0.1666ms 1.4158μs 706.2920 KOps/s 717.2876 KOps/s $\color{#d91a1a}-1.53\%$
test_creation_empty 41.6310μs 6.3866μs 156.5779 KOps/s 166.5433 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_creation_nested_1 34.5700μs 7.7211μs 129.5150 KOps/s 136.5119 KOps/s $\textbf{\color{#d91a1a}-5.13\%}$
test_creation_nested_2 36.0910μs 10.2626μs 97.4412 KOps/s 102.3517 KOps/s $\color{#d91a1a}-4.80\%$
test_clone 72.5710μs 10.2203μs 97.8446 KOps/s 96.0197 KOps/s $\color{#35bf28}+1.90\%$
test_getitem[int] 92.1308ms 15.4318μs 64.8013 KOps/s 99.6048 KOps/s $\textbf{\color{#d91a1a}-34.94\%}$
test_getitem[slice_int] 0.1090ms 19.6300μs 50.9424 KOps/s 52.3854 KOps/s $\color{#d91a1a}-2.75\%$
test_getitem[range] 0.1315ms 35.1260μs 28.4689 KOps/s 28.0798 KOps/s $\color{#35bf28}+1.39\%$
test_getitem[tuple] 0.1041ms 17.0893μs 58.5160 KOps/s 59.0331 KOps/s $\color{#d91a1a}-0.88\%$
test_getitem[list] 0.2648ms 31.9446μs 31.3042 KOps/s 31.1617 KOps/s $\color{#35bf28}+0.46\%$
test_setitem_dim[int] 38.9310μs 18.3564μs 54.4770 KOps/s 56.3097 KOps/s $\color{#d91a1a}-3.25\%$
test_setitem_dim[slice_int] 65.5120μs 36.1571μs 27.6571 KOps/s 27.9980 KOps/s $\color{#d91a1a}-1.22\%$
test_setitem_dim[range] 72.5110μs 51.4655μs 19.4305 KOps/s 20.1583 KOps/s $\color{#d91a1a}-3.61\%$
test_setitem_dim[tuple] 71.4120μs 31.0135μs 32.2440 KOps/s 34.0109 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_setitem 92.6620μs 13.9533μs 71.6677 KOps/s 74.5719 KOps/s $\color{#d91a1a}-3.89\%$
test_set 94.7320μs 13.2699μs 75.3583 KOps/s 76.0947 KOps/s $\color{#d91a1a}-0.97\%$
test_set_shared 1.6675ms 0.1439ms 6.9503 KOps/s 6.9527 KOps/s $\color{#d91a1a}-0.04\%$
test_update 1.0601ms 15.4911μs 64.5532 KOps/s 67.0900 KOps/s $\color{#d91a1a}-3.78\%$
test_update_nested 0.1074ms 20.4441μs 48.9139 KOps/s 51.6661 KOps/s $\textbf{\color{#d91a1a}-5.33\%}$
test_update__nested 0.2426ms 23.5628μs 42.4398 KOps/s 42.1216 KOps/s $\color{#35bf28}+0.76\%$
test_set_nested 94.8820μs 14.4033μs 69.4284 KOps/s 70.2609 KOps/s $\color{#d91a1a}-1.18\%$
test_set_nested_new 73.1720μs 16.6177μs 60.1767 KOps/s 61.0746 KOps/s $\color{#d91a1a}-1.47\%$
test_select 95.9720μs 27.1065μs 36.8916 KOps/s 35.6940 KOps/s $\color{#35bf28}+3.36\%$
test_select_nested 0.1024ms 40.8520μs 24.4786 KOps/s 24.6299 KOps/s $\color{#d91a1a}-0.61\%$
test_exclude_nested 89.4820μs 57.1335μs 17.5029 KOps/s 17.7630 KOps/s $\color{#d91a1a}-1.46\%$
test_empty[True] 0.2888ms 0.2532ms 3.9488 KOps/s 4.0174 KOps/s $\color{#d91a1a}-1.71\%$
test_empty[False] 3.3971μs 0.7448μs 1.3426 MOps/s 1.3581 MOps/s $\color{#d91a1a}-1.13\%$
test_to 83.2620μs 53.5840μs 18.6623 KOps/s 18.6747 KOps/s $\color{#d91a1a}-0.07\%$
test_to_nonblocking 92.0620μs 43.3175μs 23.0854 KOps/s 22.2782 KOps/s $\color{#35bf28}+3.62\%$
test_unbind_speed 0.2539ms 0.2259ms 4.4271 KOps/s 4.4603 KOps/s $\color{#d91a1a}-0.74\%$
test_unbind_speed_stack0 0.2826ms 0.2281ms 4.3849 KOps/s 4.4107 KOps/s $\color{#d91a1a}-0.58\%$
test_unbind_speed_stack1 92.6684ms 0.6476ms 1.5442 KOps/s 1.7189 KOps/s $\textbf{\color{#d91a1a}-10.16\%}$
test_split 93.1191ms 1.5380ms 650.1805 Ops/s 643.6402 Ops/s $\color{#35bf28}+1.02\%$
test_chunk 97.0212ms 1.7226ms 580.5201 Ops/s 645.9870 Ops/s $\textbf{\color{#d91a1a}-10.13\%}$
test_consolidate[False-None] 2.6107ms 2.5385ms 393.9388 Ops/s 357.9407 Ops/s $\textbf{\color{#35bf28}+10.06\%}$
test_consolidate[default-None] 1.7440ms 1.6610ms 602.0390 Ops/s 609.9128 Ops/s $\color{#d91a1a}-1.29\%$
test_consolidate[reduce-overhead-None] 1.7598ms 1.6936ms 590.4553 Ops/s 606.4998 Ops/s $\color{#d91a1a}-2.65\%$
test_consolidate_njt[False-None] 6.4892ms 6.4046ms 156.1378 Ops/s 156.1502 Ops/s $-0.01\%$
test_to[False-False-None] 1.7672ms 1.7005ms 588.0726 Ops/s 584.4730 Ops/s $\color{#35bf28}+0.62\%$
test_to[True-False-None] 1.4416ms 1.2424ms 804.8917 Ops/s 776.1304 Ops/s $\color{#35bf28}+3.71\%$
test_to[within-False-None] 0.2957s 5.0348ms 198.6165 Ops/s 253.3865 Ops/s $\textbf{\color{#d91a1a}-21.62\%}$
test_to[True-default-None] 5.1825ms 4.9154ms 203.4426 Ops/s 192.3786 Ops/s $\textbf{\color{#35bf28}+5.75\%}$
test_to_njt[False-False-None] 7.1969ms 6.8704ms 145.5515 Ops/s 147.0774 Ops/s $\color{#d91a1a}-1.04\%$
test_to_njt[True-False-None] 5.5948ms 5.2732ms 189.6384 Ops/s 183.6468 Ops/s $\color{#35bf28}+3.26\%$
test_to_njt[within-False-None] 12.0097ms 11.8786ms 84.1851 Ops/s 85.1511 Ops/s $\color{#d91a1a}-1.13\%$
test_creation[device0] 0.5632ms 80.0428μs 12.4933 KOps/s 12.8936 KOps/s $\color{#d91a1a}-3.10\%$
test_creation_from_tensor 0.6142ms 83.9002μs 11.9189 KOps/s 12.1108 KOps/s $\color{#d91a1a}-1.58\%$
test_add_one[memmap_tensor0] 0.4360ms 6.6246μs 150.9530 KOps/s 149.4076 KOps/s $\color{#35bf28}+1.03\%$
test_contiguous[memmap_tensor0] 1.6720μs 0.3898μs 2.5653 MOps/s 2.5177 MOps/s $\color{#35bf28}+1.89\%$
test_stack[memmap_tensor0] 26.5410μs 4.3274μs 231.0873 KOps/s 236.6312 KOps/s $\color{#d91a1a}-2.34\%$
test_memmaptd_index 2.0513ms 0.2397ms 4.1719 KOps/s 4.1937 KOps/s $\color{#d91a1a}-0.52\%$
test_memmaptd_index_astensor 0.7901ms 0.2962ms 3.3764 KOps/s 3.4533 KOps/s $\color{#d91a1a}-2.23\%$
test_memmaptd_index_op 0.9592ms 0.5463ms 1.8304 KOps/s 1.8653 KOps/s $\color{#d91a1a}-1.87\%$
test_serialize_model 0.1310s 0.1296s 7.7136 Ops/s 7.6845 Ops/s $\color{#35bf28}+0.38\%$
test_serialize_model_pickle 1.3494s 1.1904s 0.8401 Ops/s 0.8255 Ops/s $\color{#35bf28}+1.76\%$
test_serialize_weights 0.1310s 0.1298s 7.7015 Ops/s 7.7440 Ops/s $\color{#d91a1a}-0.55\%$
test_serialize_weights_returnearly 49.6778ms 40.2118ms 24.8683 Ops/s 14.9409 Ops/s $\textbf{\color{#35bf28}+66.44\%}$
test_serialize_weights_pickle 1.3480s 1.1837s 0.8448 Ops/s 0.8212 Ops/s $\color{#35bf28}+2.87\%$
test_reshape_pytree 71.1320μs 21.2007μs 47.1683 KOps/s 44.9407 KOps/s $\color{#35bf28}+4.96\%$
test_reshape_td 49.2310μs 25.8565μs 38.6750 KOps/s 38.3621 KOps/s $\color{#35bf28}+0.82\%$
test_view_pytree 47.9510μs 21.3969μs 46.7357 KOps/s 45.7724 KOps/s $\color{#35bf28}+2.10\%$
test_view_td 59.4010μs 28.2227μs 35.4325 KOps/s 31.9766 KOps/s $\textbf{\color{#35bf28}+10.81\%}$
test_unbind_pytree 56.0810μs 27.1252μs 36.8661 KOps/s 35.9667 KOps/s $\color{#35bf28}+2.50\%$
test_unbind_td 0.5322ms 34.2025μs 29.2376 KOps/s 29.0629 KOps/s $\color{#35bf28}+0.60\%$
test_split_pytree 57.7810μs 29.9670μs 33.3700 KOps/s 33.9585 KOps/s $\color{#d91a1a}-1.73\%$
test_split_td 0.1642ms 36.6405μs 27.2922 KOps/s 26.6352 KOps/s $\color{#35bf28}+2.47\%$
test_add_pytree 69.1710μs 34.3185μs 29.1388 KOps/s 28.8915 KOps/s $\color{#35bf28}+0.86\%$
test_add_td 0.1348ms 44.5208μs 22.4614 KOps/s 24.0436 KOps/s $\textbf{\color{#d91a1a}-6.58\%}$
test_compile_add_one_nested[tensordict-compile] 0.1710ms 0.1177ms 8.4928 KOps/s 8.3041 KOps/s $\color{#35bf28}+2.27\%$
test_compile_add_one_nested[tensordict-eager] 0.2142ms 0.1214ms 8.2349 KOps/s 8.0449 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_nested[pytree-compile] 0.1430ms 94.6001μs 10.5708 KOps/s 10.3929 KOps/s $\color{#35bf28}+1.71\%$
test_compile_add_one_nested[pytree-eager] 1.1621ms 0.1489ms 6.7172 KOps/s 6.6704 KOps/s $\color{#35bf28}+0.70\%$
test_compile_copy_nested[tensordict-compile] 0.2298ms 24.4313μs 40.9311 KOps/s 44.5347 KOps/s $\textbf{\color{#d91a1a}-8.09\%}$
test_compile_copy_nested[tensordict-eager] 0.1032ms 26.3268μs 37.9842 KOps/s 37.9864 KOps/s $-0.01\%$
test_compile_copy_nested[pytree-compile] 1.3677ms 63.7287μs 15.6915 KOps/s 15.4374 KOps/s $\color{#35bf28}+1.65\%$
test_compile_copy_nested[pytree-eager] 81.1110μs 49.1636μs 20.3402 KOps/s 19.9244 KOps/s $\color{#35bf28}+2.09\%$
test_compile_add_one_flat[tensordict-compile] 0.2068ms 0.1405ms 7.1176 KOps/s 7.1096 KOps/s $\color{#35bf28}+0.11\%$
test_compile_add_one_flat[tensordict-eager] 0.2957ms 0.2055ms 4.8671 KOps/s 4.8373 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_flat[tensorclass-compile] 0.1337ms 96.6202μs 10.3498 KOps/s 10.4108 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_add_one_flat[tensorclass-eager] 0.1028ms 49.6590μs 20.1373 KOps/s 20.3735 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_add_one_flat[pytree-compile] 0.1814ms 0.1338ms 7.4761 KOps/s 7.4102 KOps/s $\color{#35bf28}+0.89\%$
test_compile_add_one_flat[pytree-eager] 0.5297ms 0.4821ms 2.0743 KOps/s 2.0434 KOps/s $\color{#35bf28}+1.51\%$
test_compile_add_self_flat[tensordict-eager] 0.3687ms 0.2463ms 4.0595 KOps/s 4.0642 KOps/s $\color{#d91a1a}-0.11\%$
test_compile_add_self_flat[tensordict-compile] 0.2899ms 0.1516ms 6.5980 KOps/s 7.1468 KOps/s $\textbf{\color{#d91a1a}-7.68\%}$
test_compile_add_self_flat[tensorclass-eager] 0.1810ms 61.6208μs 16.2283 KOps/s 16.8397 KOps/s $\color{#d91a1a}-3.63\%$
test_compile_add_self_flat[tensorclass-compile] 0.5153ms 0.1029ms 9.7140 KOps/s 10.4691 KOps/s $\textbf{\color{#d91a1a}-7.21\%}$
test_compile_add_self_flat[pytree-eager] 0.5745ms 0.4240ms 2.3585 KOps/s 2.4100 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_add_self_flat[pytree-compile] 0.2030ms 0.1392ms 7.1860 KOps/s 7.4602 KOps/s $\color{#d91a1a}-3.68\%$
test_compile_copy_flat[tensordict-compile] 67.5120μs 18.0107μs 55.5225 KOps/s 54.4784 KOps/s $\color{#35bf28}+1.92\%$
test_compile_copy_flat[tensordict-eager] 80.2710μs 26.6597μs 37.5098 KOps/s 37.0631 KOps/s $\color{#35bf28}+1.21\%$
test_compile_copy_flat[pytree-compile] 0.1041ms 69.7072μs 14.3457 KOps/s 14.4362 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_copy_flat[pytree-eager] 94.0210μs 52.0582μs 19.2093 KOps/s 19.2692 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_assign_and_add[tensordict-compile] 1.6084ms 0.3876ms 2.5802 KOps/s 2.2006 KOps/s $\textbf{\color{#35bf28}+17.25\%}$
test_compile_assign_and_add[tensordict-eager] 2.6655ms 2.5921ms 385.7814 Ops/s 381.5329 Ops/s $\color{#35bf28}+1.11\%$
test_compile_assign_and_add[pytree-compile] 1.5827ms 0.3775ms 2.6487 KOps/s 2.2948 KOps/s $\textbf{\color{#35bf28}+15.42\%}$
test_compile_assign_and_add[pytree-eager] 2.7870ms 2.6528ms 376.9564 Ops/s 373.6686 Ops/s $\color{#35bf28}+0.88\%$
test_compile_indexing[tensor-tensordict-compile] 0.1730ms 0.1124ms 8.8953 KOps/s 8.6837 KOps/s $\color{#35bf28}+2.44\%$
test_compile_indexing[tensor-tensordict-eager] 0.5560ms 76.8015μs 13.0206 KOps/s 12.8608 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1962ms 0.1059ms 9.4425 KOps/s 9.4586 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1122ms 66.9019μs 14.9473 KOps/s 14.0386 KOps/s $\textbf{\color{#35bf28}+6.47\%}$
test_compile_indexing[tensor-pytree-compile] 0.1655ms 0.1057ms 9.4595 KOps/s 8.9776 KOps/s $\textbf{\color{#35bf28}+5.37\%}$
test_compile_indexing[tensor-pytree-eager] 0.1071ms 67.1725μs 14.8870 KOps/s 14.2445 KOps/s $\color{#35bf28}+4.51\%$
test_compile_indexing[slice-tensordict-compile] 0.1476ms 99.4319μs 10.0571 KOps/s 10.1378 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_indexing[slice-tensordict-eager] 0.1448ms 16.3101μs 61.3115 KOps/s 59.2490 KOps/s $\color{#35bf28}+3.48\%$
test_compile_indexing[slice-tensorclass-compile] 0.1396ms 94.3261μs 10.6015 KOps/s 10.6285 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_indexing[slice-tensorclass-eager] 51.9510μs 15.2662μs 65.5042 KOps/s 64.0622 KOps/s $\color{#35bf28}+2.25\%$
test_compile_indexing[slice-pytree-compile] 0.1414ms 94.4488μs 10.5877 KOps/s 10.4439 KOps/s $\color{#35bf28}+1.38\%$
test_compile_indexing[slice-pytree-eager] 43.4910μs 15.1334μs 66.0791 KOps/s 56.8154 KOps/s $\textbf{\color{#35bf28}+16.30\%}$
test_compile_indexing[int-tensordict-compile] 0.1503ms 99.1855μs 10.0821 KOps/s 10.0427 KOps/s $\color{#35bf28}+0.39\%$
test_compile_indexing[int-tensordict-eager] 0.6007ms 16.1443μs 61.9414 KOps/s 60.3703 KOps/s $\color{#35bf28}+2.60\%$
test_compile_indexing[int-tensorclass-compile] 0.2003ms 94.6336μs 10.5671 KOps/s 10.1301 KOps/s $\color{#35bf28}+4.31\%$
test_compile_indexing[int-tensorclass-eager] 43.2610μs 15.2659μs 65.5054 KOps/s 64.5419 KOps/s $\color{#35bf28}+1.49\%$
test_compile_indexing[int-pytree-compile] 0.1403ms 94.4642μs 10.5860 KOps/s 10.5056 KOps/s $\color{#35bf28}+0.77\%$
test_compile_indexing[int-pytree-eager] 44.4300μs 15.2955μs 65.3789 KOps/s 64.3759 KOps/s $\color{#35bf28}+1.56\%$
test_mod_add[eager] 60.2420μs 29.4072μs 34.0052 KOps/s 33.9218 KOps/s $\color{#35bf28}+0.25\%$
test_mod_add[compile] 0.1258ms 77.6735μs 12.8744 KOps/s 12.8150 KOps/s $\color{#35bf28}+0.46\%$
test_mod_add[compile-overhead] 0.3139ms 0.1621ms 6.1689 KOps/s 5.7009 KOps/s $\textbf{\color{#35bf28}+8.21\%}$
test_mod_wrap[eager] 0.3237ms 0.2481ms 4.0310 KOps/s 4.1713 KOps/s $\color{#d91a1a}-3.36\%$
test_mod_wrap[compile] 1.5909ms 0.2793ms 3.5805 KOps/s 3.5750 KOps/s $\color{#35bf28}+0.15\%$
test_mod_wrap[compile-overhead] 7.2562ms 3.7909ms 263.7881 Ops/s 261.3155 Ops/s $\color{#35bf28}+0.95\%$
test_mod_wrap_and_backward[eager] 1.4743ms 1.3482ms 741.7352 Ops/s 687.9240 Ops/s $\textbf{\color{#35bf28}+7.82\%}$
test_mod_wrap_and_backward[compile] 1.3789ms 1.2443ms 803.6745 Ops/s 735.6997 Ops/s $\textbf{\color{#35bf28}+9.24\%}$
test_mod_wrap_and_backward[compile-overhead] 1.5100ms 1.0160ms 984.2035 Ops/s 978.0523 Ops/s $\color{#35bf28}+0.63\%$
test_seq_add[eager] 0.1375ms 92.7043μs 10.7870 KOps/s 10.5315 KOps/s $\color{#35bf28}+2.43\%$
test_seq_add[compile] 0.2097ms 84.9248μs 11.7751 KOps/s 11.3308 KOps/s $\color{#35bf28}+3.92\%$
test_seq_add[compile-overhead] 0.1641ms 0.1261ms 7.9275 KOps/s 7.9660 KOps/s $\color{#d91a1a}-0.48\%$
test_seq_wrap[eager] 0.4306ms 0.3674ms 2.7218 KOps/s 2.7028 KOps/s $\color{#35bf28}+0.70\%$
test_seq_wrap[compile] 0.3920ms 0.2940ms 3.4017 KOps/s 3.2604 KOps/s $\color{#35bf28}+4.33\%$
test_seq_wrap[compile-overhead] 0.2617ms 0.2190ms 4.5671 KOps/s 4.6144 KOps/s $\color{#d91a1a}-1.03\%$
test_func_call_runtime[False-eager] 0.8388ms 0.7382ms 1.3547 KOps/s 1.3514 KOps/s $\color{#35bf28}+0.25\%$
test_func_call_runtime[False-compile] 0.9248ms 0.7274ms 1.3747 KOps/s 1.3595 KOps/s $\color{#35bf28}+1.12\%$
test_func_call_runtime[False-compile-overhead] 0.3960ms 0.3524ms 2.8378 KOps/s 2.7802 KOps/s $\color{#35bf28}+2.07\%$
test_func_call_runtime[True-eager] 0.9521ms 0.8938ms 1.1188 KOps/s 1.0958 KOps/s $\color{#35bf28}+2.10\%$
test_func_call_runtime[True-compile] 0.8714ms 0.7482ms 1.3366 KOps/s 1.3147 KOps/s $\color{#35bf28}+1.67\%$
test_func_call_runtime[True-compile-overhead] 0.4446ms 0.3741ms 2.6733 KOps/s 2.6772 KOps/s $\color{#d91a1a}-0.15\%$
test_func_call_cm_runtime[False-eager] 0.8392ms 0.7552ms 1.3242 KOps/s 1.3611 KOps/s $\color{#d91a1a}-2.72\%$
test_func_call_cm_runtime[False-compile] 0.9595ms 0.7260ms 1.3774 KOps/s 1.3372 KOps/s $\color{#35bf28}+3.00\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4012ms 0.3540ms 2.8248 KOps/s 2.8155 KOps/s $\color{#35bf28}+0.33\%$
test_func_call_cm_runtime[True-eager] 1.1269ms 0.9920ms 1.0080 KOps/s 992.9092 Ops/s $\color{#35bf28}+1.52\%$
test_func_call_cm_runtime[True-compile] 0.8244ms 0.7741ms 1.2918 KOps/s 1.2904 KOps/s $\color{#35bf28}+0.11\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4688ms 0.3981ms 2.5117 KOps/s 2.5039 KOps/s $\color{#35bf28}+0.31\%$
test_vmap_func_call_cm_runtime[eager] 2.5266ms 2.0734ms 482.2958 Ops/s 483.5957 Ops/s $\color{#d91a1a}-0.27\%$
test_vmap_func_call_cm_runtime[compile] 0.9173ms 0.8110ms 1.2331 KOps/s 1.2452 KOps/s $\color{#d91a1a}-0.97\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4493ms 0.4033ms 2.4797 KOps/s 2.4285 KOps/s $\color{#35bf28}+2.11\%$
test_distributed 2.0639ms 0.2034ms 4.9164 KOps/s 8.7577 KOps/s $\textbf{\color{#d91a1a}-43.86\%}$
test_tdmodule 49.1010μs 12.6857μs 78.8290 KOps/s 73.2409 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_tdmodule_dispatch 45.5610μs 25.0117μs 39.9812 KOps/s 39.2085 KOps/s $\color{#35bf28}+1.97\%$
test_tdseq 33.6200μs 14.2553μs 70.1494 KOps/s 65.4772 KOps/s $\textbf{\color{#35bf28}+7.14\%}$
test_tdseq_dispatch 47.6610μs 28.3326μs 35.2951 KOps/s 34.3043 KOps/s $\color{#35bf28}+2.89\%$
test_instantiation_functorch 1.6153ms 1.5176ms 658.9449 Ops/s 665.1934 Ops/s $\color{#d91a1a}-0.94\%$
test_exec_functorch 0.2237ms 0.1444ms 6.9257 KOps/s 6.8552 KOps/s $\color{#35bf28}+1.03\%$
test_exec_functional_call 0.1711ms 0.1360ms 7.3504 KOps/s 7.0054 KOps/s $\color{#35bf28}+4.93\%$
test_exec_td_decorator 0.3619ms 0.1782ms 5.6111 KOps/s 5.2502 KOps/s $\textbf{\color{#35bf28}+6.88\%}$
test_vmap_mlp_speed_decorator[True-True] 0.7266ms 0.6646ms 1.5046 KOps/s 1.4721 KOps/s $\color{#35bf28}+2.21\%$
test_vmap_mlp_speed_decorator[True-False] 0.7470ms 0.6619ms 1.5107 KOps/s 1.4453 KOps/s $\color{#35bf28}+4.53\%$
test_vmap_mlp_speed_decorator[False-True] 0.6989ms 0.5891ms 1.6974 KOps/s 1.6294 KOps/s $\color{#35bf28}+4.17\%$
test_vmap_mlp_speed_decorator[False-False] 0.6937ms 0.5878ms 1.7014 KOps/s 1.6213 KOps/s $\color{#35bf28}+4.94\%$
test_vmap_transformer_speed_decorator[True-True] 19.2046ms 19.1193ms 52.3031 Ops/s 52.6008 Ops/s $\color{#d91a1a}-0.57\%$
test_vmap_transformer_speed_decorator[True-False] 19.3081ms 19.1906ms 52.1087 Ops/s 52.1896 Ops/s $\color{#d91a1a}-0.16\%$
test_vmap_transformer_speed_decorator[False-True] 19.1225ms 19.0170ms 52.5845 Ops/s 53.1443 Ops/s $\color{#d91a1a}-1.05\%$
test_vmap_transformer_speed_decorator[False-False] 19.1176ms 19.0190ms 52.5791 Ops/s 52.7687 Ops/s $\color{#d91a1a}-0.36\%$
test_to_module_speed[True] 1.0169ms 0.9176ms 1.0899 KOps/s 1.0861 KOps/s $\color{#35bf28}+0.34\%$
test_to_module_speed[False] 1.3745ms 0.9000ms 1.1111 KOps/s 1.1067 KOps/s $\color{#35bf28}+0.40\%$
test_tc_init 67.7510μs 32.9399μs 30.3583 KOps/s 31.4709 KOps/s $\color{#d91a1a}-3.54\%$
test_tc_init_nested 0.1052ms 68.0844μs 14.6876 KOps/s 15.6343 KOps/s $\textbf{\color{#d91a1a}-6.05\%}$
test_tc_first_layer_tensor 3.8559μs 0.6905μs 1.4482 MOps/s 1.4505 MOps/s $\color{#d91a1a}-0.16\%$
test_tc_first_layer_nontensor 29.4200μs 2.2705μs 440.4394 KOps/s 440.0301 KOps/s $\color{#35bf28}+0.09\%$
test_tc_second_layer_tensor 11.4502μs 1.3760μs 726.7645 KOps/s 722.1855 KOps/s $\color{#35bf28}+0.63\%$
test_tc_second_layer_nontensor 22.8610μs 2.9772μs 335.8883 KOps/s 337.4197 KOps/s $\color{#d91a1a}-0.45\%$
test_unbind 0.2212s 9.6290ms 103.8528 Ops/s 153.0871 Ops/s $\textbf{\color{#d91a1a}-32.16\%}$
test_full_like 10.2782ms 9.0929ms 109.9755 Ops/s 109.0385 Ops/s $\color{#35bf28}+0.86\%$
test_zeros_like 9.2212ms 7.1422ms 140.0127 Ops/s 114.6833 Ops/s $\textbf{\color{#35bf28}+22.09\%}$
test_ones_like 9.1546ms 7.1761ms 139.3516 Ops/s 232.0332 Ops/s $\textbf{\color{#d91a1a}-39.94\%}$
test_clone 6.9204ms 6.3248ms 158.1087 Ops/s 158.7270 Ops/s $\color{#d91a1a}-0.39\%$
test_squeeze 58.7010μs 9.1772μs 108.9657 KOps/s 106.8381 KOps/s $\color{#35bf28}+1.99\%$
test_unsqueeze 0.1200ms 67.2405μs 14.8720 KOps/s 14.4790 KOps/s $\color{#35bf28}+2.71\%$
test_split 0.3793ms 0.1512ms 6.6150 KOps/s 6.2786 KOps/s $\textbf{\color{#35bf28}+5.36\%}$
test_permute 0.2062ms 0.1673ms 5.9757 KOps/s 5.7698 KOps/s $\color{#35bf28}+3.57\%$
test_stack 50.7186ms 50.3964ms 19.8427 Ops/s 19.5520 Ops/s $\color{#35bf28}+1.49\%$
test_cat 50.4059ms 50.2208ms 19.9121 Ops/s 19.8482 Ops/s $\color{#35bf28}+0.32\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants