Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Use 'spawn' mp context in all tests #1111

Merged
merged 1 commit into from
Nov 26, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 26, 2024

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 26, 2024
vmoens added a commit that referenced this pull request Nov 26, 2024
ghstack-source-id: a7d786fe77c2c12d5c8c85579123a64ef5c87cf2
Pull Request resolved: #1111
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 34.4040μs 17.5806μs 56.8808 KOps/s 56.5388 KOps/s $\color{#35bf28}+0.60\%$
test_plain_set_stack_nested 79.6390μs 17.6364μs 56.7010 KOps/s 55.4510 KOps/s $\color{#35bf28}+2.25\%$
test_plain_set_nested_inplace 44.6330μs 19.2760μs 51.8779 KOps/s 50.3339 KOps/s $\color{#35bf28}+3.07\%$
test_plain_set_stack_nested_inplace 75.8520μs 19.2161μs 52.0396 KOps/s 50.3511 KOps/s $\color{#35bf28}+3.35\%$
test_items 22.5020μs 4.1996μs 238.1173 KOps/s 239.8507 KOps/s $\color{#d91a1a}-0.72\%$
test_items_nested 0.5023ms 0.4035ms 2.4780 KOps/s 2.4754 KOps/s $\color{#35bf28}+0.11\%$
test_items_nested_locked 0.6058ms 0.4052ms 2.4678 KOps/s 2.5003 KOps/s $\color{#d91a1a}-1.30\%$
test_items_nested_leaf 0.1309ms 72.3919μs 13.8137 KOps/s 13.8291 KOps/s $\color{#d91a1a}-0.11\%$
test_items_stack_nested 0.5215ms 0.4164ms 2.4014 KOps/s 2.5000 KOps/s $\color{#d91a1a}-3.94\%$
test_items_stack_nested_leaf 0.1354ms 72.5052μs 13.7921 KOps/s 13.5087 KOps/s $\color{#35bf28}+2.10\%$
test_items_stack_nested_locked 0.5650ms 0.4077ms 2.4525 KOps/s 2.5175 KOps/s $\color{#d91a1a}-2.58\%$
test_keys 24.3450μs 3.6683μs 272.6079 KOps/s 282.5404 KOps/s $\color{#d91a1a}-3.52\%$
test_keys_nested 0.2395ms 0.1364ms 7.3321 KOps/s 7.2058 KOps/s $\color{#35bf28}+1.75\%$
test_keys_nested_locked 1.7050ms 0.1434ms 6.9731 KOps/s 7.0997 KOps/s $\color{#d91a1a}-1.78\%$
test_keys_nested_leaf 0.2396ms 0.1171ms 8.5377 KOps/s 8.6573 KOps/s $\color{#d91a1a}-1.38\%$
test_keys_stack_nested 0.2414ms 0.1361ms 7.3477 KOps/s 7.3265 KOps/s $\color{#35bf28}+0.29\%$
test_keys_stack_nested_leaf 0.2022ms 0.1170ms 8.5445 KOps/s 8.5223 KOps/s $\color{#35bf28}+0.26\%$
test_keys_stack_nested_locked 0.2384ms 0.1425ms 7.0169 KOps/s 6.9810 KOps/s $\color{#35bf28}+0.51\%$
test_values 9.1668μs 1.0292μs 971.6351 KOps/s 920.8436 KOps/s $\textbf{\color{#35bf28}+5.52\%}$
test_values_nested 99.1450μs 55.2471μs 18.1005 KOps/s 17.9664 KOps/s $\color{#35bf28}+0.75\%$
test_values_nested_locked 0.1183ms 55.1449μs 18.1340 KOps/s 18.2241 KOps/s $\color{#d91a1a}-0.49\%$
test_values_nested_leaf 0.1106ms 60.5879μs 16.5050 KOps/s 16.3979 KOps/s $\color{#35bf28}+0.65\%$
test_values_stack_nested 0.1128ms 56.1054μs 17.8236 KOps/s 17.9090 KOps/s $\color{#d91a1a}-0.48\%$
test_values_stack_nested_leaf 0.1102ms 60.5931μs 16.5035 KOps/s 16.6050 KOps/s $\color{#d91a1a}-0.61\%$
test_values_stack_nested_locked 0.1165ms 55.6356μs 17.9741 KOps/s 17.7434 KOps/s $\color{#35bf28}+1.30\%$
test_membership 2.0704μs 0.7398μs 1.3518 MOps/s 1.1430 MOps/s $\textbf{\color{#35bf28}+18.27\%}$
test_membership_nested 43.2730μs 2.9627μs 337.5317 KOps/s 342.7316 KOps/s $\color{#d91a1a}-1.52\%$
test_membership_nested_leaf 31.5790μs 2.9650μs 337.2700 KOps/s 335.8827 KOps/s $\color{#35bf28}+0.41\%$
test_membership_stacked_nested 47.9000μs 2.9669μs 337.0536 KOps/s 345.5982 KOps/s $\color{#d91a1a}-2.47\%$
test_membership_stacked_nested_leaf 21.4400μs 2.9478μs 339.2405 KOps/s 335.4989 KOps/s $\color{#35bf28}+1.12\%$
test_membership_nested_last 42.9500μs 4.3217μs 231.3905 KOps/s 237.1208 KOps/s $\color{#d91a1a}-2.42\%$
test_membership_nested_leaf_last 36.5590μs 4.3524μs 229.7608 KOps/s 237.1721 KOps/s $\color{#d91a1a}-3.12\%$
test_membership_stacked_nested_last 54.5720μs 4.2734μs 234.0077 KOps/s 241.9535 KOps/s $\color{#d91a1a}-3.28\%$
test_membership_stacked_nested_leaf_last 25.0070μs 4.3177μs 231.6065 KOps/s 238.1017 KOps/s $\color{#d91a1a}-2.73\%$
test_nested_getleaf 56.7860μs 10.7662μs 92.8832 KOps/s 92.2595 KOps/s $\color{#35bf28}+0.68\%$
test_nested_get 47.8090μs 10.2797μs 97.2789 KOps/s 97.9306 KOps/s $\color{#d91a1a}-0.67\%$
test_stacked_getleaf 31.1380μs 10.7301μs 93.1962 KOps/s 94.2494 KOps/s $\color{#d91a1a}-1.12\%$
test_stacked_get 46.5170μs 10.1753μs 98.2771 KOps/s 97.6859 KOps/s $\color{#35bf28}+0.61\%$
test_nested_getitemleaf 40.1650μs 11.5258μs 86.7617 KOps/s 89.4319 KOps/s $\color{#d91a1a}-2.99\%$
test_nested_getitem 55.4640μs 10.5215μs 95.0438 KOps/s 95.1088 KOps/s $\color{#d91a1a}-0.07\%$
test_stacked_getitemleaf 57.0260μs 11.4404μs 87.4093 KOps/s 89.3769 KOps/s $\color{#d91a1a}-2.20\%$
test_stacked_getitem 33.8230μs 10.7146μs 93.3307 KOps/s 95.7687 KOps/s $\color{#d91a1a}-2.55\%$
test_lock_nested 3.1533ms 0.4508ms 2.2185 KOps/s 2.2576 KOps/s $\color{#d91a1a}-1.73\%$
test_lock_stack_nested 0.5457ms 0.4169ms 2.3988 KOps/s 2.4022 KOps/s $\color{#d91a1a}-0.14\%$
test_unlock_nested 1.0902ms 0.3619ms 2.7635 KOps/s 2.7435 KOps/s $\color{#35bf28}+0.73\%$
test_unlock_stack_nested 0.5077ms 0.3335ms 2.9985 KOps/s 3.0209 KOps/s $\color{#d91a1a}-0.74\%$
test_flatten_speed 0.1998ms 97.8751μs 10.2171 KOps/s 10.4191 KOps/s $\color{#d91a1a}-1.94\%$
test_unflatten_speed 0.8074ms 0.5062ms 1.9756 KOps/s 1.9862 KOps/s $\color{#d91a1a}-0.54\%$
test_common_ops 1.6030ms 0.7913ms 1.2638 KOps/s 1.2992 KOps/s $\color{#d91a1a}-2.73\%$
test_creation 19.3450μs 2.0492μs 487.9922 KOps/s 481.0562 KOps/s $\color{#35bf28}+1.44\%$
test_creation_empty 34.1440μs 10.0409μs 99.5929 KOps/s 94.7517 KOps/s $\textbf{\color{#35bf28}+5.11\%}$
test_creation_nested_1 47.9190μs 12.8775μs 77.6546 KOps/s 73.3356 KOps/s $\textbf{\color{#35bf28}+5.89\%}$
test_creation_nested_2 44.4130μs 17.7412μs 56.3660 KOps/s 57.3535 KOps/s $\color{#d91a1a}-1.72\%$
test_clone 1.5589ms 13.3441μs 74.9396 KOps/s 78.1417 KOps/s $\color{#d91a1a}-4.10\%$
test_getitem[int] 1.0284ms 13.0987μs 76.3436 KOps/s 79.5256 KOps/s $\color{#d91a1a}-4.00\%$
test_getitem[slice_int] 0.1399ms 25.3370μs 39.4679 KOps/s 40.7585 KOps/s $\color{#d91a1a}-3.17\%$
test_getitem[range] 0.1752ms 49.3840μs 20.2495 KOps/s 20.5155 KOps/s $\color{#d91a1a}-1.30\%$
test_getitem[tuple] 0.1325ms 21.0960μs 47.4024 KOps/s 50.2786 KOps/s $\textbf{\color{#d91a1a}-5.72\%}$
test_getitem[list] 0.1693ms 45.1416μs 22.1525 KOps/s 22.6998 KOps/s $\color{#d91a1a}-2.41\%$
test_setitem_dim[int] 48.5500μs 25.6210μs 39.0304 KOps/s 39.6335 KOps/s $\color{#d91a1a}-1.52\%$
test_setitem_dim[slice_int] 88.0440μs 53.9030μs 18.5518 KOps/s 18.9771 KOps/s $\color{#d91a1a}-2.24\%$
test_setitem_dim[range] 0.1429ms 74.6075μs 13.4035 KOps/s 13.3561 KOps/s $\color{#35bf28}+0.35\%$
test_setitem_dim[tuple] 79.1480μs 42.0473μs 23.7827 KOps/s 24.5230 KOps/s $\color{#d91a1a}-3.02\%$
test_setitem 88.0040μs 20.5274μs 48.7153 KOps/s 51.2368 KOps/s $\color{#d91a1a}-4.92\%$
test_set 81.4410μs 19.7424μs 50.6525 KOps/s 52.3441 KOps/s $\color{#d91a1a}-3.23\%$
test_set_shared 3.0874ms 0.1721ms 5.8090 KOps/s 5.9049 KOps/s $\color{#d91a1a}-1.62\%$
test_update 0.1349ms 22.4905μs 44.4632 KOps/s 46.5208 KOps/s $\color{#d91a1a}-4.42\%$
test_update_nested 98.1330μs 32.5729μs 30.7004 KOps/s 31.8615 KOps/s $\color{#d91a1a}-3.64\%$
test_update__nested 0.4291ms 33.1733μs 30.1448 KOps/s 30.9532 KOps/s $\color{#d91a1a}-2.61\%$
test_set_nested 76.0120μs 21.8476μs 45.7717 KOps/s 46.2149 KOps/s $\color{#d91a1a}-0.96\%$
test_set_nested_new 76.3530μs 26.3595μs 37.9370 KOps/s 38.0418 KOps/s $\color{#d91a1a}-0.28\%$
test_select 0.2165ms 43.0204μs 23.2448 KOps/s 23.6868 KOps/s $\color{#d91a1a}-1.87\%$
test_select_nested 0.1257ms 60.4242μs 16.5497 KOps/s 16.7342 KOps/s $\color{#d91a1a}-1.10\%$
test_exclude_nested 0.1677ms 80.9235μs 12.3573 KOps/s 12.6308 KOps/s $\color{#d91a1a}-2.16\%$
test_empty[True] 0.5123ms 0.3888ms 2.5719 KOps/s 2.5943 KOps/s $\color{#d91a1a}-0.87\%$
test_empty[False] 8.6338μs 1.2388μs 807.2426 KOps/s 788.2483 KOps/s $\color{#35bf28}+2.41\%$
test_unbind_speed 0.3522ms 0.2644ms 3.7820 KOps/s 3.8423 KOps/s $\color{#d91a1a}-1.57\%$
test_unbind_speed_stack0 0.6454ms 0.2605ms 3.8389 KOps/s 3.9054 KOps/s $\color{#d91a1a}-1.70\%$
test_unbind_speed_stack1 0.1013s 0.7662ms 1.3052 KOps/s 1.5857 KOps/s $\textbf{\color{#d91a1a}-17.69\%}$
test_split 0.1009s 1.7573ms 569.0635 Ops/s 573.0410 Ops/s $\color{#d91a1a}-0.69\%$
test_chunk 0.1013s 1.7621ms 567.5198 Ops/s 577.0731 Ops/s $\color{#d91a1a}-1.66\%$
test_consolidate_njt[False-None] 10.3993ms 8.1297ms 123.0056 Ops/s 121.9177 Ops/s $\color{#35bf28}+0.89\%$
test_creation[device0] 3.5141ms 93.5232μs 10.6925 KOps/s 10.6897 KOps/s $\color{#35bf28}+0.03\%$
test_creation_from_tensor 0.2281ms 94.0934μs 10.6277 KOps/s 10.3312 KOps/s $\color{#35bf28}+2.87\%$
test_add_one[memmap_tensor0] 0.1953ms 5.3138μs 188.1890 KOps/s 207.5055 KOps/s $\textbf{\color{#d91a1a}-9.31\%}$
test_contiguous[memmap_tensor0] 14.6280μs 0.5257μs 1.9021 MOps/s 1.9308 MOps/s $\color{#d91a1a}-1.49\%$
test_stack[memmap_tensor0] 36.8890μs 3.5817μs 279.1953 KOps/s 296.9024 KOps/s $\textbf{\color{#d91a1a}-5.96\%}$
test_memmaptd_index 1.0527ms 0.2419ms 4.1332 KOps/s 4.3413 KOps/s $\color{#d91a1a}-4.79\%$
test_memmaptd_index_astensor 0.6594ms 0.3193ms 3.1315 KOps/s 3.2312 KOps/s $\color{#d91a1a}-3.09\%$
test_memmaptd_index_op 1.0102ms 0.5745ms 1.7406 KOps/s 1.7491 KOps/s $\color{#d91a1a}-0.48\%$
test_serialize_model 0.1288s 0.1153s 8.6734 Ops/s 7.3885 Ops/s $\textbf{\color{#35bf28}+17.39\%}$
test_serialize_model_pickle 0.4473s 0.3924s 2.5484 Ops/s 2.5777 Ops/s $\color{#d91a1a}-1.13\%$
test_serialize_weights 0.1235s 0.1128s 8.8676 Ops/s 8.8405 Ops/s $\color{#35bf28}+0.31\%$
test_serialize_weights_returnearly 0.1675s 0.1560s 6.4097 Ops/s 6.2944 Ops/s $\color{#35bf28}+1.83\%$
test_serialize_weights_pickle 0.5863s 0.4241s 2.3580 Ops/s 2.3731 Ops/s $\color{#d91a1a}-0.64\%$
test_serialize_weights_filesystem 0.1466s 0.1417s 7.0589 Ops/s 6.4293 Ops/s $\textbf{\color{#35bf28}+9.79\%}$
test_serialize_model_filesystem 0.1626s 0.1525s 6.5573 Ops/s 6.6994 Ops/s $\color{#d91a1a}-2.12\%$
test_reshape_pytree 57.4070μs 27.2208μs 36.7366 KOps/s 37.9657 KOps/s $\color{#d91a1a}-3.24\%$
test_reshape_td 0.1080ms 33.3635μs 29.9728 KOps/s 31.1942 KOps/s $\color{#d91a1a}-3.92\%$
test_view_pytree 66.6340μs 27.7228μs 36.0714 KOps/s 37.7117 KOps/s $\color{#d91a1a}-4.35\%$
test_view_td 77.8650μs 38.2032μs 26.1758 KOps/s 27.3928 KOps/s $\color{#d91a1a}-4.44\%$
test_unbind_pytree 75.6810μs 29.9733μs 33.3630 KOps/s 33.7364 KOps/s $\color{#d91a1a}-1.11\%$
test_unbind_td 0.3398ms 38.6483μs 25.8744 KOps/s 26.6399 KOps/s $\color{#d91a1a}-2.87\%$
test_split_pytree 68.4170μs 29.9417μs 33.3982 KOps/s 34.0918 KOps/s $\color{#d91a1a}-2.03\%$
test_split_td 0.5247ms 45.2427μs 22.1030 KOps/s 23.1399 KOps/s $\color{#d91a1a}-4.48\%$
test_add_pytree 91.8310μs 36.8763μs 27.1177 KOps/s 28.4212 KOps/s $\color{#d91a1a}-4.59\%$
test_add_td 0.1306ms 53.4645μs 18.7040 KOps/s 19.1164 KOps/s $\color{#d91a1a}-2.16\%$
test_compile_add_one_nested[tensordict-compile] 0.1291ms 62.0599μs 16.1135 KOps/s 15.7052 KOps/s $\color{#35bf28}+2.60\%$
test_compile_add_one_nested[tensordict-eager] 0.3509ms 0.1628ms 6.1410 KOps/s 6.1986 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_add_one_nested[pytree-compile] 0.1095ms 45.5813μs 21.9388 KOps/s 21.3738 KOps/s $\color{#35bf28}+2.64\%$
test_compile_add_one_nested[pytree-eager] 0.2356ms 0.1213ms 8.2461 KOps/s 8.2534 KOps/s $\color{#d91a1a}-0.09\%$
test_compile_copy_nested[tensordict-compile] 0.1439ms 26.4001μs 37.8786 KOps/s 37.6419 KOps/s $\color{#35bf28}+0.63\%$
test_compile_copy_nested[tensordict-eager] 0.2328ms 55.1934μs 18.1181 KOps/s 18.5561 KOps/s $\color{#d91a1a}-2.36\%$
test_compile_copy_nested[pytree-compile] 0.1427ms 81.8654μs 12.2152 KOps/s 12.5173 KOps/s $\color{#d91a1a}-2.41\%$
test_compile_copy_nested[pytree-eager] 0.1224ms 69.2542μs 14.4396 KOps/s 14.5025 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_add_one_flat[tensordict-compile] 0.1835ms 0.1044ms 9.5826 KOps/s 9.3972 KOps/s $\color{#35bf28}+1.97\%$
test_compile_add_one_flat[tensordict-eager] 0.3584ms 0.2019ms 4.9534 KOps/s 5.0104 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_add_one_flat[tensorclass-compile] 90.6870μs 44.8485μs 22.2973 KOps/s 21.8086 KOps/s $\color{#35bf28}+2.24\%$
test_compile_add_one_flat[tensorclass-eager] 0.5005ms 61.7318μs 16.1991 KOps/s 16.1800 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_one_flat[pytree-compile] 0.2140ms 0.1029ms 9.7185 KOps/s 9.6356 KOps/s $\color{#35bf28}+0.86\%$
test_compile_add_one_flat[pytree-eager] 0.3826ms 0.2084ms 4.7974 KOps/s 4.8919 KOps/s $\color{#d91a1a}-1.93\%$
test_compile_add_self_flat[tensordict-eager] 0.3007ms 0.2156ms 4.6379 KOps/s 4.7146 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_add_self_flat[tensordict-compile] 0.2253ms 0.1055ms 9.4824 KOps/s 9.4460 KOps/s $\color{#35bf28}+0.39\%$
test_compile_add_self_flat[tensorclass-eager] 0.1352ms 54.6982μs 18.2821 KOps/s 18.1377 KOps/s $\color{#35bf28}+0.80\%$
test_compile_add_self_flat[tensorclass-compile] 0.1114ms 46.4809μs 21.5142 KOps/s 20.8819 KOps/s $\color{#35bf28}+3.03\%$
test_compile_add_self_flat[pytree-eager] 0.6182ms 0.1637ms 6.1103 KOps/s 6.0466 KOps/s $\color{#35bf28}+1.05\%$
test_compile_add_self_flat[pytree-compile] 0.1920ms 0.1059ms 9.4386 KOps/s 9.6200 KOps/s $\color{#d91a1a}-1.89\%$
test_compile_copy_flat[tensordict-compile] 78.2560μs 22.2982μs 44.8467 KOps/s 46.0662 KOps/s $\color{#d91a1a}-2.65\%$
test_compile_copy_flat[tensordict-eager] 0.1108ms 58.8060μs 17.0051 KOps/s 17.0419 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_copy_flat[pytree-compile] 0.1769ms 84.6557μs 11.8126 KOps/s 11.9522 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_copy_flat[pytree-eager] 0.1476ms 72.4227μs 13.8078 KOps/s 13.8395 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_assign_and_add[tensordict-compile] 0.2927ms 0.2026ms 4.9348 KOps/s 4.7858 KOps/s $\color{#35bf28}+3.11\%$
test_compile_assign_and_add[tensordict-eager] 1.5663ms 1.3056ms 765.9243 Ops/s 771.6167 Ops/s $\color{#d91a1a}-0.74\%$
test_compile_assign_and_add[pytree-compile] 0.2974ms 0.2004ms 4.9907 KOps/s 4.9052 KOps/s $\color{#35bf28}+1.74\%$
test_compile_assign_and_add[pytree-eager] 0.9480ms 0.7965ms 1.2555 KOps/s 1.2740 KOps/s $\color{#d91a1a}-1.45\%$
test_compile_assign_and_add_stack[compile] 0.8130ms 0.4530ms 2.2077 KOps/s 2.1794 KOps/s $\color{#35bf28}+1.30\%$
test_compile_assign_and_add_stack[eager] 3.9865ms 2.6491ms 377.4874 Ops/s 391.1901 Ops/s $\color{#d91a1a}-3.50\%$
test_compile_indexing[tensor-tensordict-compile] 89.1660μs 36.7825μs 27.1868 KOps/s 27.3379 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_indexing[tensor-tensordict-eager] 0.5230ms 34.1949μs 29.2442 KOps/s 29.9677 KOps/s $\color{#d91a1a}-2.41\%$
test_compile_indexing[tensor-tensorclass-compile] 98.4430μs 29.8399μs 33.5122 KOps/s 34.3107 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_indexing[tensor-tensorclass-eager] 64.2600μs 23.4431μs 42.6565 KOps/s 42.2109 KOps/s $\color{#35bf28}+1.06\%$
test_compile_indexing[tensor-pytree-compile] 90.5090μs 30.7436μs 32.5271 KOps/s 32.8903 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_indexing[tensor-pytree-eager] 0.1199ms 23.8168μs 41.9872 KOps/s 42.6207 KOps/s $\color{#d91a1a}-1.49\%$
test_compile_indexing[slice-tensordict-compile] 0.1274ms 52.3345μs 19.1078 KOps/s 19.2134 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_indexing[slice-tensordict-eager] 0.6105ms 21.0768μs 47.4456 KOps/s 49.8675 KOps/s $\color{#d91a1a}-4.86\%$
test_compile_indexing[slice-tensorclass-compile] 0.1064ms 44.3922μs 22.5265 KOps/s 22.3784 KOps/s $\color{#35bf28}+0.66\%$
test_compile_indexing[slice-tensorclass-eager] 81.8750μs 18.9683μs 52.7196 KOps/s 51.9010 KOps/s $\color{#35bf28}+1.58\%$
test_compile_indexing[slice-pytree-compile] 89.8480μs 45.0545μs 22.1954 KOps/s 21.8940 KOps/s $\color{#35bf28}+1.38\%$
test_compile_indexing[slice-pytree-eager] 81.4520μs 19.4321μs 51.4614 KOps/s 52.5383 KOps/s $\color{#d91a1a}-2.05\%$
test_compile_indexing[int-tensordict-compile] 0.1118ms 52.2381μs 19.1431 KOps/s 18.8726 KOps/s $\color{#35bf28}+1.43\%$
test_compile_indexing[int-tensordict-eager] 0.9690ms 20.5183μs 48.7369 KOps/s 50.7942 KOps/s $\color{#d91a1a}-4.05\%$
test_compile_indexing[int-tensorclass-compile] 0.1242ms 45.0088μs 22.2179 KOps/s 22.3612 KOps/s $\color{#d91a1a}-0.64\%$
test_compile_indexing[int-tensorclass-eager] 82.8340μs 19.3401μs 51.7061 KOps/s 52.6596 KOps/s $\color{#d91a1a}-1.81\%$
test_compile_indexing[int-pytree-compile] 0.1108ms 45.1499μs 22.1485 KOps/s 22.1824 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_indexing[int-pytree-eager] 59.9120μs 19.0608μs 52.4638 KOps/s 53.3284 KOps/s $\color{#d91a1a}-1.62\%$
test_mod_add[eager] 75.8910μs 34.4284μs 29.0458 KOps/s 30.3733 KOps/s $\color{#d91a1a}-4.37\%$
test_mod_add[compile] 0.1190ms 46.9617μs 21.2939 KOps/s 20.1062 KOps/s $\textbf{\color{#35bf28}+5.91\%}$
test_mod_add[compile-overhead] 0.1249ms 48.3697μs 20.6741 KOps/s 19.7800 KOps/s $\color{#35bf28}+4.52\%$
test_mod_wrap[eager] 0.4839ms 0.2319ms 4.3127 KOps/s 4.4088 KOps/s $\color{#d91a1a}-2.18\%$
test_mod_wrap[compile] 0.3224ms 0.2089ms 4.7874 KOps/s 4.7028 KOps/s $\color{#35bf28}+1.80\%$
test_mod_wrap[compile-overhead] 0.3974ms 0.2054ms 4.8681 KOps/s 4.7472 KOps/s $\color{#35bf28}+2.55\%$
test_mod_wrap_and_backward[eager] 16.2540ms 11.6403ms 85.9088 Ops/s 81.4684 Ops/s $\textbf{\color{#35bf28}+5.45\%}$
test_mod_wrap_and_backward[compile] 13.9757ms 11.5073ms 86.9010 Ops/s 81.2963 Ops/s $\textbf{\color{#35bf28}+6.89\%}$
test_mod_wrap_and_backward[compile-overhead] 12.9030ms 11.9902ms 83.4015 Ops/s 80.5341 Ops/s $\color{#35bf28}+3.56\%$
test_seq_add[eager] 0.2176ms 0.1123ms 8.9050 KOps/s 9.0911 KOps/s $\color{#d91a1a}-2.05\%$
test_seq_add[compile] 0.1472ms 62.6583μs 15.9596 KOps/s 15.9384 KOps/s $\color{#35bf28}+0.13\%$
test_seq_add[compile-overhead] 0.1530ms 60.2860μs 16.5876 KOps/s 16.3331 KOps/s $\color{#35bf28}+1.56\%$
test_seq_wrap[eager] 0.8023ms 0.4369ms 2.2888 KOps/s 2.2785 KOps/s $\color{#35bf28}+0.45\%$
test_seq_wrap[compile] 0.3567ms 0.2293ms 4.3612 KOps/s 4.2418 KOps/s $\color{#35bf28}+2.82\%$
test_seq_wrap[compile-overhead] 0.4246ms 0.2306ms 4.3374 KOps/s 4.2659 KOps/s $\color{#35bf28}+1.67\%$
test_func_call_runtime[False-eager] 0.8888ms 0.5653ms 1.7690 KOps/s 1.8040 KOps/s $\color{#d91a1a}-1.94\%$
test_func_call_runtime[False-compile] 0.6158ms 0.4322ms 2.3137 KOps/s 2.3233 KOps/s $\color{#d91a1a}-0.41\%$
test_func_call_runtime[False-compile-overhead] 0.5567ms 0.4363ms 2.2918 KOps/s 2.3290 KOps/s $\color{#d91a1a}-1.59\%$
test_func_call_runtime[True-eager] 1.4969ms 0.7861ms 1.2722 KOps/s 1.3076 KOps/s $\color{#d91a1a}-2.71\%$
test_func_call_runtime[True-compile] 0.5877ms 0.4748ms 2.1061 KOps/s 2.1112 KOps/s $\color{#d91a1a}-0.24\%$
test_func_call_runtime[True-compile-overhead] 0.6931ms 0.4739ms 2.1103 KOps/s 2.1245 KOps/s $\color{#d91a1a}-0.67\%$
test_func_call_cm_runtime[False-eager] 0.9020ms 0.5648ms 1.7705 KOps/s 1.8417 KOps/s $\color{#d91a1a}-3.87\%$
test_func_call_cm_runtime[False-compile] 0.6350ms 0.4346ms 2.3011 KOps/s 2.3274 KOps/s $\color{#d91a1a}-1.13\%$
test_func_call_cm_runtime[False-compile-overhead] 0.9076ms 0.4301ms 2.3249 KOps/s 2.3172 KOps/s $\color{#35bf28}+0.33\%$
test_func_call_cm_runtime[True-eager] 1.5009ms 0.9307ms 1.0744 KOps/s 1.0988 KOps/s $\color{#d91a1a}-2.22\%$
test_func_call_cm_runtime[True-compile] 0.6275ms 0.5000ms 1.9999 KOps/s 1.9969 KOps/s $\color{#35bf28}+0.15\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5987ms 0.4994ms 2.0023 KOps/s 2.0120 KOps/s $\color{#d91a1a}-0.48\%$
test_vmap_func_call_cm_runtime[eager] 2.4328ms 1.9163ms 521.8477 Ops/s 513.5480 Ops/s $\color{#35bf28}+1.62\%$
test_vmap_func_call_cm_runtime[compile] 1.1735ms 0.5328ms 1.8768 KOps/s 1.8809 KOps/s $\color{#d91a1a}-0.22\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8590ms 0.5505ms 1.8164 KOps/s 1.8842 KOps/s $\color{#d91a1a}-3.60\%$
test_distributed 0.3201ms 0.1273ms 7.8572 KOps/s 7.7637 KOps/s $\color{#35bf28}+1.20\%$
test_tdmodule 87.8140μs 25.8561μs 38.6756 KOps/s 39.9251 KOps/s $\color{#d91a1a}-3.13\%$
test_tdmodule_dispatch 83.5760μs 47.7495μs 20.9426 KOps/s 21.3109 KOps/s $\color{#d91a1a}-1.73\%$
test_tdseq 43.7810μs 25.2763μs 39.5627 KOps/s 38.9521 KOps/s $\color{#35bf28}+1.57\%$
test_tdseq_dispatch 75.0600μs 49.7107μs 20.1164 KOps/s 20.1252 KOps/s $\color{#d91a1a}-0.04\%$
test_instantiation_functorch 2.4143ms 1.5589ms 641.4637 Ops/s 633.0954 Ops/s $\color{#35bf28}+1.32\%$
test_exec_functorch 0.4254ms 0.1841ms 5.4312 KOps/s 5.4111 KOps/s $\color{#35bf28}+0.37\%$
test_exec_functional_call 0.3156ms 0.1790ms 5.5869 KOps/s 5.6459 KOps/s $\color{#d91a1a}-1.05\%$
test_exec_td_decorator 0.5344ms 0.2360ms 4.2372 KOps/s 4.2568 KOps/s $\color{#d91a1a}-0.46\%$
test_vmap_mlp_speed_decorator[True-True] 1.1638ms 0.6618ms 1.5111 KOps/s 1.4983 KOps/s $\color{#35bf28}+0.85\%$
test_vmap_mlp_speed_decorator[True-False] 0.9015ms 0.6555ms 1.5256 KOps/s 1.5147 KOps/s $\color{#35bf28}+0.71\%$
test_vmap_mlp_speed_decorator[False-True] 0.8295ms 0.5349ms 1.8696 KOps/s 1.8812 KOps/s $\color{#d91a1a}-0.62\%$
test_vmap_mlp_speed_decorator[False-False] 0.7650ms 0.5355ms 1.8676 KOps/s 1.8962 KOps/s $\color{#d91a1a}-1.51\%$
test_to_module_speed[True] 1.6669ms 1.3007ms 768.7928 Ops/s 768.1274 Ops/s $\color{#35bf28}+0.09\%$
test_to_module_speed[False] 2.0345ms 1.2791ms 781.8055 Ops/s 800.4740 Ops/s $\color{#d91a1a}-2.33\%$
test_tc_init 89.9980μs 48.9173μs 20.4427 KOps/s 22.9565 KOps/s $\textbf{\color{#d91a1a}-10.95\%}$
test_tc_init_nested 0.1933ms 98.2263μs 10.1806 KOps/s 11.6690 KOps/s $\textbf{\color{#d91a1a}-12.76\%}$
test_tc_first_layer_tensor 21.8510μs 1.6103μs 620.9898 KOps/s 612.9044 KOps/s $\color{#35bf28}+1.32\%$
test_tc_first_layer_nontensor 41.9180μs 4.9740μs 201.0467 KOps/s 204.4354 KOps/s $\color{#d91a1a}-1.66\%$
test_tc_second_layer_tensor 24.3550μs 3.0009μs 333.2306 KOps/s 330.3610 KOps/s $\color{#35bf28}+0.87\%$
test_tc_second_layer_nontensor 46.3060μs 6.4579μs 154.8503 KOps/s 158.3126 KOps/s $\color{#d91a1a}-2.19\%$
test_unbind 0.2279s 13.7982ms 72.4732 Ops/s 78.6314 Ops/s $\textbf{\color{#d91a1a}-7.83\%}$
test_full_like 9.5102ms 8.5034ms 117.6007 Ops/s 131.0931 Ops/s $\textbf{\color{#d91a1a}-10.29\%}$
test_zeros_like 3.7421ms 3.2043ms 312.0780 Ops/s 351.5265 Ops/s $\textbf{\color{#d91a1a}-11.22\%}$
test_ones_like 4.1349ms 3.6554ms 273.5659 Ops/s 164.4256 Ops/s $\textbf{\color{#35bf28}+66.38\%}$
test_clone 6.3703ms 5.6238ms 177.8170 Ops/s 127.9916 Ops/s $\textbf{\color{#35bf28}+38.93\%}$
test_squeeze 79.9990μs 11.9912μs 83.3945 KOps/s 84.8835 KOps/s $\color{#d91a1a}-1.75\%$
test_unsqueeze 0.1629ms 90.5350μs 11.0455 KOps/s 11.0907 KOps/s $\color{#d91a1a}-0.41\%$
test_split 0.4949ms 0.1992ms 5.0209 KOps/s 5.2187 KOps/s $\color{#d91a1a}-3.79\%$
test_permute 0.3195ms 0.2199ms 4.5482 KOps/s 4.5376 KOps/s $\color{#35bf28}+0.23\%$
test_stack 30.1682ms 25.2434ms 39.6142 Ops/s 39.0674 Ops/s $\color{#35bf28}+1.40\%$
test_cat 29.7636ms 25.2266ms 39.6407 Ops/s 39.4385 Ops/s $\color{#35bf28}+0.51\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}21$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 32.9110μs 10.5692μs 94.6145 KOps/s 98.8888 KOps/s $\color{#d91a1a}-4.32\%$
test_plain_set_stack_nested 34.1700μs 10.5183μs 95.0728 KOps/s 99.4130 KOps/s $\color{#d91a1a}-4.37\%$
test_plain_set_nested_inplace 47.6800μs 11.3195μs 88.3432 KOps/s 90.8469 KOps/s $\color{#d91a1a}-2.76\%$
test_plain_set_stack_nested_inplace 39.6600μs 11.3743μs 87.9177 KOps/s 91.2833 KOps/s $\color{#d91a1a}-3.69\%$
test_items 40.9100μs 2.8610μs 349.5259 KOps/s 348.1706 KOps/s $\color{#35bf28}+0.39\%$
test_items_nested 0.4483ms 0.3545ms 2.8209 KOps/s 2.8612 KOps/s $\color{#d91a1a}-1.41\%$
test_items_nested_locked 0.4066ms 0.3568ms 2.8023 KOps/s 2.8420 KOps/s $\color{#d91a1a}-1.40\%$
test_items_nested_leaf 86.8910μs 59.0817μs 16.9257 KOps/s 17.2095 KOps/s $\color{#d91a1a}-1.65\%$
test_items_stack_nested 0.4049ms 0.3584ms 2.7905 KOps/s 2.8252 KOps/s $\color{#d91a1a}-1.23\%$
test_items_stack_nested_leaf 97.9910μs 60.7325μs 16.4656 KOps/s 16.8193 KOps/s $\color{#d91a1a}-2.10\%$
test_items_stack_nested_locked 0.4175ms 0.3633ms 2.7528 KOps/s 2.8373 KOps/s $\color{#d91a1a}-2.98\%$
test_keys 31.1910μs 3.4611μs 288.9228 KOps/s 291.6953 KOps/s $\color{#d91a1a}-0.95\%$
test_keys_nested 0.1058ms 70.1476μs 14.2557 KOps/s 14.2432 KOps/s $\color{#35bf28}+0.09\%$
test_keys_nested_locked 0.6476ms 75.2305μs 13.2925 KOps/s 13.1809 KOps/s $\color{#35bf28}+0.85\%$
test_keys_nested_leaf 0.1012ms 61.6084μs 16.2316 KOps/s 16.4263 KOps/s $\color{#d91a1a}-1.19\%$
test_keys_stack_nested 96.1710μs 71.1942μs 14.0461 KOps/s 14.2200 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_stack_nested_leaf 91.1200μs 62.3908μs 16.0280 KOps/s 16.2501 KOps/s $\color{#d91a1a}-1.37\%$
test_keys_stack_nested_locked 0.1045ms 76.1173μs 13.1376 KOps/s 13.1273 KOps/s $\color{#35bf28}+0.08\%$
test_values 5.6102μs 0.8420μs 1.1876 MOps/s 1.1868 MOps/s $\color{#35bf28}+0.07\%$
test_values_nested 59.6800μs 31.8363μs 31.4107 KOps/s 32.1044 KOps/s $\color{#d91a1a}-2.16\%$
test_values_nested_locked 60.4210μs 33.5750μs 29.7840 KOps/s 30.6008 KOps/s $\color{#d91a1a}-2.67\%$
test_values_nested_leaf 68.7700μs 34.0874μs 29.3363 KOps/s 30.0422 KOps/s $\color{#d91a1a}-2.35\%$
test_values_stack_nested 78.4400μs 32.3996μs 30.8646 KOps/s 31.8415 KOps/s $\color{#d91a1a}-3.07\%$
test_values_stack_nested_leaf 85.4210μs 34.5246μs 28.9649 KOps/s 29.4861 KOps/s $\color{#d91a1a}-1.77\%$
test_values_stack_nested_locked 75.0400μs 33.9171μs 29.4836 KOps/s 30.1536 KOps/s $\color{#d91a1a}-2.22\%$
test_membership 2.3605μs 0.5120μs 1.9532 MOps/s 1.9793 MOps/s $\color{#d91a1a}-1.32\%$
test_membership_nested 16.4405μs 1.9298μs 518.2017 KOps/s 520.8107 KOps/s $\color{#d91a1a}-0.50\%$
test_membership_nested_leaf 17.6800μs 1.9388μs 515.7895 KOps/s 511.5195 KOps/s $\color{#35bf28}+0.83\%$
test_membership_stacked_nested 29.0300μs 2.0272μs 493.3017 KOps/s 498.3835 KOps/s $\color{#d91a1a}-1.02\%$
test_membership_stacked_nested_leaf 44.2800μs 2.0271μs 493.3147 KOps/s 492.6622 KOps/s $\color{#35bf28}+0.13\%$
test_membership_nested_last 31.9100μs 2.8671μs 348.7894 KOps/s 350.8533 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_nested_leaf_last 39.7710μs 2.9236μs 342.0494 KOps/s 348.7489 KOps/s $\color{#d91a1a}-1.92\%$
test_membership_stacked_nested_last 29.9800μs 2.9529μs 338.6458 KOps/s 348.5700 KOps/s $\color{#d91a1a}-2.85\%$
test_membership_stacked_nested_leaf_last 30.8800μs 2.8843μs 346.7056 KOps/s 346.7190 KOps/s $-0.00\%$
test_nested_getleaf 54.2800μs 6.0914μs 164.1665 KOps/s 164.0045 KOps/s $\color{#35bf28}+0.10\%$
test_nested_get 29.1800μs 5.8275μs 171.5995 KOps/s 171.8707 KOps/s $\color{#d91a1a}-0.16\%$
test_stacked_getleaf 28.7400μs 6.1126μs 163.5962 KOps/s 164.8808 KOps/s $\color{#d91a1a}-0.78\%$
test_stacked_get 46.7200μs 5.7950μs 172.5640 KOps/s 173.0097 KOps/s $\color{#d91a1a}-0.26\%$
test_nested_getitemleaf 40.5010μs 6.1870μs 161.6288 KOps/s 161.7343 KOps/s $\color{#d91a1a}-0.07\%$
test_nested_getitem 35.8200μs 5.9149μs 169.0657 KOps/s 170.0955 KOps/s $\color{#d91a1a}-0.61\%$
test_stacked_getitemleaf 36.7500μs 6.1588μs 162.3703 KOps/s 161.2010 KOps/s $\color{#35bf28}+0.73\%$
test_stacked_getitem 39.8710μs 5.8773μs 170.1463 KOps/s 170.4638 KOps/s $\color{#d91a1a}-0.19\%$
test_lock_nested 9.4038ms 0.3780ms 2.6456 KOps/s 2.7023 KOps/s $\color{#d91a1a}-2.10\%$
test_lock_stack_nested 0.4359ms 0.3374ms 2.9638 KOps/s 2.9976 KOps/s $\color{#d91a1a}-1.13\%$
test_unlock_nested 0.7030ms 0.3097ms 3.2288 KOps/s 3.2637 KOps/s $\color{#d91a1a}-1.07\%$
test_unlock_stack_nested 0.3077ms 0.2768ms 3.6132 KOps/s 3.6552 KOps/s $\color{#d91a1a}-1.15\%$
test_flatten_speed 0.1019ms 74.4590μs 13.4302 KOps/s 13.0313 KOps/s $\color{#35bf28}+3.06\%$
test_unflatten_speed 0.3549ms 0.3005ms 3.3277 KOps/s 3.3246 KOps/s $\color{#35bf28}+0.10\%$
test_common_ops 1.6448ms 0.5723ms 1.7473 KOps/s 1.8126 KOps/s $\color{#d91a1a}-3.60\%$
test_creation 95.4910μs 1.4240μs 702.2447 KOps/s 704.5640 KOps/s $\color{#d91a1a}-0.33\%$
test_creation_empty 43.8610μs 7.2384μs 138.1516 KOps/s 158.1738 KOps/s $\textbf{\color{#d91a1a}-12.66\%}$
test_creation_nested_1 45.5210μs 8.6675μs 115.3738 KOps/s 127.6870 KOps/s $\textbf{\color{#d91a1a}-9.64\%}$
test_creation_nested_2 41.3000μs 11.0496μs 90.5013 KOps/s 97.8647 KOps/s $\textbf{\color{#d91a1a}-7.52\%}$
test_clone 68.2600μs 10.2163μs 97.8830 KOps/s 101.5947 KOps/s $\color{#d91a1a}-3.65\%$
test_getitem[int] 1.2022ms 11.3168μs 88.3643 KOps/s 94.4526 KOps/s $\textbf{\color{#d91a1a}-6.45\%}$
test_getitem[slice_int] 0.1122ms 21.0008μs 47.6172 KOps/s 48.9088 KOps/s $\color{#d91a1a}-2.64\%$
test_getitem[range] 0.1356ms 35.1809μs 28.4245 KOps/s 28.0036 KOps/s $\color{#35bf28}+1.50\%$
test_getitem[tuple] 0.1142ms 18.5219μs 53.9901 KOps/s 55.1146 KOps/s $\color{#d91a1a}-2.04\%$
test_getitem[list] 0.2251ms 31.5660μs 31.6797 KOps/s 31.4314 KOps/s $\color{#35bf28}+0.79\%$
test_setitem_dim[int] 38.0400μs 18.1252μs 55.1717 KOps/s 57.3470 KOps/s $\color{#d91a1a}-3.79\%$
test_setitem_dim[slice_int] 67.2010μs 36.8622μs 27.1281 KOps/s 27.5926 KOps/s $\color{#d91a1a}-1.68\%$
test_setitem_dim[range] 73.4000μs 50.3854μs 19.8470 KOps/s 19.5808 KOps/s $\color{#35bf28}+1.36\%$
test_setitem_dim[tuple] 51.7200μs 31.5419μs 31.7039 KOps/s 31.5283 KOps/s $\color{#35bf28}+0.56\%$
test_setitem 89.1110μs 14.1797μs 70.5234 KOps/s 75.0175 KOps/s $\textbf{\color{#d91a1a}-5.99\%}$
test_set 85.2810μs 13.5399μs 73.8556 KOps/s 77.1704 KOps/s $\color{#d91a1a}-4.30\%$
test_set_shared 1.6017ms 0.1459ms 6.8528 KOps/s 6.9455 KOps/s $\color{#d91a1a}-1.33\%$
test_update 0.3445ms 16.1513μs 61.9147 KOps/s 67.5526 KOps/s $\textbf{\color{#d91a1a}-8.35\%}$
test_update_nested 81.6510μs 21.2405μs 47.0799 KOps/s 49.1120 KOps/s $\color{#d91a1a}-4.14\%$
test_update__nested 0.9420ms 23.4347μs 42.6717 KOps/s 42.7909 KOps/s $\color{#d91a1a}-0.28\%$
test_set_nested 71.6200μs 14.7768μs 67.6737 KOps/s 72.4322 KOps/s $\textbf{\color{#d91a1a}-6.57\%}$
test_set_nested_new 91.6300μs 16.7152μs 59.8259 KOps/s 63.4025 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_select 84.8710μs 28.4110μs 35.1976 KOps/s 37.2293 KOps/s $\textbf{\color{#d91a1a}-5.46\%}$
test_select_nested 73.0310μs 40.7307μs 24.5515 KOps/s 24.2862 KOps/s $\color{#35bf28}+1.09\%$
test_exclude_nested 96.4710μs 58.1817μs 17.1875 KOps/s 16.9420 KOps/s $\color{#35bf28}+1.45\%$
test_empty[True] 0.3183ms 0.2701ms 3.7019 KOps/s 3.6846 KOps/s $\color{#35bf28}+0.47\%$
test_empty[False] 3.9551μs 0.7375μs 1.3559 MOps/s 1.3580 MOps/s $\color{#d91a1a}-0.15\%$
test_to 87.1210μs 54.3706μs 18.3923 KOps/s 18.2399 KOps/s $\color{#35bf28}+0.84\%$
test_to_nonblocking 0.9787ms 45.3473μs 22.0520 KOps/s 22.1990 KOps/s $\color{#d91a1a}-0.66\%$
test_unbind_speed 1.6079ms 0.2364ms 4.2303 KOps/s 4.3327 KOps/s $\color{#d91a1a}-2.36\%$
test_unbind_speed_stack0 0.2867ms 0.2364ms 4.2307 KOps/s 4.2151 KOps/s $\color{#35bf28}+0.37\%$
test_unbind_speed_stack1 93.7962ms 0.6620ms 1.5107 KOps/s 1.5360 KOps/s $\color{#d91a1a}-1.65\%$
test_split 94.4445ms 1.6588ms 602.8367 Ops/s 579.9932 Ops/s $\color{#35bf28}+3.94\%$
test_chunk 96.9491ms 1.6573ms 603.3910 Ops/s 691.4343 Ops/s $\textbf{\color{#d91a1a}-12.73\%}$
test_consolidate[False-None] 97.1493ms 2.9016ms 344.6383 Ops/s 357.1532 Ops/s $\color{#d91a1a}-3.50\%$
test_consolidate[default-None] 1.7879ms 1.7239ms 580.0811 Ops/s 588.9860 Ops/s $\color{#d91a1a}-1.51\%$
test_consolidate[reduce-overhead-None] 1.8093ms 1.7478ms 572.1550 Ops/s 591.2723 Ops/s $\color{#d91a1a}-3.23\%$
test_consolidate_njt[False-None] 6.7876ms 6.3225ms 158.1643 Ops/s 161.9896 Ops/s $\color{#d91a1a}-2.36\%$
test_to[False-False-None] 1.8044ms 1.6986ms 588.7039 Ops/s 616.1676 Ops/s $\color{#d91a1a}-4.46\%$
test_to[True-False-None] 1.5204ms 1.2710ms 786.7701 Ops/s 808.8923 Ops/s $\color{#d91a1a}-2.73\%$
test_to[within-False-None] 4.2828ms 4.0739ms 245.4674 Ops/s 257.7204 Ops/s $\color{#d91a1a}-4.75\%$
test_to[True-default-None] 5.5503ms 5.2590ms 190.1497 Ops/s 193.3854 Ops/s $\color{#d91a1a}-1.67\%$
test_to_njt[False-False-None] 7.1652ms 6.8415ms 146.1677 Ops/s 145.5265 Ops/s $\color{#35bf28}+0.44\%$
test_to_njt[True-False-None] 5.6325ms 5.4261ms 184.2929 Ops/s 183.7922 Ops/s $\color{#35bf28}+0.27\%$
test_to_njt[within-False-None] 12.1838ms 11.7439ms 85.1508 Ops/s 83.9037 Ops/s $\color{#35bf28}+1.49\%$
test_creation[device0] 0.4698ms 77.7378μs 12.8638 KOps/s 12.7874 KOps/s $\color{#35bf28}+0.60\%$
test_creation_from_tensor 0.4970ms 81.2598μs 12.3062 KOps/s 12.1012 KOps/s $\color{#35bf28}+1.69\%$
test_add_one[memmap_tensor0] 0.4110ms 6.6453μs 150.4814 KOps/s 160.2328 KOps/s $\textbf{\color{#d91a1a}-6.09\%}$
test_contiguous[memmap_tensor0] 1.8085μs 0.3975μs 2.5156 MOps/s 2.4964 MOps/s $\color{#35bf28}+0.77\%$
test_stack[memmap_tensor0] 45.3200μs 4.8327μs 206.9219 KOps/s 225.0931 KOps/s $\textbf{\color{#d91a1a}-8.07\%}$
test_memmaptd_index 1.7904ms 0.2638ms 3.7906 KOps/s 4.1312 KOps/s $\textbf{\color{#d91a1a}-8.25\%}$
test_memmaptd_index_astensor 0.9356ms 0.3179ms 3.1461 KOps/s 3.3598 KOps/s $\textbf{\color{#d91a1a}-6.36\%}$
test_memmaptd_index_op 1.0048ms 0.5785ms 1.7285 KOps/s 1.9080 KOps/s $\textbf{\color{#d91a1a}-9.41\%}$
test_serialize_model 0.1297s 0.1283s 7.7972 Ops/s 7.6890 Ops/s $\color{#35bf28}+1.41\%$
test_serialize_model_pickle 1.3777s 1.1903s 0.8401 Ops/s 0.8180 Ops/s $\color{#35bf28}+2.70\%$
test_serialize_weights 0.1296s 0.1282s 7.8012 Ops/s 5.5165 Ops/s $\textbf{\color{#35bf28}+41.41\%}$
test_serialize_weights_returnearly 0.7150s 68.2325ms 14.6558 Ops/s 23.5226 Ops/s $\textbf{\color{#d91a1a}-37.69\%}$
test_serialize_weights_pickle 1.3770s 1.2219s 0.8184 Ops/s 0.8209 Ops/s $\color{#d91a1a}-0.31\%$
test_reshape_pytree 51.6800μs 22.3427μs 44.7573 KOps/s 46.1799 KOps/s $\color{#d91a1a}-3.08\%$
test_reshape_td 49.1700μs 26.1092μs 38.3006 KOps/s 39.9128 KOps/s $\color{#d91a1a}-4.04\%$
test_view_pytree 53.4910μs 21.9497μs 45.5586 KOps/s 47.1549 KOps/s $\color{#d91a1a}-3.39\%$
test_view_td 59.6510μs 28.7934μs 34.7302 KOps/s 34.2453 KOps/s $\color{#35bf28}+1.42\%$
test_unbind_pytree 52.9910μs 28.2506μs 35.3974 KOps/s 36.2837 KOps/s $\color{#d91a1a}-2.44\%$
test_unbind_td 0.7612ms 35.9503μs 27.8162 KOps/s 28.6887 KOps/s $\color{#d91a1a}-3.04\%$
test_split_pytree 63.5210μs 30.2604μs 33.0465 KOps/s 33.8855 KOps/s $\color{#d91a1a}-2.48\%$
test_split_td 0.1521s 51.8927μs 19.2705 KOps/s 26.3670 KOps/s $\textbf{\color{#d91a1a}-26.91\%}$
test_add_pytree 66.0600μs 33.5234μs 29.8299 KOps/s 30.9993 KOps/s $\color{#d91a1a}-3.77\%$
test_add_td 0.1632ms 43.7360μs 22.8644 KOps/s 23.9812 KOps/s $\color{#d91a1a}-4.66\%$
test_compile_add_one_nested[tensordict-compile] 0.1699ms 0.1194ms 8.3757 KOps/s 8.2754 KOps/s $\color{#35bf28}+1.21\%$
test_compile_add_one_nested[tensordict-eager] 0.2130ms 0.1209ms 8.2690 KOps/s 8.2121 KOps/s $\color{#35bf28}+0.69\%$
test_compile_add_one_nested[pytree-compile] 0.1461ms 98.8576μs 10.1156 KOps/s 10.1828 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_add_one_nested[pytree-eager] 1.3983ms 0.1515ms 6.5989 KOps/s 6.8885 KOps/s $\color{#d91a1a}-4.20\%$
test_compile_copy_nested[tensordict-compile] 0.1306ms 23.6027μs 42.3681 KOps/s 48.1754 KOps/s $\textbf{\color{#d91a1a}-12.05\%}$
test_compile_copy_nested[tensordict-eager] 84.9710μs 26.3719μs 37.9192 KOps/s 37.6370 KOps/s $\color{#35bf28}+0.75\%$
test_compile_copy_nested[pytree-compile] 0.4510ms 64.9269μs 15.4019 KOps/s 15.3268 KOps/s $\color{#35bf28}+0.49\%$
test_compile_copy_nested[pytree-eager] 75.4300μs 49.0323μs 20.3947 KOps/s 20.1403 KOps/s $\color{#35bf28}+1.26\%$
test_compile_add_one_flat[tensordict-compile] 0.1854ms 0.1418ms 7.0515 KOps/s 7.0137 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_one_flat[tensordict-eager] 0.2956ms 0.2048ms 4.8835 KOps/s 4.9198 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_add_one_flat[tensorclass-compile] 0.1453ms 96.4836μs 10.3645 KOps/s 10.4249 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_add_one_flat[tensorclass-eager] 0.1044ms 50.0052μs 19.9979 KOps/s 19.9137 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_one_flat[pytree-compile] 0.1849ms 0.1383ms 7.2331 KOps/s 7.3561 KOps/s $\color{#d91a1a}-1.67\%$
test_compile_add_one_flat[pytree-eager] 0.5642ms 0.4977ms 2.0093 KOps/s 2.1103 KOps/s $\color{#d91a1a}-4.79\%$
test_compile_add_self_flat[tensordict-eager] 0.3896ms 0.2464ms 4.0583 KOps/s 4.0602 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_self_flat[tensordict-compile] 0.1959ms 0.1438ms 6.9563 KOps/s 6.7968 KOps/s $\color{#35bf28}+2.35\%$
test_compile_add_self_flat[tensorclass-eager] 0.1565ms 59.8213μs 16.7164 KOps/s 16.1181 KOps/s $\color{#35bf28}+3.71\%$
test_compile_add_self_flat[tensorclass-compile] 0.1423ms 98.0769μs 10.1961 KOps/s 9.7906 KOps/s $\color{#35bf28}+4.14\%$
test_compile_add_self_flat[pytree-eager] 0.5507ms 0.4234ms 2.3620 KOps/s 2.4704 KOps/s $\color{#d91a1a}-4.39\%$
test_compile_add_self_flat[pytree-compile] 0.2369ms 0.1346ms 7.4320 KOps/s 7.3588 KOps/s $\color{#35bf28}+0.99\%$
test_compile_copy_flat[tensordict-compile] 45.4000μs 18.3253μs 54.5695 KOps/s 42.1735 KOps/s $\textbf{\color{#35bf28}+29.39\%}$
test_compile_copy_flat[tensordict-eager] 61.2600μs 27.9455μs 35.7839 KOps/s 36.9458 KOps/s $\color{#d91a1a}-3.15\%$
test_compile_copy_flat[pytree-compile] 0.1164ms 69.8172μs 14.3231 KOps/s 14.1957 KOps/s $\color{#35bf28}+0.90\%$
test_compile_copy_flat[pytree-eager] 77.6800μs 51.4840μs 19.4235 KOps/s 19.4125 KOps/s $\color{#35bf28}+0.06\%$
test_compile_assign_and_add[tensordict-compile] 1.6018ms 0.3884ms 2.5750 KOps/s 2.2358 KOps/s $\textbf{\color{#35bf28}+15.17\%}$
test_compile_assign_and_add[tensordict-eager] 2.7308ms 2.5810ms 387.4469 Ops/s 392.4989 Ops/s $\color{#d91a1a}-1.29\%$
test_compile_assign_and_add[pytree-compile] 1.6030ms 0.4328ms 2.3106 KOps/s 2.3046 KOps/s $\color{#35bf28}+0.26\%$
test_compile_assign_and_add[pytree-eager] 2.7517ms 2.6464ms 377.8764 Ops/s 382.3764 Ops/s $\color{#d91a1a}-1.18\%$
test_compile_indexing[tensor-tensordict-compile] 0.1711ms 0.1140ms 8.7707 KOps/s 8.7403 KOps/s $\color{#35bf28}+0.35\%$
test_compile_indexing[tensor-tensordict-eager] 0.5709ms 80.3321μs 12.4483 KOps/s 12.8214 KOps/s $\color{#d91a1a}-2.91\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1742ms 0.1091ms 9.1675 KOps/s 9.7391 KOps/s $\textbf{\color{#d91a1a}-5.87\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1141ms 69.4367μs 14.4016 KOps/s 15.0446 KOps/s $\color{#d91a1a}-4.27\%$
test_compile_indexing[tensor-pytree-compile] 0.1720ms 0.1091ms 9.1684 KOps/s 9.4850 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_indexing[tensor-pytree-eager] 0.1526ms 68.8279μs 14.5290 KOps/s 14.9665 KOps/s $\color{#d91a1a}-2.92\%$
test_compile_indexing[slice-tensordict-compile] 0.2207ms 0.1004ms 9.9589 KOps/s 9.9051 KOps/s $\color{#35bf28}+0.54\%$
test_compile_indexing[slice-tensordict-eager] 0.1436ms 17.2932μs 57.8261 KOps/s 57.7432 KOps/s $\color{#35bf28}+0.14\%$
test_compile_indexing[slice-tensorclass-compile] 0.1856ms 93.9870μs 10.6398 KOps/s 10.3128 KOps/s $\color{#35bf28}+3.17\%$
test_compile_indexing[slice-tensorclass-eager] 50.6510μs 15.9389μs 62.7397 KOps/s 64.9799 KOps/s $\color{#d91a1a}-3.45\%$
test_compile_indexing[slice-pytree-compile] 0.1458ms 95.7671μs 10.4420 KOps/s 10.2733 KOps/s $\color{#35bf28}+1.64\%$
test_compile_indexing[slice-pytree-eager] 49.0100μs 16.0606μs 62.2643 KOps/s 61.0402 KOps/s $\color{#35bf28}+2.01\%$
test_compile_indexing[int-tensordict-compile] 0.2091ms 0.1017ms 9.8351 KOps/s 9.8295 KOps/s $\color{#35bf28}+0.06\%$
test_compile_indexing[int-tensordict-eager] 0.5932ms 17.5216μs 57.0723 KOps/s 59.2634 KOps/s $\color{#d91a1a}-3.70\%$
test_compile_indexing[int-tensorclass-compile] 0.1534ms 97.2088μs 10.2871 KOps/s 10.1761 KOps/s $\color{#35bf28}+1.09\%$
test_compile_indexing[int-tensorclass-eager] 0.1129ms 15.8083μs 63.2579 KOps/s 64.7669 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_indexing[int-pytree-compile] 0.1458ms 98.0219μs 10.2018 KOps/s 10.4084 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_indexing[int-pytree-eager] 43.4110μs 15.8549μs 63.0720 KOps/s 64.7880 KOps/s $\color{#d91a1a}-2.65\%$
test_mod_add[eager] 85.2710μs 35.3830μs 28.2622 KOps/s 28.5732 KOps/s $\color{#d91a1a}-1.09\%$
test_mod_add[compile] 0.1225ms 79.1031μs 12.6417 KOps/s 12.6259 KOps/s $\color{#35bf28}+0.13\%$
test_mod_add[compile-overhead] 0.3298ms 0.1681ms 5.9477 KOps/s 5.7750 KOps/s $\color{#35bf28}+2.99\%$
test_mod_wrap[eager] 0.3516ms 0.2390ms 4.1842 KOps/s 4.0902 KOps/s $\color{#35bf28}+2.30\%$
test_mod_wrap[compile] 0.3977ms 0.2856ms 3.5018 KOps/s 3.5301 KOps/s $\color{#d91a1a}-0.80\%$
test_mod_wrap[compile-overhead] 7.1199ms 3.7499ms 266.6740 Ops/s 277.0118 Ops/s $\color{#d91a1a}-3.73\%$
test_mod_wrap_and_backward[eager] 1.4670ms 1.3390ms 746.8523 Ops/s 709.2335 Ops/s $\textbf{\color{#35bf28}+5.30\%}$
test_mod_wrap_and_backward[compile] 1.3769ms 1.2364ms 808.8114 Ops/s 739.6925 Ops/s $\textbf{\color{#35bf28}+9.34\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3642ms 0.9054ms 1.1045 KOps/s 976.4067 Ops/s $\textbf{\color{#35bf28}+13.12\%}$
test_seq_add[eager] 0.1499ms 0.1086ms 9.2053 KOps/s 9.2852 KOps/s $\color{#d91a1a}-0.86\%$
test_seq_add[compile] 0.5410ms 88.3609μs 11.3172 KOps/s 11.0886 KOps/s $\color{#35bf28}+2.06\%$
test_seq_add[compile-overhead] 0.1781ms 0.1291ms 7.7486 KOps/s 7.8591 KOps/s $\color{#d91a1a}-1.41\%$
test_seq_wrap[eager] 0.5152ms 0.4028ms 2.4825 KOps/s 2.3742 KOps/s $\color{#35bf28}+4.56\%$
test_seq_wrap[compile] 0.4068ms 0.2976ms 3.3602 KOps/s 3.3729 KOps/s $\color{#d91a1a}-0.38\%$
test_seq_wrap[compile-overhead] 0.2735ms 0.2213ms 4.5194 KOps/s 4.5137 KOps/s $\color{#35bf28}+0.13\%$
test_func_call_runtime[False-eager] 0.7730ms 0.7017ms 1.4250 KOps/s 1.3974 KOps/s $\color{#35bf28}+1.98\%$
test_func_call_runtime[False-compile] 1.0457ms 0.7262ms 1.3770 KOps/s 1.3518 KOps/s $\color{#35bf28}+1.87\%$
test_func_call_runtime[False-compile-overhead] 0.4868ms 0.3608ms 2.7715 KOps/s 2.8111 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_runtime[True-eager] 1.0531ms 0.8740ms 1.1442 KOps/s 1.1485 KOps/s $\color{#d91a1a}-0.37\%$
test_func_call_runtime[True-compile] 0.9209ms 0.7614ms 1.3133 KOps/s 1.3219 KOps/s $\color{#d91a1a}-0.65\%$
test_func_call_runtime[True-compile-overhead] 0.5228ms 0.3868ms 2.5855 KOps/s 2.6313 KOps/s $\color{#d91a1a}-1.74\%$
test_func_call_cm_runtime[False-eager] 0.8500ms 0.7080ms 1.4124 KOps/s 1.3607 KOps/s $\color{#35bf28}+3.80\%$
test_func_call_cm_runtime[False-compile] 1.1891ms 0.7467ms 1.3392 KOps/s 1.3526 KOps/s $\color{#d91a1a}-0.99\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5266ms 0.3668ms 2.7263 KOps/s 2.7763 KOps/s $\color{#d91a1a}-1.80\%$
test_func_call_cm_runtime[True-eager] 1.1515ms 0.9788ms 1.0217 KOps/s 1.0244 KOps/s $\color{#d91a1a}-0.27\%$
test_func_call_cm_runtime[True-compile] 1.2609ms 0.7994ms 1.2510 KOps/s 1.2694 KOps/s $\color{#d91a1a}-1.45\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5524ms 0.4138ms 2.4169 KOps/s 2.4738 KOps/s $\color{#d91a1a}-2.30\%$
test_vmap_func_call_cm_runtime[eager] 2.4949ms 2.0238ms 494.1189 Ops/s 498.4394 Ops/s $\color{#d91a1a}-0.87\%$
test_vmap_func_call_cm_runtime[compile] 0.8738ms 0.7878ms 1.2693 KOps/s 1.2198 KOps/s $\color{#35bf28}+4.06\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4623ms 0.4061ms 2.4627 KOps/s 2.4527 KOps/s $\color{#35bf28}+0.41\%$
test_distributed 2.4851ms 0.1785ms 5.6036 KOps/s 8.7491 KOps/s $\textbf{\color{#d91a1a}-35.95\%}$
test_tdmodule 0.2888ms 18.4853μs 54.0970 KOps/s 55.8436 KOps/s $\color{#d91a1a}-3.13\%$
test_tdmodule_dispatch 75.5000μs 32.1799μs 31.0753 KOps/s 31.1447 KOps/s $\color{#d91a1a}-0.22\%$
test_tdseq 37.9110μs 18.3521μs 54.4898 KOps/s 56.7637 KOps/s $\color{#d91a1a}-4.01\%$
test_tdseq_dispatch 54.4910μs 34.9375μs 28.6225 KOps/s 29.5063 KOps/s $\color{#d91a1a}-3.00\%$
test_instantiation_functorch 1.6986ms 1.5360ms 651.0372 Ops/s 657.5932 Ops/s $\color{#d91a1a}-1.00\%$
test_exec_functorch 0.1979ms 0.1402ms 7.1303 KOps/s 7.2140 KOps/s $\color{#d91a1a}-1.16\%$
test_exec_functional_call 0.1864ms 0.1321ms 7.5701 KOps/s 7.7173 KOps/s $\color{#d91a1a}-1.91\%$
test_exec_td_decorator 0.3644ms 0.1809ms 5.5293 KOps/s 5.7335 KOps/s $\color{#d91a1a}-3.56\%$
test_vmap_mlp_speed_decorator[True-True] 0.8387ms 0.6630ms 1.5084 KOps/s 1.5247 KOps/s $\color{#d91a1a}-1.07\%$
test_vmap_mlp_speed_decorator[True-False] 1.0513ms 0.6662ms 1.5012 KOps/s 1.5209 KOps/s $\color{#d91a1a}-1.30\%$
test_vmap_mlp_speed_decorator[False-True] 0.6793ms 0.5653ms 1.7689 KOps/s 1.7452 KOps/s $\color{#35bf28}+1.35\%$
test_vmap_mlp_speed_decorator[False-False] 0.6759ms 0.5692ms 1.7569 KOps/s 1.7443 KOps/s $\color{#35bf28}+0.72\%$
test_vmap_transformer_speed_decorator[True-True] 18.5498ms 18.4698ms 54.1425 Ops/s 53.8444 Ops/s $\color{#35bf28}+0.55\%$
test_vmap_transformer_speed_decorator[True-False] 18.8101ms 18.4767ms 54.1222 Ops/s 53.6802 Ops/s $\color{#35bf28}+0.82\%$
test_vmap_transformer_speed_decorator[False-True] 18.9641ms 18.6833ms 53.5238 Ops/s 54.2497 Ops/s $\color{#d91a1a}-1.34\%$
test_vmap_transformer_speed_decorator[False-False] 18.8456ms 18.6727ms 53.5541 Ops/s 54.0729 Ops/s $\color{#d91a1a}-0.96\%$
test_to_module_speed[True] 1.1797ms 0.9284ms 1.0772 KOps/s 1.0737 KOps/s $\color{#35bf28}+0.32\%$
test_to_module_speed[False] 1.5262ms 0.9174ms 1.0900 KOps/s 1.0894 KOps/s $\color{#35bf28}+0.05\%$
test_tc_init 72.2110μs 33.8629μs 29.5309 KOps/s 30.1633 KOps/s $\color{#d91a1a}-2.10\%$
test_tc_init_nested 0.1044ms 68.6034μs 14.5765 KOps/s 14.8108 KOps/s $\color{#d91a1a}-1.58\%$
test_tc_first_layer_tensor 4.8044μs 0.6881μs 1.4532 MOps/s 1.4487 MOps/s $\color{#35bf28}+0.31\%$
test_tc_first_layer_nontensor 32.5600μs 2.3275μs 429.6453 KOps/s 411.9113 KOps/s $\color{#35bf28}+4.31\%$
test_tc_second_layer_tensor 9.6550μs 1.4168μs 705.7961 KOps/s 703.1309 KOps/s $\color{#35bf28}+0.38\%$
test_tc_second_layer_nontensor 0.1380ms 3.0853μs 324.1206 KOps/s 315.2526 KOps/s $\color{#35bf28}+2.81\%$
test_unbind 0.2471s 10.4020ms 96.1356 Ops/s 152.9994 Ops/s $\textbf{\color{#d91a1a}-37.17\%}$
test_full_like 11.6367ms 9.3924ms 106.4690 Ops/s 107.3651 Ops/s $\color{#d91a1a}-0.83\%$
test_zeros_like 5.4117ms 4.3509ms 229.8388 Ops/s 230.0868 Ops/s $\color{#d91a1a}-0.11\%$
test_ones_like 4.9605ms 4.3551ms 229.6148 Ops/s 230.7390 Ops/s $\color{#d91a1a}-0.49\%$
test_clone 6.6970ms 6.4485ms 155.0755 Ops/s 156.0118 Ops/s $\color{#d91a1a}-0.60\%$
test_squeeze 58.2410μs 9.0156μs 110.9186 KOps/s 108.8202 KOps/s $\color{#35bf28}+1.93\%$
test_unsqueeze 0.2340ms 70.4755μs 14.1893 KOps/s 14.4183 KOps/s $\color{#d91a1a}-1.59\%$
test_split 0.4541ms 0.1569ms 6.3718 KOps/s 6.3635 KOps/s $\color{#35bf28}+0.13\%$
test_permute 0.5952ms 0.1798ms 5.5605 KOps/s 5.7219 KOps/s $\color{#d91a1a}-2.82\%$
test_stack 51.5898ms 51.0262ms 19.5978 Ops/s 19.6007 Ops/s $\color{#d91a1a}-0.02\%$
test_cat 51.2064ms 50.8400ms 19.6696 Ops/s 19.7803 Ops/s $\color{#d91a1a}-0.56\%$

@vmoens vmoens merged commit 4d57176 into gh/vmoens/35/base Nov 26, 2024
49 of 50 checks passed
vmoens added a commit that referenced this pull request Nov 26, 2024
ghstack-source-id: a7d786fe77c2c12d5c8c85579123a64ef5c87cf2
Pull Request resolved: #1111
@vmoens vmoens deleted the gh/vmoens/35/head branch November 26, 2024 10:34
@vmoens vmoens added the bug Something isn't working label Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants