Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tests] Skip deprecation warning tests on FB fbcode #1128

Merged
merged 1 commit into from
Dec 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 4, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: fb0cc381a670377667194324f5b019076b8e762d
Pull Request resolved: #1128
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 4, 2024
@vmoens vmoens merged commit f563108 into gh/vmoens/35/base Dec 4, 2024
26 of 37 checks passed
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: fb0cc381a670377667194324f5b019076b8e762d
Pull Request resolved: #1128
@vmoens vmoens deleted the gh/vmoens/35/head branch December 4, 2024 13:18
Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}19$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 39.9240μs 18.3699μs 54.4368 KOps/s 54.0308 KOps/s $\color{#35bf28}+0.75\%$
test_plain_set_stack_nested 45.8260μs 18.2413μs 54.8207 KOps/s 53.2733 KOps/s $\color{#35bf28}+2.90\%$
test_plain_set_nested_inplace 40.3350μs 20.1710μs 49.5762 KOps/s 48.7044 KOps/s $\color{#35bf28}+1.79\%$
test_plain_set_stack_nested_inplace 66.4140μs 19.7754μs 50.5679 KOps/s 48.5606 KOps/s $\color{#35bf28}+4.13\%$
test_items 24.2750μs 4.2290μs 236.4631 KOps/s 233.9212 KOps/s $\color{#35bf28}+1.09\%$
test_items_nested 0.5394ms 0.4010ms 2.4936 KOps/s 2.5222 KOps/s $\color{#d91a1a}-1.13\%$
test_items_nested_locked 0.7297ms 0.4022ms 2.4860 KOps/s 2.5320 KOps/s $\color{#d91a1a}-1.82\%$
test_items_nested_leaf 0.1112ms 71.7098μs 13.9451 KOps/s 14.0862 KOps/s $\color{#d91a1a}-1.00\%$
test_items_stack_nested 0.4837ms 0.4033ms 2.4796 KOps/s 2.5346 KOps/s $\color{#d91a1a}-2.17\%$
test_items_stack_nested_leaf 0.1546ms 74.1588μs 13.4846 KOps/s 13.5721 KOps/s $\color{#d91a1a}-0.65\%$
test_items_stack_nested_locked 0.6035ms 0.4015ms 2.4905 KOps/s 2.5313 KOps/s $\color{#d91a1a}-1.61\%$
test_keys 35.2960μs 3.4782μs 287.5041 KOps/s 288.2401 KOps/s $\color{#d91a1a}-0.26\%$
test_keys_nested 0.2567ms 0.1365ms 7.3238 KOps/s 7.4814 KOps/s $\color{#d91a1a}-2.11\%$
test_keys_nested_locked 1.7110ms 0.1412ms 7.0821 KOps/s 7.1242 KOps/s $\color{#d91a1a}-0.59\%$
test_keys_nested_leaf 0.2237ms 0.1179ms 8.4849 KOps/s 8.6559 KOps/s $\color{#d91a1a}-1.97\%$
test_keys_stack_nested 0.2168ms 0.1343ms 7.4455 KOps/s 7.4472 KOps/s $\color{#d91a1a}-0.02\%$
test_keys_stack_nested_leaf 0.1670ms 0.1146ms 8.7257 KOps/s 8.6010 KOps/s $\color{#35bf28}+1.45\%$
test_keys_stack_nested_locked 0.2900ms 0.1399ms 7.1499 KOps/s 7.1256 KOps/s $\color{#35bf28}+0.34\%$
test_values 8.5780μs 1.0428μs 958.9769 KOps/s 962.5130 KOps/s $\color{#d91a1a}-0.37\%$
test_values_nested 0.1062ms 54.9612μs 18.1947 KOps/s 18.1354 KOps/s $\color{#35bf28}+0.33\%$
test_values_nested_locked 0.1101ms 54.6724μs 18.2908 KOps/s 18.3182 KOps/s $\color{#d91a1a}-0.15\%$
test_values_nested_leaf 0.1189ms 59.3672μs 16.8443 KOps/s 16.5035 KOps/s $\color{#35bf28}+2.07\%$
test_values_stack_nested 99.9570μs 56.5162μs 17.6940 KOps/s 17.8624 KOps/s $\color{#d91a1a}-0.94\%$
test_values_stack_nested_leaf 0.1109ms 60.0265μs 16.6593 KOps/s 15.4143 KOps/s $\textbf{\color{#35bf28}+8.08\%}$
test_values_stack_nested_locked 0.1050ms 56.5886μs 17.6714 KOps/s 17.9924 KOps/s $\color{#d91a1a}-1.78\%$
test_membership 11.8320μs 0.8813μs 1.1347 MOps/s 1.1631 MOps/s $\color{#d91a1a}-2.44\%$
test_membership_nested 34.7010μs 2.9130μs 343.2943 KOps/s 348.8559 KOps/s $\color{#d91a1a}-1.59\%$
test_membership_nested_leaf 32.0800μs 2.8949μs 345.4410 KOps/s 341.7166 KOps/s $\color{#35bf28}+1.09\%$
test_membership_stacked_nested 43.0000μs 2.8701μs 348.4213 KOps/s 347.2882 KOps/s $\color{#35bf28}+0.33\%$
test_membership_stacked_nested_leaf 30.8070μs 2.9245μs 341.9422 KOps/s 345.3107 KOps/s $\color{#d91a1a}-0.98\%$
test_membership_nested_last 36.0770μs 4.1885μs 238.7514 KOps/s 241.6311 KOps/s $\color{#d91a1a}-1.19\%$
test_membership_nested_leaf_last 52.4780μs 4.1612μs 240.3138 KOps/s 239.7412 KOps/s $\color{#35bf28}+0.24\%$
test_membership_stacked_nested_last 42.4890μs 13.4064μs 74.5910 KOps/s 209.2210 KOps/s $\textbf{\color{#d91a1a}-64.35\%}$
test_membership_stacked_nested_leaf_last 66.3640μs 13.0915μs 76.3853 KOps/s 207.8900 KOps/s $\textbf{\color{#d91a1a}-63.26\%}$
test_nested_getleaf 53.5400μs 10.7272μs 93.2213 KOps/s 93.8143 KOps/s $\color{#d91a1a}-0.63\%$
test_nested_get 57.0460μs 10.3236μs 96.8651 KOps/s 97.8481 KOps/s $\color{#d91a1a}-1.00\%$
test_stacked_getleaf 59.1800μs 10.7609μs 92.9288 KOps/s 92.8785 KOps/s $\color{#35bf28}+0.05\%$
test_stacked_get 51.3460μs 10.2628μs 97.4395 KOps/s 98.9065 KOps/s $\color{#d91a1a}-1.48\%$
test_nested_getitemleaf 54.4110μs 11.3841μs 87.8418 KOps/s 89.6896 KOps/s $\color{#d91a1a}-2.06\%$
test_nested_getitem 39.0830μs 10.5577μs 94.7174 KOps/s 95.4708 KOps/s $\color{#d91a1a}-0.79\%$
test_stacked_getitemleaf 62.0160μs 11.1689μs 89.5343 KOps/s 90.1469 KOps/s $\color{#d91a1a}-0.68\%$
test_stacked_getitem 44.1030μs 10.3630μs 96.4969 KOps/s 94.8799 KOps/s $\color{#35bf28}+1.70\%$
test_lock_nested 3.0045ms 0.4505ms 2.2198 KOps/s 2.2853 KOps/s $\color{#d91a1a}-2.87\%$
test_lock_stack_nested 0.6175ms 0.4088ms 2.4462 KOps/s 2.4388 KOps/s $\color{#35bf28}+0.30\%$
test_unlock_nested 1.1060ms 0.3648ms 2.7416 KOps/s 2.7761 KOps/s $\color{#d91a1a}-1.25\%$
test_unlock_stack_nested 0.5710ms 0.3257ms 3.0706 KOps/s 3.0268 KOps/s $\color{#35bf28}+1.45\%$
test_flatten_speed 0.1811ms 95.4945μs 10.4718 KOps/s 10.5961 KOps/s $\color{#d91a1a}-1.17\%$
test_unflatten_speed 0.7085ms 0.4998ms 2.0010 KOps/s 2.0518 KOps/s $\color{#d91a1a}-2.48\%$
test_common_ops 4.3448ms 0.8125ms 1.2307 KOps/s 1.2688 KOps/s $\color{#d91a1a}-3.00\%$
test_creation 14.3260μs 2.0539μs 486.8715 KOps/s 480.5497 KOps/s $\color{#35bf28}+1.32\%$
test_creation_empty 33.1420μs 11.6814μs 85.6065 KOps/s 80.3729 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_creation_nested_1 46.6670μs 14.8913μs 67.1531 KOps/s 64.1135 KOps/s $\color{#35bf28}+4.74\%$
test_creation_nested_2 50.1040μs 19.0238μs 52.5656 KOps/s 51.0202 KOps/s $\color{#35bf28}+3.03\%$
test_clone 67.2250μs 13.1790μs 75.8784 KOps/s 78.5852 KOps/s $\color{#d91a1a}-3.44\%$
test_getitem[int] 1.2078ms 12.7339μs 78.5304 KOps/s 81.8125 KOps/s $\color{#d91a1a}-4.01\%$
test_getitem[slice_int] 0.1635ms 24.7201μs 40.4530 KOps/s 42.4677 KOps/s $\color{#d91a1a}-4.74\%$
test_getitem[range] 0.1708ms 49.1725μs 20.3366 KOps/s 21.4485 KOps/s $\textbf{\color{#d91a1a}-5.18\%}$
test_getitem[tuple] 0.1377ms 20.4808μs 48.8263 KOps/s 52.0069 KOps/s $\textbf{\color{#d91a1a}-6.12\%}$
test_getitem[list] 0.1698ms 43.7702μs 22.8466 KOps/s 23.5708 KOps/s $\color{#d91a1a}-3.07\%$
test_setitem_dim[int] 50.1530μs 25.2120μs 39.6637 KOps/s 39.8231 KOps/s $\color{#d91a1a}-0.40\%$
test_setitem_dim[slice_int] 94.0960μs 53.0067μs 18.8656 KOps/s 19.8151 KOps/s $\color{#d91a1a}-4.79\%$
test_setitem_dim[range] 0.1319ms 74.7399μs 13.3797 KOps/s 13.7965 KOps/s $\color{#d91a1a}-3.02\%$
test_setitem_dim[tuple] 70.9620μs 41.0521μs 24.3593 KOps/s 25.5074 KOps/s $\color{#d91a1a}-4.50\%$
test_setitem 0.1088ms 21.0673μs 47.4670 KOps/s 50.1093 KOps/s $\textbf{\color{#d91a1a}-5.27\%}$
test_set 80.6200μs 20.8487μs 47.9645 KOps/s 50.8420 KOps/s $\textbf{\color{#d91a1a}-5.66\%}$
test_set_shared 1.1459ms 0.1690ms 5.9155 KOps/s 5.9844 KOps/s $\color{#d91a1a}-1.15\%$
test_update 0.1315ms 23.1025μs 43.2853 KOps/s 44.0436 KOps/s $\color{#d91a1a}-1.72\%$
test_update_nested 0.1137ms 33.3586μs 29.9773 KOps/s 30.6863 KOps/s $\color{#d91a1a}-2.31\%$
test_update__nested 0.9836ms 32.1862μs 31.0692 KOps/s 32.0805 KOps/s $\color{#d91a1a}-3.15\%$
test_set_nested 69.8400μs 22.8011μs 43.8576 KOps/s 47.2400 KOps/s $\textbf{\color{#d91a1a}-7.16\%}$
test_set_nested_new 81.4220μs 27.6357μs 36.1850 KOps/s 38.3352 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_select 0.1240ms 44.4094μs 22.5178 KOps/s 23.8894 KOps/s $\textbf{\color{#d91a1a}-5.74\%}$
test_select_nested 0.1113ms 59.0024μs 16.9485 KOps/s 17.2259 KOps/s $\color{#d91a1a}-1.61\%$
test_exclude_nested 0.1406ms 77.0358μs 12.9810 KOps/s 13.1382 KOps/s $\color{#d91a1a}-1.20\%$
test_empty[True] 0.6939ms 0.3872ms 2.5825 KOps/s 2.6603 KOps/s $\color{#d91a1a}-2.93\%$
test_empty[False] 11.4337μs 1.2054μs 829.5855 KOps/s 822.5157 KOps/s $\color{#35bf28}+0.86\%$
test_unbind_speed 0.3635ms 0.2635ms 3.7946 KOps/s 3.8539 KOps/s $\color{#d91a1a}-1.54\%$
test_unbind_speed_stack0 0.5302ms 0.2526ms 3.9593 KOps/s 3.9413 KOps/s $\color{#35bf28}+0.46\%$
test_unbind_speed_stack1 0.1119s 0.7500ms 1.3333 KOps/s 1.4490 KOps/s $\textbf{\color{#d91a1a}-7.98\%}$
test_split 1.7734ms 1.5620ms 640.1914 Ops/s 576.1230 Ops/s $\textbf{\color{#35bf28}+11.12\%}$
test_chunk 0.1150s 1.9403ms 515.3967 Ops/s 575.2311 Ops/s $\textbf{\color{#d91a1a}-10.40\%}$
test_consolidate_njt[False-None] 10.0422ms 8.0839ms 123.7028 Ops/s 123.2167 Ops/s $\color{#35bf28}+0.39\%$
test_creation[device0] 0.2109ms 91.3441μs 10.9476 KOps/s 11.0501 KOps/s $\color{#d91a1a}-0.93\%$
test_creation_from_tensor 3.6283ms 95.5099μs 10.4701 KOps/s 10.2207 KOps/s $\color{#35bf28}+2.44\%$
test_add_one[memmap_tensor0] 0.1729ms 4.9581μs 201.6919 KOps/s 197.2967 KOps/s $\color{#35bf28}+2.23\%$
test_contiguous[memmap_tensor0] 13.4260μs 0.5193μs 1.9256 MOps/s 1.9589 MOps/s $\color{#d91a1a}-1.70\%$
test_stack[memmap_tensor0] 39.3830μs 3.3991μs 294.1949 KOps/s 297.5145 KOps/s $\color{#d91a1a}-1.12\%$
test_memmaptd_index 1.1819ms 0.2373ms 4.2141 KOps/s 4.2712 KOps/s $\color{#d91a1a}-1.34\%$
test_memmaptd_index_astensor 0.7844ms 0.3171ms 3.1531 KOps/s 3.2185 KOps/s $\color{#d91a1a}-2.03\%$
test_memmaptd_index_op 1.0069ms 0.5945ms 1.6822 KOps/s 1.6767 KOps/s $\color{#35bf28}+0.32\%$
test_serialize_model 0.1320s 0.1211s 8.2598 Ops/s 7.4107 Ops/s $\textbf{\color{#35bf28}+11.46\%}$
test_serialize_model_pickle 0.4613s 0.4004s 2.4977 Ops/s 2.5436 Ops/s $\color{#d91a1a}-1.80\%$
test_serialize_weights 0.1261s 0.1172s 8.5320 Ops/s 8.6332 Ops/s $\color{#d91a1a}-1.17\%$
test_serialize_weights_returnearly 0.1727s 0.1597s 6.2610 Ops/s 6.4267 Ops/s $\color{#d91a1a}-2.58\%$
test_serialize_weights_pickle 0.5581s 0.4417s 2.2641 Ops/s 2.4872 Ops/s $\textbf{\color{#d91a1a}-8.97\%}$
test_serialize_weights_filesystem 0.1450s 0.1401s 7.1380 Ops/s 6.2786 Ops/s $\textbf{\color{#35bf28}+13.69\%}$
test_serialize_model_filesystem 0.1581s 0.1487s 6.7266 Ops/s 6.5017 Ops/s $\color{#35bf28}+3.46\%$
test_reshape_pytree 69.8400μs 26.1944μs 38.1760 KOps/s 38.0040 KOps/s $\color{#35bf28}+0.45\%$
test_reshape_td 67.9970μs 33.8071μs 29.5796 KOps/s 31.5394 KOps/s $\textbf{\color{#d91a1a}-6.21\%}$
test_view_pytree 86.2610μs 26.2015μs 38.1658 KOps/s 38.0473 KOps/s $\color{#35bf28}+0.31\%$
test_view_td 0.1412ms 39.3926μs 25.3855 KOps/s 27.2716 KOps/s $\textbf{\color{#d91a1a}-6.92\%}$
test_unbind_pytree 61.9960μs 29.7887μs 33.5697 KOps/s 33.9818 KOps/s $\color{#d91a1a}-1.21\%$
test_unbind_td 0.3530ms 38.4827μs 25.9857 KOps/s 26.2661 KOps/s $\color{#d91a1a}-1.07\%$
test_split_pytree 90.5050μs 29.1326μs 34.3258 KOps/s 33.9920 KOps/s $\color{#35bf28}+0.98\%$
test_split_td 0.5610ms 45.0662μs 22.1896 KOps/s 22.8831 KOps/s $\color{#d91a1a}-3.03\%$
test_add_pytree 85.2300μs 35.9917μs 27.7842 KOps/s 28.3220 KOps/s $\color{#d91a1a}-1.90\%$
test_add_td 0.1337ms 57.6860μs 17.3352 KOps/s 17.7046 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_add_one_nested[tensordict-compile] 0.1365ms 64.4677μs 15.5116 KOps/s 15.9592 KOps/s $\color{#d91a1a}-2.80\%$
test_compile_add_one_nested[tensordict-eager] 12.0546ms 0.1643ms 6.0862 KOps/s 6.1914 KOps/s $\color{#d91a1a}-1.70\%$
test_compile_add_one_nested[pytree-compile] 0.1130ms 47.3371μs 21.1251 KOps/s 22.0205 KOps/s $\color{#d91a1a}-4.07\%$
test_compile_add_one_nested[pytree-eager] 0.2988ms 0.1191ms 8.3959 KOps/s 8.5704 KOps/s $\color{#d91a1a}-2.04\%$
test_compile_copy_nested[tensordict-compile] 62.3160μs 26.3607μs 37.9353 KOps/s 38.3803 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_copy_nested[tensordict-eager] 0.1232ms 53.9912μs 18.5215 KOps/s 18.4839 KOps/s $\color{#35bf28}+0.20\%$
test_compile_copy_nested[pytree-compile] 0.1471ms 78.3872μs 12.7572 KOps/s 12.6623 KOps/s $\color{#35bf28}+0.75\%$
test_compile_copy_nested[pytree-eager] 0.1335ms 67.9694μs 14.7125 KOps/s 14.7617 KOps/s $\color{#d91a1a}-0.33\%$
test_compile_add_one_flat[tensordict-compile] 0.2420ms 0.1067ms 9.3719 KOps/s 9.6013 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_add_one_flat[tensordict-eager] 0.3640ms 0.1983ms 5.0420 KOps/s 5.0107 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_flat[tensorclass-compile] 0.1208ms 45.5520μs 21.9529 KOps/s 22.5938 KOps/s $\color{#d91a1a}-2.84\%$
test_compile_add_one_flat[tensorclass-eager] 0.4917ms 63.5002μs 15.7480 KOps/s 16.2084 KOps/s $\color{#d91a1a}-2.84\%$
test_compile_add_one_flat[pytree-compile] 0.2409ms 0.1047ms 9.5534 KOps/s 9.7377 KOps/s $\color{#d91a1a}-1.89\%$
test_compile_add_one_flat[pytree-eager] 0.2878ms 0.1986ms 5.0348 KOps/s 4.9553 KOps/s $\color{#35bf28}+1.60\%$
test_compile_add_self_flat[tensordict-eager] 0.2966ms 0.2100ms 4.7614 KOps/s 4.7938 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_add_self_flat[tensordict-compile] 0.2276ms 0.1093ms 9.1455 KOps/s 9.5395 KOps/s $\color{#d91a1a}-4.13\%$
test_compile_add_self_flat[tensorclass-eager] 0.2259ms 55.9798μs 17.8636 KOps/s 18.9271 KOps/s $\textbf{\color{#d91a1a}-5.62\%}$
test_compile_add_self_flat[tensorclass-compile] 96.3600μs 47.1542μs 21.2070 KOps/s 21.5779 KOps/s $\color{#d91a1a}-1.72\%$
test_compile_add_self_flat[pytree-eager] 0.6163ms 0.1569ms 6.3721 KOps/s 6.2494 KOps/s $\color{#35bf28}+1.96\%$
test_compile_add_self_flat[pytree-compile] 0.1879ms 0.1067ms 9.3750 KOps/s 9.7564 KOps/s $\color{#d91a1a}-3.91\%$
test_compile_copy_flat[tensordict-compile] 62.8980μs 21.1875μs 47.1977 KOps/s 47.2847 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_copy_flat[tensordict-eager] 0.1229ms 58.6847μs 17.0402 KOps/s 16.7225 KOps/s $\color{#35bf28}+1.90\%$
test_compile_copy_flat[pytree-compile] 0.1529ms 78.8977μs 12.6746 KOps/s 12.2556 KOps/s $\color{#35bf28}+3.42\%$
test_compile_copy_flat[pytree-eager] 0.1285ms 67.2524μs 14.8694 KOps/s 14.4814 KOps/s $\color{#35bf28}+2.68\%$
test_compile_assign_and_add[tensordict-compile] 0.3490ms 0.2047ms 4.8863 KOps/s 4.8408 KOps/s $\color{#35bf28}+0.94\%$
test_compile_assign_and_add[tensordict-eager] 1.7106ms 1.2959ms 771.6709 Ops/s 792.5105 Ops/s $\color{#d91a1a}-2.63\%$
test_compile_assign_and_add[pytree-compile] 0.3891ms 0.2030ms 4.9271 KOps/s 4.9410 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_assign_and_add[pytree-eager] 0.8638ms 0.7692ms 1.3000 KOps/s 1.2859 KOps/s $\color{#35bf28}+1.09\%$
test_compile_assign_and_add_stack[compile] 0.6056ms 0.4554ms 2.1957 KOps/s 2.1972 KOps/s $\color{#d91a1a}-0.07\%$
test_compile_assign_and_add_stack[eager] 4.3364ms 2.6769ms 373.5734 Ops/s 375.6398 Ops/s $\color{#d91a1a}-0.55\%$
test_compile_indexing[tensor-tensordict-compile] 0.1166ms 37.2706μs 26.8308 KOps/s 27.3072 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[tensor-tensordict-eager] 0.5429ms 33.1547μs 30.1617 KOps/s 30.7804 KOps/s $\color{#d91a1a}-2.01\%$
test_compile_indexing[tensor-tensorclass-compile] 93.8950μs 29.8387μs 33.5135 KOps/s 34.8581 KOps/s $\color{#d91a1a}-3.86\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1042ms 23.1200μs 43.2526 KOps/s 43.9182 KOps/s $\color{#d91a1a}-1.52\%$
test_compile_indexing[tensor-pytree-compile] 72.8650μs 30.0212μs 33.3098 KOps/s 33.9393 KOps/s $\color{#d91a1a}-1.85\%$
test_compile_indexing[tensor-pytree-eager] 80.8610μs 23.4801μs 42.5892 KOps/s 43.0725 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_indexing[slice-tensordict-compile] 0.1083ms 51.0610μs 19.5844 KOps/s 19.3595 KOps/s $\color{#35bf28}+1.16\%$
test_compile_indexing[slice-tensordict-eager] 0.5966ms 20.3262μs 49.1976 KOps/s 50.5643 KOps/s $\color{#d91a1a}-2.70\%$
test_compile_indexing[slice-tensorclass-compile] 0.1247ms 43.9044μs 22.7768 KOps/s 22.5276 KOps/s $\color{#35bf28}+1.11\%$
test_compile_indexing[slice-tensorclass-eager] 62.6670μs 18.4407μs 54.2278 KOps/s 52.7706 KOps/s $\color{#35bf28}+2.76\%$
test_compile_indexing[slice-pytree-compile] 99.1150μs 44.5882μs 22.4275 KOps/s 22.0145 KOps/s $\color{#35bf28}+1.88\%$
test_compile_indexing[slice-pytree-eager] 62.6170μs 18.6119μs 53.7291 KOps/s 52.8883 KOps/s $\color{#35bf28}+1.59\%$
test_compile_indexing[int-tensordict-compile] 0.1378ms 53.2289μs 18.7868 KOps/s 18.7380 KOps/s $\color{#35bf28}+0.26\%$
test_compile_indexing[int-tensordict-eager] 1.0171ms 20.0926μs 49.7695 KOps/s 50.3873 KOps/s $\color{#d91a1a}-1.23\%$
test_compile_indexing[int-tensorclass-compile] 97.8420μs 44.2505μs 22.5986 KOps/s 22.4823 KOps/s $\color{#35bf28}+0.52\%$
test_compile_indexing[int-tensorclass-eager] 63.4380μs 18.5354μs 53.9509 KOps/s 53.4517 KOps/s $\color{#35bf28}+0.93\%$
test_compile_indexing[int-pytree-compile] 97.9930μs 44.4280μs 22.5083 KOps/s 22.2333 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[int-pytree-eager] 67.7470μs 18.4058μs 54.3307 KOps/s 53.2115 KOps/s $\color{#35bf28}+2.10\%$
test_mod_add[eager] 91.3910μs 35.2096μs 28.4013 KOps/s 28.1044 KOps/s $\color{#35bf28}+1.06\%$
test_mod_add[compile] 0.1081ms 47.3117μs 21.1364 KOps/s 20.8210 KOps/s $\color{#35bf28}+1.51\%$
test_mod_add[compile-overhead] 99.9570μs 46.7183μs 21.4049 KOps/s 20.6605 KOps/s $\color{#35bf28}+3.60\%$
test_mod_wrap[eager] 0.3652ms 0.2248ms 4.4485 KOps/s 4.4251 KOps/s $\color{#35bf28}+0.53\%$
test_mod_wrap[compile] 0.4396ms 0.2085ms 4.7973 KOps/s 4.7585 KOps/s $\color{#35bf28}+0.82\%$
test_mod_wrap[compile-overhead] 0.4113ms 0.2065ms 4.8424 KOps/s 4.8049 KOps/s $\color{#35bf28}+0.78\%$
test_mod_wrap_and_backward[eager] 17.7090ms 12.2698ms 81.5006 Ops/s 75.1033 Ops/s $\textbf{\color{#35bf28}+8.52\%}$
test_mod_wrap_and_backward[compile] 17.2776ms 13.4619ms 74.2840 Ops/s 73.5251 Ops/s $\color{#35bf28}+1.03\%$
test_mod_wrap_and_backward[compile-overhead] 23.2788ms 13.1413ms 76.0961 Ops/s 76.0696 Ops/s $\color{#35bf28}+0.03\%$
test_seq_add[eager] 0.1890ms 0.1128ms 8.8628 KOps/s 8.6461 KOps/s $\color{#35bf28}+2.51\%$
test_seq_add[compile] 0.1238ms 62.6916μs 15.9511 KOps/s 15.8339 KOps/s $\color{#35bf28}+0.74\%$
test_seq_add[compile-overhead] 0.1291ms 59.1388μs 16.9094 KOps/s 16.2609 KOps/s $\color{#35bf28}+3.99\%$
test_seq_wrap[eager] 0.8299ms 0.4447ms 2.2487 KOps/s 2.2128 KOps/s $\color{#35bf28}+1.62\%$
test_seq_wrap[compile] 0.4029ms 0.2297ms 4.3532 KOps/s 4.2855 KOps/s $\color{#35bf28}+1.58\%$
test_seq_wrap[compile-overhead] 0.4204ms 0.2277ms 4.3921 KOps/s 4.3319 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_runtime[False-eager] 0.7434ms 0.5445ms 1.8364 KOps/s 1.8001 KOps/s $\color{#35bf28}+2.02\%$
test_func_call_runtime[False-compile] 0.5306ms 0.4291ms 2.3306 KOps/s 2.3684 KOps/s $\color{#d91a1a}-1.60\%$
test_func_call_runtime[False-compile-overhead] 0.7945ms 0.4306ms 2.3223 KOps/s 2.3548 KOps/s $\color{#d91a1a}-1.38\%$
test_func_call_runtime[True-eager] 1.5981ms 0.7501ms 1.3332 KOps/s 1.3105 KOps/s $\color{#35bf28}+1.73\%$
test_func_call_runtime[True-compile] 0.7392ms 0.4656ms 2.1478 KOps/s 2.1283 KOps/s $\color{#35bf28}+0.92\%$
test_func_call_runtime[True-compile-overhead] 0.5668ms 0.4651ms 2.1500 KOps/s 2.1222 KOps/s $\color{#35bf28}+1.31\%$
test_func_call_cm_runtime[False-eager] 0.8955ms 0.5455ms 1.8333 KOps/s 1.8320 KOps/s $\color{#35bf28}+0.07\%$
test_func_call_cm_runtime[False-compile] 0.7699ms 0.4289ms 2.3314 KOps/s 2.3356 KOps/s $\color{#d91a1a}-0.18\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8975ms 0.4273ms 2.3405 KOps/s 2.3351 KOps/s $\color{#35bf28}+0.23\%$
test_func_call_cm_runtime[True-eager] 1.0358ms 0.8889ms 1.1249 KOps/s 1.0979 KOps/s $\color{#35bf28}+2.46\%$
test_func_call_cm_runtime[True-compile] 0.8772ms 0.4922ms 2.0317 KOps/s 2.0210 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5986ms 0.4915ms 2.0348 KOps/s 2.0162 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_func_call_cm_runtime[eager] 2.6776ms 1.8665ms 535.7478 Ops/s 524.9593 Ops/s $\color{#35bf28}+2.06\%$
test_vmap_func_call_cm_runtime[compile] 0.8918ms 0.5193ms 1.9256 KOps/s 1.9243 KOps/s $\color{#35bf28}+0.06\%$
test_vmap_func_call_cm_runtime[compile-overhead] 1.0558ms 0.5287ms 1.8914 KOps/s 1.9267 KOps/s $\color{#d91a1a}-1.83\%$
test_distributed 0.2221ms 0.1266ms 7.9016 KOps/s 7.7515 KOps/s $\color{#35bf28}+1.94\%$
test_tdmodule 87.3430μs 25.4763μs 39.2522 KOps/s 36.7552 KOps/s $\textbf{\color{#35bf28}+6.79\%}$
test_tdmodule_dispatch 85.7300μs 46.3240μs 21.5871 KOps/s 20.0633 KOps/s $\textbf{\color{#35bf28}+7.59\%}$
test_tdseq 51.0960μs 26.2668μs 38.0708 KOps/s 36.6547 KOps/s $\color{#35bf28}+3.86\%$
test_tdseq_dispatch 82.5240μs 49.7056μs 20.1185 KOps/s 19.1253 KOps/s $\textbf{\color{#35bf28}+5.19\%}$
test_instantiation_functorch 2.4193ms 1.5527ms 644.0463 Ops/s 638.5479 Ops/s $\color{#35bf28}+0.86\%$
test_exec_functorch 0.3135ms 0.1833ms 5.4545 KOps/s 5.5539 KOps/s $\color{#d91a1a}-1.79\%$
test_exec_functional_call 0.2811ms 0.1716ms 5.8265 KOps/s 5.8200 KOps/s $\color{#35bf28}+0.11\%$
test_exec_td_decorator 0.5060ms 0.2317ms 4.3159 KOps/s 4.3076 KOps/s $\color{#35bf28}+0.19\%$
test_vmap_mlp_speed_decorator[True-True] 1.0071ms 0.6711ms 1.4901 KOps/s 1.4923 KOps/s $\color{#d91a1a}-0.15\%$
test_vmap_mlp_speed_decorator[True-False] 0.8590ms 0.6463ms 1.5472 KOps/s 1.5302 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed_decorator[False-True] 0.6886ms 0.5187ms 1.9280 KOps/s 1.8930 KOps/s $\color{#35bf28}+1.85\%$
test_vmap_mlp_speed_decorator[False-False] 0.8260ms 0.5223ms 1.9145 KOps/s 1.8874 KOps/s $\color{#35bf28}+1.43\%$
test_to_module_speed[True] 1.4275ms 1.2689ms 788.0788 Ops/s 773.0217 Ops/s $\color{#35bf28}+1.95\%$
test_to_module_speed[False] 1.7182ms 1.2411ms 805.7153 Ops/s 797.1555 Ops/s $\color{#35bf28}+1.07\%$
test_tc_init 83.6460μs 48.0804μs 20.7985 KOps/s 21.5787 KOps/s $\color{#d91a1a}-3.62\%$
test_tc_init_nested 0.1421ms 97.4340μs 10.2634 KOps/s 10.7809 KOps/s $\color{#d91a1a}-4.80\%$
test_tc_first_layer_tensor 49.3680μs 1.5039μs 664.9180 KOps/s 644.9792 KOps/s $\color{#35bf28}+3.09\%$
test_tc_first_layer_nontensor 23.6040μs 4.6728μs 214.0034 KOps/s 214.7428 KOps/s $\color{#d91a1a}-0.34\%$
test_tc_second_layer_tensor 23.4740μs 2.7983μs 357.3586 KOps/s 354.9637 KOps/s $\color{#35bf28}+0.67\%$
test_tc_second_layer_nontensor 23.0230μs 6.0139μs 166.2814 KOps/s 166.0140 KOps/s $\color{#35bf28}+0.16\%$
test_unbind 0.2268s 13.2596ms 75.4169 Ops/s 76.2390 Ops/s $\color{#d91a1a}-1.08\%$
test_full_like 15.9520ms 13.5961ms 73.5506 Ops/s 124.6884 Ops/s $\textbf{\color{#d91a1a}-41.01\%}$
test_zeros_like 15.6117ms 7.7675ms 128.7408 Ops/s 323.5209 Ops/s $\textbf{\color{#d91a1a}-60.21\%}$
test_ones_like 13.4488ms 8.1621ms 122.5181 Ops/s 288.6180 Ops/s $\textbf{\color{#d91a1a}-57.55\%}$
test_clone 12.6356ms 10.0526ms 99.4768 Ops/s 173.1938 Ops/s $\textbf{\color{#d91a1a}-42.56\%}$
test_squeeze 59.4710μs 11.6620μs 85.7487 KOps/s 86.2377 KOps/s $\color{#d91a1a}-0.57\%$
test_unsqueeze 0.2831ms 89.4672μs 11.1773 KOps/s 11.5619 KOps/s $\color{#d91a1a}-3.33\%$
test_split 0.3207ms 0.1939ms 5.1585 KOps/s 5.2205 KOps/s $\color{#d91a1a}-1.19\%$
test_permute 0.4433ms 0.2227ms 4.4908 KOps/s 4.6972 KOps/s $\color{#d91a1a}-4.39\%$
test_stack 33.6165ms 27.7447ms 36.0429 Ops/s 36.3084 Ops/s $\color{#d91a1a}-0.73\%$
test_cat 31.1772ms 27.5181ms 36.3398 Ops/s 36.6274 Ops/s $\color{#d91a1a}-0.79\%$

Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}37$. Worsened: $\large\color{#d91a1a}16$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 37.5920μs 10.9931μs 90.9663 KOps/s 96.5287 KOps/s $\textbf{\color{#d91a1a}-5.76\%}$
test_plain_set_stack_nested 37.3720μs 10.9843μs 91.0393 KOps/s 96.3851 KOps/s $\textbf{\color{#d91a1a}-5.55\%}$
test_plain_set_nested_inplace 45.5530μs 11.8502μs 84.3865 KOps/s 88.5236 KOps/s $\color{#d91a1a}-4.67\%$
test_plain_set_stack_nested_inplace 44.9630μs 11.8788μs 84.1836 KOps/s 88.9868 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_items 35.5320μs 2.8960μs 345.3049 KOps/s 341.9895 KOps/s $\color{#35bf28}+0.97\%$
test_items_nested 0.4223ms 0.3529ms 2.8339 KOps/s 2.8034 KOps/s $\color{#35bf28}+1.09\%$
test_items_nested_locked 0.5236ms 0.3531ms 2.8318 KOps/s 2.7489 KOps/s $\color{#35bf28}+3.01\%$
test_items_nested_leaf 0.2103ms 58.0282μs 17.2330 KOps/s 17.0753 KOps/s $\color{#35bf28}+0.92\%$
test_items_stack_nested 0.4091ms 0.3552ms 2.8155 KOps/s 2.7980 KOps/s $\color{#35bf28}+0.62\%$
test_items_stack_nested_leaf 0.1134ms 59.4139μs 16.8311 KOps/s 16.8892 KOps/s $\color{#d91a1a}-0.34\%$
test_items_stack_nested_locked 0.4564ms 0.3561ms 2.8082 KOps/s 2.7760 KOps/s $\color{#35bf28}+1.16\%$
test_keys 29.2120μs 3.4438μs 290.3754 KOps/s 285.0213 KOps/s $\color{#35bf28}+1.88\%$
test_keys_nested 0.1291ms 69.7550μs 14.3359 KOps/s 14.0981 KOps/s $\color{#35bf28}+1.69\%$
test_keys_nested_locked 0.6954ms 75.4073μs 13.2613 KOps/s 13.1227 KOps/s $\color{#35bf28}+1.06\%$
test_keys_nested_leaf 90.7340μs 61.2490μs 16.3268 KOps/s 16.1587 KOps/s $\color{#35bf28}+1.04\%$
test_keys_stack_nested 0.1079ms 70.9312μs 14.0982 KOps/s 13.9991 KOps/s $\color{#35bf28}+0.71\%$
test_keys_stack_nested_leaf 99.8550μs 62.3266μs 16.0445 KOps/s 16.0285 KOps/s $\color{#35bf28}+0.10\%$
test_keys_stack_nested_locked 0.2196ms 75.7899μs 13.1944 KOps/s 12.9958 KOps/s $\color{#35bf28}+1.53\%$
test_values 7.0787μs 0.8405μs 1.1898 MOps/s 1.1746 MOps/s $\color{#35bf28}+1.30\%$
test_values_nested 97.0450μs 31.1803μs 32.0715 KOps/s 32.0915 KOps/s $\color{#d91a1a}-0.06\%$
test_values_nested_locked 68.3330μs 32.7247μs 30.5580 KOps/s 30.4168 KOps/s $\color{#35bf28}+0.46\%$
test_values_nested_leaf 65.4940μs 33.6257μs 29.7391 KOps/s 29.6890 KOps/s $\color{#35bf28}+0.17\%$
test_values_stack_nested 0.1389ms 31.9601μs 31.2890 KOps/s 31.5594 KOps/s $\color{#d91a1a}-0.86\%$
test_values_stack_nested_leaf 69.2140μs 34.1115μs 29.3157 KOps/s 29.0610 KOps/s $\color{#35bf28}+0.88\%$
test_values_stack_nested_locked 0.1006ms 33.2897μs 30.0393 KOps/s 30.0941 KOps/s $\color{#d91a1a}-0.18\%$
test_membership 2.1786μs 0.5101μs 1.9604 MOps/s 1.9913 MOps/s $\color{#d91a1a}-1.55\%$
test_membership_nested 25.4720μs 2.0559μs 486.4110 KOps/s 491.0199 KOps/s $\color{#d91a1a}-0.94\%$
test_membership_nested_leaf 35.5215μs 1.9971μs 500.7272 KOps/s 486.5386 KOps/s $\color{#35bf28}+2.92\%$
test_membership_stacked_nested 54.5820μs 2.0563μs 486.3175 KOps/s 479.4191 KOps/s $\color{#35bf28}+1.44\%$
test_membership_stacked_nested_leaf 38.9120μs 2.0723μs 482.5537 KOps/s 468.5837 KOps/s $\color{#35bf28}+2.98\%$
test_membership_nested_last 36.6010μs 2.9854μs 334.9597 KOps/s 327.9673 KOps/s $\color{#35bf28}+2.13\%$
test_membership_nested_leaf_last 0.1726ms 3.0186μs 331.2763 KOps/s 332.1810 KOps/s $\color{#d91a1a}-0.27\%$
test_membership_stacked_nested_last 32.6220μs 3.3904μs 294.9532 KOps/s 331.3436 KOps/s $\textbf{\color{#d91a1a}-10.98\%}$
test_membership_stacked_nested_leaf_last 59.4330μs 3.3832μs 295.5791 KOps/s 335.0125 KOps/s $\textbf{\color{#d91a1a}-11.77\%}$
test_nested_getleaf 0.1946ms 6.1095μs 163.6805 KOps/s 163.1137 KOps/s $\color{#35bf28}+0.35\%$
test_nested_get 64.6030μs 5.7948μs 172.5672 KOps/s 170.2109 KOps/s $\color{#35bf28}+1.38\%$
test_stacked_getleaf 37.1620μs 6.0984μs 163.9772 KOps/s 162.5275 KOps/s $\color{#35bf28}+0.89\%$
test_stacked_get 42.1920μs 5.8013μs 172.3759 KOps/s 170.7037 KOps/s $\color{#35bf28}+0.98\%$
test_nested_getitemleaf 34.1320μs 6.1980μs 161.3422 KOps/s 160.2595 KOps/s $\color{#35bf28}+0.68\%$
test_nested_getitem 39.2910μs 5.8870μs 169.8669 KOps/s 168.2028 KOps/s $\color{#35bf28}+0.99\%$
test_stacked_getitemleaf 35.5410μs 6.2111μs 161.0026 KOps/s 160.5718 KOps/s $\color{#35bf28}+0.27\%$
test_stacked_getitem 40.1510μs 5.8576μs 170.7183 KOps/s 168.3641 KOps/s $\color{#35bf28}+1.40\%$
test_lock_nested 9.6581ms 0.3788ms 2.6396 KOps/s 2.6557 KOps/s $\color{#d91a1a}-0.60\%$
test_lock_stack_nested 0.4377ms 0.3388ms 2.9519 KOps/s 2.9235 KOps/s $\color{#35bf28}+0.97\%$
test_unlock_nested 0.6064ms 0.3076ms 3.2514 KOps/s 3.2138 KOps/s $\color{#35bf28}+1.17\%$
test_unlock_stack_nested 0.3636ms 0.2746ms 3.6411 KOps/s 3.5488 KOps/s $\color{#35bf28}+2.60\%$
test_flatten_speed 0.1091ms 74.4149μs 13.4382 KOps/s 13.3599 KOps/s $\color{#35bf28}+0.59\%$
test_unflatten_speed 0.3644ms 0.3055ms 3.2735 KOps/s 3.2302 KOps/s $\color{#35bf28}+1.34\%$
test_common_ops 1.6532ms 0.6056ms 1.6512 KOps/s 1.6592 KOps/s $\color{#d91a1a}-0.48\%$
test_creation 0.1822ms 1.4660μs 682.1240 KOps/s 677.2260 KOps/s $\color{#35bf28}+0.72\%$
test_creation_empty 34.1120μs 8.1474μs 122.7384 KOps/s 144.8755 KOps/s $\textbf{\color{#d91a1a}-15.28\%}$
test_creation_nested_1 41.0020μs 9.6861μs 103.2403 KOps/s 118.7688 KOps/s $\textbf{\color{#d91a1a}-13.07\%}$
test_creation_nested_2 92.5540μs 12.5548μs 79.6508 KOps/s 90.2239 KOps/s $\textbf{\color{#d91a1a}-11.72\%}$
test_clone 55.7220μs 10.3167μs 96.9306 KOps/s 87.0321 KOps/s $\textbf{\color{#35bf28}+11.37\%}$
test_getitem[int] 93.8656ms 15.7280μs 63.5808 KOps/s 89.7817 KOps/s $\textbf{\color{#d91a1a}-29.18\%}$
test_getitem[slice_int] 0.1596ms 20.5780μs 48.5956 KOps/s 44.2465 KOps/s $\textbf{\color{#35bf28}+9.83\%}$
test_getitem[range] 0.1370ms 38.2237μs 26.1618 KOps/s 25.5808 KOps/s $\color{#35bf28}+2.27\%$
test_getitem[tuple] 0.1228ms 18.0869μs 55.2886 KOps/s 52.2476 KOps/s $\textbf{\color{#35bf28}+5.82\%}$
test_getitem[list] 0.1853ms 33.0619μs 30.2463 KOps/s 28.4520 KOps/s $\textbf{\color{#35bf28}+6.31\%}$
test_setitem_dim[int] 45.4620μs 18.3719μs 54.4309 KOps/s 48.9508 KOps/s $\textbf{\color{#35bf28}+11.20\%}$
test_setitem_dim[slice_int] 61.8130μs 38.5161μs 25.9632 KOps/s 24.5915 KOps/s $\textbf{\color{#35bf28}+5.58\%}$
test_setitem_dim[range] 83.8340μs 53.2240μs 18.7885 KOps/s 17.9736 KOps/s $\color{#35bf28}+4.53\%$
test_setitem_dim[tuple] 55.0320μs 31.7572μs 31.4889 KOps/s 28.7458 KOps/s $\textbf{\color{#35bf28}+9.54\%}$
test_setitem 94.0140μs 14.9200μs 67.0243 KOps/s 63.3214 KOps/s $\textbf{\color{#35bf28}+5.85\%}$
test_set 88.8850μs 14.4709μs 69.1042 KOps/s 65.5536 KOps/s $\textbf{\color{#35bf28}+5.42\%}$
test_set_shared 1.7132ms 0.1467ms 6.8170 KOps/s 6.7324 KOps/s $\color{#35bf28}+1.26\%$
test_update 0.6290ms 17.4288μs 57.3763 KOps/s 56.9868 KOps/s $\color{#35bf28}+0.68\%$
test_update_nested 89.6740μs 22.4548μs 44.5338 KOps/s 43.4415 KOps/s $\color{#35bf28}+2.51\%$
test_update__nested 0.4133ms 24.2688μs 41.2051 KOps/s 39.0560 KOps/s $\textbf{\color{#35bf28}+5.50\%}$
test_set_nested 75.5140μs 15.6557μs 63.8745 KOps/s 60.1423 KOps/s $\textbf{\color{#35bf28}+6.21\%}$
test_set_nested_new 0.1137ms 17.8287μs 56.0892 KOps/s 52.4214 KOps/s $\textbf{\color{#35bf28}+7.00\%}$
test_select 96.0450μs 29.7427μs 33.6217 KOps/s 32.1513 KOps/s $\color{#35bf28}+4.57\%$
test_select_nested 73.2040μs 41.5992μs 24.0389 KOps/s 23.6123 KOps/s $\color{#35bf28}+1.81\%$
test_exclude_nested 99.2950μs 60.9454μs 16.4081 KOps/s 15.9898 KOps/s $\color{#35bf28}+2.62\%$
test_empty[True] 0.3265ms 0.2718ms 3.6787 KOps/s 3.6060 KOps/s $\color{#35bf28}+2.02\%$
test_empty[False] 3.4152μs 0.7433μs 1.3454 MOps/s 1.3488 MOps/s $\color{#d91a1a}-0.25\%$
test_to 86.8050μs 54.7441μs 18.2668 KOps/s 17.3173 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_to_nonblocking 0.1967ms 46.0037μs 21.7374 KOps/s 20.9192 KOps/s $\color{#35bf28}+3.91\%$
test_unbind_speed 1.5972ms 0.2332ms 4.2883 KOps/s 4.2644 KOps/s $\color{#35bf28}+0.56\%$
test_unbind_speed_stack0 0.2891ms 0.2307ms 4.3345 KOps/s 4.2251 KOps/s $\color{#35bf28}+2.59\%$
test_unbind_speed_stack1 93.0565ms 0.6455ms 1.5492 KOps/s 1.5238 KOps/s $\color{#35bf28}+1.66\%$
test_split 94.9839ms 1.5806ms 632.6910 Ops/s 606.2921 Ops/s $\color{#35bf28}+4.35\%$
test_chunk 98.3910ms 1.7321ms 577.3426 Ops/s 597.5116 Ops/s $\color{#d91a1a}-3.38\%$
test_consolidate[False-None] 3.0173ms 2.6693ms 374.6313 Ops/s 341.9527 Ops/s $\textbf{\color{#35bf28}+9.56\%}$
test_consolidate[default-None] 1.8237ms 1.6847ms 593.5631 Ops/s 575.6733 Ops/s $\color{#35bf28}+3.11\%$
test_consolidate[reduce-overhead-None] 1.9314ms 1.7332ms 576.9681 Ops/s 557.2343 Ops/s $\color{#35bf28}+3.54\%$
test_consolidate_njt[False-None] 7.1479ms 6.7413ms 148.3393 Ops/s 146.7488 Ops/s $\color{#35bf28}+1.08\%$
test_to[False-False-None] 1.9469ms 1.7003ms 588.1257 Ops/s 574.6049 Ops/s $\color{#35bf28}+2.35\%$
test_to[True-False-None] 1.5457ms 1.3115ms 762.4796 Ops/s 727.5157 Ops/s $\color{#35bf28}+4.81\%$
test_to[within-False-None] 0.2991s 5.3349ms 187.4445 Ops/s 237.5039 Ops/s $\textbf{\color{#d91a1a}-21.08\%}$
test_to[True-default-None] 5.7557ms 5.4515ms 183.4355 Ops/s 176.7180 Ops/s $\color{#35bf28}+3.80\%$
test_to_njt[False-False-None] 7.6955ms 7.4426ms 134.3615 Ops/s 136.4130 Ops/s $\color{#d91a1a}-1.50\%$
test_to_njt[True-False-None] 6.1482ms 5.9390ms 168.3774 Ops/s 166.1982 Ops/s $\color{#35bf28}+1.31\%$
test_to_njt[within-False-None] 13.1993ms 12.8586ms 77.7692 Ops/s 75.6799 Ops/s $\color{#35bf28}+2.76\%$
test_creation[device0] 0.4531ms 83.5888μs 11.9633 KOps/s 11.6184 KOps/s $\color{#35bf28}+2.97\%$
test_creation_from_tensor 0.5446ms 84.2287μs 11.8724 KOps/s 11.4810 KOps/s $\color{#35bf28}+3.41\%$
test_add_one[memmap_tensor0] 0.4858ms 6.8670μs 145.6238 KOps/s 136.7177 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_contiguous[memmap_tensor0] 2.1341μs 0.4085μs 2.4478 MOps/s 2.4051 MOps/s $\color{#35bf28}+1.78\%$
test_stack[memmap_tensor0] 0.1570ms 4.3594μs 229.3874 KOps/s 204.5980 KOps/s $\textbf{\color{#35bf28}+12.12\%}$
test_memmaptd_index 2.0293ms 0.2528ms 3.9549 KOps/s 3.8318 KOps/s $\color{#35bf28}+3.21\%$
test_memmaptd_index_astensor 0.8965ms 0.3091ms 3.2349 KOps/s 3.1401 KOps/s $\color{#35bf28}+3.02\%$
test_memmaptd_index_op 1.1504ms 0.5909ms 1.6924 KOps/s 1.6612 KOps/s $\color{#35bf28}+1.88\%$
test_serialize_model 0.1322s 0.1308s 7.6424 Ops/s 7.6258 Ops/s $\color{#35bf28}+0.22\%$
test_serialize_model_pickle 1.3482s 1.2117s 0.8253 Ops/s 0.8443 Ops/s $\color{#d91a1a}-2.25\%$
test_serialize_weights 0.1309s 0.1299s 7.6979 Ops/s 7.6699 Ops/s $\color{#35bf28}+0.37\%$
test_serialize_weights_returnearly 0.3449s 62.3590ms 16.0362 Ops/s 23.5183 Ops/s $\textbf{\color{#d91a1a}-31.81\%}$
test_serialize_weights_pickle 1.3721s 1.1970s 0.8355 Ops/s 0.8229 Ops/s $\color{#35bf28}+1.52\%$
test_reshape_pytree 88.9440μs 22.6676μs 44.1158 KOps/s 43.1121 KOps/s $\color{#35bf28}+2.33\%$
test_reshape_td 66.8730μs 27.1955μs 36.7708 KOps/s 35.5510 KOps/s $\color{#35bf28}+3.43\%$
test_view_pytree 58.8130μs 22.6553μs 44.1398 KOps/s 44.0288 KOps/s $\color{#35bf28}+0.25\%$
test_view_td 0.1383ms 30.1046μs 33.2175 KOps/s 32.8821 KOps/s $\color{#35bf28}+1.02\%$
test_unbind_pytree 0.1635ms 28.3217μs 35.3086 KOps/s 35.1960 KOps/s $\color{#35bf28}+0.32\%$
test_unbind_td 0.6194ms 35.6438μs 28.0554 KOps/s 27.2552 KOps/s $\color{#35bf28}+2.94\%$
test_split_pytree 0.1594ms 30.8309μs 32.4350 KOps/s 32.7273 KOps/s $\color{#d91a1a}-0.89\%$
test_split_td 0.7980ms 38.5885μs 25.9145 KOps/s 24.5576 KOps/s $\textbf{\color{#35bf28}+5.53\%}$
test_add_pytree 0.1865ms 35.4279μs 28.2263 KOps/s 27.8880 KOps/s $\color{#35bf28}+1.21\%$
test_add_td 0.2365ms 50.4254μs 19.8313 KOps/s 21.1526 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_compile_add_one_nested[tensordict-compile] 0.3004ms 0.1213ms 8.2422 KOps/s 7.9638 KOps/s $\color{#35bf28}+3.50\%$
test_compile_add_one_nested[tensordict-eager] 0.2744ms 0.1261ms 7.9285 KOps/s 7.8766 KOps/s $\color{#35bf28}+0.66\%$
test_compile_add_one_nested[pytree-compile] 0.2516ms 0.1002ms 9.9816 KOps/s 10.0404 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_add_one_nested[pytree-eager] 1.1002ms 0.1525ms 6.5567 KOps/s 6.4538 KOps/s $\color{#35bf28}+1.60\%$
test_compile_copy_nested[tensordict-compile] 0.1472ms 23.1424μs 43.2107 KOps/s 43.7538 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_copy_nested[tensordict-eager] 0.1285ms 27.2377μs 36.7138 KOps/s 36.7262 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_copy_nested[pytree-compile] 0.4889ms 65.4097μs 15.2883 KOps/s 15.2864 KOps/s $\color{#35bf28}+0.01\%$
test_compile_copy_nested[pytree-eager] 0.1786ms 49.6132μs 20.1559 KOps/s 20.0054 KOps/s $\color{#35bf28}+0.75\%$
test_compile_add_one_flat[tensordict-compile] 0.2886ms 0.1422ms 7.0310 KOps/s 6.8651 KOps/s $\color{#35bf28}+2.42\%$
test_compile_add_one_flat[tensordict-eager] 0.3718ms 0.2107ms 4.7457 KOps/s 4.8033 KOps/s $\color{#d91a1a}-1.20\%$
test_compile_add_one_flat[tensorclass-compile] 0.2563ms 97.6336μs 10.2424 KOps/s 9.9713 KOps/s $\color{#35bf28}+2.72\%$
test_compile_add_one_flat[tensorclass-eager] 0.1986ms 52.2334μs 19.1448 KOps/s 19.0375 KOps/s $\color{#35bf28}+0.56\%$
test_compile_add_one_flat[pytree-compile] 0.5472ms 0.1372ms 7.2870 KOps/s 7.2212 KOps/s $\color{#35bf28}+0.91\%$
test_compile_add_one_flat[pytree-eager] 0.8933ms 0.4996ms 2.0015 KOps/s 1.9895 KOps/s $\color{#35bf28}+0.60\%$
test_compile_add_self_flat[tensordict-eager] 0.6475ms 0.2489ms 4.0183 KOps/s 4.0138 KOps/s $\color{#35bf28}+0.11\%$
test_compile_add_self_flat[tensordict-compile] 0.2932ms 0.1438ms 6.9539 KOps/s 6.8425 KOps/s $\color{#35bf28}+1.63\%$
test_compile_add_self_flat[tensorclass-eager] 0.2117ms 62.3525μs 16.0379 KOps/s 16.0410 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_self_flat[tensorclass-compile] 0.2367ms 97.4396μs 10.2628 KOps/s 9.9487 KOps/s $\color{#35bf28}+3.16\%$
test_compile_add_self_flat[pytree-eager] 0.5662ms 0.4148ms 2.4107 KOps/s 2.4003 KOps/s $\color{#35bf28}+0.43\%$
test_compile_add_self_flat[pytree-compile] 0.2830ms 0.1360ms 7.3509 KOps/s 7.2736 KOps/s $\color{#35bf28}+1.06\%$
test_compile_copy_flat[tensordict-compile] 0.4140ms 18.7051μs 53.4615 KOps/s 48.2767 KOps/s $\textbf{\color{#35bf28}+10.74\%}$
test_compile_copy_flat[tensordict-eager] 0.4220ms 26.7441μs 37.3914 KOps/s 37.3967 KOps/s $\color{#d91a1a}-0.01\%$
test_compile_copy_flat[pytree-compile] 0.4605ms 70.0606μs 14.2734 KOps/s 14.1773 KOps/s $\color{#35bf28}+0.68\%$
test_compile_copy_flat[pytree-eager] 0.4364ms 51.9000μs 19.2678 KOps/s 19.2832 KOps/s $\color{#d91a1a}-0.08\%$
test_compile_assign_and_add[tensordict-compile] 1.6862ms 0.4033ms 2.4795 KOps/s 2.1673 KOps/s $\textbf{\color{#35bf28}+14.40\%}$
test_compile_assign_and_add[tensordict-eager] 2.8594ms 2.6402ms 378.7624 Ops/s 368.6711 Ops/s $\color{#35bf28}+2.74\%$
test_compile_assign_and_add[pytree-compile] 1.6412ms 0.4412ms 2.2666 KOps/s 1.9748 KOps/s $\textbf{\color{#35bf28}+14.78\%}$
test_compile_assign_and_add[pytree-eager] 2.9907ms 2.6964ms 370.8594 Ops/s 354.2619 Ops/s $\color{#35bf28}+4.69\%$
test_compile_indexing[tensor-tensordict-compile] 0.3569ms 0.1149ms 8.7039 KOps/s 8.1844 KOps/s $\textbf{\color{#35bf28}+6.35\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5493ms 84.7217μs 11.8034 KOps/s 11.7154 KOps/s $\color{#35bf28}+0.75\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2562ms 0.1077ms 9.2808 KOps/s 9.3040 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2544ms 71.5333μs 13.9795 KOps/s 13.6346 KOps/s $\color{#35bf28}+2.53\%$
test_compile_indexing[tensor-pytree-compile] 0.2982ms 0.1134ms 8.8145 KOps/s 8.6172 KOps/s $\color{#35bf28}+2.29\%$
test_compile_indexing[tensor-pytree-eager] 0.2540ms 71.8307μs 13.9216 KOps/s 13.5674 KOps/s $\color{#35bf28}+2.61\%$
test_compile_indexing[slice-tensordict-compile] 0.2584ms 0.1017ms 9.8365 KOps/s 9.8088 KOps/s $\color{#35bf28}+0.28\%$
test_compile_indexing[slice-tensordict-eager] 0.1447ms 17.2756μs 57.8851 KOps/s 51.4173 KOps/s $\textbf{\color{#35bf28}+12.58\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2711ms 96.5611μs 10.3561 KOps/s 9.7899 KOps/s $\textbf{\color{#35bf28}+5.78\%}$
test_compile_indexing[slice-tensorclass-eager] 0.1360ms 15.9293μs 62.7772 KOps/s 60.1918 KOps/s $\color{#35bf28}+4.30\%$
test_compile_indexing[slice-pytree-compile] 0.3195ms 98.3846μs 10.1642 KOps/s 9.6060 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_compile_indexing[slice-pytree-eager] 0.1334ms 15.9583μs 62.6635 KOps/s 59.8049 KOps/s $\color{#35bf28}+4.78\%$
test_compile_indexing[int-tensordict-compile] 0.3109ms 0.1030ms 9.7119 KOps/s 9.2414 KOps/s $\textbf{\color{#35bf28}+5.09\%}$
test_compile_indexing[int-tensordict-eager] 0.5729ms 17.0747μs 58.5661 KOps/s 52.8249 KOps/s $\textbf{\color{#35bf28}+10.87\%}$
test_compile_indexing[int-tensorclass-compile] 0.2423ms 96.8265μs 10.3277 KOps/s 9.5356 KOps/s $\textbf{\color{#35bf28}+8.31\%}$
test_compile_indexing[int-tensorclass-eager] 0.1530ms 15.8601μs 63.0513 KOps/s 59.7168 KOps/s $\textbf{\color{#35bf28}+5.58\%}$
test_compile_indexing[int-pytree-compile] 0.2789ms 0.1011ms 9.8885 KOps/s 9.5854 KOps/s $\color{#35bf28}+3.16\%$
test_compile_indexing[int-pytree-eager] 47.0230μs 16.0373μs 62.3548 KOps/s 61.0759 KOps/s $\color{#35bf28}+2.09\%$
test_mod_add[eager] 0.2206ms 40.6772μs 24.5838 KOps/s 26.5043 KOps/s $\textbf{\color{#d91a1a}-7.25\%}$
test_mod_add[compile] 0.2980ms 81.8731μs 12.2140 KOps/s 11.8972 KOps/s $\color{#35bf28}+2.66\%$
test_mod_add[compile-overhead] 0.3278ms 0.1692ms 5.9099 KOps/s 5.5992 KOps/s $\textbf{\color{#35bf28}+5.55\%}$
test_mod_wrap[eager] 0.4576ms 0.2626ms 3.8086 KOps/s 3.7531 KOps/s $\color{#35bf28}+1.48\%$
test_mod_wrap[compile] 0.5043ms 0.2866ms 3.4888 KOps/s 3.2476 KOps/s $\textbf{\color{#35bf28}+7.43\%}$
test_mod_wrap[compile-overhead] 7.3046ms 3.7629ms 265.7542 Ops/s 265.6049 Ops/s $\color{#35bf28}+0.06\%$
test_mod_wrap_and_backward[eager] 1.5544ms 1.3850ms 722.0148 Ops/s 682.9112 Ops/s $\textbf{\color{#35bf28}+5.73\%}$
test_mod_wrap_and_backward[compile] 1.4606ms 1.2771ms 783.0044 Ops/s 711.7012 Ops/s $\textbf{\color{#35bf28}+10.02\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3928ms 0.9281ms 1.0775 KOps/s 949.8300 Ops/s $\textbf{\color{#35bf28}+13.44\%}$
test_seq_add[eager] 0.2937ms 0.1127ms 8.8704 KOps/s 8.7089 KOps/s $\color{#35bf28}+1.85\%$
test_seq_add[compile] 0.2614ms 91.1835μs 10.9669 KOps/s 11.0563 KOps/s $\color{#d91a1a}-0.81\%$
test_seq_add[compile-overhead] 0.2745ms 0.1319ms 7.5824 KOps/s 7.3667 KOps/s $\color{#35bf28}+2.93\%$
test_seq_wrap[eager] 0.6142ms 0.4376ms 2.2851 KOps/s 2.2252 KOps/s $\color{#35bf28}+2.69\%$
test_seq_wrap[compile] 0.4845ms 0.3093ms 3.2332 KOps/s 3.2400 KOps/s $\color{#d91a1a}-0.21\%$
test_seq_wrap[compile-overhead] 0.3936ms 0.2305ms 4.3379 KOps/s 4.3722 KOps/s $\color{#d91a1a}-0.78\%$
test_func_call_runtime[False-eager] 0.8900ms 0.7512ms 1.3313 KOps/s 1.2892 KOps/s $\color{#35bf28}+3.26\%$
test_func_call_runtime[False-compile] 0.9416ms 0.7541ms 1.3261 KOps/s 1.3058 KOps/s $\color{#35bf28}+1.56\%$
test_func_call_runtime[False-compile-overhead] 0.5100ms 0.3631ms 2.7542 KOps/s 2.7035 KOps/s $\color{#35bf28}+1.87\%$
test_func_call_runtime[True-eager] 1.1546ms 0.9033ms 1.1070 KOps/s 1.0830 KOps/s $\color{#35bf28}+2.22\%$
test_func_call_runtime[True-compile] 0.9389ms 0.7723ms 1.2948 KOps/s 1.2722 KOps/s $\color{#35bf28}+1.78\%$
test_func_call_runtime[True-compile-overhead] 0.4520ms 0.3832ms 2.6094 KOps/s 2.5730 KOps/s $\color{#35bf28}+1.41\%$
test_func_call_cm_runtime[False-eager] 0.9862ms 0.7821ms 1.2787 KOps/s 1.3288 KOps/s $\color{#d91a1a}-3.77\%$
test_func_call_cm_runtime[False-compile] 0.9384ms 0.7538ms 1.3266 KOps/s 1.3016 KOps/s $\color{#35bf28}+1.92\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4993ms 0.3652ms 2.7384 KOps/s 2.6998 KOps/s $\color{#35bf28}+1.43\%$
test_func_call_cm_runtime[True-eager] 1.1487ms 1.0027ms 997.3133 Ops/s 972.2138 Ops/s $\color{#35bf28}+2.58\%$
test_func_call_cm_runtime[True-compile] 0.9914ms 0.8021ms 1.2467 KOps/s 1.2278 KOps/s $\color{#35bf28}+1.54\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5587ms 0.4118ms 2.4281 KOps/s 2.4088 KOps/s $\color{#35bf28}+0.80\%$
test_vmap_func_call_cm_runtime[eager] 2.5558ms 2.0915ms 478.1248 Ops/s 474.0365 Ops/s $\color{#35bf28}+0.86\%$
test_vmap_func_call_cm_runtime[compile] 0.9627ms 0.8192ms 1.2207 KOps/s 1.1962 KOps/s $\color{#35bf28}+2.05\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5708ms 0.4129ms 2.4216 KOps/s 2.3829 KOps/s $\color{#35bf28}+1.63\%$
test_distributed 2.6152ms 0.1846ms 5.4176 KOps/s 8.5000 KOps/s $\textbf{\color{#d91a1a}-36.26\%}$
test_tdmodule 37.6320μs 19.0852μs 52.3965 KOps/s 51.4595 KOps/s $\color{#35bf28}+1.82\%$
test_tdmodule_dispatch 0.1773ms 34.8509μs 28.6937 KOps/s 29.3102 KOps/s $\color{#d91a1a}-2.10\%$
test_tdseq 44.3020μs 19.1469μs 52.2279 KOps/s 52.5008 KOps/s $\color{#d91a1a}-0.52\%$
test_tdseq_dispatch 78.3640μs 36.8556μs 27.1329 KOps/s 27.6006 KOps/s $\color{#d91a1a}-1.69\%$
test_instantiation_functorch 1.7379ms 1.5524ms 644.1843 Ops/s 619.0251 Ops/s $\color{#35bf28}+4.06\%$
test_exec_functorch 0.2884ms 0.1443ms 6.9314 KOps/s 6.5502 KOps/s $\textbf{\color{#35bf28}+5.82\%}$
test_exec_functional_call 0.2304ms 0.1405ms 7.1168 KOps/s 6.8208 KOps/s $\color{#35bf28}+4.34\%$
test_exec_td_decorator 0.3759ms 0.1857ms 5.3849 KOps/s 5.2195 KOps/s $\color{#35bf28}+3.17\%$
test_vmap_mlp_speed_decorator[True-True] 0.9806ms 0.6866ms 1.4564 KOps/s 1.4508 KOps/s $\color{#35bf28}+0.38\%$
test_vmap_mlp_speed_decorator[True-False] 0.8727ms 0.7017ms 1.4252 KOps/s 1.4492 KOps/s $\color{#d91a1a}-1.66\%$
test_vmap_mlp_speed_decorator[False-True] 0.8122ms 0.6237ms 1.6033 KOps/s 1.6619 KOps/s $\color{#d91a1a}-3.53\%$
test_vmap_mlp_speed_decorator[False-False] 0.8143ms 0.6228ms 1.6057 KOps/s 1.6613 KOps/s $\color{#d91a1a}-3.35\%$
test_vmap_transformer_speed_decorator[True-True] 20.2706ms 19.5565ms 51.1340 Ops/s 51.6412 Ops/s $\color{#d91a1a}-0.98\%$
test_vmap_transformer_speed_decorator[True-False] 20.2831ms 19.7874ms 50.5373 Ops/s 51.6299 Ops/s $\color{#d91a1a}-2.12\%$
test_vmap_transformer_speed_decorator[False-True] 20.1445ms 19.7645ms 50.5957 Ops/s 52.0982 Ops/s $\color{#d91a1a}-2.88\%$
test_vmap_transformer_speed_decorator[False-False] 20.2288ms 19.6816ms 50.8090 Ops/s 51.9889 Ops/s $\color{#d91a1a}-2.27\%$
test_to_module_speed[True] 1.0425ms 0.9394ms 1.0645 KOps/s 1.0577 KOps/s $\color{#35bf28}+0.64\%$
test_to_module_speed[False] 1.3954ms 0.9289ms 1.0766 KOps/s 1.0866 KOps/s $\color{#d91a1a}-0.93\%$
test_tc_init 0.1831ms 35.0170μs 28.5575 KOps/s 28.4985 KOps/s $\color{#35bf28}+0.21\%$
test_tc_init_nested 0.1133ms 71.9715μs 13.8944 KOps/s 14.2084 KOps/s $\color{#d91a1a}-2.21\%$
test_tc_first_layer_tensor 9.9276μs 0.7044μs 1.4195 MOps/s 1.4190 MOps/s $\color{#35bf28}+0.04\%$
test_tc_first_layer_nontensor 41.1520μs 2.3234μs 430.4029 KOps/s 427.4081 KOps/s $\color{#35bf28}+0.70\%$
test_tc_second_layer_tensor 20.5960μs 1.4030μs 712.7370 KOps/s 694.4006 KOps/s $\color{#35bf28}+2.64\%$
test_tc_second_layer_nontensor 0.2010ms 3.0540μs 327.4377 KOps/s 325.7707 KOps/s $\color{#35bf28}+0.51\%$
test_unbind 0.2228s 9.8873ms 101.1397 Ops/s 151.2450 Ops/s $\textbf{\color{#d91a1a}-33.13\%}$
test_full_like 10.3681ms 9.7384ms 102.6858 Ops/s 102.8504 Ops/s $\color{#d91a1a}-0.16\%$
test_zeros_like 4.9197ms 4.3785ms 228.3910 Ops/s 113.2223 Ops/s $\textbf{\color{#35bf28}+101.72\%}$
test_ones_like 5.0747ms 4.4213ms 226.1769 Ops/s 227.8516 Ops/s $\color{#d91a1a}-0.73\%$
test_clone 7.4652ms 6.7759ms 147.5822 Ops/s 147.3596 Ops/s $\color{#35bf28}+0.15\%$
test_squeeze 66.4430μs 9.5760μs 104.4275 KOps/s 97.0696 KOps/s $\textbf{\color{#35bf28}+7.58\%}$
test_unsqueeze 0.2182ms 74.3850μs 13.4436 KOps/s 13.2414 KOps/s $\color{#35bf28}+1.53\%$
test_split 0.3895ms 0.1640ms 6.0961 KOps/s 5.6181 KOps/s $\textbf{\color{#35bf28}+8.51\%}$
test_permute 0.3713ms 0.1875ms 5.3327 KOps/s 5.2096 KOps/s $\color{#35bf28}+2.36\%$
test_stack 52.5294ms 51.9125ms 19.2632 Ops/s 19.3261 Ops/s $\color{#d91a1a}-0.33\%$
test_cat 52.3965ms 51.7654ms 19.3179 Ops/s 23.0275 Ops/s $\textbf{\color{#d91a1a}-16.11\%}$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants