Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] CompositeDistribution.from_distributions #1113

Merged
merged 2 commits into from
Nov 27, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 26, 2024

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 26, 2024
Copy link

github-actions bot commented Nov 26, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}33$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 64.9920μs 18.0009μs 55.5526 KOps/s 61.3180 KOps/s $\textbf{\color{#d91a1a}-9.40\%}$
test_plain_set_stack_nested 55.4030μs 18.2587μs 54.7683 KOps/s 59.7236 KOps/s $\textbf{\color{#d91a1a}-8.30\%}$
test_plain_set_nested_inplace 58.7300μs 19.7984μs 50.5090 KOps/s 54.8199 KOps/s $\textbf{\color{#d91a1a}-7.86\%}$
test_plain_set_stack_nested_inplace 58.0980μs 19.3404μs 51.7053 KOps/s 54.8377 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_items 39.7030μs 4.1655μs 240.0678 KOps/s 240.6913 KOps/s $\color{#d91a1a}-0.26\%$
test_items_nested 0.5905ms 0.4035ms 2.4781 KOps/s 2.4730 KOps/s $\color{#35bf28}+0.20\%$
test_items_nested_locked 0.5478ms 0.4031ms 2.4805 KOps/s 2.4634 KOps/s $\color{#35bf28}+0.70\%$
test_items_nested_leaf 0.1385ms 72.4334μs 13.8058 KOps/s 14.1339 KOps/s $\color{#d91a1a}-2.32\%$
test_items_stack_nested 0.5651ms 0.4066ms 2.4595 KOps/s 2.4613 KOps/s $\color{#d91a1a}-0.07\%$
test_items_stack_nested_leaf 0.1497ms 75.1640μs 13.3042 KOps/s 13.5989 KOps/s $\color{#d91a1a}-2.17\%$
test_items_stack_nested_locked 0.8522ms 0.4068ms 2.4580 KOps/s 2.4806 KOps/s $\color{#d91a1a}-0.91\%$
test_keys 32.1600μs 3.6079μs 277.1683 KOps/s 283.2637 KOps/s $\color{#d91a1a}-2.15\%$
test_keys_nested 0.2550ms 0.1377ms 7.2623 KOps/s 7.3735 KOps/s $\color{#d91a1a}-1.51\%$
test_keys_nested_locked 2.4412ms 0.1438ms 6.9541 KOps/s 6.9742 KOps/s $\color{#d91a1a}-0.29\%$
test_keys_nested_leaf 0.2361ms 0.1177ms 8.4996 KOps/s 8.5940 KOps/s $\color{#d91a1a}-1.10\%$
test_keys_stack_nested 0.2190ms 0.1358ms 7.3615 KOps/s 7.3409 KOps/s $\color{#35bf28}+0.28\%$
test_keys_stack_nested_leaf 0.2076ms 0.1153ms 8.6729 KOps/s 8.6504 KOps/s $\color{#35bf28}+0.26\%$
test_keys_stack_nested_locked 0.2716ms 0.1415ms 7.0677 KOps/s 7.0536 KOps/s $\color{#35bf28}+0.20\%$
test_values 20.8088μs 1.0396μs 961.9185 KOps/s 931.4558 KOps/s $\color{#35bf28}+3.27\%$
test_values_nested 0.1135ms 55.7596μs 17.9341 KOps/s 18.1563 KOps/s $\color{#d91a1a}-1.22\%$
test_values_nested_locked 0.1243ms 56.1952μs 17.7951 KOps/s 18.1915 KOps/s $\color{#d91a1a}-2.18\%$
test_values_nested_leaf 0.1266ms 59.7975μs 16.7231 KOps/s 15.6359 KOps/s $\textbf{\color{#35bf28}+6.95\%}$
test_values_stack_nested 0.1161ms 57.3337μs 17.4418 KOps/s 17.8184 KOps/s $\color{#d91a1a}-2.11\%$
test_values_stack_nested_leaf 0.1141ms 60.0618μs 16.6495 KOps/s 16.4555 KOps/s $\color{#35bf28}+1.18\%$
test_values_stack_nested_locked 0.1127ms 57.4674μs 17.4012 KOps/s 17.7569 KOps/s $\color{#d91a1a}-2.00\%$
test_membership 20.2270μs 0.8825μs 1.1331 MOps/s 1.0997 MOps/s $\color{#35bf28}+3.04\%$
test_membership_nested 46.8470μs 2.9890μs 334.5651 KOps/s 340.6057 KOps/s $\color{#d91a1a}-1.77\%$
test_membership_nested_leaf 45.3740μs 2.9721μs 336.4616 KOps/s 343.0323 KOps/s $\color{#d91a1a}-1.92\%$
test_membership_stacked_nested 37.9410μs 2.9451μs 339.5440 KOps/s 341.0271 KOps/s $\color{#d91a1a}-0.43\%$
test_membership_stacked_nested_leaf 26.2490μs 2.9801μs 335.5570 KOps/s 340.0950 KOps/s $\color{#d91a1a}-1.33\%$
test_membership_nested_last 34.5040μs 4.2707μs 234.1519 KOps/s 236.1188 KOps/s $\color{#d91a1a}-0.83\%$
test_membership_nested_leaf_last 46.4670μs 4.2524μs 235.1633 KOps/s 232.7971 KOps/s $\color{#35bf28}+1.02\%$
test_membership_stacked_nested_last 40.1050μs 6.8929μs 145.0761 KOps/s 238.2569 KOps/s $\textbf{\color{#d91a1a}-39.11\%}$
test_membership_stacked_nested_leaf_last 57.8480μs 6.8516μs 145.9522 KOps/s 240.8192 KOps/s $\textbf{\color{#d91a1a}-39.39\%}$
test_nested_getleaf 42.4590μs 10.9403μs 91.4054 KOps/s 92.9016 KOps/s $\color{#d91a1a}-1.61\%$
test_nested_get 51.3360μs 10.3138μs 96.9572 KOps/s 97.2819 KOps/s $\color{#d91a1a}-0.33\%$
test_stacked_getleaf 45.5750μs 10.8442μs 92.2155 KOps/s 91.7672 KOps/s $\color{#35bf28}+0.49\%$
test_stacked_get 33.3430μs 10.2582μs 97.4832 KOps/s 99.1507 KOps/s $\color{#d91a1a}-1.68\%$
test_nested_getitemleaf 73.5040μs 11.3446μs 88.1474 KOps/s 88.8592 KOps/s $\color{#d91a1a}-0.80\%$
test_nested_getitem 54.4420μs 10.5755μs 94.5585 KOps/s 95.4221 KOps/s $\color{#d91a1a}-0.91\%$
test_stacked_getitemleaf 37.8910μs 11.3710μs 87.9433 KOps/s 88.8069 KOps/s $\color{#d91a1a}-0.97\%$
test_stacked_getitem 58.4890μs 10.5827μs 94.4937 KOps/s 95.1435 KOps/s $\color{#d91a1a}-0.68\%$
test_lock_nested 3.4139ms 0.4595ms 2.1761 KOps/s 2.2543 KOps/s $\color{#d91a1a}-3.47\%$
test_lock_stack_nested 0.5820ms 0.4198ms 2.3821 KOps/s 2.3739 KOps/s $\color{#35bf28}+0.35\%$
test_unlock_nested 0.9500ms 0.3733ms 2.6788 KOps/s 2.7329 KOps/s $\color{#d91a1a}-1.98\%$
test_unlock_stack_nested 0.4803ms 0.3360ms 2.9762 KOps/s 2.9649 KOps/s $\color{#35bf28}+0.38\%$
test_flatten_speed 0.1871ms 95.6938μs 10.4500 KOps/s 10.4992 KOps/s $\color{#d91a1a}-0.47\%$
test_unflatten_speed 0.5875ms 0.4922ms 2.0316 KOps/s 2.0317 KOps/s $-0.01\%$
test_common_ops 4.1556ms 0.7901ms 1.2656 KOps/s 1.3442 KOps/s $\textbf{\color{#d91a1a}-5.85\%}$
test_creation 0.1053ms 2.0650μs 484.2629 KOps/s 481.1262 KOps/s $\color{#35bf28}+0.65\%$
test_creation_empty 0.1369ms 11.5122μs 86.8645 KOps/s 115.4227 KOps/s $\textbf{\color{#d91a1a}-24.74\%}$
test_creation_nested_1 50.3940μs 13.8053μs 72.4358 KOps/s 87.6091 KOps/s $\textbf{\color{#d91a1a}-17.32\%}$
test_creation_nested_2 64.7600μs 17.9598μs 55.6798 KOps/s 62.8666 KOps/s $\textbf{\color{#d91a1a}-11.43\%}$
test_clone 0.1742ms 13.2557μs 75.4393 KOps/s 75.7335 KOps/s $\color{#d91a1a}-0.39\%$
test_getitem[int] 1.3131ms 12.9057μs 77.4850 KOps/s 77.1727 KOps/s $\color{#35bf28}+0.40\%$
test_getitem[slice_int] 0.1800ms 25.7866μs 38.7798 KOps/s 39.7047 KOps/s $\color{#d91a1a}-2.33\%$
test_getitem[range] 0.2787ms 49.0188μs 20.4004 KOps/s 19.4883 KOps/s $\color{#35bf28}+4.68\%$
test_getitem[tuple] 0.1558ms 20.6598μs 48.4032 KOps/s 48.6689 KOps/s $\color{#d91a1a}-0.55\%$
test_getitem[list] 0.3835ms 49.7288μs 20.1091 KOps/s 21.3877 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_setitem_dim[int] 75.9720μs 25.8768μs 38.6446 KOps/s 39.3349 KOps/s $\color{#d91a1a}-1.75\%$
test_setitem_dim[slice_int] 0.1197ms 53.8233μs 18.5793 KOps/s 19.1720 KOps/s $\color{#d91a1a}-3.09\%$
test_setitem_dim[range] 0.1522ms 75.3247μs 13.2759 KOps/s 13.0586 KOps/s $\color{#35bf28}+1.66\%$
test_setitem_dim[tuple] 0.1248ms 43.2039μs 23.1461 KOps/s 24.2807 KOps/s $\color{#d91a1a}-4.67\%$
test_setitem 0.1944ms 20.3249μs 49.2007 KOps/s 53.7328 KOps/s $\textbf{\color{#d91a1a}-8.43\%}$
test_set 0.2112ms 19.9153μs 50.2125 KOps/s 55.0495 KOps/s $\textbf{\color{#d91a1a}-8.79\%}$
test_set_shared 2.5826ms 0.1746ms 5.7269 KOps/s 5.8522 KOps/s $\color{#d91a1a}-2.14\%$
test_update 0.1806ms 22.1958μs 45.0536 KOps/s 51.1579 KOps/s $\textbf{\color{#d91a1a}-11.93\%}$
test_update_nested 0.1754ms 33.1097μs 30.2027 KOps/s 32.9890 KOps/s $\textbf{\color{#d91a1a}-8.45\%}$
test_update__nested 0.8965ms 31.7712μs 31.4750 KOps/s 30.4658 KOps/s $\color{#35bf28}+3.31\%$
test_set_nested 0.1486ms 22.0913μs 45.2667 KOps/s 48.9047 KOps/s $\textbf{\color{#d91a1a}-7.44\%}$
test_set_nested_new 0.1704ms 26.4237μs 37.8448 KOps/s 39.9257 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_select 0.1887ms 42.7611μs 23.3857 KOps/s 24.3478 KOps/s $\color{#d91a1a}-3.95\%$
test_select_nested 0.1560ms 59.8574μs 16.7064 KOps/s 16.3199 KOps/s $\color{#35bf28}+2.37\%$
test_exclude_nested 0.1625ms 78.8947μs 12.6751 KOps/s 12.8613 KOps/s $\color{#d91a1a}-1.45\%$
test_empty[True] 0.7019ms 0.3852ms 2.5961 KOps/s 2.6649 KOps/s $\color{#d91a1a}-2.58\%$
test_empty[False] 10.3620μs 1.2330μs 811.0275 KOps/s 841.7366 KOps/s $\color{#d91a1a}-3.65\%$
test_unbind_speed 0.6026ms 0.2613ms 3.8265 KOps/s 3.8293 KOps/s $\color{#d91a1a}-0.07\%$
test_unbind_speed_stack0 0.4245ms 0.2529ms 3.9536 KOps/s 3.8759 KOps/s $\color{#35bf28}+2.00\%$
test_unbind_speed_stack1 0.1125s 0.7695ms 1.2995 KOps/s 1.4072 KOps/s $\textbf{\color{#d91a1a}-7.65\%}$
test_split 0.1176s 1.8292ms 546.6738 Ops/s 560.3362 Ops/s $\color{#d91a1a}-2.44\%$
test_chunk 1.8676ms 1.6322ms 612.6608 Ops/s 557.2152 Ops/s $\textbf{\color{#35bf28}+9.95\%}$
test_consolidate_njt[False-None] 0.1238s 9.6364ms 103.7729 Ops/s 117.3698 Ops/s $\textbf{\color{#d91a1a}-11.58\%}$
test_creation[device0] 0.3186ms 93.1646μs 10.7337 KOps/s 10.7028 KOps/s $\color{#35bf28}+0.29\%$
test_creation_from_tensor 4.4918ms 96.7253μs 10.3386 KOps/s 10.2517 KOps/s $\color{#35bf28}+0.85\%$
test_add_one[memmap_tensor0] 0.3426ms 4.9967μs 200.1321 KOps/s 192.2278 KOps/s $\color{#35bf28}+4.11\%$
test_contiguous[memmap_tensor0] 17.6430μs 0.5129μs 1.9495 MOps/s 1.9911 MOps/s $\color{#d91a1a}-2.09\%$
test_stack[memmap_tensor0] 35.5760μs 3.3745μs 296.3399 KOps/s 285.4565 KOps/s $\color{#35bf28}+3.81\%$
test_memmaptd_index 1.2283ms 0.2391ms 4.1825 KOps/s 4.1973 KOps/s $\color{#d91a1a}-0.35\%$
test_memmaptd_index_astensor 0.7615ms 0.3210ms 3.1148 KOps/s 3.1586 KOps/s $\color{#d91a1a}-1.39\%$
test_memmaptd_index_op 1.1149ms 0.5850ms 1.7094 KOps/s 1.8217 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_serialize_model 0.1292s 0.1220s 8.1995 Ops/s 8.4918 Ops/s $\color{#d91a1a}-3.44\%$
test_serialize_model_pickle 0.4721s 0.3893s 2.5685 Ops/s 2.4630 Ops/s $\color{#35bf28}+4.28\%$
test_serialize_weights 0.1413s 0.1216s 8.2240 Ops/s 8.3279 Ops/s $\color{#d91a1a}-1.25\%$
test_serialize_weights_returnearly 0.1714s 0.1586s 6.3036 Ops/s 6.4384 Ops/s $\color{#d91a1a}-2.09\%$
test_serialize_weights_pickle 0.5038s 0.4090s 2.4453 Ops/s 2.4838 Ops/s $\color{#d91a1a}-1.55\%$
test_serialize_weights_filesystem 0.1513s 0.1447s 6.9104 Ops/s 6.9210 Ops/s $\color{#d91a1a}-0.15\%$
test_serialize_model_filesystem 0.1575s 0.1518s 6.5890 Ops/s 5.8693 Ops/s $\textbf{\color{#35bf28}+12.26\%}$
test_reshape_pytree 71.9140μs 27.1294μs 36.8604 KOps/s 37.0980 KOps/s $\color{#d91a1a}-0.64\%$
test_reshape_td 77.4540μs 33.3981μs 29.9418 KOps/s 30.0000 KOps/s $\color{#d91a1a}-0.19\%$
test_view_pytree 0.1043ms 27.5257μs 36.3297 KOps/s 36.4560 KOps/s $\color{#d91a1a}-0.35\%$
test_view_td 96.4800μs 37.8072μs 26.4500 KOps/s 25.8228 KOps/s $\color{#35bf28}+2.43\%$
test_unbind_pytree 0.1096ms 30.6125μs 32.6664 KOps/s 32.6289 KOps/s $\color{#35bf28}+0.12\%$
test_unbind_td 0.3976ms 39.0858μs 25.5847 KOps/s 25.8919 KOps/s $\color{#d91a1a}-1.19\%$
test_split_pytree 0.1028ms 29.8732μs 33.4748 KOps/s 33.3871 KOps/s $\color{#35bf28}+0.26\%$
test_split_td 0.2062ms 45.0847μs 22.1805 KOps/s 22.4118 KOps/s $\color{#d91a1a}-1.03\%$
test_add_pytree 0.1093ms 36.3367μs 27.5204 KOps/s 27.8314 KOps/s $\color{#d91a1a}-1.12\%$
test_add_td 0.1507ms 54.1355μs 18.4722 KOps/s 20.0273 KOps/s $\textbf{\color{#d91a1a}-7.76\%}$
test_compile_add_one_nested[tensordict-compile] 0.1830ms 63.1657μs 15.8314 KOps/s 15.8004 KOps/s $\color{#35bf28}+0.20\%$
test_compile_add_one_nested[tensordict-eager] 1.8286ms 0.1639ms 6.1030 KOps/s 6.1467 KOps/s $\color{#d91a1a}-0.71\%$
test_compile_add_one_nested[pytree-compile] 0.2547ms 47.4641μs 21.0686 KOps/s 21.5977 KOps/s $\color{#d91a1a}-2.45\%$
test_compile_add_one_nested[pytree-eager] 0.2284ms 0.1201ms 8.3275 KOps/s 8.2194 KOps/s $\color{#35bf28}+1.31\%$
test_compile_copy_nested[tensordict-compile] 70.5820μs 26.6472μs 37.5274 KOps/s 38.0560 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_copy_nested[tensordict-eager] 0.1039ms 53.9387μs 18.5396 KOps/s 18.5009 KOps/s $\color{#35bf28}+0.21\%$
test_compile_copy_nested[pytree-compile] 0.2021ms 79.7582μs 12.5379 KOps/s 12.4022 KOps/s $\color{#35bf28}+1.09\%$
test_compile_copy_nested[pytree-eager] 0.1208ms 68.4698μs 14.6050 KOps/s 14.5476 KOps/s $\color{#35bf28}+0.39\%$
test_compile_add_one_flat[tensordict-compile] 0.2308ms 0.1054ms 9.4896 KOps/s 9.2743 KOps/s $\color{#35bf28}+2.32\%$
test_compile_add_one_flat[tensordict-eager] 1.5744ms 0.2020ms 4.9508 KOps/s 4.9516 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_one_flat[tensorclass-compile] 0.1054ms 45.3650μs 22.0434 KOps/s 21.7413 KOps/s $\color{#35bf28}+1.39\%$
test_compile_add_one_flat[tensorclass-eager] 1.4829ms 61.7726μs 16.1884 KOps/s 15.8651 KOps/s $\color{#35bf28}+2.04\%$
test_compile_add_one_flat[pytree-compile] 0.2376ms 0.1046ms 9.5617 KOps/s 9.5416 KOps/s $\color{#35bf28}+0.21\%$
test_compile_add_one_flat[pytree-eager] 0.2944ms 0.2042ms 4.8976 KOps/s 4.9008 KOps/s $\color{#d91a1a}-0.07\%$
test_compile_add_self_flat[tensordict-eager] 0.4020ms 0.2155ms 4.6413 KOps/s 4.7344 KOps/s $\color{#d91a1a}-1.97\%$
test_compile_add_self_flat[tensordict-compile] 0.2142ms 0.1066ms 9.3773 KOps/s 9.4266 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_add_self_flat[tensorclass-eager] 0.2130ms 58.2379μs 17.1709 KOps/s 18.0970 KOps/s $\textbf{\color{#d91a1a}-5.12\%}$
test_compile_add_self_flat[tensorclass-compile] 0.1234ms 45.8483μs 21.8111 KOps/s 21.5092 KOps/s $\color{#35bf28}+1.40\%$
test_compile_add_self_flat[pytree-eager] 0.6628ms 0.1610ms 6.2114 KOps/s 6.2525 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_add_self_flat[pytree-compile] 0.2038ms 0.1050ms 9.5271 KOps/s 9.5321 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_copy_flat[tensordict-compile] 75.1800μs 20.9799μs 47.6647 KOps/s 46.8412 KOps/s $\color{#35bf28}+1.76\%$
test_compile_copy_flat[tensordict-eager] 0.1255ms 60.0593μs 16.6502 KOps/s 17.4570 KOps/s $\color{#d91a1a}-4.62\%$
test_compile_copy_flat[pytree-compile] 0.1823ms 82.0431μs 12.1887 KOps/s 12.3893 KOps/s $\color{#d91a1a}-1.62\%$
test_compile_copy_flat[pytree-eager] 0.1311ms 69.4761μs 14.3934 KOps/s 14.5923 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_assign_and_add[tensordict-compile] 0.3842ms 0.2114ms 4.7305 KOps/s 4.7395 KOps/s $\color{#d91a1a}-0.19\%$
test_compile_assign_and_add[tensordict-eager] 1.5095ms 1.2738ms 785.0600 Ops/s 764.5989 Ops/s $\color{#35bf28}+2.68\%$
test_compile_assign_and_add[pytree-compile] 0.3236ms 0.2089ms 4.7864 KOps/s 4.8686 KOps/s $\color{#d91a1a}-1.69\%$
test_compile_assign_and_add[pytree-eager] 1.0645ms 0.7766ms 1.2876 KOps/s 1.2747 KOps/s $\color{#35bf28}+1.02\%$
test_compile_assign_and_add_stack[compile] 0.6146ms 0.4729ms 2.1147 KOps/s 2.1710 KOps/s $\color{#d91a1a}-2.60\%$
test_compile_assign_and_add_stack[eager] 3.2495ms 2.7340ms 365.7640 Ops/s 397.6055 Ops/s $\textbf{\color{#d91a1a}-8.01\%}$
test_compile_indexing[tensor-tensordict-compile] 0.1067ms 36.2358μs 27.5970 KOps/s 27.0827 KOps/s $\color{#35bf28}+1.90\%$
test_compile_indexing[tensor-tensordict-eager] 0.7698ms 35.1135μs 28.4791 KOps/s 29.7063 KOps/s $\color{#d91a1a}-4.13\%$
test_compile_indexing[tensor-tensorclass-compile] 99.9860μs 30.1631μs 33.1531 KOps/s 34.3455 KOps/s $\color{#d91a1a}-3.47\%$
test_compile_indexing[tensor-tensorclass-eager] 87.3730μs 23.6201μs 42.3368 KOps/s 42.3550 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_indexing[tensor-pytree-compile] 0.1268ms 31.1427μs 32.1102 KOps/s 34.0537 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_compile_indexing[tensor-pytree-eager] 73.0960μs 23.5264μs 42.5054 KOps/s 42.5987 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_indexing[slice-tensordict-compile] 0.1540ms 53.0690μs 18.8434 KOps/s 18.9935 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_indexing[slice-tensordict-eager] 0.7146ms 20.7081μs 48.2903 KOps/s 50.7533 KOps/s $\color{#d91a1a}-4.85\%$
test_compile_indexing[slice-tensorclass-compile] 0.1292ms 45.1687μs 22.1392 KOps/s 22.6195 KOps/s $\color{#d91a1a}-2.12\%$
test_compile_indexing[slice-tensorclass-eager] 92.6920μs 19.1544μs 52.2072 KOps/s 52.8139 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_indexing[slice-pytree-compile] 0.1229ms 45.3332μs 22.0589 KOps/s 22.0710 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_indexing[slice-pytree-eager] 59.4310μs 18.7547μs 53.3199 KOps/s 53.1358 KOps/s $\color{#35bf28}+0.35\%$
test_compile_indexing[int-tensordict-compile] 0.1927ms 54.2146μs 18.4452 KOps/s 18.6408 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_indexing[int-tensordict-eager] 1.0472ms 20.9378μs 47.7605 KOps/s 49.3392 KOps/s $\color{#d91a1a}-3.20\%$
test_compile_indexing[int-tensorclass-compile] 0.2192ms 45.5713μs 21.9436 KOps/s 22.3402 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_indexing[int-tensorclass-eager] 82.9850μs 19.0686μs 52.4422 KOps/s 52.8603 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_indexing[int-pytree-compile] 0.1393ms 45.4097μs 22.0217 KOps/s 22.0211 KOps/s $+0.00\%$
test_compile_indexing[int-pytree-eager] 77.0430μs 18.6487μs 53.6231 KOps/s 53.9713 KOps/s $\color{#d91a1a}-0.65\%$
test_mod_add[eager] 90.6790μs 33.8733μs 29.5218 KOps/s 30.1005 KOps/s $\color{#d91a1a}-1.92\%$
test_mod_add[compile] 0.1200ms 48.2277μs 20.7350 KOps/s 20.6252 KOps/s $\color{#35bf28}+0.53\%$
test_mod_add[compile-overhead] 0.1192ms 48.2514μs 20.7248 KOps/s 20.3793 KOps/s $\color{#35bf28}+1.70\%$
test_mod_wrap[eager] 0.4529ms 0.2325ms 4.3002 KOps/s 4.3852 KOps/s $\color{#d91a1a}-1.94\%$
test_mod_wrap[compile] 0.3686ms 0.2101ms 4.7604 KOps/s 4.7286 KOps/s $\color{#35bf28}+0.67\%$
test_mod_wrap[compile-overhead] 0.4264ms 0.2074ms 4.8213 KOps/s 4.7863 KOps/s $\color{#35bf28}+0.73\%$
test_mod_wrap_and_backward[eager] 13.6008ms 11.5087ms 86.8911 Ops/s 81.7155 Ops/s $\textbf{\color{#35bf28}+6.33\%}$
test_mod_wrap_and_backward[compile] 16.3786ms 13.0440ms 76.6633 Ops/s 80.2115 Ops/s $\color{#d91a1a}-4.42\%$
test_mod_wrap_and_backward[compile-overhead] 13.6229ms 12.0948ms 82.6803 Ops/s 73.6196 Ops/s $\textbf{\color{#35bf28}+12.31\%}$
test_seq_add[eager] 0.2233ms 0.1092ms 9.1605 KOps/s 9.2936 KOps/s $\color{#d91a1a}-1.43\%$
test_seq_add[compile] 0.1511ms 61.4215μs 16.2809 KOps/s 15.8841 KOps/s $\color{#35bf28}+2.50\%$
test_seq_add[compile-overhead] 0.1540ms 59.3806μs 16.8405 KOps/s 16.1668 KOps/s $\color{#35bf28}+4.17\%$
test_seq_wrap[eager] 0.6981ms 0.4400ms 2.2729 KOps/s 2.3201 KOps/s $\color{#d91a1a}-2.03\%$
test_seq_wrap[compile] 0.4551ms 0.2275ms 4.3951 KOps/s 4.3169 KOps/s $\color{#35bf28}+1.81\%$
test_seq_wrap[compile-overhead] 0.3671ms 0.2260ms 4.4249 KOps/s 4.3208 KOps/s $\color{#35bf28}+2.41\%$
test_func_call_runtime[False-eager] 0.9269ms 0.5636ms 1.7744 KOps/s 1.7508 KOps/s $\color{#35bf28}+1.35\%$
test_func_call_runtime[False-compile] 0.8830ms 0.4324ms 2.3126 KOps/s 2.2661 KOps/s $\color{#35bf28}+2.05\%$
test_func_call_runtime[False-compile-overhead] 0.8424ms 0.4317ms 2.3166 KOps/s 2.2738 KOps/s $\color{#35bf28}+1.88\%$
test_func_call_runtime[True-eager] 1.0519ms 0.7782ms 1.2850 KOps/s 1.2657 KOps/s $\color{#35bf28}+1.52\%$
test_func_call_runtime[True-compile] 0.9086ms 0.4766ms 2.0980 KOps/s 2.0707 KOps/s $\color{#35bf28}+1.32\%$
test_func_call_runtime[True-compile-overhead] 0.7859ms 0.4769ms 2.0969 KOps/s 2.0899 KOps/s $\color{#35bf28}+0.34\%$
test_func_call_cm_runtime[False-eager] 1.1001ms 0.5668ms 1.7642 KOps/s 1.7297 KOps/s $\color{#35bf28}+2.00\%$
test_func_call_cm_runtime[False-compile] 0.6557ms 0.4298ms 2.3268 KOps/s 2.2750 KOps/s $\color{#35bf28}+2.28\%$
test_func_call_cm_runtime[False-compile-overhead] 0.7246ms 0.4300ms 2.3257 KOps/s 2.2872 KOps/s $\color{#35bf28}+1.68\%$
test_func_call_cm_runtime[True-eager] 1.5293ms 0.9185ms 1.0888 KOps/s 1.0792 KOps/s $\color{#35bf28}+0.89\%$
test_func_call_cm_runtime[True-compile] 0.8354ms 0.5021ms 1.9917 KOps/s 1.9908 KOps/s $\color{#35bf28}+0.04\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8294ms 0.5003ms 1.9988 KOps/s 1.9913 KOps/s $\color{#35bf28}+0.38\%$
test_vmap_func_call_cm_runtime[eager] 2.8205ms 1.9598ms 510.2435 Ops/s 484.3326 Ops/s $\textbf{\color{#35bf28}+5.35\%}$
test_vmap_func_call_cm_runtime[compile] 0.7209ms 0.5211ms 1.9191 KOps/s 1.8566 KOps/s $\color{#35bf28}+3.37\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9224ms 0.5239ms 1.9088 KOps/s 1.8517 KOps/s $\color{#35bf28}+3.08\%$
test_distributed 0.3036ms 0.1261ms 7.9313 KOps/s 7.6631 KOps/s $\color{#35bf28}+3.50\%$
test_tdmodule 0.1305ms 26.1188μs 38.2866 KOps/s 40.3174 KOps/s $\textbf{\color{#d91a1a}-5.04\%}$
test_tdmodule_dispatch 92.6130μs 47.0324μs 21.2619 KOps/s 21.3273 KOps/s $\color{#d91a1a}-0.31\%$
test_tdseq 58.6500μs 25.9513μs 38.5337 KOps/s 39.1465 KOps/s $\color{#d91a1a}-1.57\%$
test_tdseq_dispatch 0.1649ms 53.9121μs 18.5487 KOps/s 20.5544 KOps/s $\textbf{\color{#d91a1a}-9.76\%}$
test_instantiation_functorch 2.6623ms 1.5507ms 644.8820 Ops/s 635.4511 Ops/s $\color{#35bf28}+1.48\%$
test_exec_functorch 0.5911ms 0.1821ms 5.4918 KOps/s 5.3715 KOps/s $\color{#35bf28}+2.24\%$
test_exec_functional_call 0.3317ms 0.1725ms 5.7963 KOps/s 5.5227 KOps/s $\color{#35bf28}+4.95\%$
test_exec_td_decorator 0.5395ms 0.2293ms 4.3616 KOps/s 4.1772 KOps/s $\color{#35bf28}+4.41\%$
test_vmap_mlp_speed_decorator[True-True] 1.2364ms 0.6669ms 1.4994 KOps/s 1.4935 KOps/s $\color{#35bf28}+0.39\%$
test_vmap_mlp_speed_decorator[True-False] 0.9473ms 0.6582ms 1.5194 KOps/s 1.5152 KOps/s $\color{#35bf28}+0.28\%$
test_vmap_mlp_speed_decorator[False-True] 1.0265ms 0.5350ms 1.8692 KOps/s 1.8384 KOps/s $\color{#35bf28}+1.68\%$
test_vmap_mlp_speed_decorator[False-False] 0.9020ms 0.5356ms 1.8671 KOps/s 1.8379 KOps/s $\color{#35bf28}+1.59\%$
test_to_module_speed[True] 1.4139ms 1.2943ms 772.5891 Ops/s 768.9742 Ops/s $\color{#35bf28}+0.47\%$
test_to_module_speed[False] 1.7862ms 1.2811ms 780.5506 Ops/s 781.0165 Ops/s $\color{#d91a1a}-0.06\%$
test_tc_init 0.1125ms 48.2766μs 20.7140 KOps/s 23.6581 KOps/s $\textbf{\color{#d91a1a}-12.44\%}$
test_tc_init_nested 0.1992ms 96.6824μs 10.3431 KOps/s 12.1248 KOps/s $\textbf{\color{#d91a1a}-14.69\%}$
test_tc_first_layer_tensor 29.3850μs 1.5148μs 660.1461 KOps/s 666.4685 KOps/s $\color{#d91a1a}-0.95\%$
test_tc_first_layer_nontensor 31.1880μs 4.9173μs 203.3649 KOps/s 213.1210 KOps/s $\color{#d91a1a}-4.58\%$
test_tc_second_layer_tensor 26.4290μs 2.7604μs 362.2682 KOps/s 339.5757 KOps/s $\textbf{\color{#35bf28}+6.68\%}$
test_tc_second_layer_nontensor 36.8190μs 6.0649μs 164.8827 KOps/s 166.7637 KOps/s $\color{#d91a1a}-1.13\%$
test_unbind 0.2499s 14.3515ms 69.6791 Ops/s 77.6232 Ops/s $\textbf{\color{#d91a1a}-10.23\%}$
test_full_like 19.2976ms 13.6017ms 73.5205 Ops/s 106.1879 Ops/s $\textbf{\color{#d91a1a}-30.76\%}$
test_zeros_like 12.0090ms 8.1412ms 122.8321 Ops/s 278.3250 Ops/s $\textbf{\color{#d91a1a}-55.87\%}$
test_ones_like 11.9005ms 8.1150ms 123.2292 Ops/s 246.0855 Ops/s $\textbf{\color{#d91a1a}-49.92\%}$
test_clone 17.7482ms 10.2827ms 97.2508 Ops/s 156.6941 Ops/s $\textbf{\color{#d91a1a}-37.94\%}$
test_squeeze 76.6420μs 12.0047μs 83.3004 KOps/s 85.1787 KOps/s $\color{#d91a1a}-2.21\%$
test_unsqueeze 0.3602ms 90.5499μs 11.0436 KOps/s 11.0355 KOps/s $\color{#35bf28}+0.07\%$
test_split 0.4525ms 0.2002ms 4.9949 KOps/s 5.0353 KOps/s $\color{#d91a1a}-0.80\%$
test_permute 0.3387ms 0.2188ms 4.5703 KOps/s 4.5555 KOps/s $\color{#35bf28}+0.33\%$
test_stack 30.7477ms 26.4626ms 37.7892 Ops/s 34.9678 Ops/s $\textbf{\color{#35bf28}+8.07\%}$
test_cat 30.2165ms 27.7468ms 36.0402 Ops/s 35.0719 Ops/s $\color{#35bf28}+2.76\%$

[ghstack-poisoned]
@vmoens vmoens merged commit f02115c into gh/vmoens/36/base Nov 27, 2024
47 of 50 checks passed
vmoens added a commit that referenced this pull request Nov 27, 2024
ghstack-source-id: 04a62439b0fe60422fbc901172df46306e161cc5
Pull Request resolved: #1113
@vmoens vmoens deleted the gh/vmoens/36/head branch November 27, 2024 14:06
@vmoens vmoens added the enhancement New feature or request label Nov 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants