-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] CompositeDistribution.from_distributions #1113
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Nov 26, 2024
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 26, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 64.9920μs | 18.0009μs | 55.5526 KOps/s | 61.3180 KOps/s | |
test_plain_set_stack_nested | 55.4030μs | 18.2587μs | 54.7683 KOps/s | 59.7236 KOps/s | |
test_plain_set_nested_inplace | 58.7300μs | 19.7984μs | 50.5090 KOps/s | 54.8199 KOps/s | |
test_plain_set_stack_nested_inplace | 58.0980μs | 19.3404μs | 51.7053 KOps/s | 54.8377 KOps/s | |
test_items | 39.7030μs | 4.1655μs | 240.0678 KOps/s | 240.6913 KOps/s | |
test_items_nested | 0.5905ms | 0.4035ms | 2.4781 KOps/s | 2.4730 KOps/s | |
test_items_nested_locked | 0.5478ms | 0.4031ms | 2.4805 KOps/s | 2.4634 KOps/s | |
test_items_nested_leaf | 0.1385ms | 72.4334μs | 13.8058 KOps/s | 14.1339 KOps/s | |
test_items_stack_nested | 0.5651ms | 0.4066ms | 2.4595 KOps/s | 2.4613 KOps/s | |
test_items_stack_nested_leaf | 0.1497ms | 75.1640μs | 13.3042 KOps/s | 13.5989 KOps/s | |
test_items_stack_nested_locked | 0.8522ms | 0.4068ms | 2.4580 KOps/s | 2.4806 KOps/s | |
test_keys | 32.1600μs | 3.6079μs | 277.1683 KOps/s | 283.2637 KOps/s | |
test_keys_nested | 0.2550ms | 0.1377ms | 7.2623 KOps/s | 7.3735 KOps/s | |
test_keys_nested_locked | 2.4412ms | 0.1438ms | 6.9541 KOps/s | 6.9742 KOps/s | |
test_keys_nested_leaf | 0.2361ms | 0.1177ms | 8.4996 KOps/s | 8.5940 KOps/s | |
test_keys_stack_nested | 0.2190ms | 0.1358ms | 7.3615 KOps/s | 7.3409 KOps/s | |
test_keys_stack_nested_leaf | 0.2076ms | 0.1153ms | 8.6729 KOps/s | 8.6504 KOps/s | |
test_keys_stack_nested_locked | 0.2716ms | 0.1415ms | 7.0677 KOps/s | 7.0536 KOps/s | |
test_values | 20.8088μs | 1.0396μs | 961.9185 KOps/s | 931.4558 KOps/s | |
test_values_nested | 0.1135ms | 55.7596μs | 17.9341 KOps/s | 18.1563 KOps/s | |
test_values_nested_locked | 0.1243ms | 56.1952μs | 17.7951 KOps/s | 18.1915 KOps/s | |
test_values_nested_leaf | 0.1266ms | 59.7975μs | 16.7231 KOps/s | 15.6359 KOps/s | |
test_values_stack_nested | 0.1161ms | 57.3337μs | 17.4418 KOps/s | 17.8184 KOps/s | |
test_values_stack_nested_leaf | 0.1141ms | 60.0618μs | 16.6495 KOps/s | 16.4555 KOps/s | |
test_values_stack_nested_locked | 0.1127ms | 57.4674μs | 17.4012 KOps/s | 17.7569 KOps/s | |
test_membership | 20.2270μs | 0.8825μs | 1.1331 MOps/s | 1.0997 MOps/s | |
test_membership_nested | 46.8470μs | 2.9890μs | 334.5651 KOps/s | 340.6057 KOps/s | |
test_membership_nested_leaf | 45.3740μs | 2.9721μs | 336.4616 KOps/s | 343.0323 KOps/s | |
test_membership_stacked_nested | 37.9410μs | 2.9451μs | 339.5440 KOps/s | 341.0271 KOps/s | |
test_membership_stacked_nested_leaf | 26.2490μs | 2.9801μs | 335.5570 KOps/s | 340.0950 KOps/s | |
test_membership_nested_last | 34.5040μs | 4.2707μs | 234.1519 KOps/s | 236.1188 KOps/s | |
test_membership_nested_leaf_last | 46.4670μs | 4.2524μs | 235.1633 KOps/s | 232.7971 KOps/s | |
test_membership_stacked_nested_last | 40.1050μs | 6.8929μs | 145.0761 KOps/s | 238.2569 KOps/s | |
test_membership_stacked_nested_leaf_last | 57.8480μs | 6.8516μs | 145.9522 KOps/s | 240.8192 KOps/s | |
test_nested_getleaf | 42.4590μs | 10.9403μs | 91.4054 KOps/s | 92.9016 KOps/s | |
test_nested_get | 51.3360μs | 10.3138μs | 96.9572 KOps/s | 97.2819 KOps/s | |
test_stacked_getleaf | 45.5750μs | 10.8442μs | 92.2155 KOps/s | 91.7672 KOps/s | |
test_stacked_get | 33.3430μs | 10.2582μs | 97.4832 KOps/s | 99.1507 KOps/s | |
test_nested_getitemleaf | 73.5040μs | 11.3446μs | 88.1474 KOps/s | 88.8592 KOps/s | |
test_nested_getitem | 54.4420μs | 10.5755μs | 94.5585 KOps/s | 95.4221 KOps/s | |
test_stacked_getitemleaf | 37.8910μs | 11.3710μs | 87.9433 KOps/s | 88.8069 KOps/s | |
test_stacked_getitem | 58.4890μs | 10.5827μs | 94.4937 KOps/s | 95.1435 KOps/s | |
test_lock_nested | 3.4139ms | 0.4595ms | 2.1761 KOps/s | 2.2543 KOps/s | |
test_lock_stack_nested | 0.5820ms | 0.4198ms | 2.3821 KOps/s | 2.3739 KOps/s | |
test_unlock_nested | 0.9500ms | 0.3733ms | 2.6788 KOps/s | 2.7329 KOps/s | |
test_unlock_stack_nested | 0.4803ms | 0.3360ms | 2.9762 KOps/s | 2.9649 KOps/s | |
test_flatten_speed | 0.1871ms | 95.6938μs | 10.4500 KOps/s | 10.4992 KOps/s | |
test_unflatten_speed | 0.5875ms | 0.4922ms | 2.0316 KOps/s | 2.0317 KOps/s | |
test_common_ops | 4.1556ms | 0.7901ms | 1.2656 KOps/s | 1.3442 KOps/s | |
test_creation | 0.1053ms | 2.0650μs | 484.2629 KOps/s | 481.1262 KOps/s | |
test_creation_empty | 0.1369ms | 11.5122μs | 86.8645 KOps/s | 115.4227 KOps/s | |
test_creation_nested_1 | 50.3940μs | 13.8053μs | 72.4358 KOps/s | 87.6091 KOps/s | |
test_creation_nested_2 | 64.7600μs | 17.9598μs | 55.6798 KOps/s | 62.8666 KOps/s | |
test_clone | 0.1742ms | 13.2557μs | 75.4393 KOps/s | 75.7335 KOps/s | |
test_getitem[int] | 1.3131ms | 12.9057μs | 77.4850 KOps/s | 77.1727 KOps/s | |
test_getitem[slice_int] | 0.1800ms | 25.7866μs | 38.7798 KOps/s | 39.7047 KOps/s | |
test_getitem[range] | 0.2787ms | 49.0188μs | 20.4004 KOps/s | 19.4883 KOps/s | |
test_getitem[tuple] | 0.1558ms | 20.6598μs | 48.4032 KOps/s | 48.6689 KOps/s | |
test_getitem[list] | 0.3835ms | 49.7288μs | 20.1091 KOps/s | 21.3877 KOps/s | |
test_setitem_dim[int] | 75.9720μs | 25.8768μs | 38.6446 KOps/s | 39.3349 KOps/s | |
test_setitem_dim[slice_int] | 0.1197ms | 53.8233μs | 18.5793 KOps/s | 19.1720 KOps/s | |
test_setitem_dim[range] | 0.1522ms | 75.3247μs | 13.2759 KOps/s | 13.0586 KOps/s | |
test_setitem_dim[tuple] | 0.1248ms | 43.2039μs | 23.1461 KOps/s | 24.2807 KOps/s | |
test_setitem | 0.1944ms | 20.3249μs | 49.2007 KOps/s | 53.7328 KOps/s | |
test_set | 0.2112ms | 19.9153μs | 50.2125 KOps/s | 55.0495 KOps/s | |
test_set_shared | 2.5826ms | 0.1746ms | 5.7269 KOps/s | 5.8522 KOps/s | |
test_update | 0.1806ms | 22.1958μs | 45.0536 KOps/s | 51.1579 KOps/s | |
test_update_nested | 0.1754ms | 33.1097μs | 30.2027 KOps/s | 32.9890 KOps/s | |
test_update__nested | 0.8965ms | 31.7712μs | 31.4750 KOps/s | 30.4658 KOps/s | |
test_set_nested | 0.1486ms | 22.0913μs | 45.2667 KOps/s | 48.9047 KOps/s | |
test_set_nested_new | 0.1704ms | 26.4237μs | 37.8448 KOps/s | 39.9257 KOps/s | |
test_select | 0.1887ms | 42.7611μs | 23.3857 KOps/s | 24.3478 KOps/s | |
test_select_nested | 0.1560ms | 59.8574μs | 16.7064 KOps/s | 16.3199 KOps/s | |
test_exclude_nested | 0.1625ms | 78.8947μs | 12.6751 KOps/s | 12.8613 KOps/s | |
test_empty[True] | 0.7019ms | 0.3852ms | 2.5961 KOps/s | 2.6649 KOps/s | |
test_empty[False] | 10.3620μs | 1.2330μs | 811.0275 KOps/s | 841.7366 KOps/s | |
test_unbind_speed | 0.6026ms | 0.2613ms | 3.8265 KOps/s | 3.8293 KOps/s | |
test_unbind_speed_stack0 | 0.4245ms | 0.2529ms | 3.9536 KOps/s | 3.8759 KOps/s | |
test_unbind_speed_stack1 | 0.1125s | 0.7695ms | 1.2995 KOps/s | 1.4072 KOps/s | |
test_split | 0.1176s | 1.8292ms | 546.6738 Ops/s | 560.3362 Ops/s | |
test_chunk | 1.8676ms | 1.6322ms | 612.6608 Ops/s | 557.2152 Ops/s | |
test_consolidate_njt[False-None] | 0.1238s | 9.6364ms | 103.7729 Ops/s | 117.3698 Ops/s | |
test_creation[device0] | 0.3186ms | 93.1646μs | 10.7337 KOps/s | 10.7028 KOps/s | |
test_creation_from_tensor | 4.4918ms | 96.7253μs | 10.3386 KOps/s | 10.2517 KOps/s | |
test_add_one[memmap_tensor0] | 0.3426ms | 4.9967μs | 200.1321 KOps/s | 192.2278 KOps/s | |
test_contiguous[memmap_tensor0] | 17.6430μs | 0.5129μs | 1.9495 MOps/s | 1.9911 MOps/s | |
test_stack[memmap_tensor0] | 35.5760μs | 3.3745μs | 296.3399 KOps/s | 285.4565 KOps/s | |
test_memmaptd_index | 1.2283ms | 0.2391ms | 4.1825 KOps/s | 4.1973 KOps/s | |
test_memmaptd_index_astensor | 0.7615ms | 0.3210ms | 3.1148 KOps/s | 3.1586 KOps/s | |
test_memmaptd_index_op | 1.1149ms | 0.5850ms | 1.7094 KOps/s | 1.8217 KOps/s | |
test_serialize_model | 0.1292s | 0.1220s | 8.1995 Ops/s | 8.4918 Ops/s | |
test_serialize_model_pickle | 0.4721s | 0.3893s | 2.5685 Ops/s | 2.4630 Ops/s | |
test_serialize_weights | 0.1413s | 0.1216s | 8.2240 Ops/s | 8.3279 Ops/s | |
test_serialize_weights_returnearly | 0.1714s | 0.1586s | 6.3036 Ops/s | 6.4384 Ops/s | |
test_serialize_weights_pickle | 0.5038s | 0.4090s | 2.4453 Ops/s | 2.4838 Ops/s | |
test_serialize_weights_filesystem | 0.1513s | 0.1447s | 6.9104 Ops/s | 6.9210 Ops/s | |
test_serialize_model_filesystem | 0.1575s | 0.1518s | 6.5890 Ops/s | 5.8693 Ops/s | |
test_reshape_pytree | 71.9140μs | 27.1294μs | 36.8604 KOps/s | 37.0980 KOps/s | |
test_reshape_td | 77.4540μs | 33.3981μs | 29.9418 KOps/s | 30.0000 KOps/s | |
test_view_pytree | 0.1043ms | 27.5257μs | 36.3297 KOps/s | 36.4560 KOps/s | |
test_view_td | 96.4800μs | 37.8072μs | 26.4500 KOps/s | 25.8228 KOps/s | |
test_unbind_pytree | 0.1096ms | 30.6125μs | 32.6664 KOps/s | 32.6289 KOps/s | |
test_unbind_td | 0.3976ms | 39.0858μs | 25.5847 KOps/s | 25.8919 KOps/s | |
test_split_pytree | 0.1028ms | 29.8732μs | 33.4748 KOps/s | 33.3871 KOps/s | |
test_split_td | 0.2062ms | 45.0847μs | 22.1805 KOps/s | 22.4118 KOps/s | |
test_add_pytree | 0.1093ms | 36.3367μs | 27.5204 KOps/s | 27.8314 KOps/s | |
test_add_td | 0.1507ms | 54.1355μs | 18.4722 KOps/s | 20.0273 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1830ms | 63.1657μs | 15.8314 KOps/s | 15.8004 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.8286ms | 0.1639ms | 6.1030 KOps/s | 6.1467 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2547ms | 47.4641μs | 21.0686 KOps/s | 21.5977 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2284ms | 0.1201ms | 8.3275 KOps/s | 8.2194 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 70.5820μs | 26.6472μs | 37.5274 KOps/s | 38.0560 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1039ms | 53.9387μs | 18.5396 KOps/s | 18.5009 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2021ms | 79.7582μs | 12.5379 KOps/s | 12.4022 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1208ms | 68.4698μs | 14.6050 KOps/s | 14.5476 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2308ms | 0.1054ms | 9.4896 KOps/s | 9.2743 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 1.5744ms | 0.2020ms | 4.9508 KOps/s | 4.9516 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1054ms | 45.3650μs | 22.0434 KOps/s | 21.7413 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 1.4829ms | 61.7726μs | 16.1884 KOps/s | 15.8651 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2376ms | 0.1046ms | 9.5617 KOps/s | 9.5416 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2944ms | 0.2042ms | 4.8976 KOps/s | 4.9008 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4020ms | 0.2155ms | 4.6413 KOps/s | 4.7344 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2142ms | 0.1066ms | 9.3773 KOps/s | 9.4266 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2130ms | 58.2379μs | 17.1709 KOps/s | 18.0970 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1234ms | 45.8483μs | 21.8111 KOps/s | 21.5092 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6628ms | 0.1610ms | 6.2114 KOps/s | 6.2525 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2038ms | 0.1050ms | 9.5271 KOps/s | 9.5321 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 75.1800μs | 20.9799μs | 47.6647 KOps/s | 46.8412 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1255ms | 60.0593μs | 16.6502 KOps/s | 17.4570 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1823ms | 82.0431μs | 12.1887 KOps/s | 12.3893 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1311ms | 69.4761μs | 14.3934 KOps/s | 14.5923 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3842ms | 0.2114ms | 4.7305 KOps/s | 4.7395 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5095ms | 1.2738ms | 785.0600 Ops/s | 764.5989 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3236ms | 0.2089ms | 4.7864 KOps/s | 4.8686 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0645ms | 0.7766ms | 1.2876 KOps/s | 1.2747 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6146ms | 0.4729ms | 2.1147 KOps/s | 2.1710 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.2495ms | 2.7340ms | 365.7640 Ops/s | 397.6055 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1067ms | 36.2358μs | 27.5970 KOps/s | 27.0827 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7698ms | 35.1135μs | 28.4791 KOps/s | 29.7063 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 99.9860μs | 30.1631μs | 33.1531 KOps/s | 34.3455 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 87.3730μs | 23.6201μs | 42.3368 KOps/s | 42.3550 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1268ms | 31.1427μs | 32.1102 KOps/s | 34.0537 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 73.0960μs | 23.5264μs | 42.5054 KOps/s | 42.5987 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1540ms | 53.0690μs | 18.8434 KOps/s | 18.9935 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.7146ms | 20.7081μs | 48.2903 KOps/s | 50.7533 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1292ms | 45.1687μs | 22.1392 KOps/s | 22.6195 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 92.6920μs | 19.1544μs | 52.2072 KOps/s | 52.8139 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1229ms | 45.3332μs | 22.0589 KOps/s | 22.0710 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 59.4310μs | 18.7547μs | 53.3199 KOps/s | 53.1358 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1927ms | 54.2146μs | 18.4452 KOps/s | 18.6408 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0472ms | 20.9378μs | 47.7605 KOps/s | 49.3392 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2192ms | 45.5713μs | 21.9436 KOps/s | 22.3402 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 82.9850μs | 19.0686μs | 52.4422 KOps/s | 52.8603 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1393ms | 45.4097μs | 22.0217 KOps/s | 22.0211 KOps/s | |
test_compile_indexing[int-pytree-eager] | 77.0430μs | 18.6487μs | 53.6231 KOps/s | 53.9713 KOps/s | |
test_mod_add[eager] | 90.6790μs | 33.8733μs | 29.5218 KOps/s | 30.1005 KOps/s | |
test_mod_add[compile] | 0.1200ms | 48.2277μs | 20.7350 KOps/s | 20.6252 KOps/s | |
test_mod_add[compile-overhead] | 0.1192ms | 48.2514μs | 20.7248 KOps/s | 20.3793 KOps/s | |
test_mod_wrap[eager] | 0.4529ms | 0.2325ms | 4.3002 KOps/s | 4.3852 KOps/s | |
test_mod_wrap[compile] | 0.3686ms | 0.2101ms | 4.7604 KOps/s | 4.7286 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4264ms | 0.2074ms | 4.8213 KOps/s | 4.7863 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.6008ms | 11.5087ms | 86.8911 Ops/s | 81.7155 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.3786ms | 13.0440ms | 76.6633 Ops/s | 80.2115 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.6229ms | 12.0948ms | 82.6803 Ops/s | 73.6196 Ops/s | |
test_seq_add[eager] | 0.2233ms | 0.1092ms | 9.1605 KOps/s | 9.2936 KOps/s | |
test_seq_add[compile] | 0.1511ms | 61.4215μs | 16.2809 KOps/s | 15.8841 KOps/s | |
test_seq_add[compile-overhead] | 0.1540ms | 59.3806μs | 16.8405 KOps/s | 16.1668 KOps/s | |
test_seq_wrap[eager] | 0.6981ms | 0.4400ms | 2.2729 KOps/s | 2.3201 KOps/s | |
test_seq_wrap[compile] | 0.4551ms | 0.2275ms | 4.3951 KOps/s | 4.3169 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3671ms | 0.2260ms | 4.4249 KOps/s | 4.3208 KOps/s | |
test_func_call_runtime[False-eager] | 0.9269ms | 0.5636ms | 1.7744 KOps/s | 1.7508 KOps/s | |
test_func_call_runtime[False-compile] | 0.8830ms | 0.4324ms | 2.3126 KOps/s | 2.2661 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8424ms | 0.4317ms | 2.3166 KOps/s | 2.2738 KOps/s | |
test_func_call_runtime[True-eager] | 1.0519ms | 0.7782ms | 1.2850 KOps/s | 1.2657 KOps/s | |
test_func_call_runtime[True-compile] | 0.9086ms | 0.4766ms | 2.0980 KOps/s | 2.0707 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7859ms | 0.4769ms | 2.0969 KOps/s | 2.0899 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1001ms | 0.5668ms | 1.7642 KOps/s | 1.7297 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6557ms | 0.4298ms | 2.3268 KOps/s | 2.2750 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7246ms | 0.4300ms | 2.3257 KOps/s | 2.2872 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.5293ms | 0.9185ms | 1.0888 KOps/s | 1.0792 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8354ms | 0.5021ms | 1.9917 KOps/s | 1.9908 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8294ms | 0.5003ms | 1.9988 KOps/s | 1.9913 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.8205ms | 1.9598ms | 510.2435 Ops/s | 484.3326 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7209ms | 0.5211ms | 1.9191 KOps/s | 1.8566 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9224ms | 0.5239ms | 1.9088 KOps/s | 1.8517 KOps/s | |
test_distributed | 0.3036ms | 0.1261ms | 7.9313 KOps/s | 7.6631 KOps/s | |
test_tdmodule | 0.1305ms | 26.1188μs | 38.2866 KOps/s | 40.3174 KOps/s | |
test_tdmodule_dispatch | 92.6130μs | 47.0324μs | 21.2619 KOps/s | 21.3273 KOps/s | |
test_tdseq | 58.6500μs | 25.9513μs | 38.5337 KOps/s | 39.1465 KOps/s | |
test_tdseq_dispatch | 0.1649ms | 53.9121μs | 18.5487 KOps/s | 20.5544 KOps/s | |
test_instantiation_functorch | 2.6623ms | 1.5507ms | 644.8820 Ops/s | 635.4511 Ops/s | |
test_exec_functorch | 0.5911ms | 0.1821ms | 5.4918 KOps/s | 5.3715 KOps/s | |
test_exec_functional_call | 0.3317ms | 0.1725ms | 5.7963 KOps/s | 5.5227 KOps/s | |
test_exec_td_decorator | 0.5395ms | 0.2293ms | 4.3616 KOps/s | 4.1772 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2364ms | 0.6669ms | 1.4994 KOps/s | 1.4935 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9473ms | 0.6582ms | 1.5194 KOps/s | 1.5152 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.0265ms | 0.5350ms | 1.8692 KOps/s | 1.8384 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9020ms | 0.5356ms | 1.8671 KOps/s | 1.8379 KOps/s | |
test_to_module_speed[True] | 1.4139ms | 1.2943ms | 772.5891 Ops/s | 768.9742 Ops/s | |
test_to_module_speed[False] | 1.7862ms | 1.2811ms | 780.5506 Ops/s | 781.0165 Ops/s | |
test_tc_init | 0.1125ms | 48.2766μs | 20.7140 KOps/s | 23.6581 KOps/s | |
test_tc_init_nested | 0.1992ms | 96.6824μs | 10.3431 KOps/s | 12.1248 KOps/s | |
test_tc_first_layer_tensor | 29.3850μs | 1.5148μs | 660.1461 KOps/s | 666.4685 KOps/s | |
test_tc_first_layer_nontensor | 31.1880μs | 4.9173μs | 203.3649 KOps/s | 213.1210 KOps/s | |
test_tc_second_layer_tensor | 26.4290μs | 2.7604μs | 362.2682 KOps/s | 339.5757 KOps/s | |
test_tc_second_layer_nontensor | 36.8190μs | 6.0649μs | 164.8827 KOps/s | 166.7637 KOps/s | |
test_unbind | 0.2499s | 14.3515ms | 69.6791 Ops/s | 77.6232 Ops/s | |
test_full_like | 19.2976ms | 13.6017ms | 73.5205 Ops/s | 106.1879 Ops/s | |
test_zeros_like | 12.0090ms | 8.1412ms | 122.8321 Ops/s | 278.3250 Ops/s | |
test_ones_like | 11.9005ms | 8.1150ms | 123.2292 Ops/s | 246.0855 Ops/s | |
test_clone | 17.7482ms | 10.2827ms | 97.2508 Ops/s | 156.6941 Ops/s | |
test_squeeze | 76.6420μs | 12.0047μs | 83.3004 KOps/s | 85.1787 KOps/s | |
test_unsqueeze | 0.3602ms | 90.5499μs | 11.0436 KOps/s | 11.0355 KOps/s | |
test_split | 0.4525ms | 0.2002ms | 4.9949 KOps/s | 5.0353 KOps/s | |
test_permute | 0.3387ms | 0.2188ms | 4.5703 KOps/s | 4.5555 KOps/s | |
test_stack | 30.7477ms | 26.4626ms | 37.7892 Ops/s | 34.9678 Ops/s | |
test_cat | 30.2165ms | 27.7468ms | 36.0402 Ops/s | 35.0719 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 27, 2024
ghstack-source-id: 04a62439b0fe60422fbc901172df46306e161cc5 Pull Request resolved: #1113
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):