-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Optional in_keys for WrapModule #1145
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 53.4200μs | 21.2080μs | 47.1521 KOps/s | 49.8487 KOps/s | |
test_plain_set_stack_nested | 62.1960μs | 21.0640μs | 47.4744 KOps/s | 49.0533 KOps/s | |
test_plain_set_nested_inplace | 58.3590μs | 22.9718μs | 43.5316 KOps/s | 45.8462 KOps/s | |
test_plain_set_stack_nested_inplace | 58.0890μs | 23.0160μs | 43.4480 KOps/s | 45.6165 KOps/s | |
test_items | 34.2740μs | 4.1728μs | 239.6463 KOps/s | 237.6867 KOps/s | |
test_items_nested | 0.6252ms | 0.4030ms | 2.4812 KOps/s | 2.4505 KOps/s | |
test_items_nested_locked | 0.5623ms | 0.4054ms | 2.4664 KOps/s | 2.4532 KOps/s | |
test_items_nested_leaf | 0.1462ms | 77.2844μs | 12.9392 KOps/s | 12.7552 KOps/s | |
test_items_stack_nested | 0.5856ms | 0.4134ms | 2.4188 KOps/s | 2.4149 KOps/s | |
test_items_stack_nested_leaf | 0.1740ms | 81.5213μs | 12.2667 KOps/s | 12.4101 KOps/s | |
test_items_stack_nested_locked | 0.7953ms | 0.4124ms | 2.4249 KOps/s | 2.4052 KOps/s | |
test_keys | 27.5620μs | 3.4968μs | 285.9737 KOps/s | 277.2323 KOps/s | |
test_keys_nested | 0.2290ms | 0.1659ms | 6.0289 KOps/s | 5.9484 KOps/s | |
test_keys_nested_locked | 1.9598ms | 0.1738ms | 5.7538 KOps/s | 5.6985 KOps/s | |
test_keys_nested_leaf | 0.2101ms | 0.1446ms | 6.9159 KOps/s | 6.7777 KOps/s | |
test_keys_stack_nested | 0.2290ms | 0.1639ms | 6.1006 KOps/s | 5.9592 KOps/s | |
test_keys_stack_nested_leaf | 0.1972ms | 0.1398ms | 7.1552 KOps/s | 6.8665 KOps/s | |
test_keys_stack_nested_locked | 0.2344ms | 0.1694ms | 5.9021 KOps/s | 5.8107 KOps/s | |
test_values | 11.1728μs | 1.0473μs | 954.8002 KOps/s | 964.7260 KOps/s | |
test_values_nested | 0.1356ms | 63.1094μs | 15.8455 KOps/s | 15.7405 KOps/s | |
test_values_nested_locked | 0.1175ms | 62.8360μs | 15.9144 KOps/s | 15.9428 KOps/s | |
test_values_nested_leaf | 0.1329ms | 72.7534μs | 13.7451 KOps/s | 13.7526 KOps/s | |
test_values_stack_nested | 0.1227ms | 64.1778μs | 15.5817 KOps/s | 15.8333 KOps/s | |
test_values_stack_nested_leaf | 0.1360ms | 71.1261μs | 14.0595 KOps/s | 13.8056 KOps/s | |
test_values_stack_nested_locked | 0.1358ms | 64.5571μs | 15.4902 KOps/s | 15.7785 KOps/s | |
test_membership | 20.1580μs | 0.8899μs | 1.1237 MOps/s | 1.1683 MOps/s | |
test_membership_nested | 48.1000μs | 2.9116μs | 343.4531 KOps/s | 343.7016 KOps/s | |
test_membership_nested_leaf | 65.8950μs | 2.9693μs | 336.7836 KOps/s | 338.4976 KOps/s | |
test_membership_stacked_nested | 44.5840μs | 2.9189μs | 342.5953 KOps/s | 345.8450 KOps/s | |
test_membership_stacked_nested_leaf | 25.3380μs | 2.9388μs | 340.2803 KOps/s | 349.6691 KOps/s | |
test_membership_nested_last | 68.2260μs | 4.4303μs | 225.7195 KOps/s | 230.6200 KOps/s | |
test_membership_nested_leaf_last | 42.6900μs | 4.3800μs | 228.3127 KOps/s | 223.0888 KOps/s | |
test_membership_stacked_nested_last | 69.7610μs | 13.5264μs | 73.9296 KOps/s | 142.2617 KOps/s | |
test_membership_stacked_nested_leaf_last | 42.8900μs | 13.7015μs | 72.9845 KOps/s | 142.4671 KOps/s | |
test_nested_getleaf | 61.5750μs | 10.9446μs | 91.3693 KOps/s | 91.3074 KOps/s | |
test_nested_get | 66.5550μs | 10.3330μs | 96.7774 KOps/s | 96.2999 KOps/s | |
test_stacked_getleaf | 39.6140μs | 10.5214μs | 95.0441 KOps/s | 91.7525 KOps/s | |
test_stacked_get | 69.7110μs | 10.1120μs | 98.8922 KOps/s | 96.7135 KOps/s | |
test_nested_getitemleaf | 62.4770μs | 11.3207μs | 88.3338 KOps/s | 84.9023 KOps/s | |
test_nested_getitem | 51.9870μs | 10.5773μs | 94.5421 KOps/s | 93.5466 KOps/s | |
test_stacked_getitemleaf | 62.4970μs | 11.2121μs | 89.1890 KOps/s | 88.1993 KOps/s | |
test_stacked_getitem | 83.5660μs | 10.7589μs | 92.9466 KOps/s | 93.6851 KOps/s | |
test_lock_nested | 4.2331ms | 0.4598ms | 2.1748 KOps/s | 2.1520 KOps/s | |
test_lock_stack_nested | 0.8224ms | 0.4136ms | 2.4176 KOps/s | 2.3572 KOps/s | |
test_unlock_nested | 1.1670ms | 0.3752ms | 2.6649 KOps/s | 2.6284 KOps/s | |
test_unlock_stack_nested | 0.4240ms | 0.3317ms | 3.0145 KOps/s | 2.8941 KOps/s | |
test_flatten_speed | 0.1809ms | 99.5365μs | 10.0466 KOps/s | 9.9624 KOps/s | |
test_unflatten_speed | 0.7485ms | 0.5222ms | 1.9150 KOps/s | 1.8627 KOps/s | |
test_common_ops | 1.8182ms | 0.8152ms | 1.2267 KOps/s | 1.3385 KOps/s | |
test_creation | 27.9020μs | 2.4800μs | 403.2288 KOps/s | 402.1582 KOps/s | |
test_creation_empty | 62.3770μs | 12.4490μs | 80.3279 KOps/s | 102.8608 KOps/s | |
test_creation_nested_1 | 48.1000μs | 15.6082μs | 64.0688 KOps/s | 79.6168 KOps/s | |
test_creation_nested_2 | 88.3760μs | 19.9924μs | 50.0190 KOps/s | 58.5564 KOps/s | |
test_clone | 0.1026ms | 13.1977μs | 75.7708 KOps/s | 75.5678 KOps/s | |
test_getitem[int] | 1.2997ms | 12.8127μs | 78.0477 KOps/s | 78.2177 KOps/s | |
test_getitem[slice_int] | 0.1593ms | 23.8181μs | 41.9848 KOps/s | 39.3122 KOps/s | |
test_getitem[range] | 0.1912ms | 47.2779μs | 21.1515 KOps/s | 20.4098 KOps/s | |
test_getitem[tuple] | 0.1566ms | 20.1929μs | 49.5223 KOps/s | 48.8507 KOps/s | |
test_getitem[list] | 0.3165ms | 42.3696μs | 23.6018 KOps/s | 22.4169 KOps/s | |
test_setitem_dim[int] | 47.1590μs | 24.3308μs | 41.1002 KOps/s | 40.0613 KOps/s | |
test_setitem_dim[slice_int] | 89.5880μs | 50.2568μs | 19.8978 KOps/s | 19.4487 KOps/s | |
test_setitem_dim[range] | 0.1209ms | 71.4498μs | 13.9958 KOps/s | 13.6354 KOps/s | |
test_setitem_dim[tuple] | 77.2150μs | 39.8297μs | 25.1069 KOps/s | 24.4687 KOps/s | |
test_setitem | 0.1189ms | 20.5987μs | 48.5467 KOps/s | 52.6495 KOps/s | |
test_set | 0.1358ms | 20.2074μs | 49.4868 KOps/s | 51.8705 KOps/s | |
test_set_shared | 2.5452ms | 0.1751ms | 5.7122 KOps/s | 5.9207 KOps/s | |
test_update | 0.2549ms | 23.8592μs | 41.9125 KOps/s | 49.3226 KOps/s | |
test_update_nested | 0.1436ms | 34.0149μs | 29.3989 KOps/s | 33.2086 KOps/s | |
test_update__nested | 0.4199ms | 33.7299μs | 29.6473 KOps/s | 30.1091 KOps/s | |
test_set_nested | 0.1358ms | 22.6482μs | 44.1535 KOps/s | 48.2192 KOps/s | |
test_set_nested_new | 0.1483ms | 26.8116μs | 37.2972 KOps/s | 40.2961 KOps/s | |
test_select | 0.2157ms | 44.0481μs | 22.7025 KOps/s | 24.1732 KOps/s | |
test_select_nested | 0.1313ms | 63.9699μs | 15.6324 KOps/s | 15.7195 KOps/s | |
test_exclude_nested | 0.2828ms | 85.0184μs | 11.7622 KOps/s | 12.0393 KOps/s | |
test_empty[True] | 1.0505ms | 0.4226ms | 2.3661 KOps/s | 2.4021 KOps/s | |
test_empty[False] | 42.8527μs | 1.3889μs | 719.9804 KOps/s | 699.4656 KOps/s | |
test_unbind_speed | 0.3876ms | 0.2695ms | 3.7112 KOps/s | 3.7441 KOps/s | |
test_unbind_speed_stack0 | 0.3662ms | 0.2571ms | 3.8889 KOps/s | 3.8129 KOps/s | |
test_unbind_speed_stack1 | 0.1167s | 0.7904ms | 1.2652 KOps/s | 1.3656 KOps/s | |
test_split | 0.1134s | 1.7580ms | 568.8325 Ops/s | 555.2450 Ops/s | |
test_chunk | 0.1156s | 1.7810ms | 561.4812 Ops/s | 559.0630 Ops/s | |
test_consolidate_njt[False-None] | 8.6018ms | 8.0850ms | 123.6853 Ops/s | 121.0131 Ops/s | |
test_creation[device0] | 0.2682ms | 91.1498μs | 10.9709 KOps/s | 10.8524 KOps/s | |
test_creation_from_tensor | 0.3200ms | 94.0119μs | 10.6370 KOps/s | 10.3472 KOps/s | |
test_add_one[memmap_tensor0] | 0.2997ms | 4.7073μs | 212.4341 KOps/s | 212.7025 KOps/s | |
test_contiguous[memmap_tensor0] | 39.9140μs | 0.5127μs | 1.9503 MOps/s | 1.9862 MOps/s | |
test_stack[memmap_tensor0] | 46.0970μs | 3.2904μs | 303.9150 KOps/s | 305.8107 KOps/s | |
test_memmaptd_index | 0.4904ms | 0.2347ms | 4.2604 KOps/s | 4.3193 KOps/s | |
test_memmaptd_index_astensor | 0.7669ms | 0.3221ms | 3.1042 KOps/s | 3.1202 KOps/s | |
test_memmaptd_index_op | 1.0248ms | 0.5972ms | 1.6744 KOps/s | 1.8288 KOps/s | |
test_serialize_model | 0.1358s | 0.1179s | 8.4830 Ops/s | 8.4269 Ops/s | |
test_serialize_model_pickle | 0.4988s | 0.4004s | 2.4977 Ops/s | 2.4571 Ops/s | |
test_serialize_weights | 0.2413s | 0.1349s | 7.4125 Ops/s | 7.5447 Ops/s | |
test_serialize_weights_returnearly | 0.2635s | 0.1774s | 5.6383 Ops/s | 6.4035 Ops/s | |
test_serialize_weights_pickle | 0.4951s | 0.4083s | 2.4493 Ops/s | 2.1850 Ops/s | |
test_serialize_weights_filesystem | 0.1523s | 0.1444s | 6.9251 Ops/s | 7.0915 Ops/s | |
test_serialize_model_filesystem | 0.2657s | 0.1729s | 5.7833 Ops/s | 5.9112 Ops/s | |
test_reshape_pytree | 66.0140μs | 26.4369μs | 37.8260 KOps/s | 37.7521 KOps/s | |
test_reshape_td | 82.1540μs | 32.4669μs | 30.8006 KOps/s | 29.6762 KOps/s | |
test_view_pytree | 58.4390μs | 26.8560μs | 37.2357 KOps/s | 38.0254 KOps/s | |
test_view_td | 81.5930μs | 37.7093μs | 26.5186 KOps/s | 26.4128 KOps/s | |
test_unbind_pytree | 72.1450μs | 29.7949μs | 33.5627 KOps/s | 34.0393 KOps/s | |
test_unbind_td | 0.3442ms | 39.3971μs | 25.3826 KOps/s | 25.3867 KOps/s | |
test_split_pytree | 78.6170μs | 29.2448μs | 34.1941 KOps/s | 34.3820 KOps/s | |
test_split_td | 0.5630ms | 44.5585μs | 22.4424 KOps/s | 22.2997 KOps/s | |
test_add_pytree | 79.4480μs | 35.3429μs | 28.2942 KOps/s | 28.9898 KOps/s | |
test_add_td | 0.1205ms | 56.6126μs | 17.6639 KOps/s | 17.9471 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1599ms | 61.7590μs | 16.1920 KOps/s | 16.5064 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4717ms | 0.1683ms | 5.9435 KOps/s | 5.9144 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1366ms | 45.2507μs | 22.0991 KOps/s | 22.2272 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2394ms | 0.1176ms | 8.5044 KOps/s | 8.6131 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 91.4400μs | 26.3004μs | 38.0222 KOps/s | 38.8517 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1298ms | 59.1190μs | 16.9150 KOps/s | 16.9982 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2035ms | 78.1504μs | 12.7958 KOps/s | 12.7145 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1515ms | 68.0467μs | 14.6958 KOps/s | 14.7625 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2311ms | 0.1041ms | 9.6039 KOps/s | 9.7020 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4297ms | 0.2137ms | 4.6785 KOps/s | 4.6819 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1344ms | 44.3086μs | 22.5690 KOps/s | 22.1905 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4955ms | 63.6951μs | 15.6998 KOps/s | 15.3937 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1915ms | 0.1030ms | 9.7048 KOps/s | 9.7944 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4351ms | 0.2022ms | 4.9453 KOps/s | 5.0299 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4624ms | 0.2297ms | 4.3541 KOps/s | 4.3353 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1996ms | 0.1043ms | 9.5867 KOps/s | 9.4623 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1207ms | 59.9178μs | 16.6895 KOps/s | 16.9525 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1296ms | 44.3126μs | 22.5669 KOps/s | 21.9457 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.4078ms | 0.1617ms | 6.1840 KOps/s | 6.3751 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2489ms | 0.1053ms | 9.4935 KOps/s | 9.8004 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 93.3050μs | 21.4703μs | 46.5759 KOps/s | 47.7177 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.3393ms | 67.8520μs | 14.7380 KOps/s | 15.2028 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1933ms | 81.7681μs | 12.2297 KOps/s | 12.4404 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1248ms | 69.1185μs | 14.4679 KOps/s | 14.7696 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4353ms | 0.2085ms | 4.7962 KOps/s | 4.8748 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.3799ms | 1.2835ms | 779.1273 Ops/s | 726.8870 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4010ms | 0.2049ms | 4.8795 KOps/s | 4.9539 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2315ms | 0.7676ms | 1.3028 KOps/s | 1.3150 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.9899ms | 0.4574ms | 2.1865 KOps/s | 2.1951 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.5690ms | 2.6607ms | 375.8456 Ops/s | 376.7318 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 98.0820μs | 36.1920μs | 27.6304 KOps/s | 27.9012 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6123ms | 32.9214μs | 30.3754 KOps/s | 30.6141 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 73.5480μs | 28.5865μs | 34.9815 KOps/s | 34.4768 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 64.2000μs | 23.5319μs | 42.4955 KOps/s | 43.3073 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 76.8940μs | 28.9954μs | 34.4882 KOps/s | 33.3232 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1112ms | 23.1329μs | 43.2284 KOps/s | 43.8897 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1002ms | 50.4939μs | 19.8044 KOps/s | 19.7367 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5990ms | 20.2211μs | 49.4532 KOps/s | 50.2748 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 88.9160μs | 42.4064μs | 23.5814 KOps/s | 22.5720 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 60.0620μs | 18.9051μs | 52.8958 KOps/s | 54.4394 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 90.4300μs | 43.3736μs | 23.0555 KOps/s | 22.1510 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 46.4570μs | 18.9363μs | 52.8087 KOps/s | 54.4491 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1223ms | 51.6560μs | 19.3588 KOps/s | 19.1422 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9550ms | 19.7789μs | 50.5588 KOps/s | 50.2451 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 96.7710μs | 44.1712μs | 22.6392 KOps/s | 22.0850 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1759ms | 18.7777μs | 53.2547 KOps/s | 54.5953 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.5741ms | 44.3112μs | 22.5676 KOps/s | 22.1901 KOps/s | |
test_compile_indexing[int-pytree-eager] | 59.5620μs | 18.6954μs | 53.4890 KOps/s | 54.4149 KOps/s | |
test_mod_add[eager] | 90.2790μs | 35.3536μs | 28.2857 KOps/s | 29.4170 KOps/s | |
test_mod_add[compile] | 99.7470μs | 47.2762μs | 21.1523 KOps/s | 20.8499 KOps/s | |
test_mod_add[compile-overhead] | 0.1030ms | 47.3700μs | 21.1104 KOps/s | 20.4734 KOps/s | |
test_mod_wrap[eager] | 0.4289ms | 0.2251ms | 4.4431 KOps/s | 4.4733 KOps/s | |
test_mod_wrap[compile] | 0.2976ms | 0.2019ms | 4.9521 KOps/s | 4.8512 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4146ms | 0.2020ms | 4.9505 KOps/s | 4.8253 KOps/s | |
test_mod_wrap_and_backward[eager] | 18.7171ms | 12.1022ms | 82.6295 Ops/s | 86.5498 Ops/s | |
test_mod_wrap_and_backward[compile] | 19.6460ms | 12.8310ms | 77.9361 Ops/s | 72.7191 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.1073ms | 12.2432ms | 81.6782 Ops/s | 70.9170 Ops/s | |
test_seq_add[eager] | 0.2827ms | 0.1151ms | 8.6848 KOps/s | 8.7375 KOps/s | |
test_seq_add[compile] | 0.1128ms | 62.3187μs | 16.0465 KOps/s | 15.9751 KOps/s | |
test_seq_add[compile-overhead] | 0.1661ms | 60.5202μs | 16.5234 KOps/s | 16.2302 KOps/s | |
test_seq_wrap[eager] | 0.7255ms | 0.4427ms | 2.2589 KOps/s | 2.2342 KOps/s | |
test_seq_wrap[compile] | 0.3294ms | 0.2214ms | 4.5164 KOps/s | 4.2747 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4344ms | 0.2228ms | 4.4876 KOps/s | 4.3329 KOps/s | |
test_func_call_runtime[False-eager] | 0.8051ms | 0.5386ms | 1.8567 KOps/s | 1.8385 KOps/s | |
test_func_call_runtime[False-compile] | 0.5343ms | 0.4159ms | 2.4041 KOps/s | 2.3609 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7599ms | 0.4160ms | 2.4036 KOps/s | 2.3399 KOps/s | |
test_func_call_runtime[True-eager] | 1.2850ms | 0.7560ms | 1.3228 KOps/s | 1.3061 KOps/s | |
test_func_call_runtime[True-compile] | 0.6980ms | 0.4566ms | 2.1900 KOps/s | 2.1576 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6077ms | 0.4548ms | 2.1990 KOps/s | 2.1474 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0017ms | 0.5455ms | 1.8333 KOps/s | 1.8447 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5768ms | 0.4182ms | 2.3915 KOps/s | 2.3463 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5451ms | 0.4186ms | 2.3889 KOps/s | 2.3591 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4533ms | 0.9192ms | 1.0879 KOps/s | 1.0944 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6273ms | 0.4833ms | 2.0689 KOps/s | 2.0365 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2017ms | 0.5041ms | 1.9838 KOps/s | 1.9903 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5878ms | 1.9352ms | 516.7475 Ops/s | 512.7829 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.5275ms | 0.5193ms | 1.9255 KOps/s | 1.9400 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 1.5188ms | 0.5313ms | 1.8822 KOps/s | 1.8848 KOps/s | |
test_distributed | 0.3776ms | 0.1257ms | 7.9542 KOps/s | 7.6744 KOps/s | |
test_tdmodule | 0.1203ms | 26.9669μs | 37.0825 KOps/s | 39.1526 KOps/s | |
test_tdmodule_dispatch | 88.4460μs | 49.3089μs | 20.2803 KOps/s | 21.4834 KOps/s | |
test_tdseq | 69.7210μs | 29.6396μs | 33.7386 KOps/s | 35.4333 KOps/s | |
test_tdseq_dispatch | 0.1168ms | 57.4108μs | 17.4183 KOps/s | 19.0826 KOps/s | |
test_instantiation_functorch | 1.6892ms | 1.5256ms | 655.4821 Ops/s | 641.0368 Ops/s | |
test_exec_functorch | 0.2757ms | 0.1784ms | 5.6057 KOps/s | 5.5373 KOps/s | |
test_exec_functional_call | 0.4116ms | 0.1754ms | 5.7009 KOps/s | 5.7787 KOps/s | |
test_exec_td_decorator | 0.5184ms | 0.2323ms | 4.3051 KOps/s | 4.2848 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9625ms | 0.6621ms | 1.5103 KOps/s | 1.5157 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.2526ms | 0.6572ms | 1.5215 KOps/s | 1.5315 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8782ms | 0.5277ms | 1.8950 KOps/s | 1.8740 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7400ms | 0.5250ms | 1.9049 KOps/s | 1.8883 KOps/s | |
test_to_module_speed[True] | 2.1531ms | 1.3692ms | 730.3585 Ops/s | 722.7623 Ops/s | |
test_to_module_speed[False] | 1.8102ms | 1.3251ms | 754.6636 Ops/s | 732.2177 Ops/s | |
test_tc_init | 96.4410μs | 48.7065μs | 20.5311 KOps/s | 23.3502 KOps/s | |
test_tc_init_nested | 0.2159ms | 99.4313μs | 10.0572 KOps/s | 11.0159 KOps/s | |
test_tc_first_layer_tensor | 18.9450μs | 1.5005μs | 666.4252 KOps/s | 654.8910 KOps/s | |
test_tc_first_layer_nontensor | 0.1271ms | 4.9962μs | 200.1510 KOps/s | 212.1105 KOps/s | |
test_tc_second_layer_tensor | 53.0890μs | 2.7755μs | 360.3007 KOps/s | 344.5175 KOps/s | |
test_tc_second_layer_nontensor | 45.5250μs | 5.9593μs | 167.8060 KOps/s | 164.2559 KOps/s | |
test_unbind | 0.2416s | 15.0722ms | 66.3472 Ops/s | 62.4524 Ops/s | |
test_full_like | 10.3570ms | 8.9001ms | 112.3583 Ops/s | 108.2412 Ops/s | |
test_zeros_like | 5.3324ms | 3.4826ms | 287.1378 Ops/s | 290.5799 Ops/s | |
test_ones_like | 4.7822ms | 4.1580ms | 240.4987 Ops/s | 248.2286 Ops/s | |
test_clone | 7.2043ms | 6.1457ms | 162.7142 Ops/s | 165.1478 Ops/s | |
test_squeeze | 71.9450μs | 12.4035μs | 80.6226 KOps/s | 82.0601 KOps/s | |
test_unsqueeze | 0.2444ms | 90.7924μs | 11.0141 KOps/s | 11.0525 KOps/s | |
test_split | 0.5109ms | 0.1931ms | 5.1797 KOps/s | 5.0325 KOps/s | |
test_permute | 0.3296ms | 0.2063ms | 4.8462 KOps/s | 4.8000 KOps/s | |
test_stack | 28.6262ms | 26.0624ms | 38.3695 Ops/s | 36.4605 Ops/s | |
test_cat | 30.9192ms | 25.9670ms | 38.5104 Ops/s | 36.4342 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 42.6000μs | 12.0328μs | 83.1062 KOps/s | 76.0947 KOps/s | |
test_plain_set_stack_nested | 37.0800μs | 12.2793μs | 81.4376 KOps/s | 75.3392 KOps/s | |
test_plain_set_nested_inplace | 54.9210μs | 13.1649μs | 75.9598 KOps/s | 70.0096 KOps/s | |
test_plain_set_stack_nested_inplace | 79.9120μs | 13.1903μs | 75.8131 KOps/s | 69.9717 KOps/s | |
test_items | 33.5000μs | 2.9058μs | 344.1433 KOps/s | 341.6565 KOps/s | |
test_items_nested | 0.4179ms | 0.3747ms | 2.6685 KOps/s | 2.6713 KOps/s | |
test_items_nested_locked | 0.4191ms | 0.3810ms | 2.6247 KOps/s | 2.6480 KOps/s | |
test_items_nested_leaf | 0.1074ms | 57.4288μs | 17.4129 KOps/s | 17.0185 KOps/s | |
test_items_stack_nested | 0.4185ms | 0.3782ms | 2.6444 KOps/s | 2.6162 KOps/s | |
test_items_stack_nested_leaf | 0.1050ms | 57.9859μs | 17.2456 KOps/s | 16.6847 KOps/s | |
test_items_stack_nested_locked | 0.4314ms | 0.3821ms | 2.6169 KOps/s | 2.6483 KOps/s | |
test_keys | 33.2010μs | 3.4930μs | 286.2841 KOps/s | 284.8428 KOps/s | |
test_keys_nested | 0.1151ms | 82.8487μs | 12.0702 KOps/s | 12.1643 KOps/s | |
test_keys_nested_locked | 0.7844ms | 88.7879μs | 11.2628 KOps/s | 11.3528 KOps/s | |
test_keys_nested_leaf | 0.1037ms | 73.5893μs | 13.5889 KOps/s | 13.6700 KOps/s | |
test_keys_stack_nested | 0.1249ms | 82.8320μs | 12.0726 KOps/s | 11.8538 KOps/s | |
test_keys_stack_nested_leaf | 0.1134ms | 74.1097μs | 13.4935 KOps/s | 13.3266 KOps/s | |
test_keys_stack_nested_locked | 0.1369ms | 88.9669μs | 11.2401 KOps/s | 11.0536 KOps/s | |
test_values | 5.5552μs | 0.8472μs | 1.1804 MOps/s | 1.1755 MOps/s | |
test_values_nested | 83.3720μs | 35.8695μs | 27.8788 KOps/s | 28.1362 KOps/s | |
test_values_nested_locked | 63.9010μs | 38.3686μs | 26.0630 KOps/s | 26.8423 KOps/s | |
test_values_nested_leaf | 74.5420μs | 40.5375μs | 24.6685 KOps/s | 24.9033 KOps/s | |
test_values_stack_nested | 71.7310μs | 35.8587μs | 27.8872 KOps/s | 27.6775 KOps/s | |
test_values_stack_nested_leaf | 67.3810μs | 40.8215μs | 24.4969 KOps/s | 24.4323 KOps/s | |
test_values_stack_nested_locked | 69.6710μs | 38.6295μs | 25.8869 KOps/s | 26.4178 KOps/s | |
test_membership | 1.6725μs | 0.5330μs | 1.8762 MOps/s | 1.8470 MOps/s | |
test_membership_nested | 15.7055μs | 2.0862μs | 479.3304 KOps/s | 455.8850 KOps/s | |
test_membership_nested_leaf | 36.9755μs | 2.1315μs | 469.1452 KOps/s | 474.7715 KOps/s | |
test_membership_stacked_nested | 43.9210μs | 2.1655μs | 461.7803 KOps/s | 465.5412 KOps/s | |
test_membership_stacked_nested_leaf | 26.5710μs | 2.2140μs | 451.6772 KOps/s | 464.8341 KOps/s | |
test_membership_nested_last | 36.1300μs | 3.1808μs | 314.3910 KOps/s | 310.6257 KOps/s | |
test_membership_nested_leaf_last | 27.4100μs | 3.2171μs | 310.8369 KOps/s | 308.9280 KOps/s | |
test_membership_stacked_nested_last | 19.1410μs | 3.2046μs | 312.0479 KOps/s | 270.7138 KOps/s | |
test_membership_stacked_nested_leaf_last | 32.0610μs | 3.2108μs | 311.4481 KOps/s | 270.9397 KOps/s | |
test_nested_getleaf | 37.3210μs | 6.2125μs | 160.9657 KOps/s | 162.5282 KOps/s | |
test_nested_get | 35.2510μs | 5.8983μs | 169.5393 KOps/s | 172.0359 KOps/s | |
test_stacked_getleaf | 34.6100μs | 6.3056μs | 158.5889 KOps/s | 161.3757 KOps/s | |
test_stacked_get | 35.0010μs | 5.8784μs | 170.1148 KOps/s | 169.7752 KOps/s | |
test_nested_getitemleaf | 36.4810μs | 6.2263μs | 160.6087 KOps/s | 159.3792 KOps/s | |
test_nested_getitem | 27.3910μs | 6.0023μs | 166.6031 KOps/s | 167.0447 KOps/s | |
test_stacked_getitemleaf | 42.2310μs | 6.2603μs | 159.7374 KOps/s | 157.2917 KOps/s | |
test_stacked_getitem | 27.6000μs | 6.0465μs | 165.3843 KOps/s | 166.3426 KOps/s | |
test_lock_nested | 9.4184ms | 0.3919ms | 2.5520 KOps/s | 2.5408 KOps/s | |
test_lock_stack_nested | 0.4047ms | 0.3526ms | 2.8364 KOps/s | 2.7991 KOps/s | |
test_unlock_nested | 0.6263ms | 0.3119ms | 3.2059 KOps/s | 3.1772 KOps/s | |
test_unlock_stack_nested | 0.3578ms | 0.2822ms | 3.5435 KOps/s | 3.4917 KOps/s | |
test_flatten_speed | 0.1284ms | 75.2303μs | 13.2925 KOps/s | 13.0152 KOps/s | |
test_unflatten_speed | 0.3783ms | 0.3266ms | 3.0616 KOps/s | 3.0835 KOps/s | |
test_common_ops | 1.6419ms | 0.5984ms | 1.6712 KOps/s | 1.5325 KOps/s | |
test_creation | 96.9020μs | 1.8243μs | 548.1469 KOps/s | 570.3690 KOps/s | |
test_creation_empty | 30.9110μs | 7.9108μs | 126.4091 KOps/s | 101.5735 KOps/s | |
test_creation_nested_1 | 39.8300μs | 9.5059μs | 105.1980 KOps/s | 86.8270 KOps/s | |
test_creation_nested_2 | 40.2510μs | 12.3010μs | 81.2944 KOps/s | 70.3414 KOps/s | |
test_clone | 48.8010μs | 10.5814μs | 94.5059 KOps/s | 94.2201 KOps/s | |
test_getitem[int] | 93.0000ms | 16.1461μs | 61.9346 KOps/s | 93.7571 KOps/s | |
test_getitem[slice_int] | 0.1103ms | 21.1528μs | 47.2750 KOps/s | 48.0388 KOps/s | |
test_getitem[range] | 0.1257ms | 37.7445μs | 26.4939 KOps/s | 26.2987 KOps/s | |
test_getitem[tuple] | 0.1025ms | 17.9890μs | 55.5895 KOps/s | 53.9146 KOps/s | |
test_getitem[list] | 0.2248ms | 33.4694μs | 29.8780 KOps/s | 29.4334 KOps/s | |
test_setitem_dim[int] | 39.0210μs | 18.1485μs | 55.1011 KOps/s | 52.2733 KOps/s | |
test_setitem_dim[slice_int] | 70.7310μs | 38.5597μs | 25.9338 KOps/s | 25.5863 KOps/s | |
test_setitem_dim[range] | 84.3320μs | 52.9694μs | 18.8788 KOps/s | 18.8947 KOps/s | |
test_setitem_dim[tuple] | 59.7210μs | 31.4503μs | 31.7962 KOps/s | 31.0823 KOps/s | |
test_setitem | 77.4820μs | 14.6652μs | 68.1885 KOps/s | 61.9954 KOps/s | |
test_set | 88.9320μs | 14.9665μs | 66.8159 KOps/s | 63.3103 KOps/s | |
test_set_shared | 1.4655ms | 0.1505ms | 6.6452 KOps/s | 6.6313 KOps/s | |
test_update | 0.5555ms | 17.3230μs | 57.7268 KOps/s | 51.8771 KOps/s | |
test_update_nested | 83.3410μs | 23.1920μs | 43.1183 KOps/s | 41.0542 KOps/s | |
test_update__nested | 0.3165ms | 26.8831μs | 37.1981 KOps/s | 39.7907 KOps/s | |
test_set_nested | 87.0320μs | 16.4041μs | 60.9604 KOps/s | 54.2908 KOps/s | |
test_set_nested_new | 77.0710μs | 18.9099μs | 52.8824 KOps/s | 48.1089 KOps/s | |
test_select | 0.2109ms | 30.5749μs | 32.7066 KOps/s | 32.7673 KOps/s | |
test_select_nested | 67.9210μs | 44.2299μs | 22.6091 KOps/s | 22.7558 KOps/s | |
test_exclude_nested | 0.1115ms | 63.1560μs | 15.8338 KOps/s | 15.5434 KOps/s | |
test_empty[True] | 0.3546ms | 0.2960ms | 3.3780 KOps/s | 3.3813 KOps/s | |
test_empty[False] | 2.9931μs | 0.8567μs | 1.1672 MOps/s | 1.1933 MOps/s | |
test_to | 0.1156ms | 58.6800μs | 17.0416 KOps/s | 17.5918 KOps/s | |
test_to_nonblocking | 91.6010μs | 48.9004μs | 20.4497 KOps/s | 19.9310 KOps/s | |
test_unbind_speed | 0.2855ms | 0.2362ms | 4.2340 KOps/s | 4.2791 KOps/s | |
test_unbind_speed_stack0 | 0.2882ms | 0.2337ms | 4.2787 KOps/s | 4.1867 KOps/s | |
test_unbind_speed_stack1 | 93.7397ms | 0.6713ms | 1.4896 KOps/s | 1.4795 KOps/s | |
test_split | 92.9169ms | 1.5770ms | 634.1096 Ops/s | 629.3105 Ops/s | |
test_chunk | 97.4347ms | 1.7254ms | 579.5662 Ops/s | 574.3179 Ops/s | |
test_consolidate[False-None] | 2.8168ms | 2.6861ms | 372.2897 Ops/s | 369.0313 Ops/s | |
test_consolidate[default-None] | 1.7824ms | 1.6880ms | 592.4071 Ops/s | 561.7200 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7811ms | 1.7161ms | 582.7230 Ops/s | 572.2419 Ops/s | |
test_consolidate_njt[False-None] | 6.9614ms | 6.6364ms | 150.6836 Ops/s | 150.3001 Ops/s | |
test_to[False-False-None] | 1.7994ms | 1.6738ms | 597.4544 Ops/s | 567.6426 Ops/s | |
test_to[True-False-None] | 1.5478ms | 1.2975ms | 770.6971 Ops/s | 755.9525 Ops/s | |
test_to[within-False-None] | 4.3162ms | 4.0999ms | 243.9101 Ops/s | 240.8796 Ops/s | |
test_to[True-default-None] | 5.7231ms | 5.2948ms | 188.8630 Ops/s | 187.1163 Ops/s | |
test_to_njt[False-False-None] | 7.0548ms | 6.8961ms | 145.0103 Ops/s | 143.7741 Ops/s | |
test_to_njt[True-False-None] | 5.6947ms | 5.4687ms | 182.8594 Ops/s | 179.0001 Ops/s | |
test_to_njt[within-False-None] | 12.3182ms | 11.9992ms | 83.3387 Ops/s | 80.7501 Ops/s | |
test_creation[device0] | 0.4644ms | 79.2031μs | 12.6258 KOps/s | 12.4499 KOps/s | |
test_creation_from_tensor | 0.5195ms | 85.3115μs | 11.7217 KOps/s | 11.4987 KOps/s | |
test_add_one[memmap_tensor0] | 0.4212ms | 6.6082μs | 151.3264 KOps/s | 137.8660 KOps/s | |
test_contiguous[memmap_tensor0] | 1.9176μs | 0.4309μs | 2.3208 MOps/s | 2.2790 MOps/s | |
test_stack[memmap_tensor0] | 38.6110μs | 4.3664μs | 229.0232 KOps/s | 232.5074 KOps/s | |
test_memmaptd_index | 1.7621ms | 0.2471ms | 4.0466 KOps/s | 4.0621 KOps/s | |
test_memmaptd_index_astensor | 0.5804ms | 0.3082ms | 3.2449 KOps/s | 3.2538 KOps/s | |
test_memmaptd_index_op | 0.9959ms | 0.5641ms | 1.7729 KOps/s | 1.6660 KOps/s | |
test_serialize_model | 0.1321s | 0.1311s | 7.6281 Ops/s | 7.6996 Ops/s | |
test_serialize_model_pickle | 1.3498s | 1.2139s | 0.8238 Ops/s | 0.8216 Ops/s | |
test_serialize_weights | 0.1309s | 0.1294s | 7.7267 Ops/s | 7.6870 Ops/s | |
test_serialize_weights_returnearly | 0.3112s | 54.1192ms | 18.4777 Ops/s | 11.1624 Ops/s | |
test_serialize_weights_pickle | 1.3553s | 1.2178s | 0.8211 Ops/s | 0.8201 Ops/s | |
test_reshape_pytree | 53.5910μs | 21.8648μs | 45.7357 KOps/s | 43.9917 KOps/s | |
test_reshape_td | 68.1920μs | 26.2999μs | 38.0230 KOps/s | 36.6873 KOps/s | |
test_view_pytree | 60.2210μs | 22.1881μs | 45.0692 KOps/s | 44.4510 KOps/s | |
test_view_td | 59.2710μs | 30.7731μs | 32.4959 KOps/s | 31.4276 KOps/s | |
test_unbind_pytree | 51.7210μs | 27.9693μs | 35.7535 KOps/s | 35.6399 KOps/s | |
test_unbind_td | 0.8113ms | 36.8798μs | 27.1151 KOps/s | 27.6933 KOps/s | |
test_split_pytree | 63.5320μs | 29.7243μs | 33.6425 KOps/s | 32.7069 KOps/s | |
test_split_td | 0.9338ms | 38.0275μs | 26.2968 KOps/s | 26.1384 KOps/s | |
test_add_pytree | 66.1410μs | 33.9396μs | 29.4641 KOps/s | 29.3187 KOps/s | |
test_add_td | 75.3720μs | 47.1000μs | 21.2314 KOps/s | 20.5823 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1842ms | 0.1261ms | 7.9308 KOps/s | 7.8829 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2228ms | 0.1300ms | 7.6937 KOps/s | 7.6579 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1369ms | 98.0935μs | 10.1944 KOps/s | 10.2162 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.5546ms | 0.1483ms | 6.7419 KOps/s | 6.6226 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 82.7320μs | 21.9339μs | 45.5916 KOps/s | 46.0911 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 56.7710μs | 28.8897μs | 34.6145 KOps/s | 34.2075 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3419ms | 64.2505μs | 15.5641 KOps/s | 15.3149 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1414ms | 50.2615μs | 19.8959 KOps/s | 20.0466 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2173ms | 0.1435ms | 6.9706 KOps/s | 7.0650 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3034ms | 0.2130ms | 4.6958 KOps/s | 4.6838 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1359ms | 98.8872μs | 10.1125 KOps/s | 9.7759 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1653ms | 53.1424μs | 18.8174 KOps/s | 17.9520 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2052ms | 0.1372ms | 7.2863 KOps/s | 7.3450 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5975ms | 0.4709ms | 2.1234 KOps/s | 2.0688 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3732ms | 0.2546ms | 3.9277 KOps/s | 3.9084 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1903ms | 0.1452ms | 6.8855 KOps/s | 6.9887 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1628ms | 64.6075μs | 15.4781 KOps/s | 14.6692 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2030ms | 0.1003ms | 9.9697 KOps/s | 10.1492 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4666ms | 0.4025ms | 2.4846 KOps/s | 2.4074 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1941ms | 0.1376ms | 7.2655 KOps/s | 7.2450 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 50.4300μs | 18.3793μs | 54.4089 KOps/s | 58.5871 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 62.9720μs | 31.5562μs | 31.6895 KOps/s | 32.7719 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1925ms | 69.9758μs | 14.2907 KOps/s | 14.2629 KOps/s | |
test_compile_copy_flat[pytree-eager] | 89.1520μs | 51.0801μs | 19.5771 KOps/s | 19.7170 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6208ms | 0.3898ms | 2.5652 KOps/s | 2.1906 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6041ms | 2.5015ms | 399.7665 Ops/s | 366.1800 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5783ms | 0.4284ms | 2.3342 KOps/s | 2.2797 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8927ms | 2.5380ms | 394.0059 Ops/s | 374.8968 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1689ms | 0.1139ms | 8.7793 KOps/s | 8.6341 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5962ms | 78.9599μs | 12.6647 KOps/s | 12.2422 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2059ms | 0.1029ms | 9.7198 KOps/s | 8.9858 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1129ms | 66.1290μs | 15.1219 KOps/s | 14.2734 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1599ms | 0.1035ms | 9.6655 KOps/s | 9.2013 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2172ms | 66.5419μs | 15.0281 KOps/s | 14.3896 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1459ms | 0.1003ms | 9.9656 KOps/s | 9.4995 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1411ms | 17.2473μs | 57.9801 KOps/s | 56.0351 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1275ms | 96.1108μs | 10.4047 KOps/s | 10.3863 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 47.1810μs | 15.6911μs | 63.7305 KOps/s | 63.5114 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1416ms | 96.3322μs | 10.3807 KOps/s | 10.2846 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 72.1310μs | 15.4813μs | 64.5941 KOps/s | 62.3645 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1650ms | 0.1017ms | 9.8363 KOps/s | 9.4940 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6028ms | 16.6892μs | 59.9188 KOps/s | 55.2788 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2260ms | 96.9857μs | 10.3108 KOps/s | 9.6736 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 54.0210μs | 15.5816μs | 64.1781 KOps/s | 63.3929 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2487ms | 97.8753μs | 10.2171 KOps/s | 9.7963 KOps/s | |
test_compile_indexing[int-pytree-eager] | 75.4010μs | 15.4833μs | 64.5855 KOps/s | 63.8775 KOps/s | |
test_mod_add[eager] | 0.1828ms | 36.9148μs | 27.0894 KOps/s | 25.2394 KOps/s | |
test_mod_add[compile] | 0.1332ms | 83.8831μs | 11.9214 KOps/s | 12.1045 KOps/s | |
test_mod_add[compile-overhead] | 0.3188ms | 0.1676ms | 5.9662 KOps/s | 5.6717 KOps/s | |
test_mod_wrap[eager] | 0.3876ms | 0.2446ms | 4.0886 KOps/s | 3.9370 KOps/s | |
test_mod_wrap[compile] | 0.3519ms | 0.2845ms | 3.5147 KOps/s | 3.3107 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0764ms | 3.7479ms | 266.8128 Ops/s | 272.1873 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5335ms | 1.3554ms | 737.7906 Ops/s | 674.6063 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3602ms | 1.2672ms | 789.1132 Ops/s | 717.9024 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4113ms | 0.9287ms | 1.0768 KOps/s | 945.5372 Ops/s | |
test_seq_add[eager] | 0.2618ms | 0.1139ms | 8.7834 KOps/s | 8.3609 KOps/s | |
test_seq_add[compile] | 0.4821ms | 88.1949μs | 11.3385 KOps/s | 11.4216 KOps/s | |
test_seq_add[compile-overhead] | 0.2793ms | 0.1298ms | 7.7057 KOps/s | 7.4208 KOps/s | |
test_seq_wrap[eager] | 0.5565ms | 0.4119ms | 2.4276 KOps/s | 2.3215 KOps/s | |
test_seq_wrap[compile] | 0.3669ms | 0.2972ms | 3.3644 KOps/s | 3.2947 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3023ms | 0.2260ms | 4.4248 KOps/s | 4.3787 KOps/s | |
test_func_call_runtime[False-eager] | 0.8210ms | 0.7131ms | 1.4024 KOps/s | 1.3377 KOps/s | |
test_func_call_runtime[False-compile] | 0.8544ms | 0.7415ms | 1.3487 KOps/s | 1.3331 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4577ms | 0.3674ms | 2.7218 KOps/s | 2.7019 KOps/s | |
test_func_call_runtime[True-eager] | 0.9473ms | 0.8877ms | 1.1265 KOps/s | 1.0928 KOps/s | |
test_func_call_runtime[True-compile] | 0.8619ms | 0.7638ms | 1.3093 KOps/s | 1.2973 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5314ms | 0.3886ms | 2.5734 KOps/s | 2.5734 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8073ms | 0.7162ms | 1.3963 KOps/s | 1.3509 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7970ms | 0.7440ms | 1.3441 KOps/s | 1.3253 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4143ms | 0.3685ms | 2.7137 KOps/s | 2.7007 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1033ms | 0.9937ms | 1.0063 KOps/s | 983.7119 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8452ms | 0.7872ms | 1.2703 KOps/s | 1.2515 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4693ms | 0.4135ms | 2.4184 KOps/s | 2.3838 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5467ms | 2.0921ms | 477.9864 Ops/s | 464.7711 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9473ms | 0.8054ms | 1.2416 KOps/s | 1.2240 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4671ms | 0.4177ms | 2.3939 KOps/s | 2.3792 KOps/s | |
test_distributed | 6.1231ms | 0.2145ms | 4.6613 KOps/s | 8.3477 KOps/s | |
test_tdmodule | 54.4220μs | 19.2860μs | 51.8512 KOps/s | 43.7563 KOps/s | |
test_tdmodule_dispatch | 0.1661ms | 35.7434μs | 27.9772 KOps/s | 25.4910 KOps/s | |
test_tdseq | 39.4910μs | 20.4598μs | 48.8764 KOps/s | 44.5270 KOps/s | |
test_tdseq_dispatch | 60.3810μs | 38.1327μs | 26.2242 KOps/s | 23.4311 KOps/s | |
test_instantiation_functorch | 1.6427ms | 1.5646ms | 639.1318 Ops/s | 623.4008 Ops/s | |
test_exec_functorch | 0.1975ms | 0.1419ms | 7.0476 KOps/s | 6.8227 KOps/s | |
test_exec_functional_call | 0.2137ms | 0.1334ms | 7.4962 KOps/s | 7.0058 KOps/s | |
test_exec_td_decorator | 0.3843ms | 0.1830ms | 5.4659 KOps/s | 5.3843 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8554ms | 0.6839ms | 1.4622 KOps/s | 1.4118 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8716ms | 0.6810ms | 1.4684 KOps/s | 1.3994 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7214ms | 0.5960ms | 1.6779 KOps/s | 1.5725 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7424ms | 0.5949ms | 1.6809 KOps/s | 1.5829 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.3669ms | 19.2375ms | 51.9819 Ops/s | 51.0031 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.9999ms | 19.2878ms | 51.8462 Ops/s | 50.7364 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.2398ms | 19.1530ms | 52.2112 Ops/s | 51.5272 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2597ms | 19.1358ms | 52.2581 Ops/s | 51.4043 Ops/s | |
test_to_module_speed[True] | 1.0805ms | 0.9757ms | 1.0249 KOps/s | 1.0168 KOps/s | |
test_to_module_speed[False] | 1.3472ms | 0.9552ms | 1.0470 KOps/s | 1.0362 KOps/s | |
test_tc_init | 71.3720μs | 38.2802μs | 26.1232 KOps/s | 25.6036 KOps/s | |
test_tc_init_nested | 0.1743ms | 76.7088μs | 13.0363 KOps/s | 13.0757 KOps/s | |
test_tc_first_layer_tensor | 28.7400μs | 0.8108μs | 1.2334 MOps/s | 1.4458 MOps/s | |
test_tc_first_layer_nontensor | 37.6400μs | 2.3502μs | 425.4964 KOps/s | 427.2682 KOps/s | |
test_tc_second_layer_tensor | 23.9805μs | 1.4479μs | 690.6477 KOps/s | 717.1851 KOps/s | |
test_tc_second_layer_nontensor | 0.1003ms | 3.0828μs | 324.3801 KOps/s | 324.9702 KOps/s | |
test_unbind | 0.2270s | 10.2269ms | 97.7818 Ops/s | 140.0149 Ops/s | |
test_full_like | 11.6167ms | 9.5806ms | 104.3778 Ops/s | 105.3411 Ops/s | |
test_zeros_like | 6.0272ms | 4.3697ms | 228.8472 Ops/s | 233.1153 Ops/s | |
test_ones_like | 5.0353ms | 4.3653ms | 229.0785 Ops/s | 229.0816 Ops/s | |
test_clone | 7.0522ms | 6.6487ms | 150.4059 Ops/s | 106.6790 Ops/s | |
test_squeeze | 56.1710μs | 9.9395μs | 100.6084 KOps/s | 92.2083 KOps/s | |
test_unsqueeze | 0.1323ms | 75.0888μs | 13.3176 KOps/s | 13.0064 KOps/s | |
test_split | 0.4051ms | 0.1627ms | 6.1473 KOps/s | 5.9063 KOps/s | |
test_permute | 0.2408ms | 0.1877ms | 5.3269 KOps/s | 5.2130 KOps/s | |
test_stack | 51.7170ms | 51.3288ms | 19.4822 Ops/s | 19.4331 Ops/s | |
test_cat | 52.0423ms | 51.2501ms | 19.5122 Ops/s | 19.5030 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 19, 2024
ghstack-source-id: a18dd5dff39937b027243fcebc6ef449b547e0b0 Pull Request resolved: #1145
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):