-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Use 'spawn' mp context in all tests #1111
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 26, 2024
vmoens
added a commit
that referenced
this pull request
Nov 26, 2024
ghstack-source-id: a7d786fe77c2c12d5c8c85579123a64ef5c87cf2 Pull Request resolved: #1111
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.4040μs | 17.5806μs | 56.8808 KOps/s | 56.5388 KOps/s | |
test_plain_set_stack_nested | 79.6390μs | 17.6364μs | 56.7010 KOps/s | 55.4510 KOps/s | |
test_plain_set_nested_inplace | 44.6330μs | 19.2760μs | 51.8779 KOps/s | 50.3339 KOps/s | |
test_plain_set_stack_nested_inplace | 75.8520μs | 19.2161μs | 52.0396 KOps/s | 50.3511 KOps/s | |
test_items | 22.5020μs | 4.1996μs | 238.1173 KOps/s | 239.8507 KOps/s | |
test_items_nested | 0.5023ms | 0.4035ms | 2.4780 KOps/s | 2.4754 KOps/s | |
test_items_nested_locked | 0.6058ms | 0.4052ms | 2.4678 KOps/s | 2.5003 KOps/s | |
test_items_nested_leaf | 0.1309ms | 72.3919μs | 13.8137 KOps/s | 13.8291 KOps/s | |
test_items_stack_nested | 0.5215ms | 0.4164ms | 2.4014 KOps/s | 2.5000 KOps/s | |
test_items_stack_nested_leaf | 0.1354ms | 72.5052μs | 13.7921 KOps/s | 13.5087 KOps/s | |
test_items_stack_nested_locked | 0.5650ms | 0.4077ms | 2.4525 KOps/s | 2.5175 KOps/s | |
test_keys | 24.3450μs | 3.6683μs | 272.6079 KOps/s | 282.5404 KOps/s | |
test_keys_nested | 0.2395ms | 0.1364ms | 7.3321 KOps/s | 7.2058 KOps/s | |
test_keys_nested_locked | 1.7050ms | 0.1434ms | 6.9731 KOps/s | 7.0997 KOps/s | |
test_keys_nested_leaf | 0.2396ms | 0.1171ms | 8.5377 KOps/s | 8.6573 KOps/s | |
test_keys_stack_nested | 0.2414ms | 0.1361ms | 7.3477 KOps/s | 7.3265 KOps/s | |
test_keys_stack_nested_leaf | 0.2022ms | 0.1170ms | 8.5445 KOps/s | 8.5223 KOps/s | |
test_keys_stack_nested_locked | 0.2384ms | 0.1425ms | 7.0169 KOps/s | 6.9810 KOps/s | |
test_values | 9.1668μs | 1.0292μs | 971.6351 KOps/s | 920.8436 KOps/s | |
test_values_nested | 99.1450μs | 55.2471μs | 18.1005 KOps/s | 17.9664 KOps/s | |
test_values_nested_locked | 0.1183ms | 55.1449μs | 18.1340 KOps/s | 18.2241 KOps/s | |
test_values_nested_leaf | 0.1106ms | 60.5879μs | 16.5050 KOps/s | 16.3979 KOps/s | |
test_values_stack_nested | 0.1128ms | 56.1054μs | 17.8236 KOps/s | 17.9090 KOps/s | |
test_values_stack_nested_leaf | 0.1102ms | 60.5931μs | 16.5035 KOps/s | 16.6050 KOps/s | |
test_values_stack_nested_locked | 0.1165ms | 55.6356μs | 17.9741 KOps/s | 17.7434 KOps/s | |
test_membership | 2.0704μs | 0.7398μs | 1.3518 MOps/s | 1.1430 MOps/s | |
test_membership_nested | 43.2730μs | 2.9627μs | 337.5317 KOps/s | 342.7316 KOps/s | |
test_membership_nested_leaf | 31.5790μs | 2.9650μs | 337.2700 KOps/s | 335.8827 KOps/s | |
test_membership_stacked_nested | 47.9000μs | 2.9669μs | 337.0536 KOps/s | 345.5982 KOps/s | |
test_membership_stacked_nested_leaf | 21.4400μs | 2.9478μs | 339.2405 KOps/s | 335.4989 KOps/s | |
test_membership_nested_last | 42.9500μs | 4.3217μs | 231.3905 KOps/s | 237.1208 KOps/s | |
test_membership_nested_leaf_last | 36.5590μs | 4.3524μs | 229.7608 KOps/s | 237.1721 KOps/s | |
test_membership_stacked_nested_last | 54.5720μs | 4.2734μs | 234.0077 KOps/s | 241.9535 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.0070μs | 4.3177μs | 231.6065 KOps/s | 238.1017 KOps/s | |
test_nested_getleaf | 56.7860μs | 10.7662μs | 92.8832 KOps/s | 92.2595 KOps/s | |
test_nested_get | 47.8090μs | 10.2797μs | 97.2789 KOps/s | 97.9306 KOps/s | |
test_stacked_getleaf | 31.1380μs | 10.7301μs | 93.1962 KOps/s | 94.2494 KOps/s | |
test_stacked_get | 46.5170μs | 10.1753μs | 98.2771 KOps/s | 97.6859 KOps/s | |
test_nested_getitemleaf | 40.1650μs | 11.5258μs | 86.7617 KOps/s | 89.4319 KOps/s | |
test_nested_getitem | 55.4640μs | 10.5215μs | 95.0438 KOps/s | 95.1088 KOps/s | |
test_stacked_getitemleaf | 57.0260μs | 11.4404μs | 87.4093 KOps/s | 89.3769 KOps/s | |
test_stacked_getitem | 33.8230μs | 10.7146μs | 93.3307 KOps/s | 95.7687 KOps/s | |
test_lock_nested | 3.1533ms | 0.4508ms | 2.2185 KOps/s | 2.2576 KOps/s | |
test_lock_stack_nested | 0.5457ms | 0.4169ms | 2.3988 KOps/s | 2.4022 KOps/s | |
test_unlock_nested | 1.0902ms | 0.3619ms | 2.7635 KOps/s | 2.7435 KOps/s | |
test_unlock_stack_nested | 0.5077ms | 0.3335ms | 2.9985 KOps/s | 3.0209 KOps/s | |
test_flatten_speed | 0.1998ms | 97.8751μs | 10.2171 KOps/s | 10.4191 KOps/s | |
test_unflatten_speed | 0.8074ms | 0.5062ms | 1.9756 KOps/s | 1.9862 KOps/s | |
test_common_ops | 1.6030ms | 0.7913ms | 1.2638 KOps/s | 1.2992 KOps/s | |
test_creation | 19.3450μs | 2.0492μs | 487.9922 KOps/s | 481.0562 KOps/s | |
test_creation_empty | 34.1440μs | 10.0409μs | 99.5929 KOps/s | 94.7517 KOps/s | |
test_creation_nested_1 | 47.9190μs | 12.8775μs | 77.6546 KOps/s | 73.3356 KOps/s | |
test_creation_nested_2 | 44.4130μs | 17.7412μs | 56.3660 KOps/s | 57.3535 KOps/s | |
test_clone | 1.5589ms | 13.3441μs | 74.9396 KOps/s | 78.1417 KOps/s | |
test_getitem[int] | 1.0284ms | 13.0987μs | 76.3436 KOps/s | 79.5256 KOps/s | |
test_getitem[slice_int] | 0.1399ms | 25.3370μs | 39.4679 KOps/s | 40.7585 KOps/s | |
test_getitem[range] | 0.1752ms | 49.3840μs | 20.2495 KOps/s | 20.5155 KOps/s | |
test_getitem[tuple] | 0.1325ms | 21.0960μs | 47.4024 KOps/s | 50.2786 KOps/s | |
test_getitem[list] | 0.1693ms | 45.1416μs | 22.1525 KOps/s | 22.6998 KOps/s | |
test_setitem_dim[int] | 48.5500μs | 25.6210μs | 39.0304 KOps/s | 39.6335 KOps/s | |
test_setitem_dim[slice_int] | 88.0440μs | 53.9030μs | 18.5518 KOps/s | 18.9771 KOps/s | |
test_setitem_dim[range] | 0.1429ms | 74.6075μs | 13.4035 KOps/s | 13.3561 KOps/s | |
test_setitem_dim[tuple] | 79.1480μs | 42.0473μs | 23.7827 KOps/s | 24.5230 KOps/s | |
test_setitem | 88.0040μs | 20.5274μs | 48.7153 KOps/s | 51.2368 KOps/s | |
test_set | 81.4410μs | 19.7424μs | 50.6525 KOps/s | 52.3441 KOps/s | |
test_set_shared | 3.0874ms | 0.1721ms | 5.8090 KOps/s | 5.9049 KOps/s | |
test_update | 0.1349ms | 22.4905μs | 44.4632 KOps/s | 46.5208 KOps/s | |
test_update_nested | 98.1330μs | 32.5729μs | 30.7004 KOps/s | 31.8615 KOps/s | |
test_update__nested | 0.4291ms | 33.1733μs | 30.1448 KOps/s | 30.9532 KOps/s | |
test_set_nested | 76.0120μs | 21.8476μs | 45.7717 KOps/s | 46.2149 KOps/s | |
test_set_nested_new | 76.3530μs | 26.3595μs | 37.9370 KOps/s | 38.0418 KOps/s | |
test_select | 0.2165ms | 43.0204μs | 23.2448 KOps/s | 23.6868 KOps/s | |
test_select_nested | 0.1257ms | 60.4242μs | 16.5497 KOps/s | 16.7342 KOps/s | |
test_exclude_nested | 0.1677ms | 80.9235μs | 12.3573 KOps/s | 12.6308 KOps/s | |
test_empty[True] | 0.5123ms | 0.3888ms | 2.5719 KOps/s | 2.5943 KOps/s | |
test_empty[False] | 8.6338μs | 1.2388μs | 807.2426 KOps/s | 788.2483 KOps/s | |
test_unbind_speed | 0.3522ms | 0.2644ms | 3.7820 KOps/s | 3.8423 KOps/s | |
test_unbind_speed_stack0 | 0.6454ms | 0.2605ms | 3.8389 KOps/s | 3.9054 KOps/s | |
test_unbind_speed_stack1 | 0.1013s | 0.7662ms | 1.3052 KOps/s | 1.5857 KOps/s | |
test_split | 0.1009s | 1.7573ms | 569.0635 Ops/s | 573.0410 Ops/s | |
test_chunk | 0.1013s | 1.7621ms | 567.5198 Ops/s | 577.0731 Ops/s | |
test_consolidate_njt[False-None] | 10.3993ms | 8.1297ms | 123.0056 Ops/s | 121.9177 Ops/s | |
test_creation[device0] | 3.5141ms | 93.5232μs | 10.6925 KOps/s | 10.6897 KOps/s | |
test_creation_from_tensor | 0.2281ms | 94.0934μs | 10.6277 KOps/s | 10.3312 KOps/s | |
test_add_one[memmap_tensor0] | 0.1953ms | 5.3138μs | 188.1890 KOps/s | 207.5055 KOps/s | |
test_contiguous[memmap_tensor0] | 14.6280μs | 0.5257μs | 1.9021 MOps/s | 1.9308 MOps/s | |
test_stack[memmap_tensor0] | 36.8890μs | 3.5817μs | 279.1953 KOps/s | 296.9024 KOps/s | |
test_memmaptd_index | 1.0527ms | 0.2419ms | 4.1332 KOps/s | 4.3413 KOps/s | |
test_memmaptd_index_astensor | 0.6594ms | 0.3193ms | 3.1315 KOps/s | 3.2312 KOps/s | |
test_memmaptd_index_op | 1.0102ms | 0.5745ms | 1.7406 KOps/s | 1.7491 KOps/s | |
test_serialize_model | 0.1288s | 0.1153s | 8.6734 Ops/s | 7.3885 Ops/s | |
test_serialize_model_pickle | 0.4473s | 0.3924s | 2.5484 Ops/s | 2.5777 Ops/s | |
test_serialize_weights | 0.1235s | 0.1128s | 8.8676 Ops/s | 8.8405 Ops/s | |
test_serialize_weights_returnearly | 0.1675s | 0.1560s | 6.4097 Ops/s | 6.2944 Ops/s | |
test_serialize_weights_pickle | 0.5863s | 0.4241s | 2.3580 Ops/s | 2.3731 Ops/s | |
test_serialize_weights_filesystem | 0.1466s | 0.1417s | 7.0589 Ops/s | 6.4293 Ops/s | |
test_serialize_model_filesystem | 0.1626s | 0.1525s | 6.5573 Ops/s | 6.6994 Ops/s | |
test_reshape_pytree | 57.4070μs | 27.2208μs | 36.7366 KOps/s | 37.9657 KOps/s | |
test_reshape_td | 0.1080ms | 33.3635μs | 29.9728 KOps/s | 31.1942 KOps/s | |
test_view_pytree | 66.6340μs | 27.7228μs | 36.0714 KOps/s | 37.7117 KOps/s | |
test_view_td | 77.8650μs | 38.2032μs | 26.1758 KOps/s | 27.3928 KOps/s | |
test_unbind_pytree | 75.6810μs | 29.9733μs | 33.3630 KOps/s | 33.7364 KOps/s | |
test_unbind_td | 0.3398ms | 38.6483μs | 25.8744 KOps/s | 26.6399 KOps/s | |
test_split_pytree | 68.4170μs | 29.9417μs | 33.3982 KOps/s | 34.0918 KOps/s | |
test_split_td | 0.5247ms | 45.2427μs | 22.1030 KOps/s | 23.1399 KOps/s | |
test_add_pytree | 91.8310μs | 36.8763μs | 27.1177 KOps/s | 28.4212 KOps/s | |
test_add_td | 0.1306ms | 53.4645μs | 18.7040 KOps/s | 19.1164 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1291ms | 62.0599μs | 16.1135 KOps/s | 15.7052 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3509ms | 0.1628ms | 6.1410 KOps/s | 6.1986 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1095ms | 45.5813μs | 21.9388 KOps/s | 21.3738 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2356ms | 0.1213ms | 8.2461 KOps/s | 8.2534 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1439ms | 26.4001μs | 37.8786 KOps/s | 37.6419 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.2328ms | 55.1934μs | 18.1181 KOps/s | 18.5561 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1427ms | 81.8654μs | 12.2152 KOps/s | 12.5173 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1224ms | 69.2542μs | 14.4396 KOps/s | 14.5025 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1835ms | 0.1044ms | 9.5826 KOps/s | 9.3972 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3584ms | 0.2019ms | 4.9534 KOps/s | 5.0104 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 90.6870μs | 44.8485μs | 22.2973 KOps/s | 21.8086 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5005ms | 61.7318μs | 16.1991 KOps/s | 16.1800 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2140ms | 0.1029ms | 9.7185 KOps/s | 9.6356 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3826ms | 0.2084ms | 4.7974 KOps/s | 4.8919 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3007ms | 0.2156ms | 4.6379 KOps/s | 4.7146 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2253ms | 0.1055ms | 9.4824 KOps/s | 9.4460 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1352ms | 54.6982μs | 18.2821 KOps/s | 18.1377 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1114ms | 46.4809μs | 21.5142 KOps/s | 20.8819 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6182ms | 0.1637ms | 6.1103 KOps/s | 6.0466 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1920ms | 0.1059ms | 9.4386 KOps/s | 9.6200 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 78.2560μs | 22.2982μs | 44.8467 KOps/s | 46.0662 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1108ms | 58.8060μs | 17.0051 KOps/s | 17.0419 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1769ms | 84.6557μs | 11.8126 KOps/s | 11.9522 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1476ms | 72.4227μs | 13.8078 KOps/s | 13.8395 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2927ms | 0.2026ms | 4.9348 KOps/s | 4.7858 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5663ms | 1.3056ms | 765.9243 Ops/s | 771.6167 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2974ms | 0.2004ms | 4.9907 KOps/s | 4.9052 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9480ms | 0.7965ms | 1.2555 KOps/s | 1.2740 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8130ms | 0.4530ms | 2.2077 KOps/s | 2.1794 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.9865ms | 2.6491ms | 377.4874 Ops/s | 391.1901 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 89.1660μs | 36.7825μs | 27.1868 KOps/s | 27.3379 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5230ms | 34.1949μs | 29.2442 KOps/s | 29.9677 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 98.4430μs | 29.8399μs | 33.5122 KOps/s | 34.3107 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 64.2600μs | 23.4431μs | 42.6565 KOps/s | 42.2109 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 90.5090μs | 30.7436μs | 32.5271 KOps/s | 32.8903 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1199ms | 23.8168μs | 41.9872 KOps/s | 42.6207 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1274ms | 52.3345μs | 19.1078 KOps/s | 19.2134 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6105ms | 21.0768μs | 47.4456 KOps/s | 49.8675 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1064ms | 44.3922μs | 22.5265 KOps/s | 22.3784 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 81.8750μs | 18.9683μs | 52.7196 KOps/s | 51.9010 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 89.8480μs | 45.0545μs | 22.1954 KOps/s | 21.8940 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 81.4520μs | 19.4321μs | 51.4614 KOps/s | 52.5383 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1118ms | 52.2381μs | 19.1431 KOps/s | 18.8726 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9690ms | 20.5183μs | 48.7369 KOps/s | 50.7942 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1242ms | 45.0088μs | 22.2179 KOps/s | 22.3612 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 82.8340μs | 19.3401μs | 51.7061 KOps/s | 52.6596 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1108ms | 45.1499μs | 22.1485 KOps/s | 22.1824 KOps/s | |
test_compile_indexing[int-pytree-eager] | 59.9120μs | 19.0608μs | 52.4638 KOps/s | 53.3284 KOps/s | |
test_mod_add[eager] | 75.8910μs | 34.4284μs | 29.0458 KOps/s | 30.3733 KOps/s | |
test_mod_add[compile] | 0.1190ms | 46.9617μs | 21.2939 KOps/s | 20.1062 KOps/s | |
test_mod_add[compile-overhead] | 0.1249ms | 48.3697μs | 20.6741 KOps/s | 19.7800 KOps/s | |
test_mod_wrap[eager] | 0.4839ms | 0.2319ms | 4.3127 KOps/s | 4.4088 KOps/s | |
test_mod_wrap[compile] | 0.3224ms | 0.2089ms | 4.7874 KOps/s | 4.7028 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3974ms | 0.2054ms | 4.8681 KOps/s | 4.7472 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.2540ms | 11.6403ms | 85.9088 Ops/s | 81.4684 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.9757ms | 11.5073ms | 86.9010 Ops/s | 81.2963 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.9030ms | 11.9902ms | 83.4015 Ops/s | 80.5341 Ops/s | |
test_seq_add[eager] | 0.2176ms | 0.1123ms | 8.9050 KOps/s | 9.0911 KOps/s | |
test_seq_add[compile] | 0.1472ms | 62.6583μs | 15.9596 KOps/s | 15.9384 KOps/s | |
test_seq_add[compile-overhead] | 0.1530ms | 60.2860μs | 16.5876 KOps/s | 16.3331 KOps/s | |
test_seq_wrap[eager] | 0.8023ms | 0.4369ms | 2.2888 KOps/s | 2.2785 KOps/s | |
test_seq_wrap[compile] | 0.3567ms | 0.2293ms | 4.3612 KOps/s | 4.2418 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4246ms | 0.2306ms | 4.3374 KOps/s | 4.2659 KOps/s | |
test_func_call_runtime[False-eager] | 0.8888ms | 0.5653ms | 1.7690 KOps/s | 1.8040 KOps/s | |
test_func_call_runtime[False-compile] | 0.6158ms | 0.4322ms | 2.3137 KOps/s | 2.3233 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5567ms | 0.4363ms | 2.2918 KOps/s | 2.3290 KOps/s | |
test_func_call_runtime[True-eager] | 1.4969ms | 0.7861ms | 1.2722 KOps/s | 1.3076 KOps/s | |
test_func_call_runtime[True-compile] | 0.5877ms | 0.4748ms | 2.1061 KOps/s | 2.1112 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6931ms | 0.4739ms | 2.1103 KOps/s | 2.1245 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9020ms | 0.5648ms | 1.7705 KOps/s | 1.8417 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6350ms | 0.4346ms | 2.3011 KOps/s | 2.3274 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9076ms | 0.4301ms | 2.3249 KOps/s | 2.3172 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.5009ms | 0.9307ms | 1.0744 KOps/s | 1.0988 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6275ms | 0.5000ms | 1.9999 KOps/s | 1.9969 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5987ms | 0.4994ms | 2.0023 KOps/s | 2.0120 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4328ms | 1.9163ms | 521.8477 Ops/s | 513.5480 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.1735ms | 0.5328ms | 1.8768 KOps/s | 1.8809 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8590ms | 0.5505ms | 1.8164 KOps/s | 1.8842 KOps/s | |
test_distributed | 0.3201ms | 0.1273ms | 7.8572 KOps/s | 7.7637 KOps/s | |
test_tdmodule | 87.8140μs | 25.8561μs | 38.6756 KOps/s | 39.9251 KOps/s | |
test_tdmodule_dispatch | 83.5760μs | 47.7495μs | 20.9426 KOps/s | 21.3109 KOps/s | |
test_tdseq | 43.7810μs | 25.2763μs | 39.5627 KOps/s | 38.9521 KOps/s | |
test_tdseq_dispatch | 75.0600μs | 49.7107μs | 20.1164 KOps/s | 20.1252 KOps/s | |
test_instantiation_functorch | 2.4143ms | 1.5589ms | 641.4637 Ops/s | 633.0954 Ops/s | |
test_exec_functorch | 0.4254ms | 0.1841ms | 5.4312 KOps/s | 5.4111 KOps/s | |
test_exec_functional_call | 0.3156ms | 0.1790ms | 5.5869 KOps/s | 5.6459 KOps/s | |
test_exec_td_decorator | 0.5344ms | 0.2360ms | 4.2372 KOps/s | 4.2568 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1638ms | 0.6618ms | 1.5111 KOps/s | 1.4983 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9015ms | 0.6555ms | 1.5256 KOps/s | 1.5147 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8295ms | 0.5349ms | 1.8696 KOps/s | 1.8812 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7650ms | 0.5355ms | 1.8676 KOps/s | 1.8962 KOps/s | |
test_to_module_speed[True] | 1.6669ms | 1.3007ms | 768.7928 Ops/s | 768.1274 Ops/s | |
test_to_module_speed[False] | 2.0345ms | 1.2791ms | 781.8055 Ops/s | 800.4740 Ops/s | |
test_tc_init | 89.9980μs | 48.9173μs | 20.4427 KOps/s | 22.9565 KOps/s | |
test_tc_init_nested | 0.1933ms | 98.2263μs | 10.1806 KOps/s | 11.6690 KOps/s | |
test_tc_first_layer_tensor | 21.8510μs | 1.6103μs | 620.9898 KOps/s | 612.9044 KOps/s | |
test_tc_first_layer_nontensor | 41.9180μs | 4.9740μs | 201.0467 KOps/s | 204.4354 KOps/s | |
test_tc_second_layer_tensor | 24.3550μs | 3.0009μs | 333.2306 KOps/s | 330.3610 KOps/s | |
test_tc_second_layer_nontensor | 46.3060μs | 6.4579μs | 154.8503 KOps/s | 158.3126 KOps/s | |
test_unbind | 0.2279s | 13.7982ms | 72.4732 Ops/s | 78.6314 Ops/s | |
test_full_like | 9.5102ms | 8.5034ms | 117.6007 Ops/s | 131.0931 Ops/s | |
test_zeros_like | 3.7421ms | 3.2043ms | 312.0780 Ops/s | 351.5265 Ops/s | |
test_ones_like | 4.1349ms | 3.6554ms | 273.5659 Ops/s | 164.4256 Ops/s | |
test_clone | 6.3703ms | 5.6238ms | 177.8170 Ops/s | 127.9916 Ops/s | |
test_squeeze | 79.9990μs | 11.9912μs | 83.3945 KOps/s | 84.8835 KOps/s | |
test_unsqueeze | 0.1629ms | 90.5350μs | 11.0455 KOps/s | 11.0907 KOps/s | |
test_split | 0.4949ms | 0.1992ms | 5.0209 KOps/s | 5.2187 KOps/s | |
test_permute | 0.3195ms | 0.2199ms | 4.5482 KOps/s | 4.5376 KOps/s | |
test_stack | 30.1682ms | 25.2434ms | 39.6142 Ops/s | 39.0674 Ops/s | |
test_cat | 29.7636ms | 25.2266ms | 39.6407 Ops/s | 39.4385 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.9110μs | 10.5692μs | 94.6145 KOps/s | 98.8888 KOps/s | |
test_plain_set_stack_nested | 34.1700μs | 10.5183μs | 95.0728 KOps/s | 99.4130 KOps/s | |
test_plain_set_nested_inplace | 47.6800μs | 11.3195μs | 88.3432 KOps/s | 90.8469 KOps/s | |
test_plain_set_stack_nested_inplace | 39.6600μs | 11.3743μs | 87.9177 KOps/s | 91.2833 KOps/s | |
test_items | 40.9100μs | 2.8610μs | 349.5259 KOps/s | 348.1706 KOps/s | |
test_items_nested | 0.4483ms | 0.3545ms | 2.8209 KOps/s | 2.8612 KOps/s | |
test_items_nested_locked | 0.4066ms | 0.3568ms | 2.8023 KOps/s | 2.8420 KOps/s | |
test_items_nested_leaf | 86.8910μs | 59.0817μs | 16.9257 KOps/s | 17.2095 KOps/s | |
test_items_stack_nested | 0.4049ms | 0.3584ms | 2.7905 KOps/s | 2.8252 KOps/s | |
test_items_stack_nested_leaf | 97.9910μs | 60.7325μs | 16.4656 KOps/s | 16.8193 KOps/s | |
test_items_stack_nested_locked | 0.4175ms | 0.3633ms | 2.7528 KOps/s | 2.8373 KOps/s | |
test_keys | 31.1910μs | 3.4611μs | 288.9228 KOps/s | 291.6953 KOps/s | |
test_keys_nested | 0.1058ms | 70.1476μs | 14.2557 KOps/s | 14.2432 KOps/s | |
test_keys_nested_locked | 0.6476ms | 75.2305μs | 13.2925 KOps/s | 13.1809 KOps/s | |
test_keys_nested_leaf | 0.1012ms | 61.6084μs | 16.2316 KOps/s | 16.4263 KOps/s | |
test_keys_stack_nested | 96.1710μs | 71.1942μs | 14.0461 KOps/s | 14.2200 KOps/s | |
test_keys_stack_nested_leaf | 91.1200μs | 62.3908μs | 16.0280 KOps/s | 16.2501 KOps/s | |
test_keys_stack_nested_locked | 0.1045ms | 76.1173μs | 13.1376 KOps/s | 13.1273 KOps/s | |
test_values | 5.6102μs | 0.8420μs | 1.1876 MOps/s | 1.1868 MOps/s | |
test_values_nested | 59.6800μs | 31.8363μs | 31.4107 KOps/s | 32.1044 KOps/s | |
test_values_nested_locked | 60.4210μs | 33.5750μs | 29.7840 KOps/s | 30.6008 KOps/s | |
test_values_nested_leaf | 68.7700μs | 34.0874μs | 29.3363 KOps/s | 30.0422 KOps/s | |
test_values_stack_nested | 78.4400μs | 32.3996μs | 30.8646 KOps/s | 31.8415 KOps/s | |
test_values_stack_nested_leaf | 85.4210μs | 34.5246μs | 28.9649 KOps/s | 29.4861 KOps/s | |
test_values_stack_nested_locked | 75.0400μs | 33.9171μs | 29.4836 KOps/s | 30.1536 KOps/s | |
test_membership | 2.3605μs | 0.5120μs | 1.9532 MOps/s | 1.9793 MOps/s | |
test_membership_nested | 16.4405μs | 1.9298μs | 518.2017 KOps/s | 520.8107 KOps/s | |
test_membership_nested_leaf | 17.6800μs | 1.9388μs | 515.7895 KOps/s | 511.5195 KOps/s | |
test_membership_stacked_nested | 29.0300μs | 2.0272μs | 493.3017 KOps/s | 498.3835 KOps/s | |
test_membership_stacked_nested_leaf | 44.2800μs | 2.0271μs | 493.3147 KOps/s | 492.6622 KOps/s | |
test_membership_nested_last | 31.9100μs | 2.8671μs | 348.7894 KOps/s | 350.8533 KOps/s | |
test_membership_nested_leaf_last | 39.7710μs | 2.9236μs | 342.0494 KOps/s | 348.7489 KOps/s | |
test_membership_stacked_nested_last | 29.9800μs | 2.9529μs | 338.6458 KOps/s | 348.5700 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.8800μs | 2.8843μs | 346.7056 KOps/s | 346.7190 KOps/s | |
test_nested_getleaf | 54.2800μs | 6.0914μs | 164.1665 KOps/s | 164.0045 KOps/s | |
test_nested_get | 29.1800μs | 5.8275μs | 171.5995 KOps/s | 171.8707 KOps/s | |
test_stacked_getleaf | 28.7400μs | 6.1126μs | 163.5962 KOps/s | 164.8808 KOps/s | |
test_stacked_get | 46.7200μs | 5.7950μs | 172.5640 KOps/s | 173.0097 KOps/s | |
test_nested_getitemleaf | 40.5010μs | 6.1870μs | 161.6288 KOps/s | 161.7343 KOps/s | |
test_nested_getitem | 35.8200μs | 5.9149μs | 169.0657 KOps/s | 170.0955 KOps/s | |
test_stacked_getitemleaf | 36.7500μs | 6.1588μs | 162.3703 KOps/s | 161.2010 KOps/s | |
test_stacked_getitem | 39.8710μs | 5.8773μs | 170.1463 KOps/s | 170.4638 KOps/s | |
test_lock_nested | 9.4038ms | 0.3780ms | 2.6456 KOps/s | 2.7023 KOps/s | |
test_lock_stack_nested | 0.4359ms | 0.3374ms | 2.9638 KOps/s | 2.9976 KOps/s | |
test_unlock_nested | 0.7030ms | 0.3097ms | 3.2288 KOps/s | 3.2637 KOps/s | |
test_unlock_stack_nested | 0.3077ms | 0.2768ms | 3.6132 KOps/s | 3.6552 KOps/s | |
test_flatten_speed | 0.1019ms | 74.4590μs | 13.4302 KOps/s | 13.0313 KOps/s | |
test_unflatten_speed | 0.3549ms | 0.3005ms | 3.3277 KOps/s | 3.3246 KOps/s | |
test_common_ops | 1.6448ms | 0.5723ms | 1.7473 KOps/s | 1.8126 KOps/s | |
test_creation | 95.4910μs | 1.4240μs | 702.2447 KOps/s | 704.5640 KOps/s | |
test_creation_empty | 43.8610μs | 7.2384μs | 138.1516 KOps/s | 158.1738 KOps/s | |
test_creation_nested_1 | 45.5210μs | 8.6675μs | 115.3738 KOps/s | 127.6870 KOps/s | |
test_creation_nested_2 | 41.3000μs | 11.0496μs | 90.5013 KOps/s | 97.8647 KOps/s | |
test_clone | 68.2600μs | 10.2163μs | 97.8830 KOps/s | 101.5947 KOps/s | |
test_getitem[int] | 1.2022ms | 11.3168μs | 88.3643 KOps/s | 94.4526 KOps/s | |
test_getitem[slice_int] | 0.1122ms | 21.0008μs | 47.6172 KOps/s | 48.9088 KOps/s | |
test_getitem[range] | 0.1356ms | 35.1809μs | 28.4245 KOps/s | 28.0036 KOps/s | |
test_getitem[tuple] | 0.1142ms | 18.5219μs | 53.9901 KOps/s | 55.1146 KOps/s | |
test_getitem[list] | 0.2251ms | 31.5660μs | 31.6797 KOps/s | 31.4314 KOps/s | |
test_setitem_dim[int] | 38.0400μs | 18.1252μs | 55.1717 KOps/s | 57.3470 KOps/s | |
test_setitem_dim[slice_int] | 67.2010μs | 36.8622μs | 27.1281 KOps/s | 27.5926 KOps/s | |
test_setitem_dim[range] | 73.4000μs | 50.3854μs | 19.8470 KOps/s | 19.5808 KOps/s | |
test_setitem_dim[tuple] | 51.7200μs | 31.5419μs | 31.7039 KOps/s | 31.5283 KOps/s | |
test_setitem | 89.1110μs | 14.1797μs | 70.5234 KOps/s | 75.0175 KOps/s | |
test_set | 85.2810μs | 13.5399μs | 73.8556 KOps/s | 77.1704 KOps/s | |
test_set_shared | 1.6017ms | 0.1459ms | 6.8528 KOps/s | 6.9455 KOps/s | |
test_update | 0.3445ms | 16.1513μs | 61.9147 KOps/s | 67.5526 KOps/s | |
test_update_nested | 81.6510μs | 21.2405μs | 47.0799 KOps/s | 49.1120 KOps/s | |
test_update__nested | 0.9420ms | 23.4347μs | 42.6717 KOps/s | 42.7909 KOps/s | |
test_set_nested | 71.6200μs | 14.7768μs | 67.6737 KOps/s | 72.4322 KOps/s | |
test_set_nested_new | 91.6300μs | 16.7152μs | 59.8259 KOps/s | 63.4025 KOps/s | |
test_select | 84.8710μs | 28.4110μs | 35.1976 KOps/s | 37.2293 KOps/s | |
test_select_nested | 73.0310μs | 40.7307μs | 24.5515 KOps/s | 24.2862 KOps/s | |
test_exclude_nested | 96.4710μs | 58.1817μs | 17.1875 KOps/s | 16.9420 KOps/s | |
test_empty[True] | 0.3183ms | 0.2701ms | 3.7019 KOps/s | 3.6846 KOps/s | |
test_empty[False] | 3.9551μs | 0.7375μs | 1.3559 MOps/s | 1.3580 MOps/s | |
test_to | 87.1210μs | 54.3706μs | 18.3923 KOps/s | 18.2399 KOps/s | |
test_to_nonblocking | 0.9787ms | 45.3473μs | 22.0520 KOps/s | 22.1990 KOps/s | |
test_unbind_speed | 1.6079ms | 0.2364ms | 4.2303 KOps/s | 4.3327 KOps/s | |
test_unbind_speed_stack0 | 0.2867ms | 0.2364ms | 4.2307 KOps/s | 4.2151 KOps/s | |
test_unbind_speed_stack1 | 93.7962ms | 0.6620ms | 1.5107 KOps/s | 1.5360 KOps/s | |
test_split | 94.4445ms | 1.6588ms | 602.8367 Ops/s | 579.9932 Ops/s | |
test_chunk | 96.9491ms | 1.6573ms | 603.3910 Ops/s | 691.4343 Ops/s | |
test_consolidate[False-None] | 97.1493ms | 2.9016ms | 344.6383 Ops/s | 357.1532 Ops/s | |
test_consolidate[default-None] | 1.7879ms | 1.7239ms | 580.0811 Ops/s | 588.9860 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8093ms | 1.7478ms | 572.1550 Ops/s | 591.2723 Ops/s | |
test_consolidate_njt[False-None] | 6.7876ms | 6.3225ms | 158.1643 Ops/s | 161.9896 Ops/s | |
test_to[False-False-None] | 1.8044ms | 1.6986ms | 588.7039 Ops/s | 616.1676 Ops/s | |
test_to[True-False-None] | 1.5204ms | 1.2710ms | 786.7701 Ops/s | 808.8923 Ops/s | |
test_to[within-False-None] | 4.2828ms | 4.0739ms | 245.4674 Ops/s | 257.7204 Ops/s | |
test_to[True-default-None] | 5.5503ms | 5.2590ms | 190.1497 Ops/s | 193.3854 Ops/s | |
test_to_njt[False-False-None] | 7.1652ms | 6.8415ms | 146.1677 Ops/s | 145.5265 Ops/s | |
test_to_njt[True-False-None] | 5.6325ms | 5.4261ms | 184.2929 Ops/s | 183.7922 Ops/s | |
test_to_njt[within-False-None] | 12.1838ms | 11.7439ms | 85.1508 Ops/s | 83.9037 Ops/s | |
test_creation[device0] | 0.4698ms | 77.7378μs | 12.8638 KOps/s | 12.7874 KOps/s | |
test_creation_from_tensor | 0.4970ms | 81.2598μs | 12.3062 KOps/s | 12.1012 KOps/s | |
test_add_one[memmap_tensor0] | 0.4110ms | 6.6453μs | 150.4814 KOps/s | 160.2328 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8085μs | 0.3975μs | 2.5156 MOps/s | 2.4964 MOps/s | |
test_stack[memmap_tensor0] | 45.3200μs | 4.8327μs | 206.9219 KOps/s | 225.0931 KOps/s | |
test_memmaptd_index | 1.7904ms | 0.2638ms | 3.7906 KOps/s | 4.1312 KOps/s | |
test_memmaptd_index_astensor | 0.9356ms | 0.3179ms | 3.1461 KOps/s | 3.3598 KOps/s | |
test_memmaptd_index_op | 1.0048ms | 0.5785ms | 1.7285 KOps/s | 1.9080 KOps/s | |
test_serialize_model | 0.1297s | 0.1283s | 7.7972 Ops/s | 7.6890 Ops/s | |
test_serialize_model_pickle | 1.3777s | 1.1903s | 0.8401 Ops/s | 0.8180 Ops/s | |
test_serialize_weights | 0.1296s | 0.1282s | 7.8012 Ops/s | 5.5165 Ops/s | |
test_serialize_weights_returnearly | 0.7150s | 68.2325ms | 14.6558 Ops/s | 23.5226 Ops/s | |
test_serialize_weights_pickle | 1.3770s | 1.2219s | 0.8184 Ops/s | 0.8209 Ops/s | |
test_reshape_pytree | 51.6800μs | 22.3427μs | 44.7573 KOps/s | 46.1799 KOps/s | |
test_reshape_td | 49.1700μs | 26.1092μs | 38.3006 KOps/s | 39.9128 KOps/s | |
test_view_pytree | 53.4910μs | 21.9497μs | 45.5586 KOps/s | 47.1549 KOps/s | |
test_view_td | 59.6510μs | 28.7934μs | 34.7302 KOps/s | 34.2453 KOps/s | |
test_unbind_pytree | 52.9910μs | 28.2506μs | 35.3974 KOps/s | 36.2837 KOps/s | |
test_unbind_td | 0.7612ms | 35.9503μs | 27.8162 KOps/s | 28.6887 KOps/s | |
test_split_pytree | 63.5210μs | 30.2604μs | 33.0465 KOps/s | 33.8855 KOps/s | |
test_split_td | 0.1521s | 51.8927μs | 19.2705 KOps/s | 26.3670 KOps/s | |
test_add_pytree | 66.0600μs | 33.5234μs | 29.8299 KOps/s | 30.9993 KOps/s | |
test_add_td | 0.1632ms | 43.7360μs | 22.8644 KOps/s | 23.9812 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1699ms | 0.1194ms | 8.3757 KOps/s | 8.2754 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2130ms | 0.1209ms | 8.2690 KOps/s | 8.2121 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1461ms | 98.8576μs | 10.1156 KOps/s | 10.1828 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3983ms | 0.1515ms | 6.5989 KOps/s | 6.8885 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1306ms | 23.6027μs | 42.3681 KOps/s | 48.1754 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 84.9710μs | 26.3719μs | 37.9192 KOps/s | 37.6370 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4510ms | 64.9269μs | 15.4019 KOps/s | 15.3268 KOps/s | |
test_compile_copy_nested[pytree-eager] | 75.4300μs | 49.0323μs | 20.3947 KOps/s | 20.1403 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1854ms | 0.1418ms | 7.0515 KOps/s | 7.0137 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2956ms | 0.2048ms | 4.8835 KOps/s | 4.9198 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1453ms | 96.4836μs | 10.3645 KOps/s | 10.4249 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1044ms | 50.0052μs | 19.9979 KOps/s | 19.9137 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1849ms | 0.1383ms | 7.2331 KOps/s | 7.3561 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5642ms | 0.4977ms | 2.0093 KOps/s | 2.1103 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3896ms | 0.2464ms | 4.0583 KOps/s | 4.0602 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1959ms | 0.1438ms | 6.9563 KOps/s | 6.7968 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1565ms | 59.8213μs | 16.7164 KOps/s | 16.1181 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1423ms | 98.0769μs | 10.1961 KOps/s | 9.7906 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5507ms | 0.4234ms | 2.3620 KOps/s | 2.4704 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2369ms | 0.1346ms | 7.4320 KOps/s | 7.3588 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 45.4000μs | 18.3253μs | 54.5695 KOps/s | 42.1735 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 61.2600μs | 27.9455μs | 35.7839 KOps/s | 36.9458 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1164ms | 69.8172μs | 14.3231 KOps/s | 14.1957 KOps/s | |
test_compile_copy_flat[pytree-eager] | 77.6800μs | 51.4840μs | 19.4235 KOps/s | 19.4125 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6018ms | 0.3884ms | 2.5750 KOps/s | 2.2358 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7308ms | 2.5810ms | 387.4469 Ops/s | 392.4989 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6030ms | 0.4328ms | 2.3106 KOps/s | 2.3046 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7517ms | 2.6464ms | 377.8764 Ops/s | 382.3764 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1711ms | 0.1140ms | 8.7707 KOps/s | 8.7403 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5709ms | 80.3321μs | 12.4483 KOps/s | 12.8214 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1742ms | 0.1091ms | 9.1675 KOps/s | 9.7391 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1141ms | 69.4367μs | 14.4016 KOps/s | 15.0446 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1720ms | 0.1091ms | 9.1684 KOps/s | 9.4850 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1526ms | 68.8279μs | 14.5290 KOps/s | 14.9665 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2207ms | 0.1004ms | 9.9589 KOps/s | 9.9051 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1436ms | 17.2932μs | 57.8261 KOps/s | 57.7432 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1856ms | 93.9870μs | 10.6398 KOps/s | 10.3128 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 50.6510μs | 15.9389μs | 62.7397 KOps/s | 64.9799 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1458ms | 95.7671μs | 10.4420 KOps/s | 10.2733 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 49.0100μs | 16.0606μs | 62.2643 KOps/s | 61.0402 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2091ms | 0.1017ms | 9.8351 KOps/s | 9.8295 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5932ms | 17.5216μs | 57.0723 KOps/s | 59.2634 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1534ms | 97.2088μs | 10.2871 KOps/s | 10.1761 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1129ms | 15.8083μs | 63.2579 KOps/s | 64.7669 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1458ms | 98.0219μs | 10.2018 KOps/s | 10.4084 KOps/s | |
test_compile_indexing[int-pytree-eager] | 43.4110μs | 15.8549μs | 63.0720 KOps/s | 64.7880 KOps/s | |
test_mod_add[eager] | 85.2710μs | 35.3830μs | 28.2622 KOps/s | 28.5732 KOps/s | |
test_mod_add[compile] | 0.1225ms | 79.1031μs | 12.6417 KOps/s | 12.6259 KOps/s | |
test_mod_add[compile-overhead] | 0.3298ms | 0.1681ms | 5.9477 KOps/s | 5.7750 KOps/s | |
test_mod_wrap[eager] | 0.3516ms | 0.2390ms | 4.1842 KOps/s | 4.0902 KOps/s | |
test_mod_wrap[compile] | 0.3977ms | 0.2856ms | 3.5018 KOps/s | 3.5301 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1199ms | 3.7499ms | 266.6740 Ops/s | 277.0118 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4670ms | 1.3390ms | 746.8523 Ops/s | 709.2335 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3769ms | 1.2364ms | 808.8114 Ops/s | 739.6925 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3642ms | 0.9054ms | 1.1045 KOps/s | 976.4067 Ops/s | |
test_seq_add[eager] | 0.1499ms | 0.1086ms | 9.2053 KOps/s | 9.2852 KOps/s | |
test_seq_add[compile] | 0.5410ms | 88.3609μs | 11.3172 KOps/s | 11.0886 KOps/s | |
test_seq_add[compile-overhead] | 0.1781ms | 0.1291ms | 7.7486 KOps/s | 7.8591 KOps/s | |
test_seq_wrap[eager] | 0.5152ms | 0.4028ms | 2.4825 KOps/s | 2.3742 KOps/s | |
test_seq_wrap[compile] | 0.4068ms | 0.2976ms | 3.3602 KOps/s | 3.3729 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2735ms | 0.2213ms | 4.5194 KOps/s | 4.5137 KOps/s | |
test_func_call_runtime[False-eager] | 0.7730ms | 0.7017ms | 1.4250 KOps/s | 1.3974 KOps/s | |
test_func_call_runtime[False-compile] | 1.0457ms | 0.7262ms | 1.3770 KOps/s | 1.3518 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4868ms | 0.3608ms | 2.7715 KOps/s | 2.8111 KOps/s | |
test_func_call_runtime[True-eager] | 1.0531ms | 0.8740ms | 1.1442 KOps/s | 1.1485 KOps/s | |
test_func_call_runtime[True-compile] | 0.9209ms | 0.7614ms | 1.3133 KOps/s | 1.3219 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5228ms | 0.3868ms | 2.5855 KOps/s | 2.6313 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8500ms | 0.7080ms | 1.4124 KOps/s | 1.3607 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1891ms | 0.7467ms | 1.3392 KOps/s | 1.3526 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5266ms | 0.3668ms | 2.7263 KOps/s | 2.7763 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1515ms | 0.9788ms | 1.0217 KOps/s | 1.0244 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2609ms | 0.7994ms | 1.2510 KOps/s | 1.2694 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5524ms | 0.4138ms | 2.4169 KOps/s | 2.4738 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4949ms | 2.0238ms | 494.1189 Ops/s | 498.4394 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8738ms | 0.7878ms | 1.2693 KOps/s | 1.2198 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4623ms | 0.4061ms | 2.4627 KOps/s | 2.4527 KOps/s | |
test_distributed | 2.4851ms | 0.1785ms | 5.6036 KOps/s | 8.7491 KOps/s | |
test_tdmodule | 0.2888ms | 18.4853μs | 54.0970 KOps/s | 55.8436 KOps/s | |
test_tdmodule_dispatch | 75.5000μs | 32.1799μs | 31.0753 KOps/s | 31.1447 KOps/s | |
test_tdseq | 37.9110μs | 18.3521μs | 54.4898 KOps/s | 56.7637 KOps/s | |
test_tdseq_dispatch | 54.4910μs | 34.9375μs | 28.6225 KOps/s | 29.5063 KOps/s | |
test_instantiation_functorch | 1.6986ms | 1.5360ms | 651.0372 Ops/s | 657.5932 Ops/s | |
test_exec_functorch | 0.1979ms | 0.1402ms | 7.1303 KOps/s | 7.2140 KOps/s | |
test_exec_functional_call | 0.1864ms | 0.1321ms | 7.5701 KOps/s | 7.7173 KOps/s | |
test_exec_td_decorator | 0.3644ms | 0.1809ms | 5.5293 KOps/s | 5.7335 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8387ms | 0.6630ms | 1.5084 KOps/s | 1.5247 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0513ms | 0.6662ms | 1.5012 KOps/s | 1.5209 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6793ms | 0.5653ms | 1.7689 KOps/s | 1.7452 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6759ms | 0.5692ms | 1.7569 KOps/s | 1.7443 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.5498ms | 18.4698ms | 54.1425 Ops/s | 53.8444 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 18.8101ms | 18.4767ms | 54.1222 Ops/s | 53.6802 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.9641ms | 18.6833ms | 53.5238 Ops/s | 54.2497 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 18.8456ms | 18.6727ms | 53.5541 Ops/s | 54.0729 Ops/s | |
test_to_module_speed[True] | 1.1797ms | 0.9284ms | 1.0772 KOps/s | 1.0737 KOps/s | |
test_to_module_speed[False] | 1.5262ms | 0.9174ms | 1.0900 KOps/s | 1.0894 KOps/s | |
test_tc_init | 72.2110μs | 33.8629μs | 29.5309 KOps/s | 30.1633 KOps/s | |
test_tc_init_nested | 0.1044ms | 68.6034μs | 14.5765 KOps/s | 14.8108 KOps/s | |
test_tc_first_layer_tensor | 4.8044μs | 0.6881μs | 1.4532 MOps/s | 1.4487 MOps/s | |
test_tc_first_layer_nontensor | 32.5600μs | 2.3275μs | 429.6453 KOps/s | 411.9113 KOps/s | |
test_tc_second_layer_tensor | 9.6550μs | 1.4168μs | 705.7961 KOps/s | 703.1309 KOps/s | |
test_tc_second_layer_nontensor | 0.1380ms | 3.0853μs | 324.1206 KOps/s | 315.2526 KOps/s | |
test_unbind | 0.2471s | 10.4020ms | 96.1356 Ops/s | 152.9994 Ops/s | |
test_full_like | 11.6367ms | 9.3924ms | 106.4690 Ops/s | 107.3651 Ops/s | |
test_zeros_like | 5.4117ms | 4.3509ms | 229.8388 Ops/s | 230.0868 Ops/s | |
test_ones_like | 4.9605ms | 4.3551ms | 229.6148 Ops/s | 230.7390 Ops/s | |
test_clone | 6.6970ms | 6.4485ms | 155.0755 Ops/s | 156.0118 Ops/s | |
test_squeeze | 58.2410μs | 9.0156μs | 110.9186 KOps/s | 108.8202 KOps/s | |
test_unsqueeze | 0.2340ms | 70.4755μs | 14.1893 KOps/s | 14.4183 KOps/s | |
test_split | 0.4541ms | 0.1569ms | 6.3718 KOps/s | 6.3635 KOps/s | |
test_permute | 0.5952ms | 0.1798ms | 5.5605 KOps/s | 5.7219 KOps/s | |
test_stack | 51.5898ms | 51.0262ms | 19.5978 Ops/s | 19.6007 Ops/s | |
test_cat | 51.2064ms | 50.8400ms | 19.6696 Ops/s | 19.7803 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 26, 2024
ghstack-source-id: a7d786fe77c2c12d5c8c85579123a64ef5c87cf2 Pull Request resolved: #1111
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):