-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Test] fix inline TDParams kwargs for nontensordata #1095
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 20, 2024
ghstack-source-id: da8b7f40d05715170a3e9f0b47763efe356afe5e Pull Request resolved: #1095
vmoens
added a commit
that referenced
this pull request
Nov 20, 2024
ghstack-source-id: da8b7f40d05715170a3e9f0b47763efe356afe5e Pull Request resolved: #1095
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 43.8620μs | 17.9340μs | 55.7601 KOps/s | 55.2984 KOps/s | |
test_plain_set_stack_nested | 63.7190μs | 18.1485μs | 55.1011 KOps/s | 54.9763 KOps/s | |
test_plain_set_nested_inplace | 53.5810μs | 20.1140μs | 49.7165 KOps/s | 50.6457 KOps/s | |
test_plain_set_stack_nested_inplace | 72.0950μs | 20.0894μs | 49.7776 KOps/s | 50.3473 KOps/s | |
test_items | 49.4560μs | 4.1177μs | 242.8545 KOps/s | 237.7996 KOps/s | |
test_items_nested | 0.5039ms | 0.3368ms | 2.9692 KOps/s | 2.9365 KOps/s | |
test_items_nested_locked | 0.4851ms | 0.3381ms | 2.9576 KOps/s | 2.9314 KOps/s | |
test_items_nested_leaf | 0.1309ms | 72.3516μs | 13.8214 KOps/s | 14.0073 KOps/s | |
test_items_stack_nested | 0.5138ms | 0.3389ms | 2.9507 KOps/s | 2.9117 KOps/s | |
test_items_stack_nested_leaf | 0.1429ms | 73.9790μs | 13.5173 KOps/s | 13.5366 KOps/s | |
test_items_stack_nested_locked | 0.5030ms | 0.3414ms | 2.9294 KOps/s | 2.7899 KOps/s | |
test_keys | 34.3040μs | 3.5067μs | 285.1702 KOps/s | 284.4103 KOps/s | |
test_keys_nested | 0.2085ms | 0.1379ms | 7.2529 KOps/s | 7.2375 KOps/s | |
test_keys_nested_locked | 1.8755ms | 0.1434ms | 6.9722 KOps/s | 7.0479 KOps/s | |
test_keys_nested_leaf | 0.2265ms | 0.1184ms | 8.4441 KOps/s | 8.6752 KOps/s | |
test_keys_stack_nested | 0.2320ms | 0.1390ms | 7.1919 KOps/s | 7.3400 KOps/s | |
test_keys_stack_nested_leaf | 0.2281ms | 0.1194ms | 8.3748 KOps/s | 8.4624 KOps/s | |
test_keys_stack_nested_locked | 0.2468ms | 0.1431ms | 6.9903 KOps/s | 7.1312 KOps/s | |
test_values | 9.4742μs | 1.0478μs | 954.3499 KOps/s | 952.1755 KOps/s | |
test_values_nested | 0.1112ms | 55.6048μs | 17.9841 KOps/s | 18.1102 KOps/s | |
test_values_nested_locked | 0.1084ms | 55.3474μs | 18.0677 KOps/s | 18.3053 KOps/s | |
test_values_nested_leaf | 0.1281ms | 60.8281μs | 16.4398 KOps/s | 16.7748 KOps/s | |
test_values_stack_nested | 0.1208ms | 56.7602μs | 17.6180 KOps/s | 16.8679 KOps/s | |
test_values_stack_nested_leaf | 0.1247ms | 61.4458μs | 16.2745 KOps/s | 16.7327 KOps/s | |
test_values_stack_nested_locked | 0.1248ms | 56.6523μs | 17.6515 KOps/s | 18.0221 KOps/s | |
test_membership | 21.5500μs | 0.8946μs | 1.1178 MOps/s | 1.3526 MOps/s | |
test_membership_nested | 57.5380μs | 2.7860μs | 358.9361 KOps/s | 362.4034 KOps/s | |
test_membership_nested_leaf | 46.8370μs | 2.7744μs | 360.4407 KOps/s | 367.5318 KOps/s | |
test_membership_stacked_nested | 50.9940μs | 2.7644μs | 361.7399 KOps/s | 366.3861 KOps/s | |
test_membership_stacked_nested_leaf | 40.8170μs | 2.7645μs | 361.7342 KOps/s | 365.1289 KOps/s | |
test_membership_nested_last | 58.7000μs | 4.0498μs | 246.9285 KOps/s | 249.0123 KOps/s | |
test_membership_nested_leaf_last | 50.8750μs | 4.0747μs | 245.4139 KOps/s | 251.3527 KOps/s | |
test_membership_stacked_nested_last | 27.7720μs | 4.0863μs | 244.7217 KOps/s | 253.0952 KOps/s | |
test_membership_stacked_nested_leaf_last | 59.1610μs | 4.0508μs | 246.8649 KOps/s | 251.3192 KOps/s | |
test_nested_getleaf | 46.2370μs | 10.6146μs | 94.2097 KOps/s | 95.1646 KOps/s | |
test_nested_get | 53.8310μs | 10.1834μs | 98.1994 KOps/s | 100.0835 KOps/s | |
test_stacked_getleaf | 58.0890μs | 10.4620μs | 95.5840 KOps/s | 95.4275 KOps/s | |
test_stacked_get | 37.0090μs | 10.1301μs | 98.7155 KOps/s | 100.3956 KOps/s | |
test_nested_getitemleaf | 55.7740μs | 11.1347μs | 89.8093 KOps/s | 91.4605 KOps/s | |
test_nested_getitem | 54.0510μs | 10.4961μs | 95.2739 KOps/s | 97.0390 KOps/s | |
test_stacked_getitemleaf | 61.8550μs | 11.2202μs | 89.1246 KOps/s | 91.4283 KOps/s | |
test_stacked_getitem | 61.9360μs | 10.3903μs | 96.2437 KOps/s | 98.4734 KOps/s | |
test_lock_nested | 2.7775ms | 0.4559ms | 2.1936 KOps/s | 1.8339 KOps/s | |
test_lock_stack_nested | 0.7728ms | 0.4200ms | 2.3808 KOps/s | 2.4341 KOps/s | |
test_unlock_nested | 0.8515ms | 0.3658ms | 2.7335 KOps/s | 2.7351 KOps/s | |
test_unlock_stack_nested | 0.5084ms | 0.3381ms | 2.9575 KOps/s | 3.0404 KOps/s | |
test_flatten_speed | 0.1841ms | 94.3219μs | 10.6020 KOps/s | 11.1309 KOps/s | |
test_unflatten_speed | 0.6641ms | 0.4792ms | 2.0866 KOps/s | 2.1240 KOps/s | |
test_common_ops | 1.6416ms | 0.7789ms | 1.2839 KOps/s | 1.2945 KOps/s | |
test_creation | 0.1523ms | 2.0922μs | 477.9758 KOps/s | 497.5719 KOps/s | |
test_creation_empty | 39.0130μs | 11.1554μs | 89.6430 KOps/s | 86.9170 KOps/s | |
test_creation_nested_1 | 54.0310μs | 13.9624μs | 71.6211 KOps/s | 69.9810 KOps/s | |
test_creation_nested_2 | 54.3720μs | 18.0418μs | 55.4268 KOps/s | 53.6659 KOps/s | |
test_clone | 0.1197ms | 13.7276μs | 72.8459 KOps/s | 77.9354 KOps/s | |
test_getitem[int] | 0.8629ms | 12.6013μs | 79.3568 KOps/s | 79.5633 KOps/s | |
test_getitem[slice_int] | 0.1494ms | 23.9134μs | 41.8175 KOps/s | 42.3578 KOps/s | |
test_getitem[range] | 0.3861ms | 50.2638μs | 19.8950 KOps/s | 20.4316 KOps/s | |
test_getitem[tuple] | 0.1546ms | 19.7837μs | 50.5468 KOps/s | 50.6822 KOps/s | |
test_getitem[list] | 0.3218ms | 44.8784μs | 22.2824 KOps/s | 22.4687 KOps/s | |
test_setitem_dim[int] | 59.6610μs | 26.2656μs | 38.0726 KOps/s | 39.3874 KOps/s | |
test_setitem_dim[slice_int] | 95.1380μs | 52.8002μs | 18.9393 KOps/s | 19.9898 KOps/s | |
test_setitem_dim[range] | 0.1877ms | 77.2554μs | 12.9441 KOps/s | 13.1556 KOps/s | |
test_setitem_dim[tuple] | 76.0420μs | 41.3729μs | 24.1704 KOps/s | 24.9648 KOps/s | |
test_setitem | 0.1090ms | 20.8538μs | 47.9528 KOps/s | 48.4318 KOps/s | |
test_set | 0.1950ms | 20.2174μs | 49.4624 KOps/s | 49.8733 KOps/s | |
test_set_shared | 4.3688ms | 0.1706ms | 5.8625 KOps/s | 5.8632 KOps/s | |
test_update | 0.1531ms | 23.5093μs | 42.5363 KOps/s | 42.8801 KOps/s | |
test_update_nested | 0.1470ms | 33.4768μs | 29.8714 KOps/s | 30.9563 KOps/s | |
test_update__nested | 0.1619ms | 33.0247μs | 30.2804 KOps/s | 31.0708 KOps/s | |
test_set_nested | 0.1562ms | 22.6118μs | 44.2248 KOps/s | 45.6606 KOps/s | |
test_set_nested_new | 0.2541ms | 27.4687μs | 36.4051 KOps/s | 37.2415 KOps/s | |
test_select | 0.1879ms | 42.1140μs | 23.7451 KOps/s | 23.3913 KOps/s | |
test_select_nested | 0.1350ms | 60.5989μs | 16.5019 KOps/s | 16.6599 KOps/s | |
test_exclude_nested | 0.1398ms | 74.8315μs | 13.3634 KOps/s | 13.5508 KOps/s | |
test_empty[True] | 0.5743ms | 0.3504ms | 2.8542 KOps/s | 2.9003 KOps/s | |
test_empty[False] | 12.3458μs | 1.2498μs | 800.1555 KOps/s | 803.0305 KOps/s | |
test_unbind_speed | 0.4690ms | 0.2656ms | 3.7651 KOps/s | 3.8350 KOps/s | |
test_unbind_speed_stack0 | 0.5317ms | 0.2636ms | 3.7933 KOps/s | 3.9597 KOps/s | |
test_unbind_speed_stack1 | 0.1099s | 0.7887ms | 1.2679 KOps/s | 1.4297 KOps/s | |
test_split | 0.1202s | 1.7349ms | 576.3941 Ops/s | 568.4599 Ops/s | |
test_chunk | 0.1118s | 1.7361ms | 575.9931 Ops/s | 573.4761 Ops/s | |
test_consolidate_njt[False-None] | 11.2029ms | 8.2072ms | 121.8441 Ops/s | 124.6948 Ops/s | |
test_creation[device0] | 3.5816ms | 93.9037μs | 10.6492 KOps/s | 10.7889 KOps/s | |
test_creation_from_tensor | 0.2863ms | 95.1506μs | 10.5097 KOps/s | 10.4875 KOps/s | |
test_add_one[memmap_tensor0] | 0.1611ms | 4.9635μs | 201.4717 KOps/s | 207.2031 KOps/s | |
test_contiguous[memmap_tensor0] | 40.1850μs | 0.5235μs | 1.9102 MOps/s | 1.8354 MOps/s | |
test_stack[memmap_tensor0] | 28.1020μs | 3.3857μs | 295.3557 KOps/s | 297.1902 KOps/s | |
test_memmaptd_index | 0.9588ms | 0.2416ms | 4.1388 KOps/s | 4.3122 KOps/s | |
test_memmaptd_index_astensor | 0.7183ms | 0.3241ms | 3.0856 KOps/s | 3.2498 KOps/s | |
test_memmaptd_index_op | 0.9952ms | 0.6010ms | 1.6639 KOps/s | 1.7078 KOps/s | |
test_serialize_model | 0.1252s | 0.1177s | 8.4997 Ops/s | 7.4606 Ops/s | |
test_serialize_model_pickle | 0.4450s | 0.3965s | 2.5222 Ops/s | 2.5214 Ops/s | |
test_serialize_weights | 0.2592s | 0.1334s | 7.4974 Ops/s | 8.6703 Ops/s | |
test_serialize_weights_returnearly | 0.1787s | 0.1605s | 6.2287 Ops/s | 6.2847 Ops/s | |
test_serialize_weights_pickle | 0.4598s | 0.3939s | 2.5389 Ops/s | 2.4511 Ops/s | |
test_serialize_weights_filesystem | 0.1566s | 0.1463s | 6.8363 Ops/s | 6.9446 Ops/s | |
test_serialize_model_filesystem | 0.1704s | 0.1511s | 6.6202 Ops/s | 6.5421 Ops/s | |
test_reshape_pytree | 0.1032ms | 27.8804μs | 35.8674 KOps/s | 37.6214 KOps/s | |
test_reshape_td | 0.1172ms | 32.6201μs | 30.6559 KOps/s | 30.7217 KOps/s | |
test_view_pytree | 72.9670μs | 27.4434μs | 36.4386 KOps/s | 37.3674 KOps/s | |
test_view_td | 97.3930μs | 38.5720μs | 25.9256 KOps/s | 27.1835 KOps/s | |
test_unbind_pytree | 79.8490μs | 30.5438μs | 32.7399 KOps/s | 34.3216 KOps/s | |
test_unbind_td | 0.3331ms | 39.4451μs | 25.3517 KOps/s | 26.4647 KOps/s | |
test_split_pytree | 75.6420μs | 30.1751μs | 33.1399 KOps/s | 34.1195 KOps/s | |
test_split_td | 0.5476ms | 43.7646μs | 22.8495 KOps/s | 22.7812 KOps/s | |
test_add_pytree | 93.2050μs | 37.1238μs | 26.9369 KOps/s | 28.4705 KOps/s | |
test_add_td | 0.1397ms | 60.5533μs | 16.5144 KOps/s | 18.0152 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1283ms | 63.2462μs | 15.8112 KOps/s | 16.1884 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4102ms | 0.1630ms | 6.1349 KOps/s | 6.1442 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1353ms | 47.0607μs | 21.2491 KOps/s | 22.0737 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2714ms | 0.1207ms | 8.2848 KOps/s | 8.4742 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 81.2820μs | 26.5069μs | 37.7260 KOps/s | 39.2819 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1458ms | 53.7538μs | 18.6033 KOps/s | 18.6513 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2182ms | 82.2032μs | 12.1650 KOps/s | 12.6211 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1300ms | 69.3000μs | 14.4300 KOps/s | 14.5654 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2263ms | 0.1073ms | 9.3229 KOps/s | 9.4908 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3778ms | 0.2063ms | 4.8471 KOps/s | 5.1044 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1129ms | 46.0416μs | 21.7195 KOps/s | 20.9039 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4976ms | 63.0450μs | 15.8617 KOps/s | 16.4363 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1878ms | 0.1040ms | 9.6114 KOps/s | 9.7547 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3678ms | 0.2015ms | 4.9625 KOps/s | 4.9519 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4625ms | 0.2180ms | 4.5877 KOps/s | 4.8195 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1860ms | 0.1083ms | 9.2350 KOps/s | 9.4625 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1886ms | 55.1376μs | 18.1365 KOps/s | 18.6782 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1179ms | 48.2648μs | 20.7190 KOps/s | 22.0454 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6568ms | 0.1595ms | 6.2700 KOps/s | 6.2871 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1827ms | 0.1054ms | 9.4905 KOps/s | 9.7655 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 58.8700μs | 21.3237μs | 46.8962 KOps/s | 47.3159 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1599ms | 59.5415μs | 16.7950 KOps/s | 17.5390 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1579ms | 82.9121μs | 12.0610 KOps/s | 12.2524 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1311ms | 70.2728μs | 14.2303 KOps/s | 14.5802 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3087ms | 0.2116ms | 4.7263 KOps/s | 4.8213 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5868ms | 1.3034ms | 767.2083 Ops/s | 794.4541 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3027ms | 0.2039ms | 4.9033 KOps/s | 4.9213 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9473ms | 0.7736ms | 1.2926 KOps/s | 1.2959 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5696ms | 0.4621ms | 2.1640 KOps/s | 2.1991 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9487ms | 2.6541ms | 376.7686 Ops/s | 379.3485 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1094ms | 37.9441μs | 26.3546 KOps/s | 28.1154 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5684ms | 33.3265μs | 30.0061 KOps/s | 30.8751 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 82.1530μs | 29.7435μs | 33.6208 KOps/s | 34.8490 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1155ms | 24.3844μs | 41.0098 KOps/s | 43.8273 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1033ms | 30.7950μs | 32.4728 KOps/s | 32.3024 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 64.9610μs | 24.1856μs | 41.3470 KOps/s | 43.8769 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1189ms | 53.0657μs | 18.8445 KOps/s | 19.2627 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5410ms | 19.1616μs | 52.1877 KOps/s | 51.0555 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 98.5340μs | 45.2870μs | 22.0814 KOps/s | 22.4599 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 71.6650μs | 18.9818μs | 52.6821 KOps/s | 53.4058 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 99.7670μs | 46.1445μs | 21.6711 KOps/s | 22.3275 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 71.3630μs | 19.1612μs | 52.1889 KOps/s | 53.1344 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1520ms | 54.3585μs | 18.3964 KOps/s | 18.8213 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8831ms | 19.4089μs | 51.5228 KOps/s | 51.5137 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1076ms | 45.8195μs | 21.8248 KOps/s | 21.7014 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.3051ms | 19.1767μs | 52.1465 KOps/s | 53.7397 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1081ms | 46.0496μs | 21.7157 KOps/s | 22.1139 KOps/s | |
test_compile_indexing[int-pytree-eager] | 62.7880μs | 19.0787μs | 52.4146 KOps/s | 53.6118 KOps/s | |
test_mod_add[eager] | 75.5520μs | 26.6887μs | 37.4690 KOps/s | 37.4828 KOps/s | |
test_mod_add[compile] | 0.1488ms | 45.9723μs | 21.7522 KOps/s | 22.1880 KOps/s | |
test_mod_add[compile-overhead] | 94.8270μs | 45.7028μs | 21.8805 KOps/s | 21.8757 KOps/s | |
test_mod_wrap[eager] | 0.4625ms | 0.2146ms | 4.6608 KOps/s | 4.6552 KOps/s | |
test_mod_wrap[compile] | 1.8766ms | 0.2060ms | 4.8535 KOps/s | 4.8925 KOps/s | |
test_mod_wrap[compile-overhead] | 1.8639ms | 0.2038ms | 4.9076 KOps/s | 4.9248 KOps/s | |
test_mod_wrap_and_backward[eager] | 18.2272ms | 12.0994ms | 82.6485 Ops/s | 79.1816 Ops/s | |
test_mod_wrap_and_backward[compile] | 17.3069ms | 12.6939ms | 78.7779 Ops/s | 76.0960 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 22.3787ms | 12.5948ms | 79.3981 Ops/s | 73.1973 Ops/s | |
test_seq_add[eager] | 0.1807ms | 90.7294μs | 11.0218 KOps/s | 10.8686 KOps/s | |
test_seq_add[compile] | 0.1447ms | 61.6854μs | 16.2113 KOps/s | 16.1335 KOps/s | |
test_seq_add[compile-overhead] | 0.1284ms | 59.6996μs | 16.7505 KOps/s | 16.4261 KOps/s | |
test_seq_wrap[eager] | 0.5343ms | 0.3894ms | 2.5681 KOps/s | 2.4990 KOps/s | |
test_seq_wrap[compile] | 0.3790ms | 0.2284ms | 4.3785 KOps/s | 4.3195 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4615ms | 0.2290ms | 4.3677 KOps/s | 4.3548 KOps/s | |
test_func_call_runtime[False-eager] | 0.9551ms | 0.5670ms | 1.7637 KOps/s | 1.8288 KOps/s | |
test_func_call_runtime[False-compile] | 0.8065ms | 0.4270ms | 2.3421 KOps/s | 2.3144 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5453ms | 0.4291ms | 2.3304 KOps/s | 2.3199 KOps/s | |
test_func_call_runtime[True-eager] | 1.4707ms | 0.7702ms | 1.2983 KOps/s | 1.3266 KOps/s | |
test_func_call_runtime[True-compile] | 0.6894ms | 0.4696ms | 2.1296 KOps/s | 2.1183 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9957ms | 0.4694ms | 2.1302 KOps/s | 2.1204 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8823ms | 0.5493ms | 1.8206 KOps/s | 1.8617 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6557ms | 0.4266ms | 2.3444 KOps/s | 2.3365 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5684ms | 0.4258ms | 2.3484 KOps/s | 2.3167 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1177ms | 0.9001ms | 1.1110 KOps/s | 1.1362 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6466ms | 0.4927ms | 2.0297 KOps/s | 2.0068 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6395ms | 0.4949ms | 2.0204 KOps/s | 2.0252 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.3872ms | 1.8950ms | 527.7053 Ops/s | 525.7537 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8736ms | 0.5213ms | 1.9182 KOps/s | 1.9279 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7128ms | 0.5182ms | 1.9298 KOps/s | 1.9197 KOps/s | |
test_distributed | 0.3696ms | 0.1288ms | 7.7658 KOps/s | 7.7139 KOps/s | |
test_tdmodule | 84.0170μs | 18.5201μs | 53.9954 KOps/s | 50.9784 KOps/s | |
test_tdmodule_dispatch | 58.5090μs | 37.2233μs | 26.8649 KOps/s | 26.6808 KOps/s | |
test_tdseq | 46.5580μs | 21.1325μs | 47.3204 KOps/s | 45.3442 KOps/s | |
test_tdseq_dispatch | 70.9230μs | 42.5881μs | 23.4808 KOps/s | 23.2448 KOps/s | |
test_instantiation_functorch | 2.5090ms | 1.5557ms | 642.8087 Ops/s | 660.1514 Ops/s | |
test_exec_functorch | 0.3669ms | 0.1799ms | 5.5600 KOps/s | 5.6494 KOps/s | |
test_exec_functional_call | 0.3409ms | 0.1758ms | 5.6867 KOps/s | 5.8365 KOps/s | |
test_exec_td_decorator | 0.5763ms | 0.2339ms | 4.2762 KOps/s | 4.4893 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1868ms | 0.6457ms | 1.5486 KOps/s | 1.5533 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9064ms | 0.6441ms | 1.5525 KOps/s | 1.5797 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7548ms | 0.5239ms | 1.9088 KOps/s | 1.9163 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8183ms | 0.5227ms | 1.9130 KOps/s | 1.9293 KOps/s | |
test_to_module_speed[True] | 1.5795ms | 1.3008ms | 768.7465 Ops/s | 781.8737 Ops/s | |
test_to_module_speed[False] | 1.6952ms | 1.2665ms | 789.5959 Ops/s | 805.6185 Ops/s | |
test_tc_init | 76.1820μs | 44.6762μs | 22.3833 KOps/s | 21.9063 KOps/s | |
test_tc_init_nested | 0.1636ms | 86.0076μs | 11.6269 KOps/s | 11.0431 KOps/s | |
test_tc_first_layer_tensor | 49.7630μs | 1.5039μs | 664.9528 KOps/s | 655.3203 KOps/s | |
test_tc_first_layer_nontensor | 42.0490μs | 4.6881μs | 213.3054 KOps/s | 210.7062 KOps/s | |
test_tc_second_layer_tensor | 48.3910μs | 2.7622μs | 362.0341 KOps/s | 355.4126 KOps/s | |
test_tc_second_layer_nontensor | 48.8820μs | 5.9709μs | 167.4794 KOps/s | 165.5854 KOps/s | |
test_unbind | 0.2405s | 13.6172ms | 73.4363 Ops/s | 81.2529 Ops/s | |
test_full_like | 9.1469ms | 8.0832ms | 123.7127 Ops/s | 120.6627 Ops/s | |
test_zeros_like | 4.7666ms | 3.2053ms | 311.9855 Ops/s | 128.2216 Ops/s | |
test_ones_like | 4.5179ms | 3.5896ms | 278.5789 Ops/s | 124.4009 Ops/s | |
test_clone | 7.3750ms | 5.8459ms | 171.0593 Ops/s | 102.2872 Ops/s | |
test_squeeze | 61.6950μs | 11.8120μs | 84.6595 KOps/s | 81.2878 KOps/s | |
test_unsqueeze | 0.3777ms | 89.3514μs | 11.1918 KOps/s | 11.4862 KOps/s | |
test_split | 0.3142ms | 0.1875ms | 5.3322 KOps/s | 5.3882 KOps/s | |
test_permute | 0.3364ms | 0.2170ms | 4.6092 KOps/s | 4.5729 KOps/s | |
test_stack | 33.3708ms | 27.0305ms | 36.9952 Ops/s | 38.3619 Ops/s | |
test_cat | 29.6924ms | 26.6155ms | 37.5721 Ops/s | 38.3826 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.7700μs | 10.1066μs | 98.9451 KOps/s | 101.0562 KOps/s | |
test_plain_set_stack_nested | 41.1710μs | 10.1560μs | 98.4638 KOps/s | 100.6353 KOps/s | |
test_plain_set_nested_inplace | 53.2510μs | 10.8981μs | 91.7593 KOps/s | 93.0856 KOps/s | |
test_plain_set_stack_nested_inplace | 48.4510μs | 10.9557μs | 91.2766 KOps/s | 92.3535 KOps/s | |
test_items | 44.5010μs | 2.8592μs | 349.7440 KOps/s | 347.4538 KOps/s | |
test_items_nested | 0.3787ms | 0.3206ms | 3.1191 KOps/s | 3.1513 KOps/s | |
test_items_nested_locked | 0.4177ms | 0.3228ms | 3.0980 KOps/s | 3.1230 KOps/s | |
test_items_nested_leaf | 89.9810μs | 58.1363μs | 17.2010 KOps/s | 17.3623 KOps/s | |
test_items_stack_nested | 0.3686ms | 0.3236ms | 3.0901 KOps/s | 3.1432 KOps/s | |
test_items_stack_nested_leaf | 94.3920μs | 58.8688μs | 16.9869 KOps/s | 16.8466 KOps/s | |
test_items_stack_nested_locked | 0.4624ms | 0.3251ms | 3.0763 KOps/s | 3.1125 KOps/s | |
test_keys | 42.7410μs | 3.7452μs | 267.0094 KOps/s | 289.5231 KOps/s | |
test_keys_nested | 0.1163ms | 70.0365μs | 14.2783 KOps/s | 14.4842 KOps/s | |
test_keys_nested_locked | 0.7232ms | 74.9271μs | 13.3463 KOps/s | 13.3140 KOps/s | |
test_keys_nested_leaf | 0.1046ms | 60.7757μs | 16.4540 KOps/s | 16.4606 KOps/s | |
test_keys_stack_nested | 0.1377ms | 70.2078μs | 14.2434 KOps/s | 14.2143 KOps/s | |
test_keys_stack_nested_leaf | 0.1081ms | 61.5763μs | 16.2400 KOps/s | 16.2337 KOps/s | |
test_keys_stack_nested_locked | 0.1138ms | 75.1316μs | 13.3100 KOps/s | 13.2908 KOps/s | |
test_values | 5.5035μs | 0.8453μs | 1.1831 MOps/s | 1.1904 MOps/s | |
test_values_nested | 63.7620μs | 31.2351μs | 32.0153 KOps/s | 32.4039 KOps/s | |
test_values_nested_locked | 67.1710μs | 32.6379μs | 30.6392 KOps/s | 30.9784 KOps/s | |
test_values_nested_leaf | 83.5810μs | 33.6773μs | 29.6936 KOps/s | 30.0512 KOps/s | |
test_values_stack_nested | 67.0410μs | 31.8077μs | 31.4390 KOps/s | 31.7513 KOps/s | |
test_values_stack_nested_leaf | 63.0810μs | 35.3078μs | 28.3224 KOps/s | 29.4322 KOps/s | |
test_values_stack_nested_locked | 80.4710μs | 33.1708μs | 30.1470 KOps/s | 30.3064 KOps/s | |
test_membership | 1.8780μs | 0.5073μs | 1.9712 MOps/s | 1.9897 MOps/s | |
test_membership_nested | 11.5703μs | 1.7779μs | 562.4669 KOps/s | 540.1189 KOps/s | |
test_membership_nested_leaf | 11.7500μs | 1.8053μs | 553.9223 KOps/s | 550.2419 KOps/s | |
test_membership_stacked_nested | 30.6200μs | 1.9014μs | 525.9375 KOps/s | 516.4041 KOps/s | |
test_membership_stacked_nested_leaf | 30.0010μs | 1.9067μs | 524.4798 KOps/s | 520.8034 KOps/s | |
test_membership_nested_last | 41.2400μs | 2.7150μs | 368.3245 KOps/s | 365.5284 KOps/s | |
test_membership_nested_leaf_last | 27.8510μs | 2.7227μs | 367.2857 KOps/s | 365.7732 KOps/s | |
test_membership_stacked_nested_last | 39.4810μs | 2.7240μs | 367.1038 KOps/s | 266.6801 KOps/s | |
test_membership_stacked_nested_leaf_last | 41.4210μs | 2.7625μs | 361.9968 KOps/s | 271.7149 KOps/s | |
test_nested_getleaf | 38.4410μs | 5.9518μs | 168.0175 KOps/s | 167.5508 KOps/s | |
test_nested_get | 45.8610μs | 5.6856μs | 175.8818 KOps/s | 175.6776 KOps/s | |
test_stacked_getleaf | 34.9400μs | 5.9824μs | 167.1567 KOps/s | 168.6540 KOps/s | |
test_stacked_get | 49.8710μs | 5.6635μs | 176.5702 KOps/s | 176.7370 KOps/s | |
test_nested_getitemleaf | 32.9110μs | 6.0170μs | 166.1963 KOps/s | 164.4237 KOps/s | |
test_nested_getitem | 46.5810μs | 5.7446μs | 174.0769 KOps/s | 174.9893 KOps/s | |
test_stacked_getitemleaf | 25.5310μs | 6.0491μs | 165.3145 KOps/s | 166.3183 KOps/s | |
test_stacked_getitem | 38.8410μs | 5.7674μs | 173.3892 KOps/s | 175.6400 KOps/s | |
test_lock_nested | 9.1854ms | 0.3688ms | 2.7115 KOps/s | 2.7840 KOps/s | |
test_lock_stack_nested | 0.3737ms | 0.3287ms | 3.0426 KOps/s | 3.0875 KOps/s | |
test_unlock_nested | 0.6788ms | 0.2994ms | 3.3399 KOps/s | 3.3753 KOps/s | |
test_unlock_stack_nested | 0.3119ms | 0.2695ms | 3.7107 KOps/s | 3.8077 KOps/s | |
test_flatten_speed | 0.1112ms | 73.9062μs | 13.5307 KOps/s | 13.8540 KOps/s | |
test_unflatten_speed | 0.3456ms | 0.2911ms | 3.4351 KOps/s | 3.5065 KOps/s | |
test_common_ops | 1.7800ms | 0.5606ms | 1.7837 KOps/s | 1.8316 KOps/s | |
test_creation | 0.1666ms | 1.4158μs | 706.2920 KOps/s | 717.2876 KOps/s | |
test_creation_empty | 41.6310μs | 6.3866μs | 156.5779 KOps/s | 166.5433 KOps/s | |
test_creation_nested_1 | 34.5700μs | 7.7211μs | 129.5150 KOps/s | 136.5119 KOps/s | |
test_creation_nested_2 | 36.0910μs | 10.2626μs | 97.4412 KOps/s | 102.3517 KOps/s | |
test_clone | 72.5710μs | 10.2203μs | 97.8446 KOps/s | 96.0197 KOps/s | |
test_getitem[int] | 92.1308ms | 15.4318μs | 64.8013 KOps/s | 99.6048 KOps/s | |
test_getitem[slice_int] | 0.1090ms | 19.6300μs | 50.9424 KOps/s | 52.3854 KOps/s | |
test_getitem[range] | 0.1315ms | 35.1260μs | 28.4689 KOps/s | 28.0798 KOps/s | |
test_getitem[tuple] | 0.1041ms | 17.0893μs | 58.5160 KOps/s | 59.0331 KOps/s | |
test_getitem[list] | 0.2648ms | 31.9446μs | 31.3042 KOps/s | 31.1617 KOps/s | |
test_setitem_dim[int] | 38.9310μs | 18.3564μs | 54.4770 KOps/s | 56.3097 KOps/s | |
test_setitem_dim[slice_int] | 65.5120μs | 36.1571μs | 27.6571 KOps/s | 27.9980 KOps/s | |
test_setitem_dim[range] | 72.5110μs | 51.4655μs | 19.4305 KOps/s | 20.1583 KOps/s | |
test_setitem_dim[tuple] | 71.4120μs | 31.0135μs | 32.2440 KOps/s | 34.0109 KOps/s | |
test_setitem | 92.6620μs | 13.9533μs | 71.6677 KOps/s | 74.5719 KOps/s | |
test_set | 94.7320μs | 13.2699μs | 75.3583 KOps/s | 76.0947 KOps/s | |
test_set_shared | 1.6675ms | 0.1439ms | 6.9503 KOps/s | 6.9527 KOps/s | |
test_update | 1.0601ms | 15.4911μs | 64.5532 KOps/s | 67.0900 KOps/s | |
test_update_nested | 0.1074ms | 20.4441μs | 48.9139 KOps/s | 51.6661 KOps/s | |
test_update__nested | 0.2426ms | 23.5628μs | 42.4398 KOps/s | 42.1216 KOps/s | |
test_set_nested | 94.8820μs | 14.4033μs | 69.4284 KOps/s | 70.2609 KOps/s | |
test_set_nested_new | 73.1720μs | 16.6177μs | 60.1767 KOps/s | 61.0746 KOps/s | |
test_select | 95.9720μs | 27.1065μs | 36.8916 KOps/s | 35.6940 KOps/s | |
test_select_nested | 0.1024ms | 40.8520μs | 24.4786 KOps/s | 24.6299 KOps/s | |
test_exclude_nested | 89.4820μs | 57.1335μs | 17.5029 KOps/s | 17.7630 KOps/s | |
test_empty[True] | 0.2888ms | 0.2532ms | 3.9488 KOps/s | 4.0174 KOps/s | |
test_empty[False] | 3.3971μs | 0.7448μs | 1.3426 MOps/s | 1.3581 MOps/s | |
test_to | 83.2620μs | 53.5840μs | 18.6623 KOps/s | 18.6747 KOps/s | |
test_to_nonblocking | 92.0620μs | 43.3175μs | 23.0854 KOps/s | 22.2782 KOps/s | |
test_unbind_speed | 0.2539ms | 0.2259ms | 4.4271 KOps/s | 4.4603 KOps/s | |
test_unbind_speed_stack0 | 0.2826ms | 0.2281ms | 4.3849 KOps/s | 4.4107 KOps/s | |
test_unbind_speed_stack1 | 92.6684ms | 0.6476ms | 1.5442 KOps/s | 1.7189 KOps/s | |
test_split | 93.1191ms | 1.5380ms | 650.1805 Ops/s | 643.6402 Ops/s | |
test_chunk | 97.0212ms | 1.7226ms | 580.5201 Ops/s | 645.9870 Ops/s | |
test_consolidate[False-None] | 2.6107ms | 2.5385ms | 393.9388 Ops/s | 357.9407 Ops/s | |
test_consolidate[default-None] | 1.7440ms | 1.6610ms | 602.0390 Ops/s | 609.9128 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7598ms | 1.6936ms | 590.4553 Ops/s | 606.4998 Ops/s | |
test_consolidate_njt[False-None] | 6.4892ms | 6.4046ms | 156.1378 Ops/s | 156.1502 Ops/s | |
test_to[False-False-None] | 1.7672ms | 1.7005ms | 588.0726 Ops/s | 584.4730 Ops/s | |
test_to[True-False-None] | 1.4416ms | 1.2424ms | 804.8917 Ops/s | 776.1304 Ops/s | |
test_to[within-False-None] | 0.2957s | 5.0348ms | 198.6165 Ops/s | 253.3865 Ops/s | |
test_to[True-default-None] | 5.1825ms | 4.9154ms | 203.4426 Ops/s | 192.3786 Ops/s | |
test_to_njt[False-False-None] | 7.1969ms | 6.8704ms | 145.5515 Ops/s | 147.0774 Ops/s | |
test_to_njt[True-False-None] | 5.5948ms | 5.2732ms | 189.6384 Ops/s | 183.6468 Ops/s | |
test_to_njt[within-False-None] | 12.0097ms | 11.8786ms | 84.1851 Ops/s | 85.1511 Ops/s | |
test_creation[device0] | 0.5632ms | 80.0428μs | 12.4933 KOps/s | 12.8936 KOps/s | |
test_creation_from_tensor | 0.6142ms | 83.9002μs | 11.9189 KOps/s | 12.1108 KOps/s | |
test_add_one[memmap_tensor0] | 0.4360ms | 6.6246μs | 150.9530 KOps/s | 149.4076 KOps/s | |
test_contiguous[memmap_tensor0] | 1.6720μs | 0.3898μs | 2.5653 MOps/s | 2.5177 MOps/s | |
test_stack[memmap_tensor0] | 26.5410μs | 4.3274μs | 231.0873 KOps/s | 236.6312 KOps/s | |
test_memmaptd_index | 2.0513ms | 0.2397ms | 4.1719 KOps/s | 4.1937 KOps/s | |
test_memmaptd_index_astensor | 0.7901ms | 0.2962ms | 3.3764 KOps/s | 3.4533 KOps/s | |
test_memmaptd_index_op | 0.9592ms | 0.5463ms | 1.8304 KOps/s | 1.8653 KOps/s | |
test_serialize_model | 0.1310s | 0.1296s | 7.7136 Ops/s | 7.6845 Ops/s | |
test_serialize_model_pickle | 1.3494s | 1.1904s | 0.8401 Ops/s | 0.8255 Ops/s | |
test_serialize_weights | 0.1310s | 0.1298s | 7.7015 Ops/s | 7.7440 Ops/s | |
test_serialize_weights_returnearly | 49.6778ms | 40.2118ms | 24.8683 Ops/s | 14.9409 Ops/s | |
test_serialize_weights_pickle | 1.3480s | 1.1837s | 0.8448 Ops/s | 0.8212 Ops/s | |
test_reshape_pytree | 71.1320μs | 21.2007μs | 47.1683 KOps/s | 44.9407 KOps/s | |
test_reshape_td | 49.2310μs | 25.8565μs | 38.6750 KOps/s | 38.3621 KOps/s | |
test_view_pytree | 47.9510μs | 21.3969μs | 46.7357 KOps/s | 45.7724 KOps/s | |
test_view_td | 59.4010μs | 28.2227μs | 35.4325 KOps/s | 31.9766 KOps/s | |
test_unbind_pytree | 56.0810μs | 27.1252μs | 36.8661 KOps/s | 35.9667 KOps/s | |
test_unbind_td | 0.5322ms | 34.2025μs | 29.2376 KOps/s | 29.0629 KOps/s | |
test_split_pytree | 57.7810μs | 29.9670μs | 33.3700 KOps/s | 33.9585 KOps/s | |
test_split_td | 0.1642ms | 36.6405μs | 27.2922 KOps/s | 26.6352 KOps/s | |
test_add_pytree | 69.1710μs | 34.3185μs | 29.1388 KOps/s | 28.8915 KOps/s | |
test_add_td | 0.1348ms | 44.5208μs | 22.4614 KOps/s | 24.0436 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1710ms | 0.1177ms | 8.4928 KOps/s | 8.3041 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2142ms | 0.1214ms | 8.2349 KOps/s | 8.0449 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1430ms | 94.6001μs | 10.5708 KOps/s | 10.3929 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.1621ms | 0.1489ms | 6.7172 KOps/s | 6.6704 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.2298ms | 24.4313μs | 40.9311 KOps/s | 44.5347 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1032ms | 26.3268μs | 37.9842 KOps/s | 37.9864 KOps/s | |
test_compile_copy_nested[pytree-compile] | 1.3677ms | 63.7287μs | 15.6915 KOps/s | 15.4374 KOps/s | |
test_compile_copy_nested[pytree-eager] | 81.1110μs | 49.1636μs | 20.3402 KOps/s | 19.9244 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2068ms | 0.1405ms | 7.1176 KOps/s | 7.1096 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2957ms | 0.2055ms | 4.8671 KOps/s | 4.8373 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1337ms | 96.6202μs | 10.3498 KOps/s | 10.4108 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1028ms | 49.6590μs | 20.1373 KOps/s | 20.3735 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1814ms | 0.1338ms | 7.4761 KOps/s | 7.4102 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5297ms | 0.4821ms | 2.0743 KOps/s | 2.0434 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3687ms | 0.2463ms | 4.0595 KOps/s | 4.0642 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2899ms | 0.1516ms | 6.5980 KOps/s | 7.1468 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1810ms | 61.6208μs | 16.2283 KOps/s | 16.8397 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.5153ms | 0.1029ms | 9.7140 KOps/s | 10.4691 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5745ms | 0.4240ms | 2.3585 KOps/s | 2.4100 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2030ms | 0.1392ms | 7.1860 KOps/s | 7.4602 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 67.5120μs | 18.0107μs | 55.5225 KOps/s | 54.4784 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 80.2710μs | 26.6597μs | 37.5098 KOps/s | 37.0631 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1041ms | 69.7072μs | 14.3457 KOps/s | 14.4362 KOps/s | |
test_compile_copy_flat[pytree-eager] | 94.0210μs | 52.0582μs | 19.2093 KOps/s | 19.2692 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6084ms | 0.3876ms | 2.5802 KOps/s | 2.2006 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6655ms | 2.5921ms | 385.7814 Ops/s | 381.5329 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5827ms | 0.3775ms | 2.6487 KOps/s | 2.2948 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7870ms | 2.6528ms | 376.9564 Ops/s | 373.6686 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1730ms | 0.1124ms | 8.8953 KOps/s | 8.6837 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5560ms | 76.8015μs | 13.0206 KOps/s | 12.8608 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1962ms | 0.1059ms | 9.4425 KOps/s | 9.4586 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1122ms | 66.9019μs | 14.9473 KOps/s | 14.0386 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1655ms | 0.1057ms | 9.4595 KOps/s | 8.9776 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1071ms | 67.1725μs | 14.8870 KOps/s | 14.2445 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1476ms | 99.4319μs | 10.0571 KOps/s | 10.1378 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1448ms | 16.3101μs | 61.3115 KOps/s | 59.2490 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1396ms | 94.3261μs | 10.6015 KOps/s | 10.6285 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 51.9510μs | 15.2662μs | 65.5042 KOps/s | 64.0622 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1414ms | 94.4488μs | 10.5877 KOps/s | 10.4439 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 43.4910μs | 15.1334μs | 66.0791 KOps/s | 56.8154 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1503ms | 99.1855μs | 10.0821 KOps/s | 10.0427 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6007ms | 16.1443μs | 61.9414 KOps/s | 60.3703 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2003ms | 94.6336μs | 10.5671 KOps/s | 10.1301 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 43.2610μs | 15.2659μs | 65.5054 KOps/s | 64.5419 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1403ms | 94.4642μs | 10.5860 KOps/s | 10.5056 KOps/s | |
test_compile_indexing[int-pytree-eager] | 44.4300μs | 15.2955μs | 65.3789 KOps/s | 64.3759 KOps/s | |
test_mod_add[eager] | 60.2420μs | 29.4072μs | 34.0052 KOps/s | 33.9218 KOps/s | |
test_mod_add[compile] | 0.1258ms | 77.6735μs | 12.8744 KOps/s | 12.8150 KOps/s | |
test_mod_add[compile-overhead] | 0.3139ms | 0.1621ms | 6.1689 KOps/s | 5.7009 KOps/s | |
test_mod_wrap[eager] | 0.3237ms | 0.2481ms | 4.0310 KOps/s | 4.1713 KOps/s | |
test_mod_wrap[compile] | 1.5909ms | 0.2793ms | 3.5805 KOps/s | 3.5750 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2562ms | 3.7909ms | 263.7881 Ops/s | 261.3155 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4743ms | 1.3482ms | 741.7352 Ops/s | 687.9240 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3789ms | 1.2443ms | 803.6745 Ops/s | 735.6997 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5100ms | 1.0160ms | 984.2035 Ops/s | 978.0523 Ops/s | |
test_seq_add[eager] | 0.1375ms | 92.7043μs | 10.7870 KOps/s | 10.5315 KOps/s | |
test_seq_add[compile] | 0.2097ms | 84.9248μs | 11.7751 KOps/s | 11.3308 KOps/s | |
test_seq_add[compile-overhead] | 0.1641ms | 0.1261ms | 7.9275 KOps/s | 7.9660 KOps/s | |
test_seq_wrap[eager] | 0.4306ms | 0.3674ms | 2.7218 KOps/s | 2.7028 KOps/s | |
test_seq_wrap[compile] | 0.3920ms | 0.2940ms | 3.4017 KOps/s | 3.2604 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2617ms | 0.2190ms | 4.5671 KOps/s | 4.6144 KOps/s | |
test_func_call_runtime[False-eager] | 0.8388ms | 0.7382ms | 1.3547 KOps/s | 1.3514 KOps/s | |
test_func_call_runtime[False-compile] | 0.9248ms | 0.7274ms | 1.3747 KOps/s | 1.3595 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.3960ms | 0.3524ms | 2.8378 KOps/s | 2.7802 KOps/s | |
test_func_call_runtime[True-eager] | 0.9521ms | 0.8938ms | 1.1188 KOps/s | 1.0958 KOps/s | |
test_func_call_runtime[True-compile] | 0.8714ms | 0.7482ms | 1.3366 KOps/s | 1.3147 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4446ms | 0.3741ms | 2.6733 KOps/s | 2.6772 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8392ms | 0.7552ms | 1.3242 KOps/s | 1.3611 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9595ms | 0.7260ms | 1.3774 KOps/s | 1.3372 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4012ms | 0.3540ms | 2.8248 KOps/s | 2.8155 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1269ms | 0.9920ms | 1.0080 KOps/s | 992.9092 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8244ms | 0.7741ms | 1.2918 KOps/s | 1.2904 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4688ms | 0.3981ms | 2.5117 KOps/s | 2.5039 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5266ms | 2.0734ms | 482.2958 Ops/s | 483.5957 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9173ms | 0.8110ms | 1.2331 KOps/s | 1.2452 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4493ms | 0.4033ms | 2.4797 KOps/s | 2.4285 KOps/s | |
test_distributed | 2.0639ms | 0.2034ms | 4.9164 KOps/s | 8.7577 KOps/s | |
test_tdmodule | 49.1010μs | 12.6857μs | 78.8290 KOps/s | 73.2409 KOps/s | |
test_tdmodule_dispatch | 45.5610μs | 25.0117μs | 39.9812 KOps/s | 39.2085 KOps/s | |
test_tdseq | 33.6200μs | 14.2553μs | 70.1494 KOps/s | 65.4772 KOps/s | |
test_tdseq_dispatch | 47.6610μs | 28.3326μs | 35.2951 KOps/s | 34.3043 KOps/s | |
test_instantiation_functorch | 1.6153ms | 1.5176ms | 658.9449 Ops/s | 665.1934 Ops/s | |
test_exec_functorch | 0.2237ms | 0.1444ms | 6.9257 KOps/s | 6.8552 KOps/s | |
test_exec_functional_call | 0.1711ms | 0.1360ms | 7.3504 KOps/s | 7.0054 KOps/s | |
test_exec_td_decorator | 0.3619ms | 0.1782ms | 5.6111 KOps/s | 5.2502 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7266ms | 0.6646ms | 1.5046 KOps/s | 1.4721 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7470ms | 0.6619ms | 1.5107 KOps/s | 1.4453 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6989ms | 0.5891ms | 1.6974 KOps/s | 1.6294 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6937ms | 0.5878ms | 1.7014 KOps/s | 1.6213 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.2046ms | 19.1193ms | 52.3031 Ops/s | 52.6008 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.3081ms | 19.1906ms | 52.1087 Ops/s | 52.1896 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1225ms | 19.0170ms | 52.5845 Ops/s | 53.1443 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1176ms | 19.0190ms | 52.5791 Ops/s | 52.7687 Ops/s | |
test_to_module_speed[True] | 1.0169ms | 0.9176ms | 1.0899 KOps/s | 1.0861 KOps/s | |
test_to_module_speed[False] | 1.3745ms | 0.9000ms | 1.1111 KOps/s | 1.1067 KOps/s | |
test_tc_init | 67.7510μs | 32.9399μs | 30.3583 KOps/s | 31.4709 KOps/s | |
test_tc_init_nested | 0.1052ms | 68.0844μs | 14.6876 KOps/s | 15.6343 KOps/s | |
test_tc_first_layer_tensor | 3.8559μs | 0.6905μs | 1.4482 MOps/s | 1.4505 MOps/s | |
test_tc_first_layer_nontensor | 29.4200μs | 2.2705μs | 440.4394 KOps/s | 440.0301 KOps/s | |
test_tc_second_layer_tensor | 11.4502μs | 1.3760μs | 726.7645 KOps/s | 722.1855 KOps/s | |
test_tc_second_layer_nontensor | 22.8610μs | 2.9772μs | 335.8883 KOps/s | 337.4197 KOps/s | |
test_unbind | 0.2212s | 9.6290ms | 103.8528 Ops/s | 153.0871 Ops/s | |
test_full_like | 10.2782ms | 9.0929ms | 109.9755 Ops/s | 109.0385 Ops/s | |
test_zeros_like | 9.2212ms | 7.1422ms | 140.0127 Ops/s | 114.6833 Ops/s | |
test_ones_like | 9.1546ms | 7.1761ms | 139.3516 Ops/s | 232.0332 Ops/s | |
test_clone | 6.9204ms | 6.3248ms | 158.1087 Ops/s | 158.7270 Ops/s | |
test_squeeze | 58.7010μs | 9.1772μs | 108.9657 KOps/s | 106.8381 KOps/s | |
test_unsqueeze | 0.1200ms | 67.2405μs | 14.8720 KOps/s | 14.4790 KOps/s | |
test_split | 0.3793ms | 0.1512ms | 6.6150 KOps/s | 6.2786 KOps/s | |
test_permute | 0.2062ms | 0.1673ms | 5.9757 KOps/s | 5.7698 KOps/s | |
test_stack | 50.7186ms | 50.3964ms | 19.8427 Ops/s | 19.5520 Ops/s | |
test_cat | 50.4059ms | 50.2208ms | 19.9121 Ops/s | 19.8482 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):