-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Fix type assertion in Seq build #1143
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 17, 2024
vmoens
added a commit
that referenced
this pull request
Dec 17, 2024
ghstack-source-id: b2b2678dcda0607ca8daf036e698dfedcdf8bb79 Pull Request resolved: #1143
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 47.7890μs | 21.5059μs | 46.4989 KOps/s | 49.9190 KOps/s | |
test_plain_set_stack_nested | 59.9620μs | 21.3078μs | 46.9312 KOps/s | 49.2152 KOps/s | |
test_plain_set_nested_inplace | 91.0100μs | 23.0281μs | 43.4252 KOps/s | 45.5892 KOps/s | |
test_plain_set_stack_nested_inplace | 86.9930μs | 23.2109μs | 43.0833 KOps/s | 45.4356 KOps/s | |
test_items | 34.4340μs | 4.1455μs | 241.2255 KOps/s | 244.4773 KOps/s | |
test_items_nested | 0.5993ms | 0.3981ms | 2.5121 KOps/s | 2.4822 KOps/s | |
test_items_nested_locked | 0.5423ms | 0.3994ms | 2.5040 KOps/s | 2.5064 KOps/s | |
test_items_nested_leaf | 0.1551ms | 76.1973μs | 13.1238 KOps/s | 12.9066 KOps/s | |
test_items_stack_nested | 0.5303ms | 0.4043ms | 2.4737 KOps/s | 2.4997 KOps/s | |
test_items_stack_nested_leaf | 0.1661ms | 78.9406μs | 12.6678 KOps/s | 12.6418 KOps/s | |
test_items_stack_nested_locked | 0.6864ms | 0.4026ms | 2.4841 KOps/s | 2.4706 KOps/s | |
test_keys | 25.0870μs | 3.5337μs | 282.9897 KOps/s | 279.0355 KOps/s | |
test_keys_nested | 0.2540ms | 0.1647ms | 6.0728 KOps/s | 5.9782 KOps/s | |
test_keys_nested_locked | 1.9563ms | 0.1712ms | 5.8412 KOps/s | 5.7699 KOps/s | |
test_keys_nested_leaf | 0.2719ms | 0.1430ms | 6.9910 KOps/s | 6.9221 KOps/s | |
test_keys_stack_nested | 0.2722ms | 0.1650ms | 6.0589 KOps/s | 6.0468 KOps/s | |
test_keys_stack_nested_leaf | 0.2647ms | 0.1434ms | 6.9758 KOps/s | 6.9421 KOps/s | |
test_keys_stack_nested_locked | 0.3078ms | 0.1700ms | 5.8824 KOps/s | 5.7730 KOps/s | |
test_values | 8.9306μs | 1.0413μs | 960.3059 KOps/s | 958.1990 KOps/s | |
test_values_nested | 0.1329ms | 62.2398μs | 16.0669 KOps/s | 16.2037 KOps/s | |
test_values_nested_locked | 0.1149ms | 62.9402μs | 15.8881 KOps/s | 15.7028 KOps/s | |
test_values_nested_leaf | 0.1279ms | 71.2166μs | 14.0417 KOps/s | 13.9574 KOps/s | |
test_values_stack_nested | 0.1259ms | 63.3705μs | 15.7802 KOps/s | 15.7517 KOps/s | |
test_values_stack_nested_leaf | 0.1285ms | 71.4722μs | 13.9915 KOps/s | 13.8151 KOps/s | |
test_values_stack_nested_locked | 0.1196ms | 63.4657μs | 15.7565 KOps/s | 15.8452 KOps/s | |
test_membership | 15.8700μs | 0.8496μs | 1.1770 MOps/s | 1.1183 MOps/s | |
test_membership_nested | 44.7430μs | 2.9807μs | 335.4879 KOps/s | 328.5789 KOps/s | |
test_membership_nested_leaf | 28.1920μs | 3.0069μs | 332.5690 KOps/s | 320.5255 KOps/s | |
test_membership_stacked_nested | 45.2540μs | 2.9654μs | 337.2244 KOps/s | 326.9707 KOps/s | |
test_membership_stacked_nested_leaf | 29.8860μs | 2.9741μs | 336.2364 KOps/s | 325.3161 KOps/s | |
test_membership_nested_last | 46.6670μs | 4.4230μs | 226.0902 KOps/s | 220.9525 KOps/s | |
test_membership_nested_leaf_last | 30.5570μs | 4.4117μs | 226.6705 KOps/s | 222.2556 KOps/s | |
test_membership_stacked_nested_last | 50.3630μs | 4.3757μs | 228.5341 KOps/s | 190.1681 KOps/s | |
test_membership_stacked_nested_leaf_last | 23.2230μs | 4.3922μs | 227.6786 KOps/s | 187.3546 KOps/s | |
test_nested_getleaf | 54.9320μs | 10.9252μs | 91.5314 KOps/s | 93.4145 KOps/s | |
test_nested_get | 39.2430μs | 10.3789μs | 96.3495 KOps/s | 99.4204 KOps/s | |
test_stacked_getleaf | 36.0670μs | 10.8904μs | 91.8236 KOps/s | 94.5899 KOps/s | |
test_stacked_get | 55.5930μs | 10.4602μs | 95.6001 KOps/s | 98.8746 KOps/s | |
test_nested_getitemleaf | 38.1610μs | 11.3378μs | 88.2007 KOps/s | 90.1691 KOps/s | |
test_nested_getitem | 58.5160μs | 10.5107μs | 95.1409 KOps/s | 95.3423 KOps/s | |
test_stacked_getitemleaf | 35.4460μs | 11.2744μs | 88.6968 KOps/s | 89.4900 KOps/s | |
test_stacked_getitem | 32.1990μs | 10.4513μs | 95.6819 KOps/s | 95.2316 KOps/s | |
test_lock_nested | 4.4843ms | 0.4588ms | 2.1794 KOps/s | 2.2076 KOps/s | |
test_lock_stack_nested | 0.6691ms | 0.4299ms | 2.3261 KOps/s | 2.3398 KOps/s | |
test_unlock_nested | 0.8235ms | 0.3762ms | 2.6584 KOps/s | 2.6533 KOps/s | |
test_unlock_stack_nested | 0.5315ms | 0.3479ms | 2.8741 KOps/s | 2.8731 KOps/s | |
test_flatten_speed | 0.1825ms | 99.1774μs | 10.0829 KOps/s | 9.8254 KOps/s | |
test_unflatten_speed | 0.9217ms | 0.5232ms | 1.9113 KOps/s | 1.9045 KOps/s | |
test_common_ops | 1.6566ms | 0.8194ms | 1.2203 KOps/s | 1.2979 KOps/s | |
test_creation | 44.1820μs | 2.5526μs | 391.7600 KOps/s | 378.9265 KOps/s | |
test_creation_empty | 36.0570μs | 13.3152μs | 75.1021 KOps/s | 96.8051 KOps/s | |
test_creation_nested_1 | 53.8800μs | 16.3376μs | 61.2086 KOps/s | 74.8284 KOps/s | |
test_creation_nested_2 | 66.6170μs | 21.1296μs | 47.3269 KOps/s | 54.9547 KOps/s | |
test_clone | 67.7160μs | 13.5903μs | 73.5819 KOps/s | 71.4493 KOps/s | |
test_getitem[int] | 1.1724ms | 12.6273μs | 79.1937 KOps/s | 78.2290 KOps/s | |
test_getitem[slice_int] | 0.1405ms | 24.8053μs | 40.3139 KOps/s | 41.2706 KOps/s | |
test_getitem[range] | 0.2249ms | 49.2033μs | 20.3238 KOps/s | 20.8211 KOps/s | |
test_getitem[tuple] | 0.1319ms | 20.1897μs | 49.5303 KOps/s | 49.7207 KOps/s | |
test_getitem[list] | 0.1717ms | 44.5250μs | 22.4593 KOps/s | 23.1135 KOps/s | |
test_setitem_dim[int] | 59.8110μs | 25.3824μs | 39.3975 KOps/s | 41.5765 KOps/s | |
test_setitem_dim[slice_int] | 0.1193ms | 51.1324μs | 19.5571 KOps/s | 20.1424 KOps/s | |
test_setitem_dim[range] | 0.1281ms | 73.0532μs | 13.6887 KOps/s | 13.9109 KOps/s | |
test_setitem_dim[tuple] | 60.9830μs | 40.3506μs | 24.7828 KOps/s | 25.5191 KOps/s | |
test_setitem | 81.2710μs | 22.1077μs | 45.2331 KOps/s | 50.2658 KOps/s | |
test_set | 78.7670μs | 21.7071μs | 46.0680 KOps/s | 51.7980 KOps/s | |
test_set_shared | 4.6228ms | 0.1688ms | 5.9254 KOps/s | 5.9156 KOps/s | |
test_update | 0.1290ms | 25.8077μs | 38.7481 KOps/s | 47.0906 KOps/s | |
test_update_nested | 85.1690μs | 35.5604μs | 28.1212 KOps/s | 31.1650 KOps/s | |
test_update__nested | 0.7440ms | 34.7050μs | 28.8143 KOps/s | 29.0831 KOps/s | |
test_set_nested | 70.4410μs | 23.7919μs | 42.0311 KOps/s | 45.8072 KOps/s | |
test_set_nested_new | 93.2530μs | 28.6632μs | 34.8880 KOps/s | 38.2204 KOps/s | |
test_select | 89.5770μs | 45.2765μs | 22.0865 KOps/s | 23.3993 KOps/s | |
test_select_nested | 0.1275ms | 62.9811μs | 15.8778 KOps/s | 15.7089 KOps/s | |
test_exclude_nested | 0.1561ms | 81.1791μs | 12.3184 KOps/s | 12.0578 KOps/s | |
test_empty[True] | 0.7056ms | 0.4168ms | 2.3995 KOps/s | 2.3753 KOps/s | |
test_empty[False] | 11.6768μs | 1.3956μs | 716.5451 KOps/s | 695.7539 KOps/s | |
test_unbind_speed | 0.3958ms | 0.2724ms | 3.6706 KOps/s | 3.6446 KOps/s | |
test_unbind_speed_stack0 | 0.4063ms | 0.2701ms | 3.7027 KOps/s | 3.7517 KOps/s | |
test_unbind_speed_stack1 | 0.1082s | 0.8290ms | 1.2063 KOps/s | 1.3713 KOps/s | |
test_split | 2.4459ms | 1.5792ms | 633.2316 Ops/s | 559.8547 Ops/s | |
test_chunk | 0.1055s | 1.8944ms | 527.8672 Ops/s | 560.0845 Ops/s | |
test_consolidate_njt[False-None] | 9.2102ms | 8.1245ms | 123.0850 Ops/s | 122.4022 Ops/s | |
test_creation[device0] | 4.6937ms | 93.5674μs | 10.6875 KOps/s | 10.9892 KOps/s | |
test_creation_from_tensor | 0.3014ms | 95.3325μs | 10.4896 KOps/s | 10.6564 KOps/s | |
test_add_one[memmap_tensor0] | 0.1172ms | 4.5853μs | 218.0870 KOps/s | 200.3032 KOps/s | |
test_contiguous[memmap_tensor0] | 19.0150μs | 0.5117μs | 1.9542 MOps/s | 1.9156 MOps/s | |
test_stack[memmap_tensor0] | 28.5630μs | 3.3646μs | 297.2109 KOps/s | 287.2690 KOps/s | |
test_memmaptd_index | 1.0515ms | 0.2397ms | 4.1711 KOps/s | 4.1692 KOps/s | |
test_memmaptd_index_astensor | 0.5687ms | 0.3275ms | 3.0533 KOps/s | 3.0282 KOps/s | |
test_memmaptd_index_op | 0.9735ms | 0.6348ms | 1.5754 KOps/s | 1.7479 KOps/s | |
test_serialize_model | 0.1256s | 0.1157s | 8.6453 Ops/s | 8.5987 Ops/s | |
test_serialize_model_pickle | 0.4311s | 0.3923s | 2.5494 Ops/s | 2.5804 Ops/s | |
test_serialize_weights | 0.1233s | 0.1155s | 8.6613 Ops/s | 8.6100 Ops/s | |
test_serialize_weights_returnearly | 0.2536s | 0.1728s | 5.7886 Ops/s | 6.4250 Ops/s | |
test_serialize_weights_pickle | 0.5546s | 0.4559s | 2.1935 Ops/s | 2.4034 Ops/s | |
test_serialize_weights_filesystem | 0.1481s | 0.1410s | 7.0901 Ops/s | 7.0569 Ops/s | |
test_serialize_model_filesystem | 0.1554s | 0.1471s | 6.7965 Ops/s | 5.8084 Ops/s | |
test_reshape_pytree | 52.7380μs | 26.3366μs | 37.9700 KOps/s | 36.0292 KOps/s | |
test_reshape_td | 88.0240μs | 33.6125μs | 29.7509 KOps/s | 30.5052 KOps/s | |
test_view_pytree | 82.2000μs | 26.4274μs | 37.8395 KOps/s | 37.1514 KOps/s | |
test_view_td | 89.4060μs | 38.3475μs | 26.0773 KOps/s | 26.0564 KOps/s | |
test_unbind_pytree | 75.7510μs | 29.2872μs | 34.1446 KOps/s | 33.6318 KOps/s | |
test_unbind_td | 0.3714ms | 40.2090μs | 24.8700 KOps/s | 24.8312 KOps/s | |
test_split_pytree | 74.6390μs | 29.1414μs | 34.3154 KOps/s | 33.9005 KOps/s | |
test_split_td | 0.1014s | 53.4701μs | 18.7020 KOps/s | 21.6343 KOps/s | |
test_add_pytree | 86.2410μs | 35.2093μs | 28.4016 KOps/s | 27.7907 KOps/s | |
test_add_td | 0.1610ms | 60.5379μs | 16.5186 KOps/s | 18.4674 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1383ms | 62.6310μs | 15.9665 KOps/s | 15.8526 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3653ms | 0.1704ms | 5.8671 KOps/s | 5.8494 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1119ms | 45.8005μs | 21.8338 KOps/s | 21.9699 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2896ms | 0.1201ms | 8.3254 KOps/s | 8.3660 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 88.3340μs | 26.4191μs | 37.8513 KOps/s | 38.5142 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1160ms | 59.3971μs | 16.8358 KOps/s | 16.7194 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1709ms | 79.1660μs | 12.6317 KOps/s | 12.5742 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1392ms | 67.8960μs | 14.7284 KOps/s | 14.6668 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2148ms | 0.1058ms | 9.4531 KOps/s | 9.5422 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4816ms | 0.2168ms | 4.6123 KOps/s | 4.5907 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1089ms | 44.7255μs | 22.3586 KOps/s | 22.2206 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4984ms | 66.0459μs | 15.1410 KOps/s | 15.4997 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2096ms | 0.1026ms | 9.7434 KOps/s | 9.6773 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3720ms | 0.1995ms | 5.0115 KOps/s | 4.9287 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4064ms | 0.2318ms | 4.3137 KOps/s | 4.2692 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1969ms | 0.1055ms | 9.4810 KOps/s | 9.6511 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1382ms | 61.6237μs | 16.2275 KOps/s | 17.2521 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1175ms | 47.0265μs | 21.2646 KOps/s | 21.9601 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6442ms | 0.1580ms | 6.3290 KOps/s | 6.3111 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2080ms | 0.1050ms | 9.5251 KOps/s | 9.8528 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 52.8490μs | 20.9648μs | 47.6990 KOps/s | 44.7735 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1291ms | 64.1630μs | 15.5853 KOps/s | 14.6573 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1515ms | 80.0668μs | 12.4896 KOps/s | 12.2976 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1414ms | 68.1198μs | 14.6800 KOps/s | 14.4562 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3020ms | 0.2115ms | 4.7281 KOps/s | 4.7484 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5059ms | 1.2982ms | 770.2902 Ops/s | 765.7462 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4038ms | 0.2061ms | 4.8513 KOps/s | 4.7734 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9674ms | 0.7693ms | 1.2999 KOps/s | 1.2870 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5665ms | 0.4666ms | 2.1431 KOps/s | 2.1746 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.3036ms | 2.7804ms | 359.6620 Ops/s | 381.3326 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 98.8750μs | 36.3749μs | 27.4915 KOps/s | 27.5813 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4790ms | 32.2581μs | 30.9999 KOps/s | 30.0607 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1057ms | 28.9664μs | 34.5228 KOps/s | 33.1941 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 74.3790μs | 22.7093μs | 44.0348 KOps/s | 42.2762 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1241ms | 30.4727μs | 32.8163 KOps/s | 32.8437 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 86.4440μs | 22.6782μs | 44.0951 KOps/s | 42.0892 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1236ms | 51.9324μs | 19.2558 KOps/s | 19.0181 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5166ms | 19.2624μs | 51.9145 KOps/s | 48.4481 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.3173ms | 45.2493μs | 22.0998 KOps/s | 22.2123 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.2316ms | 18.8879μs | 52.9440 KOps/s | 53.0635 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1215ms | 45.5337μs | 21.9617 KOps/s | 21.9450 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 65.8250μs | 18.5398μs | 53.9380 KOps/s | 53.2883 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1114ms | 54.2556μs | 18.4313 KOps/s | 18.6548 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0088ms | 19.2278μs | 52.0080 KOps/s | 48.4590 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1037ms | 45.6886μs | 21.8873 KOps/s | 22.2625 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.6500μs | 18.0115μs | 55.5201 KOps/s | 52.8842 KOps/s | |
test_compile_indexing[int-pytree-compile] | 96.4200μs | 45.3798μs | 22.0362 KOps/s | 22.0650 KOps/s | |
test_compile_indexing[int-pytree-eager] | 55.1930μs | 18.1506μs | 55.0945 KOps/s | 53.0878 KOps/s | |
test_mod_add[eager] | 0.1278ms | 35.3404μs | 28.2962 KOps/s | 28.7959 KOps/s | |
test_mod_add[compile] | 0.1323ms | 49.1472μs | 20.3471 KOps/s | 20.1541 KOps/s | |
test_mod_add[compile-overhead] | 0.1752ms | 50.5899μs | 19.7668 KOps/s | 20.2411 KOps/s | |
test_mod_wrap[eager] | 0.3559ms | 0.2258ms | 4.4295 KOps/s | 4.5351 KOps/s | |
test_mod_wrap[compile] | 0.3868ms | 0.2105ms | 4.7501 KOps/s | 4.8005 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3947ms | 0.2041ms | 4.8988 KOps/s | 4.8296 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.7641ms | 12.9667ms | 77.1206 Ops/s | 88.0142 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.9159ms | 13.5458ms | 73.8234 Ops/s | 83.6756 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 16.0968ms | 14.0173ms | 71.3402 Ops/s | 74.5056 Ops/s | |
test_seq_add[eager] | 0.2288ms | 0.1175ms | 8.5101 KOps/s | 8.5478 KOps/s | |
test_seq_add[compile] | 0.1306ms | 63.1559μs | 15.8338 KOps/s | 15.6267 KOps/s | |
test_seq_add[compile-overhead] | 0.1188ms | 61.3649μs | 16.2960 KOps/s | 16.1583 KOps/s | |
test_seq_wrap[eager] | 0.5456ms | 0.4486ms | 2.2291 KOps/s | 2.2392 KOps/s | |
test_seq_wrap[compile] | 0.3600ms | 0.2292ms | 4.3634 KOps/s | 4.2815 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3334ms | 0.2272ms | 4.4010 KOps/s | 4.2892 KOps/s | |
test_func_call_runtime[False-eager] | 0.7962ms | 0.5448ms | 1.8356 KOps/s | 1.8368 KOps/s | |
test_func_call_runtime[False-compile] | 0.8105ms | 0.4350ms | 2.2988 KOps/s | 2.3237 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8169ms | 0.4331ms | 2.3088 KOps/s | 2.3345 KOps/s | |
test_func_call_runtime[True-eager] | 0.9494ms | 0.7573ms | 1.3205 KOps/s | 1.3132 KOps/s | |
test_func_call_runtime[True-compile] | 0.9219ms | 0.4745ms | 2.1076 KOps/s | 2.1092 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6534ms | 0.4736ms | 2.1116 KOps/s | 2.1202 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7568ms | 0.5433ms | 1.8407 KOps/s | 1.8555 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5515ms | 0.4314ms | 2.3181 KOps/s | 2.3420 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5217ms | 0.4302ms | 2.3247 KOps/s | 2.3383 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0354ms | 0.9095ms | 1.0995 KOps/s | 1.1126 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7055ms | 0.4979ms | 2.0085 KOps/s | 2.0022 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5971ms | 0.4986ms | 2.0055 KOps/s | 2.0266 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.8172ms | 1.8910ms | 528.8067 Ops/s | 523.9026 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7640ms | 0.5176ms | 1.9320 KOps/s | 1.9110 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8822ms | 0.5286ms | 1.8917 KOps/s | 1.8857 KOps/s | |
test_distributed | 0.3506ms | 0.1249ms | 8.0043 KOps/s | 7.6983 KOps/s | |
test_tdmodule | 87.6840μs | 27.7480μs | 36.0386 KOps/s | 39.3394 KOps/s | |
test_tdmodule_dispatch | 85.5690μs | 51.0441μs | 19.5909 KOps/s | 21.2136 KOps/s | |
test_tdseq | 54.2320μs | 31.0672μs | 32.1883 KOps/s | 35.4732 KOps/s | |
test_tdseq_dispatch | 92.5230μs | 57.3849μs | 17.4262 KOps/s | 19.0649 KOps/s | |
test_instantiation_functorch | 1.6766ms | 1.5430ms | 648.1007 Ops/s | 647.3676 Ops/s | |
test_exec_functorch | 0.4002ms | 0.1798ms | 5.5614 KOps/s | 5.5576 KOps/s | |
test_exec_functional_call | 0.2993ms | 0.1698ms | 5.8891 KOps/s | 5.7901 KOps/s | |
test_exec_td_decorator | 0.4604ms | 0.2309ms | 4.3306 KOps/s | 4.2687 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9588ms | 0.6558ms | 1.5248 KOps/s | 1.5298 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0232ms | 0.6650ms | 1.5037 KOps/s | 1.5258 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7644ms | 0.5276ms | 1.8953 KOps/s | 1.8822 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9908ms | 0.5300ms | 1.8869 KOps/s | 1.8937 KOps/s | |
test_to_module_speed[True] | 2.1367ms | 1.3552ms | 737.8772 Ops/s | 735.6328 Ops/s | |
test_to_module_speed[False] | 1.8680ms | 1.3050ms | 766.2951 Ops/s | 755.3887 Ops/s | |
test_tc_init | 78.4260μs | 48.1931μs | 20.7499 KOps/s | 21.0080 KOps/s | |
test_tc_init_nested | 0.2062ms | 96.9936μs | 10.3100 KOps/s | 10.5852 KOps/s | |
test_tc_first_layer_tensor | 19.5170μs | 1.4960μs | 668.4379 KOps/s | 664.7539 KOps/s | |
test_tc_first_layer_nontensor | 56.2280μs | 4.5991μs | 217.4335 KOps/s | 216.2035 KOps/s | |
test_tc_second_layer_tensor | 26.6490μs | 2.8256μs | 353.9100 KOps/s | 348.2270 KOps/s | |
test_tc_second_layer_nontensor | 33.2110μs | 6.0382μs | 165.6115 KOps/s | 166.7328 KOps/s | |
test_unbind | 0.2256s | 13.5186ms | 73.9723 Ops/s | 74.4842 Ops/s | |
test_full_like | 8.7752ms | 7.4975ms | 133.3775 Ops/s | 77.5345 Ops/s | |
test_zeros_like | 3.3098ms | 2.8127ms | 355.5352 Ops/s | 133.1422 Ops/s | |
test_ones_like | 3.9439ms | 3.3192ms | 301.2732 Ops/s | 129.1386 Ops/s | |
test_clone | 5.5596ms | 5.1178ms | 195.3980 Ops/s | 108.2378 Ops/s | |
test_squeeze | 69.5900μs | 12.3341μs | 81.0758 KOps/s | 82.3877 KOps/s | |
test_unsqueeze | 0.4779ms | 91.2210μs | 10.9624 KOps/s | 10.6166 KOps/s | |
test_split | 0.3646ms | 0.1933ms | 5.1738 KOps/s | 5.0829 KOps/s | |
test_permute | 0.3279ms | 0.2094ms | 4.7754 KOps/s | 4.9231 KOps/s | |
test_stack | 29.5587ms | 25.6816ms | 38.9385 Ops/s | 39.8134 Ops/s | |
test_cat | 31.1958ms | 25.5176ms | 39.1887 Ops/s | 40.5116 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 55.9230μs | 11.8442μs | 84.4295 KOps/s | 77.2659 KOps/s | |
test_plain_set_stack_nested | 37.9820μs | 11.5507μs | 86.5751 KOps/s | 76.1515 KOps/s | |
test_plain_set_nested_inplace | 38.2620μs | 12.5478μs | 79.6955 KOps/s | 70.4397 KOps/s | |
test_plain_set_stack_nested_inplace | 42.9720μs | 12.5677μs | 79.5693 KOps/s | 71.3065 KOps/s | |
test_items | 32.8920μs | 2.9458μs | 339.4609 KOps/s | 329.6168 KOps/s | |
test_items_nested | 0.4114ms | 0.3574ms | 2.7979 KOps/s | 2.7810 KOps/s | |
test_items_nested_locked | 0.4081ms | 0.3566ms | 2.8044 KOps/s | 2.7769 KOps/s | |
test_items_nested_leaf | 0.1003ms | 58.6790μs | 17.0419 KOps/s | 17.1030 KOps/s | |
test_items_stack_nested | 0.4136ms | 0.3575ms | 2.7972 KOps/s | 2.7494 KOps/s | |
test_items_stack_nested_leaf | 95.3540μs | 59.6739μs | 16.7577 KOps/s | 16.3244 KOps/s | |
test_items_stack_nested_locked | 0.4049ms | 0.3594ms | 2.7827 KOps/s | 2.7282 KOps/s | |
test_keys | 25.3810μs | 3.4542μs | 289.4991 KOps/s | 288.2230 KOps/s | |
test_keys_nested | 0.1151ms | 81.2530μs | 12.3072 KOps/s | 12.1248 KOps/s | |
test_keys_nested_locked | 0.7953ms | 87.1378μs | 11.4761 KOps/s | 11.3288 KOps/s | |
test_keys_nested_leaf | 0.1010ms | 72.2783μs | 13.8354 KOps/s | 13.6624 KOps/s | |
test_keys_stack_nested | 0.1177ms | 82.8867μs | 12.0647 KOps/s | 11.9673 KOps/s | |
test_keys_stack_nested_leaf | 0.1060ms | 73.5792μs | 13.5908 KOps/s | 13.2239 KOps/s | |
test_keys_stack_nested_locked | 0.1261ms | 88.5177μs | 11.2972 KOps/s | 11.2337 KOps/s | |
test_values | 6.3185μs | 0.8487μs | 1.1783 MOps/s | 1.1778 MOps/s | |
test_values_nested | 68.2630μs | 34.8715μs | 28.6767 KOps/s | 28.8075 KOps/s | |
test_values_nested_locked | 68.1230μs | 36.7324μs | 27.2239 KOps/s | 27.4359 KOps/s | |
test_values_nested_leaf | 71.1340μs | 39.2087μs | 25.5046 KOps/s | 25.5398 KOps/s | |
test_values_stack_nested | 64.7630μs | 35.2740μs | 28.3495 KOps/s | 28.5713 KOps/s | |
test_values_stack_nested_leaf | 73.1630μs | 39.6612μs | 25.2136 KOps/s | 24.9783 KOps/s | |
test_values_stack_nested_locked | 66.1030μs | 36.9813μs | 27.0407 KOps/s | 27.1037 KOps/s | |
test_membership | 1.9136μs | 0.5053μs | 1.9790 MOps/s | 1.9613 MOps/s | |
test_membership_nested | 33.2220μs | 2.0651μs | 484.2284 KOps/s | 489.3407 KOps/s | |
test_membership_nested_leaf | 14.4005μs | 1.9882μs | 502.9732 KOps/s | 487.5556 KOps/s | |
test_membership_stacked_nested | 45.1520μs | 2.0924μs | 477.9253 KOps/s | 473.2531 KOps/s | |
test_membership_stacked_nested_leaf | 43.6720μs | 2.1009μs | 475.9976 KOps/s | 477.7572 KOps/s | |
test_membership_nested_last | 36.2410μs | 3.1227μs | 320.2366 KOps/s | 321.6279 KOps/s | |
test_membership_nested_leaf_last | 26.1710μs | 3.1600μs | 316.4545 KOps/s | 323.6410 KOps/s | |
test_membership_stacked_nested_last | 42.0720μs | 3.1224μs | 320.2658 KOps/s | 192.2122 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.0610μs | 3.1395μs | 318.5203 KOps/s | 190.9872 KOps/s | |
test_nested_getleaf | 42.2720μs | 6.2603μs | 159.7358 KOps/s | 161.5620 KOps/s | |
test_nested_get | 34.1920μs | 5.8377μs | 171.2992 KOps/s | 168.2032 KOps/s | |
test_stacked_getleaf | 29.8610μs | 6.2021μs | 161.2345 KOps/s | 161.5852 KOps/s | |
test_stacked_get | 33.2420μs | 5.8930μs | 169.6939 KOps/s | 172.0031 KOps/s | |
test_nested_getitemleaf | 33.8110μs | 6.2418μs | 160.2105 KOps/s | 160.3220 KOps/s | |
test_nested_getitem | 29.6720μs | 5.9548μs | 167.9316 KOps/s | 165.9038 KOps/s | |
test_stacked_getitemleaf | 36.1110μs | 6.2708μs | 159.4700 KOps/s | 158.6620 KOps/s | |
test_stacked_getitem | 34.2720μs | 6.0200μs | 166.1124 KOps/s | 168.3457 KOps/s | |
test_lock_nested | 9.5225ms | 0.3900ms | 2.5638 KOps/s | 2.5535 KOps/s | |
test_lock_stack_nested | 0.3943ms | 0.3496ms | 2.8605 KOps/s | 2.8571 KOps/s | |
test_unlock_nested | 0.6125ms | 0.3176ms | 3.1484 KOps/s | 3.0756 KOps/s | |
test_unlock_stack_nested | 0.3284ms | 0.2873ms | 3.4805 KOps/s | 3.4677 KOps/s | |
test_flatten_speed | 0.1410ms | 76.5846μs | 13.0575 KOps/s | 13.0628 KOps/s | |
test_unflatten_speed | 0.3884ms | 0.3255ms | 3.0719 KOps/s | 3.0605 KOps/s | |
test_common_ops | 1.6943ms | 0.5794ms | 1.7260 KOps/s | 1.5538 KOps/s | |
test_creation | 0.1658ms | 1.7772μs | 562.6815 KOps/s | 558.9314 KOps/s | |
test_creation_empty | 32.7920μs | 6.5473μs | 152.7357 KOps/s | 109.4430 KOps/s | |
test_creation_nested_1 | 52.4330μs | 8.1656μs | 122.4653 KOps/s | 92.5464 KOps/s | |
test_creation_nested_2 | 41.4820μs | 10.9482μs | 91.3396 KOps/s | 73.5950 KOps/s | |
test_clone | 65.5530μs | 10.6907μs | 93.5394 KOps/s | 87.6295 KOps/s | |
test_getitem[int] | 1.8596ms | 10.9417μs | 91.3935 KOps/s | 88.0540 KOps/s | |
test_getitem[slice_int] | 92.5196ms | 29.6577μs | 33.7181 KOps/s | 45.5562 KOps/s | |
test_getitem[range] | 0.1270ms | 38.3356μs | 26.0854 KOps/s | 25.4177 KOps/s | |
test_getitem[tuple] | 0.1053ms | 18.2743μs | 54.7215 KOps/s | 53.0820 KOps/s | |
test_getitem[list] | 0.2406ms | 33.3399μs | 29.9941 KOps/s | 28.7578 KOps/s | |
test_setitem_dim[int] | 38.8710μs | 19.1595μs | 52.1935 KOps/s | 48.7041 KOps/s | |
test_setitem_dim[slice_int] | 66.2630μs | 38.8652μs | 25.7299 KOps/s | 25.2685 KOps/s | |
test_setitem_dim[range] | 85.2740μs | 54.1017μs | 18.4837 KOps/s | 18.1798 KOps/s | |
test_setitem_dim[tuple] | 59.8130μs | 32.9895μs | 30.3127 KOps/s | 30.6657 KOps/s | |
test_setitem | 90.3640μs | 14.2126μs | 70.3600 KOps/s | 61.0037 KOps/s | |
test_set | 98.1350μs | 13.6290μs | 73.3728 KOps/s | 62.8471 KOps/s | |
test_set_shared | 1.7114ms | 0.1503ms | 6.6546 KOps/s | 6.5752 KOps/s | |
test_update | 0.5075ms | 15.6683μs | 63.8233 KOps/s | 52.7659 KOps/s | |
test_update_nested | 89.8150μs | 20.9284μs | 47.7820 KOps/s | 40.5328 KOps/s | |
test_update__nested | 1.1373ms | 25.8751μs | 38.6472 KOps/s | 37.4408 KOps/s | |
test_set_nested | 90.2750μs | 15.0953μs | 66.2458 KOps/s | 57.6678 KOps/s | |
test_set_nested_new | 96.3750μs | 17.3937μs | 57.4921 KOps/s | 46.9451 KOps/s | |
test_select | 98.1640μs | 29.0698μs | 34.4000 KOps/s | 31.8353 KOps/s | |
test_select_nested | 0.1011ms | 43.9360μs | 22.7604 KOps/s | 22.4969 KOps/s | |
test_exclude_nested | 0.3654ms | 63.2214μs | 15.8174 KOps/s | 15.6764 KOps/s | |
test_empty[True] | 0.3372ms | 0.2915ms | 3.4310 KOps/s | 3.3428 KOps/s | |
test_empty[False] | 3.2931μs | 0.8390μs | 1.1919 MOps/s | 1.1848 MOps/s | |
test_to | 89.1840μs | 57.3614μs | 17.4333 KOps/s | 17.3594 KOps/s | |
test_to_nonblocking | 94.4150μs | 48.4185μs | 20.6533 KOps/s | 20.5283 KOps/s | |
test_unbind_speed | 1.5752ms | 0.2402ms | 4.1631 KOps/s | 4.0831 KOps/s | |
test_unbind_speed_stack0 | 0.3462ms | 0.2426ms | 4.1213 KOps/s | 4.1486 KOps/s | |
test_unbind_speed_stack1 | 92.2622ms | 0.6774ms | 1.4762 KOps/s | 1.4637 KOps/s | |
test_split | 93.4879ms | 1.6000ms | 624.9839 Ops/s | 618.0519 Ops/s | |
test_chunk | 95.5987ms | 1.5969ms | 626.2197 Ops/s | 611.1953 Ops/s | |
test_consolidate[False-None] | 3.0467ms | 2.7442ms | 364.4053 Ops/s | 333.6729 Ops/s | |
test_consolidate[default-None] | 1.8341ms | 1.7421ms | 574.0251 Ops/s | 576.2903 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9073ms | 1.7978ms | 556.2240 Ops/s | 562.6862 Ops/s | |
test_consolidate_njt[False-None] | 6.8759ms | 6.7720ms | 147.6659 Ops/s | 150.4629 Ops/s | |
test_to[False-False-None] | 1.8189ms | 1.7106ms | 584.5871 Ops/s | 578.7292 Ops/s | |
test_to[True-False-None] | 1.5712ms | 1.3599ms | 735.3275 Ops/s | 714.6530 Ops/s | |
test_to[within-False-None] | 4.4217ms | 4.2022ms | 237.9702 Ops/s | 234.1992 Ops/s | |
test_to[True-default-None] | 5.6363ms | 5.4457ms | 183.6302 Ops/s | 190.0264 Ops/s | |
test_to_njt[False-False-None] | 7.4890ms | 7.2104ms | 138.6894 Ops/s | 144.5584 Ops/s | |
test_to_njt[True-False-None] | 5.9117ms | 5.7293ms | 174.5418 Ops/s | 183.3496 Ops/s | |
test_to_njt[within-False-None] | 0.3237s | 15.7937ms | 63.3164 Ops/s | 83.4418 Ops/s | |
test_creation[device0] | 0.4658ms | 79.3763μs | 12.5982 KOps/s | 12.2363 KOps/s | |
test_creation_from_tensor | 0.5141ms | 84.2750μs | 11.8659 KOps/s | 11.6878 KOps/s | |
test_add_one[memmap_tensor0] | 0.3995ms | 6.6909μs | 149.4565 KOps/s | 140.3904 KOps/s | |
test_contiguous[memmap_tensor0] | 5.1658μs | 0.4303μs | 2.3240 MOps/s | 2.2669 MOps/s | |
test_stack[memmap_tensor0] | 33.0510μs | 4.4639μs | 224.0173 KOps/s | 218.1435 KOps/s | |
test_memmaptd_index | 1.4826ms | 0.2505ms | 3.9915 KOps/s | 3.8544 KOps/s | |
test_memmaptd_index_astensor | 0.5760ms | 0.3114ms | 3.2108 KOps/s | 3.1100 KOps/s | |
test_memmaptd_index_op | 0.9614ms | 0.5523ms | 1.8107 KOps/s | 1.6069 KOps/s | |
test_serialize_model | 0.1307s | 0.1299s | 7.6975 Ops/s | 7.6992 Ops/s | |
test_serialize_model_pickle | 1.3470s | 1.1881s | 0.8417 Ops/s | 0.8237 Ops/s | |
test_serialize_weights | 0.1304s | 0.1292s | 7.7394 Ops/s | 7.7513 Ops/s | |
test_serialize_weights_returnearly | 0.3258s | 53.7160ms | 18.6164 Ops/s | 15.7836 Ops/s | |
test_serialize_weights_pickle | 1.4138s | 1.2299s | 0.8131 Ops/s | 0.8226 Ops/s | |
test_reshape_pytree | 63.1730μs | 22.2997μs | 44.8437 KOps/s | 45.1602 KOps/s | |
test_reshape_td | 60.5130μs | 26.9976μs | 37.0403 KOps/s | 36.7535 KOps/s | |
test_view_pytree | 53.8930μs | 22.0725μs | 45.3052 KOps/s | 45.6051 KOps/s | |
test_view_td | 70.4230μs | 29.7542μs | 33.6087 KOps/s | 32.1572 KOps/s | |
test_unbind_pytree | 66.1430μs | 28.0429μs | 35.6596 KOps/s | 35.3960 KOps/s | |
test_unbind_td | 0.5751ms | 36.7750μs | 27.1924 KOps/s | 26.6095 KOps/s | |
test_split_pytree | 83.6640μs | 30.0914μs | 33.2321 KOps/s | 33.1820 KOps/s | |
test_split_td | 0.7366ms | 37.8797μs | 26.3994 KOps/s | 25.0764 KOps/s | |
test_add_pytree | 73.9740μs | 35.3267μs | 28.3072 KOps/s | 27.9619 KOps/s | |
test_add_td | 0.1027ms | 50.8393μs | 19.6698 KOps/s | 19.8017 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1824ms | 0.1233ms | 8.1098 KOps/s | 8.0937 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3028ms | 0.1351ms | 7.4035 KOps/s | 7.8318 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1483ms | 0.1007ms | 9.9293 KOps/s | 10.2824 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.2130ms | 0.1476ms | 6.7759 KOps/s | 6.5289 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 62.1630μs | 23.1849μs | 43.1316 KOps/s | 47.4178 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 60.0830μs | 29.5709μs | 33.8171 KOps/s | 33.5207 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2896ms | 64.8950μs | 15.4095 KOps/s | 15.2462 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1055ms | 48.6141μs | 20.5702 KOps/s | 20.3431 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2015ms | 0.1429ms | 6.9973 KOps/s | 6.9685 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3097ms | 0.2170ms | 4.6078 KOps/s | 4.6956 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1485ms | 98.3950μs | 10.1631 KOps/s | 9.9657 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1092ms | 54.4517μs | 18.3649 KOps/s | 17.9188 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1972ms | 0.1357ms | 7.3704 KOps/s | 7.2966 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5453ms | 0.4792ms | 2.0870 KOps/s | 1.9895 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3742ms | 0.2586ms | 3.8666 KOps/s | 3.8262 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1926ms | 0.1418ms | 7.0500 KOps/s | 6.9626 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1682ms | 63.7524μs | 15.6857 KOps/s | 15.4105 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1653ms | 98.2164μs | 10.1816 KOps/s | 10.0443 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5269ms | 0.4076ms | 2.4535 KOps/s | 2.4102 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2039ms | 0.1385ms | 7.2214 KOps/s | 7.2814 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1132ms | 17.4226μs | 57.3967 KOps/s | 57.6422 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 62.0730μs | 31.5014μs | 31.7446 KOps/s | 32.0507 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1719ms | 69.8470μs | 14.3170 KOps/s | 14.4036 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1271ms | 51.3155μs | 19.4873 KOps/s | 19.6169 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6276ms | 0.3913ms | 2.5556 KOps/s | 2.2091 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7630ms | 2.6330ms | 379.7929 Ops/s | 385.5393 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6388ms | 0.4404ms | 2.2707 KOps/s | 2.2516 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.9337ms | 2.7210ms | 367.5149 Ops/s | 373.0145 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6065ms | 0.1199ms | 8.3419 KOps/s | 8.8135 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5734ms | 85.4993μs | 11.6960 KOps/s | 12.4874 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3818ms | 0.1114ms | 8.9771 KOps/s | 9.1347 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1277ms | 74.5897μs | 13.4067 KOps/s | 14.5489 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2181ms | 0.1139ms | 8.7806 KOps/s | 8.9724 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1397ms | 74.4564μs | 13.4307 KOps/s | 14.5710 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1546ms | 0.1058ms | 9.4481 KOps/s | 9.7926 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1431ms | 17.0948μs | 58.4972 KOps/s | 55.0804 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1395ms | 97.3578μs | 10.2714 KOps/s | 10.1966 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 75.7730μs | 15.7296μs | 63.5744 KOps/s | 61.2080 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1598ms | 98.2687μs | 10.1762 KOps/s | 10.1289 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 68.8330μs | 15.7182μs | 63.6205 KOps/s | 61.9115 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1511ms | 0.1026ms | 9.7425 KOps/s | 9.5507 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5733ms | 17.0261μs | 58.7335 KOps/s | 56.0998 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1505ms | 0.1026ms | 9.7462 KOps/s | 10.1145 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 60.5930μs | 15.6772μs | 63.7870 KOps/s | 61.8620 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2700ms | 97.7010μs | 10.2353 KOps/s | 10.1818 KOps/s | |
test_compile_indexing[int-pytree-eager] | 49.8530μs | 15.7919μs | 63.3235 KOps/s | 61.8952 KOps/s | |
test_mod_add[eager] | 0.1018ms | 36.3415μs | 27.5167 KOps/s | 26.2757 KOps/s | |
test_mod_add[compile] | 0.3627ms | 81.2125μs | 12.3134 KOps/s | 12.4467 KOps/s | |
test_mod_add[compile-overhead] | 0.3272ms | 0.1678ms | 5.9580 KOps/s | 5.6337 KOps/s | |
test_mod_wrap[eager] | 0.3476ms | 0.2528ms | 3.9551 KOps/s | 3.8850 KOps/s | |
test_mod_wrap[compile] | 0.5309ms | 0.2921ms | 3.4234 KOps/s | 3.4477 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1891ms | 3.8077ms | 262.6269 Ops/s | 267.1338 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5201ms | 1.3707ms | 729.5595 Ops/s | 677.4140 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3941ms | 1.2721ms | 786.1002 Ops/s | 717.2833 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3818ms | 0.9215ms | 1.0852 KOps/s | 935.5978 Ops/s | |
test_seq_add[eager] | 0.1647ms | 0.1094ms | 9.1413 KOps/s | 8.2003 KOps/s | |
test_seq_add[compile] | 0.2732ms | 87.8220μs | 11.3867 KOps/s | 10.7761 KOps/s | |
test_seq_add[compile-overhead] | 0.1720ms | 0.1290ms | 7.7519 KOps/s | 7.2732 KOps/s | |
test_seq_wrap[eager] | 0.5668ms | 0.4088ms | 2.4462 KOps/s | 2.2127 KOps/s | |
test_seq_wrap[compile] | 0.3624ms | 0.3005ms | 3.3276 KOps/s | 3.1709 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3275ms | 0.2270ms | 4.4051 KOps/s | 4.2309 KOps/s | |
test_func_call_runtime[False-eager] | 0.9364ms | 0.7661ms | 1.3053 KOps/s | 1.3340 KOps/s | |
test_func_call_runtime[False-compile] | 0.8786ms | 0.7704ms | 1.2980 KOps/s | 1.3329 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4167ms | 0.3681ms | 2.7164 KOps/s | 2.7372 KOps/s | |
test_func_call_runtime[True-eager] | 1.0120ms | 0.9041ms | 1.1061 KOps/s | 1.0835 KOps/s | |
test_func_call_runtime[True-compile] | 0.8568ms | 0.7696ms | 1.2993 KOps/s | 1.2964 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4468ms | 0.3912ms | 2.5563 KOps/s | 2.5807 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8434ms | 0.7357ms | 1.3592 KOps/s | 1.2818 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8316ms | 0.7449ms | 1.3424 KOps/s | 1.3288 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4220ms | 0.3713ms | 2.6931 KOps/s | 2.7176 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1301ms | 1.0078ms | 992.2369 Ops/s | 981.2217 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8482ms | 0.7937ms | 1.2600 KOps/s | 1.2176 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4661ms | 0.4192ms | 2.3857 KOps/s | 2.4043 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5500ms | 2.1064ms | 474.7507 Ops/s | 468.7180 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9308ms | 0.8108ms | 1.2334 KOps/s | 1.2162 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4969ms | 0.4181ms | 2.3919 KOps/s | 2.3740 KOps/s | |
test_distributed | 0.8664ms | 0.1214ms | 8.2392 KOps/s | 8.1789 KOps/s | |
test_tdmodule | 0.1095ms | 17.7923μs | 56.2042 KOps/s | 47.6218 KOps/s | |
test_tdmodule_dispatch | 65.7040μs | 31.5826μs | 31.6630 KOps/s | 27.8621 KOps/s | |
test_tdseq | 45.3520μs | 18.9280μs | 52.8317 KOps/s | 47.8748 KOps/s | |
test_tdseq_dispatch | 70.6330μs | 34.9979μs | 28.5732 KOps/s | 25.3988 KOps/s | |
test_instantiation_functorch | 1.6407ms | 1.5564ms | 642.5027 Ops/s | 618.0826 Ops/s | |
test_exec_functorch | 0.1963ms | 0.1434ms | 6.9758 KOps/s | 6.6586 KOps/s | |
test_exec_functional_call | 0.1829ms | 0.1363ms | 7.3381 KOps/s | 7.0577 KOps/s | |
test_exec_td_decorator | 0.3827ms | 0.1856ms | 5.3877 KOps/s | 5.2150 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8315ms | 0.6825ms | 1.4652 KOps/s | 1.4366 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8281ms | 0.6802ms | 1.4702 KOps/s | 1.4380 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7064ms | 0.6001ms | 1.6663 KOps/s | 1.6581 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7196ms | 0.6003ms | 1.6659 KOps/s | 1.6404 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.5277ms | 19.3939ms | 51.5626 Ops/s | 51.2238 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.1527ms | 19.3798ms | 51.6001 Ops/s | 51.3030 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.4047ms | 19.2701ms | 51.8938 Ops/s | 51.8895 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4053ms | 19.2478ms | 51.9539 Ops/s | 51.7999 Ops/s | |
test_to_module_speed[True] | 1.0678ms | 0.9586ms | 1.0432 KOps/s | 1.0288 KOps/s | |
test_to_module_speed[False] | 1.0403ms | 0.9482ms | 1.0547 KOps/s | 1.0455 KOps/s | |
test_tc_init | 59.9930μs | 33.6802μs | 29.6910 KOps/s | 26.5861 KOps/s | |
test_tc_init_nested | 0.1060ms | 67.1630μs | 14.8892 KOps/s | 13.1680 KOps/s | |
test_tc_first_layer_tensor | 5.2831μs | 0.7161μs | 1.3964 MOps/s | 1.4071 MOps/s | |
test_tc_first_layer_nontensor | 24.4210μs | 2.3796μs | 420.2358 KOps/s | 429.7391 KOps/s | |
test_tc_second_layer_tensor | 10.8203μs | 1.4413μs | 693.8258 KOps/s | 693.2755 KOps/s | |
test_tc_second_layer_nontensor | 31.0420μs | 3.1340μs | 319.0785 KOps/s | 324.4051 KOps/s | |
test_unbind | 0.2229s | 10.4147ms | 96.0179 Ops/s | 140.3513 Ops/s | |
test_full_like | 9.7556ms | 9.3379ms | 107.0910 Ops/s | 106.0877 Ops/s | |
test_zeros_like | 5.2407ms | 4.3514ms | 229.8105 Ops/s | 233.9259 Ops/s | |
test_ones_like | 4.9760ms | 4.3521ms | 229.7730 Ops/s | 230.3114 Ops/s | |
test_clone | 6.9929ms | 6.5732ms | 152.1328 Ops/s | 107.8015 Ops/s | |
test_squeeze | 57.1630μs | 9.4551μs | 105.7634 KOps/s | 108.3551 KOps/s | |
test_unsqueeze | 0.1220ms | 70.5002μs | 14.1844 KOps/s | 14.0558 KOps/s | |
test_split | 0.4082ms | 0.1543ms | 6.4819 KOps/s | 6.3465 KOps/s | |
test_permute | 0.2256ms | 0.1742ms | 5.7399 KOps/s | 5.7100 KOps/s | |
test_stack | 53.2960ms | 51.0154ms | 19.6019 Ops/s | 19.5023 Ops/s | |
test_cat | 51.2929ms | 50.8109ms | 19.6808 Ops/s | 19.4443 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 18, 2024
ghstack-source-id: 83d3dcafe45568c366207395a22b22fb35f61de1 Pull Request resolved: #1143
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):