-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Better comparison of tensorclasses #1137
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 12, 2024
ghstack-source-id: 8def6f01f2b6d09714319a56f96b166ac1fd49d5 Pull Request resolved: #1137
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 12, 2024
vmoens
added a commit
that referenced
this pull request
Dec 12, 2024
ghstack-source-id: 8def6f01f2b6d09714319a56f96b166ac1fd49d5 Pull Request resolved: #1137
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.4930μs | 17.8770μs | 55.9378 KOps/s | 59.1072 KOps/s | |
test_plain_set_stack_nested | 69.0590μs | 18.1843μs | 54.9926 KOps/s | 57.8404 KOps/s | |
test_plain_set_nested_inplace | 0.1027ms | 19.7520μs | 50.6278 KOps/s | 53.6332 KOps/s | |
test_plain_set_stack_nested_inplace | 73.6970μs | 19.9690μs | 50.0776 KOps/s | 53.3846 KOps/s | |
test_items | 43.8310μs | 4.2455μs | 235.5427 KOps/s | 238.9171 KOps/s | |
test_items_nested | 0.5643ms | 0.4114ms | 2.4310 KOps/s | 2.5291 KOps/s | |
test_items_nested_locked | 0.5088ms | 0.4058ms | 2.4645 KOps/s | 2.5114 KOps/s | |
test_items_nested_leaf | 0.1300ms | 71.1990μs | 14.0451 KOps/s | 14.0357 KOps/s | |
test_items_stack_nested | 0.7349ms | 0.4161ms | 2.4030 KOps/s | 2.5016 KOps/s | |
test_items_stack_nested_leaf | 0.1431ms | 73.3390μs | 13.6353 KOps/s | 13.9635 KOps/s | |
test_items_stack_nested_locked | 0.6120ms | 0.4090ms | 2.4450 KOps/s | 2.5135 KOps/s | |
test_keys | 42.9100μs | 3.6446μs | 274.3798 KOps/s | 284.0541 KOps/s | |
test_keys_nested | 0.2459ms | 0.1398ms | 7.1538 KOps/s | 7.3618 KOps/s | |
test_keys_nested_locked | 1.8638ms | 0.1462ms | 6.8412 KOps/s | 6.9495 KOps/s | |
test_keys_nested_leaf | 0.2224ms | 0.1216ms | 8.2206 KOps/s | 8.4559 KOps/s | |
test_keys_stack_nested | 0.2453ms | 0.1411ms | 7.0851 KOps/s | 7.1183 KOps/s | |
test_keys_stack_nested_leaf | 0.1935ms | 0.1201ms | 8.3298 KOps/s | 8.2516 KOps/s | |
test_keys_stack_nested_locked | 0.2670ms | 0.1461ms | 6.8433 KOps/s | 7.0011 KOps/s | |
test_values | 8.2034μs | 1.0481μs | 954.1489 KOps/s | 940.8448 KOps/s | |
test_values_nested | 0.1090ms | 56.5359μs | 17.6879 KOps/s | 17.4122 KOps/s | |
test_values_nested_locked | 0.1155ms | 56.5265μs | 17.6908 KOps/s | 17.4572 KOps/s | |
test_values_nested_leaf | 0.1209ms | 60.6143μs | 16.4978 KOps/s | 16.0899 KOps/s | |
test_values_stack_nested | 98.9740μs | 57.6188μs | 17.3554 KOps/s | 17.7001 KOps/s | |
test_values_stack_nested_leaf | 0.1220ms | 61.4513μs | 16.2730 KOps/s | 16.3074 KOps/s | |
test_values_stack_nested_locked | 0.1055ms | 57.2608μs | 17.4639 KOps/s | 17.3874 KOps/s | |
test_membership | 23.2330μs | 0.9401μs | 1.0638 MOps/s | 1.0955 MOps/s | |
test_membership_nested | 52.0870μs | 3.0542μs | 327.4147 KOps/s | 341.7034 KOps/s | |
test_membership_nested_leaf | 55.9150μs | 3.0291μs | 330.1295 KOps/s | 339.2267 KOps/s | |
test_membership_stacked_nested | 20.8990μs | 2.9909μs | 334.3514 KOps/s | 342.8317 KOps/s | |
test_membership_stacked_nested_leaf | 27.8010μs | 3.0223μs | 330.8783 KOps/s | 340.5140 KOps/s | |
test_membership_nested_last | 54.1710μs | 4.2645μs | 234.4956 KOps/s | 225.6845 KOps/s | |
test_membership_nested_leaf_last | 28.1430μs | 4.2998μs | 232.5685 KOps/s | 227.6255 KOps/s | |
test_membership_stacked_nested_last | 53.2390μs | 4.2779μs | 233.7589 KOps/s | 243.6463 KOps/s | |
test_membership_stacked_nested_leaf_last | 31.6990μs | 4.3398μs | 230.4268 KOps/s | 242.9223 KOps/s | |
test_nested_getleaf | 58.1980μs | 10.7368μs | 93.1377 KOps/s | 96.0261 KOps/s | |
test_nested_get | 54.0710μs | 10.2056μs | 97.9851 KOps/s | 99.9877 KOps/s | |
test_stacked_getleaf | 41.3270μs | 10.6487μs | 93.9079 KOps/s | 94.9396 KOps/s | |
test_stacked_get | 51.7160μs | 10.1033μs | 98.9773 KOps/s | 99.2145 KOps/s | |
test_nested_getitemleaf | 57.3670μs | 11.3336μs | 88.2330 KOps/s | 90.1961 KOps/s | |
test_nested_getitem | 32.9220μs | 10.3536μs | 96.5849 KOps/s | 98.1268 KOps/s | |
test_stacked_getitemleaf | 61.5270μs | 11.1960μs | 89.3176 KOps/s | 90.3725 KOps/s | |
test_stacked_getitem | 38.5600μs | 10.2934μs | 97.1494 KOps/s | 96.2582 KOps/s | |
test_lock_nested | 5.2927ms | 0.4507ms | 2.2189 KOps/s | 2.2763 KOps/s | |
test_lock_stack_nested | 0.5129ms | 0.4178ms | 2.3934 KOps/s | 2.4421 KOps/s | |
test_unlock_nested | 0.8256ms | 0.3645ms | 2.7437 KOps/s | 2.7309 KOps/s | |
test_unlock_stack_nested | 0.4299ms | 0.3381ms | 2.9577 KOps/s | 3.0568 KOps/s | |
test_flatten_speed | 0.1743ms | 94.6575μs | 10.5644 KOps/s | 10.5421 KOps/s | |
test_unflatten_speed | 0.6283ms | 0.4880ms | 2.0493 KOps/s | 2.0452 KOps/s | |
test_common_ops | 4.9784ms | 0.7924ms | 1.2620 KOps/s | 1.3576 KOps/s | |
test_creation | 76.0420μs | 2.1023μs | 475.6778 KOps/s | 478.4002 KOps/s | |
test_creation_empty | 56.9160μs | 11.4013μs | 87.7094 KOps/s | 104.3418 KOps/s | |
test_creation_nested_1 | 63.2600μs | 14.2857μs | 70.0002 KOps/s | 81.4639 KOps/s | |
test_creation_nested_2 | 40.3060μs | 18.7961μs | 53.2024 KOps/s | 60.5255 KOps/s | |
test_clone | 99.1450μs | 14.6526μs | 68.2475 KOps/s | 76.0939 KOps/s | |
test_getitem[int] | 1.1706ms | 12.8600μs | 77.7604 KOps/s | 78.2689 KOps/s | |
test_getitem[slice_int] | 0.1494ms | 24.9228μs | 40.1239 KOps/s | 41.3616 KOps/s | |
test_getitem[range] | 0.1945ms | 49.1203μs | 20.3582 KOps/s | 21.6921 KOps/s | |
test_getitem[tuple] | 0.1302ms | 20.7140μs | 48.2765 KOps/s | 49.0500 KOps/s | |
test_getitem[list] | 0.2052ms | 44.1296μs | 22.6605 KOps/s | 23.7618 KOps/s | |
test_setitem_dim[int] | 46.0660μs | 26.2335μs | 38.1191 KOps/s | 39.7659 KOps/s | |
test_setitem_dim[slice_int] | 88.4950μs | 53.8981μs | 18.5535 KOps/s | 19.1697 KOps/s | |
test_setitem_dim[range] | 0.1363ms | 73.5884μs | 13.5891 KOps/s | 13.8917 KOps/s | |
test_setitem_dim[tuple] | 78.5870μs | 42.4457μs | 23.5595 KOps/s | 24.4330 KOps/s | |
test_setitem | 82.5040μs | 21.0630μs | 47.4766 KOps/s | 51.9602 KOps/s | |
test_set | 84.0970μs | 20.4598μs | 48.8764 KOps/s | 53.1357 KOps/s | |
test_set_shared | 3.8560ms | 0.1654ms | 6.0442 KOps/s | 5.9510 KOps/s | |
test_update | 0.1495ms | 23.2550μs | 43.0014 KOps/s | 48.7126 KOps/s | |
test_update_nested | 0.1198ms | 34.3690μs | 29.0959 KOps/s | 33.0971 KOps/s | |
test_update__nested | 0.9216ms | 33.0287μs | 30.2767 KOps/s | 31.5369 KOps/s | |
test_set_nested | 87.0320μs | 22.7160μs | 44.0218 KOps/s | 48.2648 KOps/s | |
test_set_nested_new | 0.1009ms | 27.2835μs | 36.6522 KOps/s | 39.5251 KOps/s | |
test_select | 0.1389ms | 43.9150μs | 22.7713 KOps/s | 23.7762 KOps/s | |
test_select_nested | 0.1212ms | 59.9687μs | 16.6754 KOps/s | 16.5324 KOps/s | |
test_exclude_nested | 0.1729ms | 80.4490μs | 12.4302 KOps/s | 12.7444 KOps/s | |
test_empty[True] | 0.5763ms | 0.3844ms | 2.6014 KOps/s | 2.6234 KOps/s | |
test_empty[False] | 10.5323μs | 1.3041μs | 766.8288 KOps/s | 811.0548 KOps/s | |
test_unbind_speed | 0.3668ms | 0.2696ms | 3.7095 KOps/s | 3.8472 KOps/s | |
test_unbind_speed_stack0 | 0.5902ms | 0.2622ms | 3.8135 KOps/s | 3.9238 KOps/s | |
test_unbind_speed_stack1 | 98.5437ms | 0.7640ms | 1.3089 KOps/s | 1.4567 KOps/s | |
test_split | 1.7432ms | 1.5646ms | 639.1363 Ops/s | 585.7770 Ops/s | |
test_chunk | 98.6856ms | 1.8799ms | 531.9564 Ops/s | 583.3770 Ops/s | |
test_consolidate_njt[False-None] | 8.2262ms | 8.0359ms | 124.4423 Ops/s | 124.1124 Ops/s | |
test_creation[device0] | 0.2755ms | 88.8653μs | 11.2530 KOps/s | 10.9394 KOps/s | |
test_creation_from_tensor | 4.1975ms | 93.2938μs | 10.7188 KOps/s | 10.4469 KOps/s | |
test_add_one[memmap_tensor0] | 0.1419ms | 4.9938μs | 200.2473 KOps/s | 215.0742 KOps/s | |
test_contiguous[memmap_tensor0] | 12.1330μs | 0.5200μs | 1.9232 MOps/s | 1.9593 MOps/s | |
test_stack[memmap_tensor0] | 35.0750μs | 3.3489μs | 298.6063 KOps/s | 301.0359 KOps/s | |
test_memmaptd_index | 1.0074ms | 0.2372ms | 4.2158 KOps/s | 4.2371 KOps/s | |
test_memmaptd_index_astensor | 0.6150ms | 0.3155ms | 3.1700 KOps/s | 3.1999 KOps/s | |
test_memmaptd_index_op | 0.9850ms | 0.5817ms | 1.7190 KOps/s | 1.8224 KOps/s | |
test_serialize_model | 0.1221s | 0.1129s | 8.8535 Ops/s | 7.8052 Ops/s | |
test_serialize_model_pickle | 0.4671s | 0.3926s | 2.5472 Ops/s | 2.5262 Ops/s | |
test_serialize_weights | 0.2246s | 0.1271s | 7.8701 Ops/s | 8.8461 Ops/s | |
test_serialize_weights_returnearly | 0.1655s | 0.1566s | 6.3846 Ops/s | 6.4112 Ops/s | |
test_serialize_weights_pickle | 0.5756s | 0.4319s | 2.3155 Ops/s | 2.5385 Ops/s | |
test_serialize_weights_filesystem | 0.1533s | 0.1452s | 6.8890 Ops/s | 6.4996 Ops/s | |
test_serialize_model_filesystem | 0.2348s | 0.1550s | 6.4522 Ops/s | 6.8173 Ops/s | |
test_reshape_pytree | 59.7710μs | 26.9006μs | 37.1739 KOps/s | 36.9823 KOps/s | |
test_reshape_td | 93.3850μs | 32.6107μs | 30.6648 KOps/s | 30.4394 KOps/s | |
test_view_pytree | 64.0300μs | 27.0364μs | 36.9872 KOps/s | 36.6019 KOps/s | |
test_view_td | 93.3540μs | 37.4172μs | 26.7257 KOps/s | 26.2795 KOps/s | |
test_unbind_pytree | 82.4040μs | 30.5796μs | 32.7015 KOps/s | 32.9509 KOps/s | |
test_unbind_td | 0.3236ms | 38.5965μs | 25.9091 KOps/s | 26.3184 KOps/s | |
test_split_pytree | 66.3950μs | 30.1337μs | 33.1855 KOps/s | 33.1825 KOps/s | |
test_split_td | 0.4849ms | 44.8743μs | 22.2845 KOps/s | 22.7126 KOps/s | |
test_add_pytree | 98.0930μs | 36.0233μs | 27.7598 KOps/s | 27.6428 KOps/s | |
test_add_td | 0.1488ms | 57.6692μs | 17.3403 KOps/s | 18.6575 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1157ms | 61.6092μs | 16.2314 KOps/s | 16.2816 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3562ms | 0.1606ms | 6.2283 KOps/s | 6.1214 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1068ms | 45.5007μs | 21.9777 KOps/s | 21.8353 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2212ms | 0.1208ms | 8.2812 KOps/s | 8.4235 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 66.9550μs | 26.4222μs | 37.8470 KOps/s | 38.5679 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1188ms | 53.7802μs | 18.5942 KOps/s | 18.6968 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1791ms | 79.0502μs | 12.6502 KOps/s | 12.4455 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1377ms | 68.3117μs | 14.6388 KOps/s | 14.4662 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1821ms | 0.1029ms | 9.7178 KOps/s | 9.6014 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4453ms | 0.2012ms | 4.9689 KOps/s | 4.9016 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 96.2400μs | 44.3299μs | 22.5581 KOps/s | 22.4856 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4589ms | 62.0274μs | 16.1219 KOps/s | 16.2810 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1752ms | 0.1008ms | 9.9218 KOps/s | 9.7484 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2746ms | 0.2005ms | 4.9864 KOps/s | 4.8431 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.5226ms | 0.2181ms | 4.5850 KOps/s | 4.6035 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1746ms | 0.1054ms | 9.4898 KOps/s | 9.5365 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1828ms | 53.3205μs | 18.7545 KOps/s | 17.8027 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 91.0000μs | 45.6027μs | 21.9285 KOps/s | 21.4025 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6381ms | 0.1584ms | 6.3147 KOps/s | 6.2600 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2179ms | 0.1026ms | 9.7513 KOps/s | 9.7692 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 54.1610μs | 21.3862μs | 46.7590 KOps/s | 47.6258 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1488ms | 59.3178μs | 16.8583 KOps/s | 16.8565 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1492ms | 82.6226μs | 12.1032 KOps/s | 12.1148 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1431ms | 69.9820μs | 14.2894 KOps/s | 14.1974 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3857ms | 0.2098ms | 4.7668 KOps/s | 4.6561 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5247ms | 1.2826ms | 779.6806 Ops/s | 760.1641 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2688ms | 0.2002ms | 4.9962 KOps/s | 4.9213 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0066ms | 0.7731ms | 1.2935 KOps/s | 1.2730 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.9009ms | 0.4533ms | 2.2058 KOps/s | 2.1663 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.1241ms | 2.6651ms | 375.2185 Ops/s | 392.7567 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 81.5420μs | 36.5119μs | 27.3883 KOps/s | 28.1768 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6567ms | 31.9263μs | 31.3221 KOps/s | 30.5929 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 70.3620μs | 28.9957μs | 34.4879 KOps/s | 34.0307 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 66.5040μs | 23.0052μs | 43.4685 KOps/s | 42.8864 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 73.0770μs | 29.8523μs | 33.4983 KOps/s | 32.9846 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 63.1780μs | 23.0831μs | 43.3218 KOps/s | 42.5165 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1011ms | 52.1142μs | 19.1886 KOps/s | 19.6007 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4994ms | 19.5228μs | 51.2220 KOps/s | 48.8790 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 95.8290μs | 44.4752μs | 22.4844 KOps/s | 22.8682 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 80.3230μs | 19.0712μs | 52.4351 KOps/s | 51.8512 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1001ms | 45.0881μs | 22.1788 KOps/s | 22.6462 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 57.9680μs | 18.9151μs | 52.8678 KOps/s | 51.2540 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1037ms | 53.2021μs | 18.7963 KOps/s | 19.4271 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8865ms | 19.5807μs | 51.0706 KOps/s | 49.0439 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 90.2480μs | 45.0480μs | 22.1985 KOps/s | 22.7802 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 57.4370μs | 19.0695μs | 52.4397 KOps/s | 52.0514 KOps/s | |
test_compile_indexing[int-pytree-compile] | 92.6430μs | 45.0594μs | 22.1929 KOps/s | 22.7331 KOps/s | |
test_compile_indexing[int-pytree-eager] | 75.4610μs | 18.8631μs | 53.0135 KOps/s | 51.9061 KOps/s | |
test_mod_add[eager] | 71.6740μs | 33.8369μs | 29.5535 KOps/s | 30.3264 KOps/s | |
test_mod_add[compile] | 94.5270μs | 48.4369μs | 20.6454 KOps/s | 20.6314 KOps/s | |
test_mod_add[compile-overhead] | 0.1215ms | 47.5972μs | 21.0096 KOps/s | 20.4555 KOps/s | |
test_mod_wrap[eager] | 0.4320ms | 0.2276ms | 4.3938 KOps/s | 4.3910 KOps/s | |
test_mod_wrap[compile] | 0.3434ms | 0.2032ms | 4.9210 KOps/s | 4.8255 KOps/s | |
test_mod_wrap[compile-overhead] | 0.2620ms | 0.2020ms | 4.9513 KOps/s | 4.8614 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.3074ms | 10.6820ms | 93.6158 Ops/s | 92.6525 Ops/s | |
test_mod_wrap_and_backward[compile] | 22.1962ms | 13.7475ms | 72.7405 Ops/s | 93.5862 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 11.7539ms | 10.5209ms | 95.0492 Ops/s | 93.8181 Ops/s | |
test_seq_add[eager] | 0.2366ms | 0.1112ms | 8.9918 KOps/s | 9.1603 KOps/s | |
test_seq_add[compile] | 0.1490ms | 62.1046μs | 16.1019 KOps/s | 16.4069 KOps/s | |
test_seq_add[compile-overhead] | 0.1194ms | 59.3887μs | 16.8382 KOps/s | 17.0330 KOps/s | |
test_seq_wrap[eager] | 0.6022ms | 0.4456ms | 2.2440 KOps/s | 2.3090 KOps/s | |
test_seq_wrap[compile] | 0.3424ms | 0.2251ms | 4.4433 KOps/s | 4.4144 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3497ms | 0.2229ms | 4.4873 KOps/s | 4.4187 KOps/s | |
test_func_call_runtime[False-eager] | 0.8668ms | 0.5594ms | 1.7878 KOps/s | 1.8313 KOps/s | |
test_func_call_runtime[False-compile] | 0.6245ms | 0.4214ms | 2.3731 KOps/s | 2.3517 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9345ms | 0.4224ms | 2.3673 KOps/s | 2.3452 KOps/s | |
test_func_call_runtime[True-eager] | 1.2961ms | 0.7728ms | 1.2941 KOps/s | 1.3182 KOps/s | |
test_func_call_runtime[True-compile] | 0.5589ms | 0.4593ms | 2.1771 KOps/s | 2.1365 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8684ms | 0.4648ms | 2.1514 KOps/s | 2.0942 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7791ms | 0.5527ms | 1.8093 KOps/s | 1.8690 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8770ms | 0.4258ms | 2.3483 KOps/s | 2.3326 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5227ms | 0.4224ms | 2.3675 KOps/s | 2.3463 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1090ms | 0.8939ms | 1.1187 KOps/s | 1.1039 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7915ms | 0.4870ms | 2.0534 KOps/s | 2.0120 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6263ms | 0.4846ms | 2.0637 KOps/s | 2.0234 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.0957ms | 1.8808ms | 531.6922 Ops/s | 523.8237 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6380ms | 0.5188ms | 1.9275 KOps/s | 1.9251 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.6824ms | 0.5179ms | 1.9308 KOps/s | 1.9232 KOps/s | |
test_distributed | 0.3268ms | 0.1249ms | 8.0038 KOps/s | 7.8966 KOps/s | |
test_tdmodule | 42.9410μs | 26.1851μs | 38.1896 KOps/s | 41.2972 KOps/s | |
test_tdmodule_dispatch | 81.7730μs | 47.9163μs | 20.8697 KOps/s | 21.8835 KOps/s | |
test_tdseq | 42.9910μs | 25.9981μs | 38.4643 KOps/s | 40.7548 KOps/s | |
test_tdseq_dispatch | 84.2670μs | 50.6926μs | 19.7267 KOps/s | 21.1760 KOps/s | |
test_instantiation_functorch | 2.5486ms | 1.5672ms | 638.0634 Ops/s | 658.9787 Ops/s | |
test_exec_functorch | 0.3039ms | 0.1798ms | 5.5622 KOps/s | 5.5393 KOps/s | |
test_exec_functional_call | 0.3570ms | 0.1743ms | 5.7376 KOps/s | 5.7571 KOps/s | |
test_exec_td_decorator | 0.4850ms | 0.2309ms | 4.3299 KOps/s | 4.3017 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8664ms | 0.6449ms | 1.5506 KOps/s | 1.5629 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8942ms | 0.6435ms | 1.5539 KOps/s | 1.5519 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8156ms | 0.5215ms | 1.9177 KOps/s | 1.9062 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7119ms | 0.5212ms | 1.9186 KOps/s | 1.9068 KOps/s | |
test_to_module_speed[True] | 2.0810ms | 1.3146ms | 760.6735 Ops/s | 770.3718 Ops/s | |
test_to_module_speed[False] | 2.0668ms | 1.2728ms | 785.6501 Ops/s | 799.8780 Ops/s | |
test_tc_init | 85.1800μs | 44.9949μs | 22.2247 KOps/s | 23.1743 KOps/s | |
test_tc_init_nested | 0.1759ms | 92.8327μs | 10.7721 KOps/s | 11.3990 KOps/s | |
test_tc_first_layer_tensor | 27.8520μs | 1.5213μs | 657.3257 KOps/s | 638.9879 KOps/s | |
test_tc_first_layer_nontensor | 20.9590μs | 4.7066μs | 212.4659 KOps/s | 210.7458 KOps/s | |
test_tc_second_layer_tensor | 32.9010μs | 2.8195μs | 354.6734 KOps/s | 351.7971 KOps/s | |
test_tc_second_layer_nontensor | 28.7940μs | 6.0394μs | 165.5796 KOps/s | 162.3833 KOps/s | |
test_unbind | 0.2194s | 12.6188ms | 79.2470 Ops/s | 79.4686 Ops/s | |
test_full_like | 17.6927ms | 11.6699ms | 85.6904 Ops/s | 141.7829 Ops/s | |
test_zeros_like | 13.8497ms | 6.8155ms | 146.7242 Ops/s | 370.5477 Ops/s | |
test_ones_like | 12.3430ms | 7.3509ms | 136.0375 Ops/s | 320.7381 Ops/s | |
test_clone | 14.2047ms | 8.9644ms | 111.5523 Ops/s | 204.6325 Ops/s | |
test_squeeze | 63.4280μs | 11.7991μs | 84.7521 KOps/s | 85.4930 KOps/s | |
test_unsqueeze | 0.1873ms | 88.9157μs | 11.2466 KOps/s | 11.3049 KOps/s | |
test_split | 0.5831ms | 0.1961ms | 5.1005 KOps/s | 5.2136 KOps/s | |
test_permute | 0.3369ms | 0.1982ms | 5.0450 KOps/s | 4.8899 KOps/s | |
test_stack | 28.1564ms | 24.0355ms | 41.6051 Ops/s | 41.3602 Ops/s | |
test_cat | 28.9340ms | 23.9189ms | 41.8080 Ops/s | 40.5601 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.2920μs | 10.7023μs | 93.4381 KOps/s | 98.7263 KOps/s | |
test_plain_set_stack_nested | 34.3520μs | 10.8950μs | 91.7852 KOps/s | 98.1602 KOps/s | |
test_plain_set_nested_inplace | 45.0530μs | 11.6566μs | 85.7882 KOps/s | 90.6958 KOps/s | |
test_plain_set_stack_nested_inplace | 46.7930μs | 11.6353μs | 85.9450 KOps/s | 90.6613 KOps/s | |
test_items | 25.1810μs | 2.9552μs | 338.3866 KOps/s | 338.9254 KOps/s | |
test_items_nested | 0.4914ms | 0.3660ms | 2.7323 KOps/s | 2.8140 KOps/s | |
test_items_nested_locked | 0.5381ms | 0.3678ms | 2.7189 KOps/s | 2.8158 KOps/s | |
test_items_nested_leaf | 95.6050μs | 58.4537μs | 17.1076 KOps/s | 17.3205 KOps/s | |
test_items_stack_nested | 0.6389ms | 0.3692ms | 2.7083 KOps/s | 2.8303 KOps/s | |
test_items_stack_nested_leaf | 89.3450μs | 59.3665μs | 16.8445 KOps/s | 16.9808 KOps/s | |
test_items_stack_nested_locked | 0.4266ms | 0.3690ms | 2.7098 KOps/s | 2.8258 KOps/s | |
test_keys | 27.0120μs | 3.6886μs | 271.1047 KOps/s | 289.3891 KOps/s | |
test_keys_nested | 0.1178ms | 69.5321μs | 14.3818 KOps/s | 14.1145 KOps/s | |
test_keys_nested_locked | 0.7206ms | 75.4690μs | 13.2505 KOps/s | 12.9618 KOps/s | |
test_keys_nested_leaf | 99.2250μs | 61.4306μs | 16.2785 KOps/s | 16.3756 KOps/s | |
test_keys_stack_nested | 0.1104ms | 70.2021μs | 14.2446 KOps/s | 13.9714 KOps/s | |
test_keys_stack_nested_leaf | 95.0750μs | 60.9395μs | 16.4097 KOps/s | 16.0412 KOps/s | |
test_keys_stack_nested_locked | 0.1253ms | 75.9818μs | 13.1611 KOps/s | 12.9936 KOps/s | |
test_values | 4.8487μs | 0.8666μs | 1.1540 MOps/s | 1.1763 MOps/s | |
test_values_nested | 60.9330μs | 31.2094μs | 32.0416 KOps/s | 32.1268 KOps/s | |
test_values_nested_locked | 65.9940μs | 32.8265μs | 30.4632 KOps/s | 30.4974 KOps/s | |
test_values_nested_leaf | 70.2340μs | 33.7182μs | 29.6575 KOps/s | 29.8888 KOps/s | |
test_values_stack_nested | 59.7340μs | 31.8212μs | 31.4256 KOps/s | 31.5586 KOps/s | |
test_values_stack_nested_leaf | 62.7930μs | 34.1809μs | 29.2561 KOps/s | 29.5763 KOps/s | |
test_values_stack_nested_locked | 60.2530μs | 33.3917μs | 29.9476 KOps/s | 30.1082 KOps/s | |
test_membership | 1.7201μs | 0.5035μs | 1.9863 MOps/s | 1.9748 MOps/s | |
test_membership_nested | 20.4960μs | 2.0398μs | 490.2521 KOps/s | 496.3917 KOps/s | |
test_membership_nested_leaf | 15.6555μs | 2.0456μs | 488.8477 KOps/s | 491.1596 KOps/s | |
test_membership_stacked_nested | 29.5220μs | 2.1485μs | 465.4366 KOps/s | 486.2488 KOps/s | |
test_membership_stacked_nested_leaf | 23.0910μs | 2.1038μs | 475.3246 KOps/s | 481.1275 KOps/s | |
test_membership_nested_last | 33.2210μs | 3.0715μs | 325.5786 KOps/s | 336.3705 KOps/s | |
test_membership_nested_leaf_last | 32.2220μs | 3.0734μs | 325.3700 KOps/s | 336.1607 KOps/s | |
test_membership_stacked_nested_last | 31.8810μs | 5.0285μs | 198.8650 KOps/s | 332.3328 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.0520μs | 4.9570μs | 201.7332 KOps/s | 337.3157 KOps/s | |
test_nested_getleaf | 34.4120μs | 6.1877μs | 161.6102 KOps/s | 163.2163 KOps/s | |
test_nested_get | 62.5730μs | 5.8490μs | 170.9701 KOps/s | 172.2132 KOps/s | |
test_stacked_getleaf | 34.1320μs | 6.1182μs | 163.4458 KOps/s | 163.1108 KOps/s | |
test_stacked_get | 30.3920μs | 5.8108μs | 172.0945 KOps/s | 172.4290 KOps/s | |
test_nested_getitemleaf | 25.6710μs | 6.3392μs | 157.7475 KOps/s | 160.7833 KOps/s | |
test_nested_getitem | 32.4110μs | 6.0647μs | 164.8874 KOps/s | 168.6592 KOps/s | |
test_stacked_getitemleaf | 31.3020μs | 6.3836μs | 156.6514 KOps/s | 160.2105 KOps/s | |
test_stacked_getitem | 33.2020μs | 6.0060μs | 166.4995 KOps/s | 168.6383 KOps/s | |
test_lock_nested | 9.2258ms | 0.3882ms | 2.5762 KOps/s | 2.6721 KOps/s | |
test_lock_stack_nested | 0.4262ms | 0.3392ms | 2.9481 KOps/s | 2.9351 KOps/s | |
test_unlock_nested | 0.8010ms | 0.3158ms | 3.1662 KOps/s | 3.2464 KOps/s | |
test_unlock_stack_nested | 0.3859ms | 0.2784ms | 3.5919 KOps/s | 3.5837 KOps/s | |
test_flatten_speed | 0.1235ms | 76.3193μs | 13.1028 KOps/s | 13.1883 KOps/s | |
test_unflatten_speed | 0.4442ms | 0.3129ms | 3.1963 KOps/s | 3.2679 KOps/s | |
test_common_ops | 1.7815ms | 0.6181ms | 1.6179 KOps/s | 1.7182 KOps/s | |
test_creation | 0.1091ms | 1.5050μs | 664.4609 KOps/s | 671.6914 KOps/s | |
test_creation_empty | 28.6710μs | 7.7654μs | 128.7766 KOps/s | 152.6303 KOps/s | |
test_creation_nested_1 | 45.3230μs | 9.3943μs | 106.4478 KOps/s | 124.0134 KOps/s | |
test_creation_nested_2 | 43.0820μs | 12.1646μs | 82.2058 KOps/s | 93.9368 KOps/s | |
test_clone | 57.4530μs | 11.1113μs | 89.9987 KOps/s | 94.9443 KOps/s | |
test_getitem[int] | 92.4187ms | 16.1004μs | 62.1102 KOps/s | 93.1734 KOps/s | |
test_getitem[slice_int] | 0.1060ms | 22.2588μs | 44.9260 KOps/s | 47.0895 KOps/s | |
test_getitem[range] | 0.1274ms | 39.3926μs | 25.3855 KOps/s | 25.8211 KOps/s | |
test_getitem[tuple] | 0.1103ms | 19.1771μs | 52.1456 KOps/s | 53.4112 KOps/s | |
test_getitem[list] | 0.2053ms | 34.6471μs | 28.8625 KOps/s | 29.2993 KOps/s | |
test_setitem_dim[int] | 40.6120μs | 19.8966μs | 50.2598 KOps/s | 53.0516 KOps/s | |
test_setitem_dim[slice_int] | 65.7340μs | 40.3853μs | 24.7615 KOps/s | 25.6610 KOps/s | |
test_setitem_dim[range] | 82.3040μs | 53.4868μs | 18.6962 KOps/s | 18.3806 KOps/s | |
test_setitem_dim[tuple] | 65.3740μs | 34.2438μs | 29.2024 KOps/s | 30.5291 KOps/s | |
test_setitem | 0.3865ms | 15.7546μs | 63.4736 KOps/s | 69.2779 KOps/s | |
test_set | 0.1006ms | 15.3994μs | 64.9377 KOps/s | 70.8409 KOps/s | |
test_set_shared | 1.5925ms | 0.1480ms | 6.7555 KOps/s | 6.8028 KOps/s | |
test_update | 0.3576ms | 19.0370μs | 52.5294 KOps/s | 61.8159 KOps/s | |
test_update_nested | 0.2609ms | 24.3255μs | 41.1091 KOps/s | 46.5011 KOps/s | |
test_update__nested | 0.1340ms | 25.3509μs | 39.4463 KOps/s | 40.4086 KOps/s | |
test_set_nested | 84.5950μs | 16.8159μs | 59.4674 KOps/s | 65.3212 KOps/s | |
test_set_nested_new | 0.1052ms | 19.3395μs | 51.7077 KOps/s | 56.7209 KOps/s | |
test_select | 71.4240μs | 31.6603μs | 31.5853 KOps/s | 32.9646 KOps/s | |
test_select_nested | 83.4440μs | 42.1741μs | 23.7113 KOps/s | 23.7967 KOps/s | |
test_exclude_nested | 92.3850μs | 63.5062μs | 15.7465 KOps/s | 16.0016 KOps/s | |
test_empty[True] | 0.4232ms | 0.2835ms | 3.5273 KOps/s | 3.5789 KOps/s | |
test_empty[False] | 3.0852μs | 0.7401μs | 1.3511 MOps/s | 1.3346 MOps/s | |
test_to | 87.9250μs | 55.6806μs | 17.9596 KOps/s | 17.6668 KOps/s | |
test_to_nonblocking | 90.5050μs | 46.6052μs | 21.4568 KOps/s | 21.8434 KOps/s | |
test_unbind_speed | 0.2986ms | 0.2383ms | 4.1973 KOps/s | 4.2814 KOps/s | |
test_unbind_speed_stack0 | 0.3352ms | 0.2342ms | 4.2705 KOps/s | 4.2291 KOps/s | |
test_unbind_speed_stack1 | 0.6410ms | 0.5949ms | 1.6809 KOps/s | 1.5187 KOps/s | |
test_split | 94.3287ms | 1.6258ms | 615.0931 Ops/s | 636.1440 Ops/s | |
test_chunk | 93.6998ms | 1.6409ms | 609.4049 Ops/s | 583.6896 Ops/s | |
test_consolidate[False-None] | 95.9819ms | 2.8765ms | 347.6403 Ops/s | 373.0824 Ops/s | |
test_consolidate[default-None] | 1.7609ms | 1.6838ms | 593.8950 Ops/s | 588.4710 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8374ms | 1.7106ms | 584.5984 Ops/s | 574.9906 Ops/s | |
test_consolidate_njt[False-None] | 6.7380ms | 6.6269ms | 150.9005 Ops/s | 153.3127 Ops/s | |
test_to[False-False-None] | 1.8446ms | 1.7328ms | 577.1133 Ops/s | 588.5886 Ops/s | |
test_to[True-False-None] | 1.5840ms | 1.3300ms | 751.8805 Ops/s | 759.8281 Ops/s | |
test_to[within-False-None] | 4.1615ms | 4.0538ms | 246.6827 Ops/s | 177.7377 Ops/s | |
test_to[True-default-None] | 5.4945ms | 5.2927ms | 188.9400 Ops/s | 187.7985 Ops/s | |
test_to_njt[False-False-None] | 7.2148ms | 7.0968ms | 140.9084 Ops/s | 138.9320 Ops/s | |
test_to_njt[True-False-None] | 5.8387ms | 5.6103ms | 178.2441 Ops/s | 175.7405 Ops/s | |
test_to_njt[within-False-None] | 12.5597ms | 12.3311ms | 81.0957 Ops/s | 80.0966 Ops/s | |
test_creation[device0] | 0.6739ms | 79.8837μs | 12.5182 KOps/s | 11.9782 KOps/s | |
test_creation_from_tensor | 0.4456ms | 83.7637μs | 11.9383 KOps/s | 11.6978 KOps/s | |
test_add_one[memmap_tensor0] | 0.4764ms | 7.1306μs | 140.2397 KOps/s | 141.7947 KOps/s | |
test_contiguous[memmap_tensor0] | 2.2917μs | 0.4113μs | 2.4311 MOps/s | 2.4307 MOps/s | |
test_stack[memmap_tensor0] | 26.4420μs | 4.6706μs | 214.1045 KOps/s | 219.7422 KOps/s | |
test_memmaptd_index | 1.8134ms | 0.2525ms | 3.9598 KOps/s | 3.9687 KOps/s | |
test_memmaptd_index_astensor | 0.5941ms | 0.3077ms | 3.2501 KOps/s | 3.2292 KOps/s | |
test_memmaptd_index_op | 1.0370ms | 0.5949ms | 1.6810 KOps/s | 1.7377 KOps/s | |
test_serialize_model | 0.1322s | 0.1311s | 7.6299 Ops/s | 7.6589 Ops/s | |
test_serialize_model_pickle | 1.3688s | 1.2228s | 0.8178 Ops/s | 0.8252 Ops/s | |
test_serialize_weights | 0.1315s | 0.1305s | 7.6647 Ops/s | 7.6265 Ops/s | |
test_serialize_weights_returnearly | 0.3465s | 64.2160ms | 15.5724 Ops/s | 14.5441 Ops/s | |
test_serialize_weights_pickle | 1.3785s | 1.1912s | 0.8395 Ops/s | 0.8444 Ops/s | |
test_reshape_pytree | 54.0630μs | 22.8180μs | 43.8250 KOps/s | 43.8496 KOps/s | |
test_reshape_td | 52.1930μs | 27.1406μs | 36.8451 KOps/s | 37.3995 KOps/s | |
test_view_pytree | 51.8620μs | 22.4273μs | 44.5885 KOps/s | 44.2711 KOps/s | |
test_view_td | 69.5940μs | 28.7181μs | 34.8213 KOps/s | 33.7960 KOps/s | |
test_unbind_pytree | 54.0930μs | 28.1115μs | 35.5727 KOps/s | 34.7230 KOps/s | |
test_unbind_td | 0.5459ms | 36.0857μs | 27.7118 KOps/s | 27.1597 KOps/s | |
test_split_pytree | 58.7840μs | 30.4302μs | 32.8620 KOps/s | 32.2766 KOps/s | |
test_split_td | 0.8229ms | 39.3507μs | 25.4125 KOps/s | 25.5710 KOps/s | |
test_add_pytree | 86.3450μs | 35.5365μs | 28.1401 KOps/s | 27.9218 KOps/s | |
test_add_td | 96.9050μs | 46.9849μs | 21.2835 KOps/s | 21.7068 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1780ms | 0.1253ms | 7.9790 KOps/s | 8.0073 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2315ms | 0.1242ms | 8.0534 KOps/s | 7.8261 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1465ms | 98.4132μs | 10.1612 KOps/s | 10.1102 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.1553ms | 0.1532ms | 6.5274 KOps/s | 6.7183 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 69.5240μs | 23.8119μs | 41.9958 KOps/s | 42.1699 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 53.1820μs | 26.3479μs | 37.9538 KOps/s | 36.2159 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1120ms | 63.6274μs | 15.7165 KOps/s | 14.9670 KOps/s | |
test_compile_copy_nested[pytree-eager] | 90.1450μs | 49.6111μs | 20.1568 KOps/s | 19.7232 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2042ms | 0.1432ms | 6.9828 KOps/s | 7.0267 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2955ms | 0.2076ms | 4.8162 KOps/s | 4.7914 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1625ms | 99.5542μs | 10.0448 KOps/s | 10.1374 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1150ms | 52.2383μs | 19.1430 KOps/s | 19.1036 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1950ms | 0.1373ms | 7.2811 KOps/s | 7.2913 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5751ms | 0.5030ms | 1.9881 KOps/s | 2.1249 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3846ms | 0.2471ms | 4.0466 KOps/s | 4.0272 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1848ms | 0.1446ms | 6.9170 KOps/s | 6.8728 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1719ms | 63.0264μs | 15.8664 KOps/s | 15.8305 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1409ms | 99.2958μs | 10.0709 KOps/s | 10.0753 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5022ms | 0.4256ms | 2.3494 KOps/s | 2.4539 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1890ms | 0.1386ms | 7.2132 KOps/s | 7.3150 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 55.6330μs | 19.1315μs | 52.2699 KOps/s | 44.9782 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 62.9530μs | 26.5590μs | 37.6520 KOps/s | 37.4191 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1103ms | 70.7766μs | 14.1290 KOps/s | 14.0805 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1335ms | 52.6526μs | 18.9924 KOps/s | 18.6736 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.7103ms | 0.4050ms | 2.4689 KOps/s | 2.1536 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.2085ms | 2.7847ms | 359.1098 Ops/s | 379.6635 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6041ms | 0.4344ms | 2.3020 KOps/s | 2.2136 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.9066ms | 2.7828ms | 359.3464 Ops/s | 376.7644 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1898ms | 0.1166ms | 8.5765 KOps/s | 8.5911 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5550ms | 84.5929μs | 11.8213 KOps/s | 11.9508 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1720ms | 0.1100ms | 9.0904 KOps/s | 9.1626 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1384ms | 68.4470μs | 14.6098 KOps/s | 14.1323 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1657ms | 0.1105ms | 9.0496 KOps/s | 9.0866 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1276ms | 70.3373μs | 14.2172 KOps/s | 14.4212 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1490ms | 0.1022ms | 9.7874 KOps/s | 9.8270 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1405ms | 18.2020μs | 54.9391 KOps/s | 53.2948 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1763ms | 99.1887μs | 10.0818 KOps/s | 10.2502 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 73.0540μs | 16.2486μs | 61.5438 KOps/s | 62.1299 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1511ms | 98.9207μs | 10.1091 KOps/s | 10.1611 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 73.4140μs | 16.2499μs | 61.5387 KOps/s | 62.2900 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1740ms | 0.1033ms | 9.6788 KOps/s | 9.7569 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5661ms | 18.5357μs | 53.9499 KOps/s | 55.2466 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1584ms | 0.1015ms | 9.8569 KOps/s | 9.8489 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 62.7040μs | 16.3695μs | 61.0893 KOps/s | 62.2642 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1531ms | 0.1002ms | 9.9829 KOps/s | 10.2153 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3895ms | 16.2717μs | 61.4564 KOps/s | 61.9440 KOps/s | |
test_mod_add[eager] | 92.0150μs | 39.9579μs | 25.0263 KOps/s | 27.4026 KOps/s | |
test_mod_add[compile] | 0.1353ms | 81.0163μs | 12.3432 KOps/s | 12.3801 KOps/s | |
test_mod_add[compile-overhead] | 0.3270ms | 0.1696ms | 5.8976 KOps/s | 5.6379 KOps/s | |
test_mod_wrap[eager] | 0.3487ms | 0.2642ms | 3.7857 KOps/s | 3.9397 KOps/s | |
test_mod_wrap[compile] | 0.3339ms | 0.2895ms | 3.4538 KOps/s | 3.4629 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1320ms | 3.7838ms | 264.2826 Ops/s | 271.4624 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4972ms | 1.3841ms | 722.4925 Ops/s | 684.0040 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3752ms | 1.2849ms | 778.3009 Ops/s | 779.3076 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3810ms | 0.9303ms | 1.0749 KOps/s | 1.0688 KOps/s | |
test_seq_add[eager] | 0.2541ms | 0.1161ms | 8.6116 KOps/s | 8.8821 KOps/s | |
test_seq_add[compile] | 0.1387ms | 90.3190μs | 11.0719 KOps/s | 11.1568 KOps/s | |
test_seq_add[compile-overhead] | 0.2458ms | 0.1299ms | 7.6953 KOps/s | 7.7585 KOps/s | |
test_seq_wrap[eager] | 0.5046ms | 0.4189ms | 2.3872 KOps/s | 2.3910 KOps/s | |
test_seq_wrap[compile] | 0.3854ms | 0.3055ms | 3.2736 KOps/s | 3.2692 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2851ms | 0.2255ms | 4.4339 KOps/s | 4.4180 KOps/s | |
test_func_call_runtime[False-eager] | 0.8477ms | 0.7611ms | 1.3138 KOps/s | 1.3154 KOps/s | |
test_func_call_runtime[False-compile] | 0.8525ms | 0.7593ms | 1.3170 KOps/s | 1.3331 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4188ms | 0.3638ms | 2.7484 KOps/s | 2.7439 KOps/s | |
test_func_call_runtime[True-eager] | 0.9988ms | 0.9324ms | 1.0725 KOps/s | 1.0817 KOps/s | |
test_func_call_runtime[True-compile] | 0.8840ms | 0.8135ms | 1.2292 KOps/s | 1.2941 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4472ms | 0.3865ms | 2.5873 KOps/s | 2.6019 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8440ms | 0.7572ms | 1.3207 KOps/s | 1.3378 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8430ms | 0.7607ms | 1.3146 KOps/s | 1.3272 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4653ms | 0.3649ms | 2.7402 KOps/s | 2.7170 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1309ms | 1.0205ms | 979.9316 Ops/s | 973.9781 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9007ms | 0.8068ms | 1.2394 KOps/s | 1.2509 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4621ms | 0.4144ms | 2.4134 KOps/s | 2.4118 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5765ms | 2.0980ms | 476.6488 Ops/s | 470.0593 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8823ms | 0.8271ms | 1.2090 KOps/s | 1.2185 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4680ms | 0.4137ms | 2.4173 KOps/s | 2.4117 KOps/s | |
test_distributed | 0.6794ms | 0.1616ms | 6.1893 KOps/s | 8.5854 KOps/s | |
test_tdmodule | 56.4740μs | 19.1954μs | 52.0957 KOps/s | 52.8807 KOps/s | |
test_tdmodule_dispatch | 79.1040μs | 34.5348μs | 28.9563 KOps/s | 30.2094 KOps/s | |
test_tdseq | 39.6120μs | 18.8888μs | 52.9415 KOps/s | 54.5982 KOps/s | |
test_tdseq_dispatch | 64.6140μs | 36.5121μs | 27.3882 KOps/s | 28.5108 KOps/s | |
test_instantiation_functorch | 1.6559ms | 1.5766ms | 634.2695 Ops/s | 632.7563 Ops/s | |
test_exec_functorch | 0.1919ms | 0.1499ms | 6.6726 KOps/s | 6.9050 KOps/s | |
test_exec_functional_call | 0.1835ms | 0.1435ms | 6.9666 KOps/s | 7.0240 KOps/s | |
test_exec_td_decorator | 0.3716ms | 0.1903ms | 5.2552 KOps/s | 5.3262 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7535ms | 0.6877ms | 1.4541 KOps/s | 1.4651 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8793ms | 0.6889ms | 1.4515 KOps/s | 1.4577 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7239ms | 0.5971ms | 1.6747 KOps/s | 1.6679 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7663ms | 0.5997ms | 1.6676 KOps/s | 1.6697 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.5027ms | 19.4554ms | 51.3995 Ops/s | 51.9704 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.6865ms | 19.5422ms | 51.1713 Ops/s | 51.7725 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5136ms | 19.4013ms | 51.5429 Ops/s | 52.2925 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4401ms | 19.3600ms | 51.6530 Ops/s | 52.1389 Ops/s | |
test_to_module_speed[True] | 1.0225ms | 0.9380ms | 1.0661 KOps/s | 1.0506 KOps/s | |
test_to_module_speed[False] | 1.3697ms | 0.9264ms | 1.0795 KOps/s | 1.0807 KOps/s | |
test_tc_init | 77.6050μs | 36.4012μs | 27.4716 KOps/s | 28.7906 KOps/s | |
test_tc_init_nested | 0.1676ms | 73.4685μs | 13.6113 KOps/s | 14.7457 KOps/s | |
test_tc_first_layer_tensor | 4.6374μs | 0.7054μs | 1.4176 MOps/s | 1.4540 MOps/s | |
test_tc_first_layer_nontensor | 25.7410μs | 2.3561μs | 424.4268 KOps/s | 429.4530 KOps/s | |
test_tc_second_layer_tensor | 21.9263μs | 1.4049μs | 711.7952 KOps/s | 695.6890 KOps/s | |
test_tc_second_layer_nontensor | 42.3420μs | 3.0551μs | 327.3198 KOps/s | 327.2140 KOps/s | |
test_unbind | 0.2231s | 9.8266ms | 101.7647 Ops/s | 150.9935 Ops/s | |
test_full_like | 10.1091ms | 9.3841ms | 106.5628 Ops/s | 99.7438 Ops/s | |
test_zeros_like | 4.9008ms | 4.3525ms | 229.7504 Ops/s | 230.0126 Ops/s | |
test_ones_like | 4.6441ms | 4.3849ms | 228.0568 Ops/s | 226.9086 Ops/s | |
test_clone | 7.3753ms | 6.7802ms | 147.4874 Ops/s | 145.0582 Ops/s | |
test_squeeze | 56.5140μs | 9.5221μs | 105.0187 KOps/s | 107.2510 KOps/s | |
test_unsqueeze | 0.1275ms | 71.7409μs | 13.9391 KOps/s | 13.2676 KOps/s | |
test_split | 0.3740ms | 0.1651ms | 6.0563 KOps/s | 6.1429 KOps/s | |
test_permute | 0.2316ms | 0.1788ms | 5.5913 KOps/s | 5.6027 KOps/s | |
test_stack | 51.8542ms | 51.5196ms | 19.4101 Ops/s | 19.0091 Ops/s | |
test_cat | 51.8796ms | 51.3111ms | 19.4889 Ops/s | 18.6443 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):