-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Do not unlock td if it's not locked in TDParams (for compile compat) #1125
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 3, 2024
… compat) ghstack-source-id: 9b6923f9c219e12af5560c97c1c6c58ed7870a8a Pull Request resolved: #1125
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 3, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.9590μs | 17.8971μs | 55.8750 KOps/s | 51.2743 KOps/s | |
test_plain_set_stack_nested | 42.2790μs | 18.3172μs | 54.5934 KOps/s | 50.7668 KOps/s | |
test_plain_set_nested_inplace | 46.5280μs | 19.9735μs | 50.0663 KOps/s | 47.0619 KOps/s | |
test_plain_set_stack_nested_inplace | 73.4180μs | 19.7642μs | 50.5965 KOps/s | 47.4179 KOps/s | |
test_items | 16.7320μs | 4.2435μs | 235.6559 KOps/s | 238.2722 KOps/s | |
test_items_nested | 0.7107ms | 0.4042ms | 2.4740 KOps/s | 2.4613 KOps/s | |
test_items_nested_locked | 0.8414ms | 0.4039ms | 2.4758 KOps/s | 2.4430 KOps/s | |
test_items_nested_leaf | 0.1278ms | 71.2121μs | 14.0425 KOps/s | 13.9226 KOps/s | |
test_items_stack_nested | 0.7276ms | 0.4091ms | 2.4442 KOps/s | 2.4282 KOps/s | |
test_items_stack_nested_leaf | 0.1387ms | 73.5699μs | 13.5925 KOps/s | 13.6515 KOps/s | |
test_items_stack_nested_locked | 0.5122ms | 0.4082ms | 2.4498 KOps/s | 2.4474 KOps/s | |
test_keys | 35.7470μs | 3.7569μs | 266.1742 KOps/s | 285.0655 KOps/s | |
test_keys_nested | 0.2285ms | 0.1359ms | 7.3606 KOps/s | 7.4519 KOps/s | |
test_keys_nested_locked | 1.7829ms | 0.1424ms | 7.0243 KOps/s | 7.0035 KOps/s | |
test_keys_nested_leaf | 0.2180ms | 0.1156ms | 8.6515 KOps/s | 8.4954 KOps/s | |
test_keys_stack_nested | 0.2341ms | 0.1353ms | 7.3895 KOps/s | 7.3209 KOps/s | |
test_keys_stack_nested_leaf | 0.2039ms | 0.1155ms | 8.6578 KOps/s | 8.5326 KOps/s | |
test_keys_stack_nested_locked | 0.2357ms | 0.1404ms | 7.1226 KOps/s | 7.0100 KOps/s | |
test_values | 5.0996μs | 1.0409μs | 960.7440 KOps/s | 934.7764 KOps/s | |
test_values_nested | 97.0520μs | 54.7870μs | 18.2525 KOps/s | 18.3091 KOps/s | |
test_values_nested_locked | 0.1088ms | 54.9583μs | 18.1956 KOps/s | 18.1470 KOps/s | |
test_values_nested_leaf | 0.1080ms | 59.3426μs | 16.8513 KOps/s | 16.9725 KOps/s | |
test_values_stack_nested | 0.1035ms | 56.2108μs | 17.7902 KOps/s | 17.2884 KOps/s | |
test_values_stack_nested_leaf | 0.1072ms | 59.6684μs | 16.7593 KOps/s | 16.5222 KOps/s | |
test_values_stack_nested_locked | 0.1006ms | 56.5957μs | 17.6692 KOps/s | 17.8830 KOps/s | |
test_membership | 17.5430μs | 0.8713μs | 1.1477 MOps/s | 1.0919 MOps/s | |
test_membership_nested | 28.7640μs | 2.9761μs | 336.0128 KOps/s | 336.0028 KOps/s | |
test_membership_nested_leaf | 25.9580μs | 2.9890μs | 334.5614 KOps/s | 327.4718 KOps/s | |
test_membership_stacked_nested | 37.6000μs | 2.9235μs | 342.0579 KOps/s | 339.5888 KOps/s | |
test_membership_stacked_nested_leaf | 26.8710μs | 2.9298μs | 341.3181 KOps/s | 336.5030 KOps/s | |
test_membership_nested_last | 28.9040μs | 4.1754μs | 239.4972 KOps/s | 233.3659 KOps/s | |
test_membership_nested_leaf_last | 32.9620μs | 4.2084μs | 237.6209 KOps/s | 226.0014 KOps/s | |
test_membership_stacked_nested_last | 25.8380μs | 6.7857μs | 147.3695 KOps/s | 235.5626 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.8080μs | 6.9143μs | 144.6285 KOps/s | 233.7929 KOps/s | |
test_nested_getleaf | 45.9960μs | 10.7500μs | 93.0231 KOps/s | 90.4860 KOps/s | |
test_nested_get | 33.3120μs | 10.2840μs | 97.2386 KOps/s | 95.3273 KOps/s | |
test_stacked_getleaf | 38.2820μs | 10.6563μs | 93.8412 KOps/s | 93.8100 KOps/s | |
test_stacked_get | 33.3920μs | 10.1648μs | 98.3791 KOps/s | 93.5739 KOps/s | |
test_nested_getitemleaf | 39.4740μs | 11.1137μs | 89.9792 KOps/s | 86.1600 KOps/s | |
test_nested_getitem | 34.7350μs | 10.4146μs | 96.0191 KOps/s | 92.8552 KOps/s | |
test_stacked_getitemleaf | 39.1930μs | 11.1275μs | 89.8673 KOps/s | 87.4239 KOps/s | |
test_stacked_getitem | 36.2280μs | 10.4203μs | 95.9668 KOps/s | 92.1939 KOps/s | |
test_lock_nested | 4.3624ms | 0.4473ms | 2.2358 KOps/s | 2.2333 KOps/s | |
test_lock_stack_nested | 0.6418ms | 0.4134ms | 2.4191 KOps/s | 2.3968 KOps/s | |
test_unlock_nested | 0.8000ms | 0.3649ms | 2.7404 KOps/s | 2.7310 KOps/s | |
test_unlock_stack_nested | 0.6013ms | 0.3333ms | 3.0003 KOps/s | 2.9823 KOps/s | |
test_flatten_speed | 0.2075ms | 95.8269μs | 10.4355 KOps/s | 10.6504 KOps/s | |
test_unflatten_speed | 0.6268ms | 0.4962ms | 2.0152 KOps/s | 1.9790 KOps/s | |
test_common_ops | 4.9016ms | 0.7925ms | 1.2618 KOps/s | 1.2115 KOps/s | |
test_creation | 21.0900μs | 2.0811μs | 480.5252 KOps/s | 476.0072 KOps/s | |
test_creation_empty | 36.2580μs | 11.6071μs | 86.1543 KOps/s | 75.9211 KOps/s | |
test_creation_nested_1 | 40.0760μs | 14.5256μs | 68.8438 KOps/s | 61.8194 KOps/s | |
test_creation_nested_2 | 58.0090μs | 18.6396μs | 53.6492 KOps/s | 48.9421 KOps/s | |
test_clone | 99.8670μs | 13.8774μs | 72.0595 KOps/s | 75.1946 KOps/s | |
test_getitem[int] | 1.1568ms | 12.5146μs | 79.9065 KOps/s | 81.0308 KOps/s | |
test_getitem[slice_int] | 0.1760ms | 24.4122μs | 40.9631 KOps/s | 41.5725 KOps/s | |
test_getitem[range] | 0.1708ms | 47.8416μs | 20.9023 KOps/s | 21.6778 KOps/s | |
test_getitem[tuple] | 0.1340ms | 20.7147μs | 48.2749 KOps/s | 49.5560 KOps/s | |
test_getitem[list] | 0.2007ms | 43.9880μs | 22.7335 KOps/s | 23.7884 KOps/s | |
test_setitem_dim[int] | 50.8950μs | 26.1142μs | 38.2933 KOps/s | 40.1998 KOps/s | |
test_setitem_dim[slice_int] | 91.4720μs | 51.9541μs | 19.2477 KOps/s | 19.5756 KOps/s | |
test_setitem_dim[range] | 99.4560μs | 73.6257μs | 13.5822 KOps/s | 14.2111 KOps/s | |
test_setitem_dim[tuple] | 88.7260μs | 41.4241μs | 24.1405 KOps/s | 24.2472 KOps/s | |
test_setitem | 76.1530μs | 20.9564μs | 47.7182 KOps/s | 46.7075 KOps/s | |
test_set | 81.2530μs | 20.3888μs | 49.0464 KOps/s | 46.8730 KOps/s | |
test_set_shared | 4.4097ms | 0.1691ms | 5.9142 KOps/s | 5.9632 KOps/s | |
test_update | 0.1286ms | 23.5929μs | 42.3857 KOps/s | 40.0529 KOps/s | |
test_update_nested | 0.1032ms | 33.8551μs | 29.5377 KOps/s | 28.2230 KOps/s | |
test_update__nested | 0.8264ms | 32.7737μs | 30.5123 KOps/s | 31.3144 KOps/s | |
test_set_nested | 78.0660μs | 23.0289μs | 43.4237 KOps/s | 43.3717 KOps/s | |
test_set_nested_new | 95.2890μs | 27.4834μs | 36.3856 KOps/s | 36.0470 KOps/s | |
test_select | 0.2711ms | 44.2581μs | 22.5948 KOps/s | 22.5319 KOps/s | |
test_select_nested | 0.1453ms | 58.8664μs | 16.9876 KOps/s | 16.7523 KOps/s | |
test_exclude_nested | 0.1667ms | 78.3535μs | 12.7627 KOps/s | 12.7145 KOps/s | |
test_empty[True] | 0.5904ms | 0.3873ms | 2.5817 KOps/s | 2.5958 KOps/s | |
test_empty[False] | 12.8490μs | 1.2430μs | 804.5075 KOps/s | 837.7783 KOps/s | |
test_unbind_speed | 0.5986ms | 0.2621ms | 3.8152 KOps/s | 3.8309 KOps/s | |
test_unbind_speed_stack0 | 0.5122ms | 0.2568ms | 3.8944 KOps/s | 3.8580 KOps/s | |
test_unbind_speed_stack1 | 0.1092s | 0.7642ms | 1.3085 KOps/s | 1.4210 KOps/s | |
test_split | 1.6386ms | 1.5481ms | 645.9541 Ops/s | 574.8659 Ops/s | |
test_chunk | 0.1096s | 1.8941ms | 527.9537 Ops/s | 572.3047 Ops/s | |
test_consolidate_njt[False-None] | 8.6357ms | 8.1298ms | 123.0044 Ops/s | 121.8489 Ops/s | |
test_creation[device0] | 0.2371ms | 91.4886μs | 10.9303 KOps/s | 10.9658 KOps/s | |
test_creation_from_tensor | 4.4997ms | 97.3505μs | 10.2722 KOps/s | 10.3882 KOps/s | |
test_add_one[memmap_tensor0] | 0.1405ms | 5.0669μs | 197.3576 KOps/s | 205.5693 KOps/s | |
test_contiguous[memmap_tensor0] | 14.9180μs | 0.5235μs | 1.9104 MOps/s | 1.9362 MOps/s | |
test_stack[memmap_tensor0] | 44.0730μs | 3.4084μs | 293.3967 KOps/s | 284.3373 KOps/s | |
test_memmaptd_index | 1.0386ms | 0.2373ms | 4.2142 KOps/s | 4.1794 KOps/s | |
test_memmaptd_index_astensor | 0.6787ms | 0.3173ms | 3.1520 KOps/s | 3.1569 KOps/s | |
test_memmaptd_index_op | 0.9982ms | 0.5965ms | 1.6765 KOps/s | 1.6302 KOps/s | |
test_serialize_model | 0.1207s | 0.1153s | 8.6748 Ops/s | 7.5010 Ops/s | |
test_serialize_model_pickle | 0.4871s | 0.4013s | 2.4916 Ops/s | 2.4864 Ops/s | |
test_serialize_weights | 0.2256s | 0.1308s | 7.6452 Ops/s | 8.7468 Ops/s | |
test_serialize_weights_returnearly | 0.1909s | 0.1632s | 6.1288 Ops/s | 6.3627 Ops/s | |
test_serialize_weights_pickle | 0.4481s | 0.3933s | 2.5425 Ops/s | 2.2686 Ops/s | |
test_serialize_weights_filesystem | 0.1515s | 0.1420s | 7.0421 Ops/s | 6.2110 Ops/s | |
test_serialize_model_filesystem | 0.1611s | 0.1538s | 6.5011 Ops/s | 6.5208 Ops/s | |
test_reshape_pytree | 72.0850μs | 26.9611μs | 37.0904 KOps/s | 36.5179 KOps/s | |
test_reshape_td | 75.5020μs | 32.4394μs | 30.8267 KOps/s | 30.7617 KOps/s | |
test_view_pytree | 56.1150μs | 26.6688μs | 37.4970 KOps/s | 36.9710 KOps/s | |
test_view_td | 77.1340μs | 38.1032μs | 26.2445 KOps/s | 25.6076 KOps/s | |
test_unbind_pytree | 62.6270μs | 29.6476μs | 33.7295 KOps/s | 33.3026 KOps/s | |
test_unbind_td | 0.3538ms | 38.9610μs | 25.6667 KOps/s | 26.5377 KOps/s | |
test_split_pytree | 91.1310μs | 30.0959μs | 33.2271 KOps/s | 33.0591 KOps/s | |
test_split_td | 0.2128ms | 43.7852μs | 22.8388 KOps/s | 22.6969 KOps/s | |
test_add_pytree | 89.6280μs | 36.7092μs | 27.2412 KOps/s | 27.6737 KOps/s | |
test_add_td | 0.1185ms | 56.1654μs | 17.8046 KOps/s | 17.3861 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1245ms | 62.1849μs | 16.0811 KOps/s | 16.1000 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4195ms | 0.1612ms | 6.2040 KOps/s | 6.1892 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 92.7940μs | 44.3620μs | 22.5418 KOps/s | 22.1088 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.1952ms | 0.1193ms | 8.3792 KOps/s | 8.3808 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 94.7950μs | 26.4795μs | 37.7650 KOps/s | 39.1374 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1137ms | 54.4612μs | 18.3617 KOps/s | 18.2116 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1799ms | 77.9489μs | 12.8289 KOps/s | 12.6373 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1273ms | 67.1146μs | 14.8999 KOps/s | 14.6264 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1863ms | 0.1035ms | 9.6645 KOps/s | 9.4474 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3660ms | 0.1989ms | 5.0287 KOps/s | 5.0505 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1354ms | 44.3201μs | 22.5631 KOps/s | 22.4469 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5326ms | 61.8628μs | 16.1648 KOps/s | 16.3234 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1987ms | 0.1020ms | 9.8085 KOps/s | 9.6638 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3770ms | 0.2044ms | 4.8928 KOps/s | 4.9616 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3728ms | 0.2097ms | 4.7697 KOps/s | 4.7357 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3418ms | 0.1051ms | 9.5104 KOps/s | 9.3759 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1654ms | 57.5962μs | 17.3623 KOps/s | 18.2886 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1045ms | 44.3251μs | 22.5606 KOps/s | 21.6808 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6388ms | 0.1591ms | 6.2860 KOps/s | 6.2855 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2706ms | 0.1025ms | 9.7564 KOps/s | 9.6401 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2288ms | 21.0799μs | 47.4386 KOps/s | 46.8727 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1562ms | 58.8894μs | 16.9810 KOps/s | 16.7309 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1864ms | 80.5628μs | 12.4127 KOps/s | 12.1999 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1252ms | 68.5558μs | 14.5867 KOps/s | 14.1724 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3095ms | 0.2057ms | 4.8614 KOps/s | 4.7123 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5751ms | 1.2949ms | 772.2555 Ops/s | 758.7459 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3647ms | 0.2041ms | 4.8996 KOps/s | 4.8533 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9689ms | 0.7974ms | 1.2541 KOps/s | 1.2371 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8249ms | 0.4527ms | 2.2089 KOps/s | 2.1335 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9708ms | 2.6901ms | 371.7346 Ops/s | 360.2048 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1080ms | 34.7184μs | 28.8031 KOps/s | 27.9468 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5908ms | 33.1439μs | 30.1715 KOps/s | 31.5407 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 99.6570μs | 29.0521μs | 34.4209 KOps/s | 34.2954 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.3780μs | 23.4834μs | 42.5833 KOps/s | 41.9079 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1012ms | 29.7082μs | 33.6607 KOps/s | 33.3214 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1056ms | 23.5160μs | 42.5243 KOps/s | 42.6001 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1146ms | 50.2657μs | 19.8943 KOps/s | 19.4095 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6194ms | 20.2284μs | 49.4355 KOps/s | 49.1582 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1167ms | 43.0568μs | 23.2251 KOps/s | 22.8380 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 55.9850μs | 18.7329μs | 53.3821 KOps/s | 51.9839 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1360ms | 43.7141μs | 22.8759 KOps/s | 22.5788 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 61.6460μs | 19.0133μs | 52.5947 KOps/s | 51.9315 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1238ms | 51.5194μs | 19.4102 KOps/s | 19.3145 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0254ms | 20.0495μs | 49.8765 KOps/s | 49.7810 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1417ms | 43.8522μs | 22.8039 KOps/s | 22.4324 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 78.4570μs | 18.9449μs | 52.7847 KOps/s | 52.9021 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1406ms | 44.0744μs | 22.6889 KOps/s | 22.4790 KOps/s | |
test_compile_indexing[int-pytree-eager] | 62.5970μs | 18.7880μs | 53.2254 KOps/s | 52.2878 KOps/s | |
test_mod_add[eager] | 97.5930μs | 34.4855μs | 28.9977 KOps/s | 27.2345 KOps/s | |
test_mod_add[compile] | 0.1724ms | 46.8483μs | 21.3455 KOps/s | 21.1219 KOps/s | |
test_mod_add[compile-overhead] | 0.1299ms | 46.4167μs | 21.5440 KOps/s | 21.0480 KOps/s | |
test_mod_wrap[eager] | 0.4359ms | 0.2260ms | 4.4255 KOps/s | 4.3594 KOps/s | |
test_mod_wrap[compile] | 0.3061ms | 0.2040ms | 4.9026 KOps/s | 4.7362 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3794ms | 0.2034ms | 4.9161 KOps/s | 4.6939 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.2453ms | 11.6811ms | 85.6081 Ops/s | 86.8469 Ops/s | |
test_mod_wrap_and_backward[compile] | 27.4175ms | 13.7730ms | 72.6060 Ops/s | 90.0263 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.5952ms | 12.3054ms | 81.2652 Ops/s | 89.4400 Ops/s | |
test_seq_add[eager] | 0.2003ms | 0.1119ms | 8.9343 KOps/s | 8.4117 KOps/s | |
test_seq_add[compile] | 0.1285ms | 59.6120μs | 16.7751 KOps/s | 16.3163 KOps/s | |
test_seq_add[compile-overhead] | 0.1410ms | 58.4990μs | 17.0943 KOps/s | 16.5111 KOps/s | |
test_seq_wrap[eager] | 0.5621ms | 0.4337ms | 2.3060 KOps/s | 2.0659 KOps/s | |
test_seq_wrap[compile] | 0.3367ms | 0.2230ms | 4.4837 KOps/s | 4.3232 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3312ms | 0.2228ms | 4.4887 KOps/s | 4.2872 KOps/s | |
test_func_call_runtime[False-eager] | 0.6943ms | 0.5313ms | 1.8821 KOps/s | 1.7546 KOps/s | |
test_func_call_runtime[False-compile] | 0.7864ms | 0.4214ms | 2.3729 KOps/s | 2.3227 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7290ms | 0.4197ms | 2.3828 KOps/s | 2.3027 KOps/s | |
test_func_call_runtime[True-eager] | 1.0235ms | 0.7448ms | 1.3426 KOps/s | 1.2862 KOps/s | |
test_func_call_runtime[True-compile] | 0.5506ms | 0.4578ms | 2.1842 KOps/s | 2.1272 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8543ms | 0.4657ms | 2.1475 KOps/s | 2.1335 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8989ms | 0.5396ms | 1.8534 KOps/s | 1.7712 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0970ms | 0.4308ms | 2.3214 KOps/s | 2.3331 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6263ms | 0.4200ms | 2.3811 KOps/s | 2.3219 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0917ms | 0.8721ms | 1.1467 KOps/s | 1.0928 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0529ms | 0.4995ms | 2.0021 KOps/s | 2.0209 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5863ms | 0.4840ms | 2.0663 KOps/s | 2.0283 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.0564ms | 1.8608ms | 537.4017 Ops/s | 528.1735 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.4023ms | 0.6966ms | 1.4356 KOps/s | 1.9071 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9878ms | 0.5177ms | 1.9315 KOps/s | 1.9069 KOps/s | |
test_distributed | 0.2453ms | 0.1254ms | 7.9759 KOps/s | 7.8273 KOps/s | |
test_tdmodule | 94.0360μs | 25.9080μs | 38.5982 KOps/s | 36.8010 KOps/s | |
test_tdmodule_dispatch | 83.9470μs | 47.5835μs | 21.0157 KOps/s | 19.6240 KOps/s | |
test_tdseq | 66.3250μs | 26.3243μs | 37.9878 KOps/s | 36.0445 KOps/s | |
test_tdseq_dispatch | 83.2260μs | 50.5048μs | 19.8001 KOps/s | 18.1386 KOps/s | |
test_instantiation_functorch | 1.7272ms | 1.5321ms | 652.6888 Ops/s | 635.6138 Ops/s | |
test_exec_functorch | 0.4167ms | 0.1824ms | 5.4836 KOps/s | 5.4825 KOps/s | |
test_exec_functional_call | 0.2701ms | 0.1685ms | 5.9350 KOps/s | 5.7463 KOps/s | |
test_exec_td_decorator | 0.4825ms | 0.2279ms | 4.3871 KOps/s | 4.3983 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9331ms | 0.6452ms | 1.5500 KOps/s | 1.4734 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1571ms | 0.6464ms | 1.5470 KOps/s | 1.5029 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.1514ms | 0.5325ms | 1.8779 KOps/s | 1.8801 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7908ms | 0.5212ms | 1.9187 KOps/s | 1.8706 KOps/s | |
test_to_module_speed[True] | 1.4353ms | 1.2611ms | 792.9462 Ops/s | 775.8576 Ops/s | |
test_to_module_speed[False] | 2.0984ms | 1.2556ms | 796.4088 Ops/s | 784.6562 Ops/s | |
test_tc_init | 86.9330μs | 47.3574μs | 21.1160 KOps/s | 19.5691 KOps/s | |
test_tc_init_nested | 0.3994ms | 92.6959μs | 10.7880 KOps/s | 10.0850 KOps/s | |
test_tc_first_layer_tensor | 19.8970μs | 1.5233μs | 656.4779 KOps/s | 658.2538 KOps/s | |
test_tc_first_layer_nontensor | 50.7250μs | 4.7304μs | 211.4003 KOps/s | 217.8521 KOps/s | |
test_tc_second_layer_tensor | 33.3430μs | 2.8292μs | 353.4571 KOps/s | 359.6773 KOps/s | |
test_tc_second_layer_nontensor | 49.3530μs | 6.0789μs | 164.5044 KOps/s | 168.5055 KOps/s | |
test_unbind | 0.2386s | 13.6232ms | 73.4040 Ops/s | 77.6447 Ops/s | |
test_full_like | 17.5165ms | 12.9892ms | 76.9871 Ops/s | 79.2393 Ops/s | |
test_zeros_like | 14.4381ms | 7.6269ms | 131.1150 Ops/s | 135.6396 Ops/s | |
test_ones_like | 13.1655ms | 8.5864ms | 116.4631 Ops/s | 123.3355 Ops/s | |
test_clone | 16.0085ms | 10.3341ms | 96.7670 Ops/s | 98.4681 Ops/s | |
test_squeeze | 70.3720μs | 12.0977μs | 82.6601 KOps/s | 83.6696 KOps/s | |
test_unsqueeze | 0.1873ms | 88.8992μs | 11.2487 KOps/s | 11.1233 KOps/s | |
test_split | 0.3781ms | 0.1942ms | 5.1486 KOps/s | 5.2346 KOps/s | |
test_permute | 0.3609ms | 0.2184ms | 4.5798 KOps/s | 4.5058 KOps/s | |
test_stack | 31.4993ms | 25.4088ms | 39.3565 Ops/s | 39.8716 Ops/s | |
test_cat | 31.1507ms | 25.0807ms | 39.8713 Ops/s | 40.5947 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 43.8010μs | 10.8803μs | 91.9091 KOps/s | 97.3820 KOps/s | |
test_plain_set_stack_nested | 31.0910μs | 10.8148μs | 92.4663 KOps/s | 97.0621 KOps/s | |
test_plain_set_nested_inplace | 36.1910μs | 11.6867μs | 85.5673 KOps/s | 89.8763 KOps/s | |
test_plain_set_stack_nested_inplace | 33.9000μs | 11.6549μs | 85.8010 KOps/s | 90.5466 KOps/s | |
test_items | 44.3410μs | 2.8932μs | 345.6345 KOps/s | 346.6423 KOps/s | |
test_items_nested | 0.3927ms | 0.3544ms | 2.8219 KOps/s | 2.8146 KOps/s | |
test_items_nested_locked | 0.3958ms | 0.3564ms | 2.8055 KOps/s | 2.8117 KOps/s | |
test_items_nested_leaf | 0.1089ms | 61.2737μs | 16.3202 KOps/s | 16.4471 KOps/s | |
test_items_stack_nested | 0.3999ms | 0.3588ms | 2.7873 KOps/s | 2.8058 KOps/s | |
test_items_stack_nested_leaf | 92.4110μs | 62.4529μs | 16.0121 KOps/s | 16.0484 KOps/s | |
test_items_stack_nested_locked | 0.4060ms | 0.3567ms | 2.8031 KOps/s | 2.8111 KOps/s | |
test_keys | 27.3100μs | 3.4736μs | 287.8863 KOps/s | 288.1896 KOps/s | |
test_keys_nested | 0.1292ms | 71.3803μs | 14.0095 KOps/s | 14.0584 KOps/s | |
test_keys_nested_locked | 0.8121ms | 76.5667μs | 13.0605 KOps/s | 13.0135 KOps/s | |
test_keys_nested_leaf | 0.1007ms | 62.1496μs | 16.0902 KOps/s | 16.1299 KOps/s | |
test_keys_stack_nested | 0.1046ms | 70.4246μs | 14.1996 KOps/s | 14.1432 KOps/s | |
test_keys_stack_nested_leaf | 90.7710μs | 62.2615μs | 16.0613 KOps/s | 16.1766 KOps/s | |
test_keys_stack_nested_locked | 0.1085ms | 76.4461μs | 13.0811 KOps/s | 12.9595 KOps/s | |
test_values | 9.4583μs | 0.8590μs | 1.1641 MOps/s | 1.1259 MOps/s | |
test_values_nested | 57.0010μs | 31.7898μs | 31.4566 KOps/s | 31.8364 KOps/s | |
test_values_nested_locked | 63.4110μs | 33.1552μs | 30.1612 KOps/s | 30.2845 KOps/s | |
test_values_nested_leaf | 63.9310μs | 34.0183μs | 29.3959 KOps/s | 29.6014 KOps/s | |
test_values_stack_nested | 62.0110μs | 32.1445μs | 31.1095 KOps/s | 31.2960 KOps/s | |
test_values_stack_nested_leaf | 68.0510μs | 34.6186μs | 28.8862 KOps/s | 29.1686 KOps/s | |
test_values_stack_nested_locked | 68.3410μs | 33.7027μs | 29.6713 KOps/s | 29.7607 KOps/s | |
test_membership | 1.6420μs | 0.5098μs | 1.9617 MOps/s | 1.9500 MOps/s | |
test_membership_nested | 23.6200μs | 2.0010μs | 499.7600 KOps/s | 486.5670 KOps/s | |
test_membership_nested_leaf | 15.9950μs | 1.9948μs | 501.3122 KOps/s | 487.4238 KOps/s | |
test_membership_stacked_nested | 33.6310μs | 2.0858μs | 479.4381 KOps/s | 474.8514 KOps/s | |
test_membership_stacked_nested_leaf | 32.9210μs | 2.0809μs | 480.5645 KOps/s | 479.0693 KOps/s | |
test_membership_nested_last | 30.9000μs | 2.9437μs | 339.7037 KOps/s | 336.0648 KOps/s | |
test_membership_nested_leaf_last | 32.5700μs | 2.9582μs | 338.0387 KOps/s | 331.6938 KOps/s | |
test_membership_stacked_nested_last | 39.5310μs | 3.5843μs | 278.9960 KOps/s | 126.9636 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.9800μs | 3.6109μs | 276.9359 KOps/s | 125.6250 KOps/s | |
test_nested_getleaf | 37.2300μs | 6.1115μs | 163.6252 KOps/s | 162.7256 KOps/s | |
test_nested_get | 26.8500μs | 5.8033μs | 172.3170 KOps/s | 170.4617 KOps/s | |
test_stacked_getleaf | 62.3610μs | 6.1261μs | 163.2350 KOps/s | 163.4221 KOps/s | |
test_stacked_get | 26.6110μs | 5.8301μs | 171.5249 KOps/s | 171.4546 KOps/s | |
test_nested_getitemleaf | 37.4210μs | 6.2434μs | 160.1701 KOps/s | 164.3473 KOps/s | |
test_nested_getitem | 25.7800μs | 5.8875μs | 169.8528 KOps/s | 170.2617 KOps/s | |
test_stacked_getitemleaf | 32.8510μs | 6.1929μs | 161.4755 KOps/s | 160.5391 KOps/s | |
test_stacked_getitem | 27.2110μs | 5.8724μs | 170.2889 KOps/s | 170.4852 KOps/s | |
test_lock_nested | 9.9921ms | 0.3821ms | 2.6171 KOps/s | 2.5864 KOps/s | |
test_lock_stack_nested | 0.3702ms | 0.3452ms | 2.8970 KOps/s | 2.8922 KOps/s | |
test_unlock_nested | 0.6452ms | 0.3051ms | 3.2778 KOps/s | 3.2159 KOps/s | |
test_unlock_stack_nested | 0.3114ms | 0.2736ms | 3.6552 KOps/s | 3.6529 KOps/s | |
test_flatten_speed | 0.1113ms | 78.2994μs | 12.7715 KOps/s | 12.7808 KOps/s | |
test_unflatten_speed | 0.3734ms | 0.3081ms | 3.2461 KOps/s | 3.2433 KOps/s | |
test_common_ops | 1.7694ms | 0.5896ms | 1.6962 KOps/s | 1.7432 KOps/s | |
test_creation | 0.1719ms | 1.4967μs | 668.1266 KOps/s | 670.3203 KOps/s | |
test_creation_empty | 31.5900μs | 7.9836μs | 125.2575 KOps/s | 149.4506 KOps/s | |
test_creation_nested_1 | 31.8410μs | 9.6430μs | 103.7017 KOps/s | 120.8563 KOps/s | |
test_creation_nested_2 | 47.5910μs | 12.1825μs | 82.0851 KOps/s | 92.2831 KOps/s | |
test_clone | 71.7910μs | 9.7973μs | 102.0692 KOps/s | 97.0152 KOps/s | |
test_getitem[int] | 1.6508ms | 10.7609μs | 92.9287 KOps/s | 89.0781 KOps/s | |
test_getitem[slice_int] | 0.1130ms | 20.7598μs | 48.1701 KOps/s | 43.8751 KOps/s | |
test_getitem[range] | 0.1322ms | 35.6211μs | 28.0733 KOps/s | 25.8394 KOps/s | |
test_getitem[tuple] | 0.1087ms | 18.2429μs | 54.8159 KOps/s | 52.8762 KOps/s | |
test_getitem[list] | 0.2631ms | 31.8500μs | 31.3971 KOps/s | 31.4773 KOps/s | |
test_setitem_dim[int] | 37.2600μs | 17.0978μs | 58.4869 KOps/s | 54.1124 KOps/s | |
test_setitem_dim[slice_int] | 58.3110μs | 37.0601μs | 26.9832 KOps/s | 26.2916 KOps/s | |
test_setitem_dim[range] | 93.8710μs | 52.1673μs | 19.1691 KOps/s | 19.1303 KOps/s | |
test_setitem_dim[tuple] | 59.9110μs | 31.9178μs | 31.3305 KOps/s | 31.0454 KOps/s | |
test_setitem | 83.8910μs | 14.2077μs | 70.3843 KOps/s | 71.3248 KOps/s | |
test_set | 91.1910μs | 14.3033μs | 69.9138 KOps/s | 75.0236 KOps/s | |
test_set_shared | 1.6226ms | 0.1441ms | 6.9382 KOps/s | 6.8018 KOps/s | |
test_update | 0.3188ms | 16.9747μs | 58.9113 KOps/s | 63.9741 KOps/s | |
test_update_nested | 92.0720μs | 22.1285μs | 45.1906 KOps/s | 50.8973 KOps/s | |
test_update__nested | 0.5206ms | 22.8282μs | 43.8054 KOps/s | 42.1285 KOps/s | |
test_set_nested | 79.6010μs | 14.6033μs | 68.4775 KOps/s | 70.0645 KOps/s | |
test_set_nested_new | 0.1256ms | 16.9170μs | 59.1123 KOps/s | 60.5676 KOps/s | |
test_select | 0.1041ms | 29.4187μs | 33.9920 KOps/s | 35.2319 KOps/s | |
test_select_nested | 81.1310μs | 44.1285μs | 22.6611 KOps/s | 22.6478 KOps/s | |
test_exclude_nested | 0.1039ms | 63.2825μs | 15.8022 KOps/s | 15.7793 KOps/s | |
test_empty[True] | 0.3310ms | 0.2804ms | 3.5658 KOps/s | 3.5435 KOps/s | |
test_empty[False] | 3.8270μs | 0.7540μs | 1.3263 MOps/s | 1.3367 MOps/s | |
test_to | 92.3810μs | 54.7280μs | 18.2722 KOps/s | 16.8105 KOps/s | |
test_to_nonblocking | 86.8210μs | 45.1703μs | 22.1385 KOps/s | 21.1796 KOps/s | |
test_unbind_speed | 0.2773ms | 0.2286ms | 4.3751 KOps/s | 4.3003 KOps/s | |
test_unbind_speed_stack0 | 0.2829ms | 0.2295ms | 4.3566 KOps/s | 4.3268 KOps/s | |
test_unbind_speed_stack1 | 0.1018s | 0.6394ms | 1.5640 KOps/s | 1.5388 KOps/s | |
test_split | 0.1062s | 1.6019ms | 624.2628 Ops/s | 565.8253 Ops/s | |
test_chunk | 99.4837ms | 1.5974ms | 626.0079 Ops/s | 676.5325 Ops/s | |
test_consolidate[False-None] | 0.1011s | 2.9064ms | 344.0671 Ops/s | 343.0686 Ops/s | |
test_consolidate[default-None] | 1.7657ms | 1.6736ms | 597.5267 Ops/s | 588.1606 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8440ms | 1.7166ms | 582.5544 Ops/s | 577.3076 Ops/s | |
test_consolidate_njt[False-None] | 7.1904ms | 6.7937ms | 147.1961 Ops/s | 148.7703 Ops/s | |
test_to[False-False-None] | 1.7869ms | 1.7001ms | 588.2138 Ops/s | 590.7492 Ops/s | |
test_to[True-False-None] | 0.3208s | 1.7780ms | 562.4210 Ops/s | 758.3630 Ops/s | |
test_to[within-False-None] | 4.1353ms | 4.0111ms | 249.3098 Ops/s | 244.0777 Ops/s | |
test_to[True-default-None] | 5.4435ms | 5.2071ms | 192.0470 Ops/s | 191.3753 Ops/s | |
test_to_njt[False-False-None] | 7.1907ms | 7.0615ms | 141.6127 Ops/s | 141.5102 Ops/s | |
test_to_njt[True-False-None] | 6.0905ms | 5.6548ms | 176.8419 Ops/s | 181.9529 Ops/s | |
test_to_njt[within-False-None] | 12.5378ms | 12.4128ms | 80.5623 Ops/s | 80.4840 Ops/s | |
test_creation[device0] | 0.5383ms | 79.8472μs | 12.5239 KOps/s | 12.6119 KOps/s | |
test_creation_from_tensor | 0.6138ms | 83.4106μs | 11.9889 KOps/s | 12.1050 KOps/s | |
test_add_one[memmap_tensor0] | 0.2251ms | 6.1176μs | 163.4632 KOps/s | 154.1545 KOps/s | |
test_contiguous[memmap_tensor0] | 2.0810μs | 0.4089μs | 2.4453 MOps/s | 2.4238 MOps/s | |
test_stack[memmap_tensor0] | 23.9900μs | 4.4367μs | 225.3939 KOps/s | 214.2760 KOps/s | |
test_memmaptd_index | 2.1919ms | 0.2428ms | 4.1194 KOps/s | 3.9456 KOps/s | |
test_memmaptd_index_astensor | 0.5642ms | 0.3010ms | 3.3226 KOps/s | 3.1457 KOps/s | |
test_memmaptd_index_op | 0.9559ms | 0.5520ms | 1.8114 KOps/s | 1.7853 KOps/s | |
test_serialize_model | 0.1325s | 0.1315s | 7.6048 Ops/s | 7.6127 Ops/s | |
test_serialize_model_pickle | 1.3514s | 1.2121s | 0.8250 Ops/s | 0.8418 Ops/s | |
test_serialize_weights | 0.2971s | 0.1541s | 6.4873 Ops/s | 5.3509 Ops/s | |
test_serialize_weights_returnearly | 0.3693s | 55.5439ms | 18.0038 Ops/s | 23.2590 Ops/s | |
test_serialize_weights_pickle | 1.9406s | 1.4565s | 0.6866 Ops/s | 0.8246 Ops/s | |
test_reshape_pytree | 84.1310μs | 22.4470μs | 44.5493 KOps/s | 42.4750 KOps/s | |
test_reshape_td | 64.1710μs | 26.1558μs | 38.2324 KOps/s | 36.5482 KOps/s | |
test_view_pytree | 54.5110μs | 22.0676μs | 45.3153 KOps/s | 44.3262 KOps/s | |
test_view_td | 71.7510μs | 30.9387μs | 32.3220 KOps/s | 34.0278 KOps/s | |
test_unbind_pytree | 60.4610μs | 27.6648μs | 36.1470 KOps/s | 35.3034 KOps/s | |
test_unbind_td | 0.6449ms | 35.1848μs | 28.4214 KOps/s | 27.7745 KOps/s | |
test_split_pytree | 67.7820μs | 29.8962μs | 33.4491 KOps/s | 32.7976 KOps/s | |
test_split_td | 0.7576ms | 38.8214μs | 25.7590 KOps/s | 25.0897 KOps/s | |
test_add_pytree | 64.7010μs | 32.5385μs | 30.7328 KOps/s | 29.7318 KOps/s | |
test_add_td | 88.6520μs | 43.6917μs | 22.8877 KOps/s | 23.5658 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1722ms | 0.1216ms | 8.2258 KOps/s | 7.8903 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2418ms | 0.1268ms | 7.8884 KOps/s | 7.8579 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1435ms | 98.9938μs | 10.1016 KOps/s | 9.7193 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.0954ms | 0.1501ms | 6.6616 KOps/s | 6.6688 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 71.7020μs | 23.1023μs | 43.2857 KOps/s | 44.8304 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 66.2010μs | 28.0932μs | 35.5958 KOps/s | 35.4414 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4949ms | 65.8060μs | 15.1962 KOps/s | 14.9448 KOps/s | |
test_compile_copy_nested[pytree-eager] | 85.1720μs | 50.3372μs | 19.8660 KOps/s | 19.8717 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2860ms | 0.1463ms | 6.8339 KOps/s | 6.7786 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2964ms | 0.2085ms | 4.7956 KOps/s | 4.7746 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1433ms | 99.9028μs | 10.0097 KOps/s | 9.8276 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1083ms | 52.0849μs | 19.1994 KOps/s | 18.9203 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1814ms | 0.1395ms | 7.1696 KOps/s | 7.0984 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5398ms | 0.4864ms | 2.0560 KOps/s | 2.0805 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3666ms | 0.2490ms | 4.0165 KOps/s | 4.0010 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1923ms | 0.1469ms | 6.8079 KOps/s | 6.7368 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1499ms | 64.8459μs | 15.4212 KOps/s | 15.8700 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1525ms | 0.1025ms | 9.7530 KOps/s | 9.8028 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4837ms | 0.4131ms | 2.4210 KOps/s | 2.4518 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1892ms | 0.1383ms | 7.2324 KOps/s | 7.1336 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1537ms | 19.2213μs | 52.0256 KOps/s | 54.4503 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 61.2610μs | 29.0510μs | 34.4222 KOps/s | 34.8405 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1103ms | 70.7118μs | 14.1419 KOps/s | 13.9887 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.3310μs | 52.1425μs | 19.1782 KOps/s | 19.0644 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6588ms | 0.4002ms | 2.4987 KOps/s | 2.1732 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7491ms | 2.5498ms | 392.1922 Ops/s | 380.3744 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6150ms | 0.4389ms | 2.2784 KOps/s | 2.1603 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7100ms | 2.6601ms | 375.9307 Ops/s | 380.7869 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1563ms | 0.1130ms | 8.8471 KOps/s | 8.4463 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5763ms | 77.1586μs | 12.9603 KOps/s | 12.2974 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1460ms | 0.1053ms | 9.4944 KOps/s | 9.1382 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1864ms | 66.2858μs | 15.0862 KOps/s | 14.5264 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1764ms | 0.1063ms | 9.4041 KOps/s | 9.4329 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1587ms | 65.7386μs | 15.2118 KOps/s | 14.8844 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1518ms | 0.1025ms | 9.7538 KOps/s | 9.7191 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1605ms | 17.2156μs | 58.0867 KOps/s | 48.0237 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1351ms | 97.2105μs | 10.2870 KOps/s | 9.7722 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 45.8510μs | 16.0230μs | 62.4102 KOps/s | 62.7001 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1518ms | 98.5548μs | 10.1466 KOps/s | 10.1548 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 44.7710μs | 16.0040μs | 62.4844 KOps/s | 62.0126 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1473ms | 0.1032ms | 9.6867 KOps/s | 9.2708 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6290ms | 17.0181μs | 58.7608 KOps/s | 56.2915 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1717ms | 98.3171μs | 10.1712 KOps/s | 9.8650 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1675ms | 16.0330μs | 62.3715 KOps/s | 62.5700 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1383ms | 97.4575μs | 10.2609 KOps/s | 9.7069 KOps/s | |
test_compile_indexing[int-pytree-eager] | 58.0700μs | 16.1298μs | 61.9972 KOps/s | 62.2814 KOps/s | |
test_mod_add[eager] | 75.1010μs | 38.4083μs | 26.0360 KOps/s | 26.8146 KOps/s | |
test_mod_add[compile] | 0.1541ms | 85.5020μs | 11.6956 KOps/s | 11.7950 KOps/s | |
test_mod_add[compile-overhead] | 0.3485ms | 0.1795ms | 5.5696 KOps/s | 5.2586 KOps/s | |
test_mod_wrap[eager] | 0.3297ms | 0.2477ms | 4.0366 KOps/s | 3.8940 KOps/s | |
test_mod_wrap[compile] | 0.4204ms | 0.3008ms | 3.3241 KOps/s | 3.4353 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1952ms | 3.7621ms | 265.8065 Ops/s | 265.9550 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4688ms | 1.3293ms | 752.2875 Ops/s | 700.2714 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4351ms | 1.2646ms | 790.7690 Ops/s | 723.0840 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3982ms | 0.9203ms | 1.0866 KOps/s | 938.0537 Ops/s | |
test_seq_add[eager] | 0.1953ms | 0.1127ms | 8.8739 KOps/s | 8.9825 KOps/s | |
test_seq_add[compile] | 0.1595ms | 89.5966μs | 11.1611 KOps/s | 11.2696 KOps/s | |
test_seq_add[compile-overhead] | 0.1847ms | 0.1310ms | 7.6343 KOps/s | 7.2947 KOps/s | |
test_seq_wrap[eager] | 0.5013ms | 0.4160ms | 2.4038 KOps/s | 2.3724 KOps/s | |
test_seq_wrap[compile] | 0.3732ms | 0.3036ms | 3.2943 KOps/s | 3.0924 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2788ms | 0.2247ms | 4.4508 KOps/s | 4.1839 KOps/s | |
test_func_call_runtime[False-eager] | 0.8050ms | 0.7256ms | 1.3781 KOps/s | 1.2644 KOps/s | |
test_func_call_runtime[False-compile] | 0.8578ms | 0.7512ms | 1.3312 KOps/s | 1.2580 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4319ms | 0.3638ms | 2.7489 KOps/s | 2.7080 KOps/s | |
test_func_call_runtime[True-eager] | 0.9655ms | 0.8922ms | 1.1208 KOps/s | 1.1101 KOps/s | |
test_func_call_runtime[True-compile] | 0.8617ms | 0.7789ms | 1.2839 KOps/s | 1.2886 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4448ms | 0.3860ms | 2.5904 KOps/s | 2.5606 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7980ms | 0.7234ms | 1.3823 KOps/s | 1.3842 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8496ms | 0.7575ms | 1.3201 KOps/s | 1.2490 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4240ms | 0.3674ms | 2.7216 KOps/s | 2.7072 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1178ms | 0.9946ms | 1.0054 KOps/s | 1.0005 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8989ms | 0.8098ms | 1.2348 KOps/s | 1.2398 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4628ms | 0.4098ms | 2.4402 KOps/s | 2.3913 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4819ms | 2.0039ms | 499.0225 Ops/s | 488.8809 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9719ms | 0.8124ms | 1.2309 KOps/s | 1.2047 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5065ms | 0.4150ms | 2.4095 KOps/s | 2.3786 KOps/s | |
test_distributed | 2.5903ms | 0.1703ms | 5.8727 KOps/s | 8.5017 KOps/s | |
test_tdmodule | 56.9710μs | 18.9409μs | 52.7957 KOps/s | 52.8972 KOps/s | |
test_tdmodule_dispatch | 51.7610μs | 32.9704μs | 30.3302 KOps/s | 31.0649 KOps/s | |
test_tdseq | 56.7110μs | 18.9658μs | 52.7266 KOps/s | 54.6999 KOps/s | |
test_tdseq_dispatch | 67.2510μs | 35.5444μs | 28.1338 KOps/s | 28.5129 KOps/s | |
test_instantiation_functorch | 1.6321ms | 1.5259ms | 655.3434 Ops/s | 641.8670 Ops/s | |
test_exec_functorch | 0.2162ms | 0.1410ms | 7.0930 KOps/s | 6.9898 KOps/s | |
test_exec_functional_call | 0.1878ms | 0.1327ms | 7.5352 KOps/s | 7.4283 KOps/s | |
test_exec_td_decorator | 0.3648ms | 0.1810ms | 5.5248 KOps/s | 5.4727 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8074ms | 0.6692ms | 1.4944 KOps/s | 1.4839 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8157ms | 0.6677ms | 1.4976 KOps/s | 1.4905 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6981ms | 0.5755ms | 1.7377 KOps/s | 1.7081 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6943ms | 0.5782ms | 1.7294 KOps/s | 1.7196 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.8116ms | 18.7417ms | 53.3570 Ops/s | 52.7266 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 18.8399ms | 18.7830ms | 53.2397 Ops/s | 52.7926 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.8353ms | 18.6401ms | 53.6478 Ops/s | 53.3654 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.3632ms | 18.7389ms | 53.3649 Ops/s | 53.4428 Ops/s | |
test_to_module_speed[True] | 1.0824ms | 0.9846ms | 1.0156 KOps/s | 1.0227 KOps/s | |
test_to_module_speed[False] | 1.4275ms | 0.9672ms | 1.0339 KOps/s | 1.0426 KOps/s | |
test_tc_init | 72.1020μs | 34.9824μs | 28.5858 KOps/s | 29.2603 KOps/s | |
test_tc_init_nested | 0.1132ms | 71.8818μs | 13.9117 KOps/s | 14.6602 KOps/s | |
test_tc_first_layer_tensor | 5.7059μs | 0.7378μs | 1.3554 MOps/s | 1.3505 MOps/s | |
test_tc_first_layer_nontensor | 36.3000μs | 2.4875μs | 402.0125 KOps/s | 409.4965 KOps/s | |
test_tc_second_layer_tensor | 10.1303μs | 1.5389μs | 649.8173 KOps/s | 661.8417 KOps/s | |
test_tc_second_layer_nontensor | 31.0800μs | 3.2769μs | 305.1645 KOps/s | 309.6473 KOps/s | |
test_unbind | 0.2375s | 10.2095ms | 97.9476 Ops/s | 149.8714 Ops/s | |
test_full_like | 11.2616ms | 10.4798ms | 95.4219 Ops/s | 92.5658 Ops/s | |
test_zeros_like | 5.2558ms | 4.5938ms | 217.6830 Ops/s | 132.9454 Ops/s | |
test_ones_like | 5.9163ms | 4.6583ms | 214.6721 Ops/s | 215.6067 Ops/s | |
test_clone | 12.2694ms | 7.4291ms | 134.6053 Ops/s | 132.5738 Ops/s | |
test_squeeze | 60.5510μs | 9.7215μs | 102.8646 KOps/s | 106.0973 KOps/s | |
test_unsqueeze | 0.1204ms | 75.6752μs | 13.2144 KOps/s | 14.1869 KOps/s | |
test_split | 0.4151ms | 0.1593ms | 6.2783 KOps/s | 6.1937 KOps/s | |
test_permute | 0.2524ms | 0.1858ms | 5.3814 KOps/s | 5.7622 KOps/s | |
test_stack | 54.2975ms | 53.0058ms | 18.8658 Ops/s | 18.1714 Ops/s | |
test_cat | 53.4074ms | 52.6048ms | 19.0097 Ops/s | 18.3033 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 3, 2024
… compat) ghstack-source-id: 9b6923f9c219e12af5560c97c1c6c58ed7870a8a Pull Request resolved: #1125
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):