-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Minor] print_directory_tree returns a string #1086
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 13, 2024
ghstack-source-id: d57f19dd8efcef06676fca40a4d6f95367ff1d55 Pull Request resolved: #1086
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 13, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.1370μs | 17.3648μs | 57.5879 KOps/s | 54.6726 KOps/s | |
test_plain_set_stack_nested | 40.3550μs | 17.7733μs | 56.2641 KOps/s | 53.4664 KOps/s | |
test_plain_set_nested_inplace | 46.9270μs | 19.6593μs | 50.8666 KOps/s | 48.5301 KOps/s | |
test_plain_set_stack_nested_inplace | 49.4920μs | 19.5414μs | 51.1734 KOps/s | 49.4484 KOps/s | |
test_items | 24.8760μs | 4.1623μs | 240.2512 KOps/s | 242.2614 KOps/s | |
test_items_nested | 0.4244ms | 0.3435ms | 2.9110 KOps/s | 2.9508 KOps/s | |
test_items_nested_locked | 0.3960ms | 0.3428ms | 2.9175 KOps/s | 2.9079 KOps/s | |
test_items_nested_leaf | 0.1290ms | 72.0425μs | 13.8807 KOps/s | 13.5698 KOps/s | |
test_items_stack_nested | 0.7246ms | 0.3447ms | 2.9010 KOps/s | 2.9019 KOps/s | |
test_items_stack_nested_leaf | 0.1424ms | 73.6736μs | 13.5734 KOps/s | 12.8167 KOps/s | |
test_items_stack_nested_locked | 0.5118ms | 0.3423ms | 2.9213 KOps/s | 2.8916 KOps/s | |
test_keys | 20.6290μs | 3.6206μs | 276.1959 KOps/s | 284.9894 KOps/s | |
test_keys_nested | 0.1880ms | 0.1388ms | 7.2037 KOps/s | 7.2377 KOps/s | |
test_keys_nested_locked | 1.8281ms | 0.1442ms | 6.9353 KOps/s | 7.1842 KOps/s | |
test_keys_nested_leaf | 0.2125ms | 0.1173ms | 8.5270 KOps/s | 8.5668 KOps/s | |
test_keys_stack_nested | 0.2601ms | 0.1397ms | 7.1580 KOps/s | 7.4489 KOps/s | |
test_keys_stack_nested_leaf | 0.2179ms | 0.1195ms | 8.3714 KOps/s | 8.7486 KOps/s | |
test_keys_stack_nested_locked | 0.2698ms | 0.1446ms | 6.9171 KOps/s | 7.1519 KOps/s | |
test_values | 9.8564μs | 1.0387μs | 962.6984 KOps/s | 976.7942 KOps/s | |
test_values_nested | 0.1459ms | 55.2964μs | 18.0844 KOps/s | 18.4015 KOps/s | |
test_values_nested_locked | 0.1130ms | 55.2944μs | 18.0850 KOps/s | 18.3347 KOps/s | |
test_values_nested_leaf | 0.1211ms | 60.3851μs | 16.5604 KOps/s | 16.4719 KOps/s | |
test_values_stack_nested | 0.1126ms | 56.0302μs | 17.8475 KOps/s | 17.9144 KOps/s | |
test_values_stack_nested_leaf | 0.1114ms | 60.6018μs | 16.5012 KOps/s | 16.5188 KOps/s | |
test_values_stack_nested_locked | 0.1175ms | 56.0634μs | 17.8369 KOps/s | 17.7535 KOps/s | |
test_membership | 3.9216μs | 0.7499μs | 1.3335 MOps/s | 1.1223 MOps/s | |
test_membership_nested | 22.5420μs | 2.7821μs | 359.4438 KOps/s | 361.8403 KOps/s | |
test_membership_nested_leaf | 27.8320μs | 2.8054μs | 356.4559 KOps/s | 357.2666 KOps/s | |
test_membership_stacked_nested | 26.4290μs | 2.7808μs | 359.6143 KOps/s | 362.9351 KOps/s | |
test_membership_stacked_nested_leaf | 42.2090μs | 2.7627μs | 361.9667 KOps/s | 358.2172 KOps/s | |
test_membership_nested_last | 33.9330μs | 4.1480μs | 241.0798 KOps/s | 241.3450 KOps/s | |
test_membership_nested_leaf_last | 29.3550μs | 4.1086μs | 243.3904 KOps/s | 244.8880 KOps/s | |
test_membership_stacked_nested_last | 26.2590μs | 4.1316μs | 242.0376 KOps/s | 157.0703 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.6670μs | 4.0832μs | 244.9067 KOps/s | 162.9435 KOps/s | |
test_nested_getleaf | 35.0250μs | 10.5554μs | 94.7382 KOps/s | 92.7092 KOps/s | |
test_nested_get | 36.1470μs | 9.9677μs | 100.3244 KOps/s | 98.2160 KOps/s | |
test_stacked_getleaf | 33.3820μs | 10.3994μs | 96.1598 KOps/s | 94.7132 KOps/s | |
test_stacked_get | 43.5010μs | 9.8536μs | 101.4860 KOps/s | 99.0012 KOps/s | |
test_nested_getitemleaf | 34.4440μs | 10.7788μs | 92.7743 KOps/s | 89.2999 KOps/s | |
test_nested_getitem | 33.4020μs | 10.0612μs | 99.3919 KOps/s | 96.8246 KOps/s | |
test_stacked_getitemleaf | 37.3990μs | 10.8036μs | 92.5621 KOps/s | 89.1198 KOps/s | |
test_stacked_getitem | 39.9540μs | 9.9734μs | 100.2666 KOps/s | 96.9968 KOps/s | |
test_lock_nested | 3.4923ms | 0.4515ms | 2.2147 KOps/s | 1.8176 KOps/s | |
test_lock_stack_nested | 0.8272ms | 0.4189ms | 2.3872 KOps/s | 2.4754 KOps/s | |
test_unlock_nested | 0.7908ms | 0.3669ms | 2.7253 KOps/s | 2.7509 KOps/s | |
test_unlock_stack_nested | 0.5977ms | 0.3378ms | 2.9602 KOps/s | 3.0921 KOps/s | |
test_flatten_speed | 0.1483ms | 91.8330μs | 10.8893 KOps/s | 10.9215 KOps/s | |
test_unflatten_speed | 0.8428ms | 0.4963ms | 2.0148 KOps/s | 2.1589 KOps/s | |
test_common_ops | 6.0750ms | 0.7686ms | 1.3011 KOps/s | 1.2621 KOps/s | |
test_creation | 17.6530μs | 2.0675μs | 483.6803 KOps/s | 480.7257 KOps/s | |
test_creation_empty | 37.0890μs | 10.6867μs | 93.5744 KOps/s | 83.2450 KOps/s | |
test_creation_nested_1 | 44.8840μs | 13.6671μs | 73.1684 KOps/s | 67.8633 KOps/s | |
test_creation_nested_2 | 50.0530μs | 17.7635μs | 56.2954 KOps/s | 52.7713 KOps/s | |
test_clone | 89.2660μs | 13.3534μs | 74.8871 KOps/s | 76.2384 KOps/s | |
test_getitem[int] | 1.1747ms | 12.4513μs | 80.3131 KOps/s | 79.4120 KOps/s | |
test_getitem[slice_int] | 0.1388ms | 24.1531μs | 41.4025 KOps/s | 41.3430 KOps/s | |
test_getitem[range] | 0.4367ms | 50.0294μs | 19.9882 KOps/s | 20.1987 KOps/s | |
test_getitem[tuple] | 0.1392ms | 20.0147μs | 49.9634 KOps/s | 50.2106 KOps/s | |
test_getitem[list] | 0.2818ms | 43.8827μs | 22.7880 KOps/s | 22.1804 KOps/s | |
test_setitem_dim[int] | 54.3410μs | 25.1898μs | 39.6986 KOps/s | 39.2815 KOps/s | |
test_setitem_dim[slice_int] | 0.1131ms | 51.4041μs | 19.4537 KOps/s | 19.1782 KOps/s | |
test_setitem_dim[range] | 0.1160ms | 73.4031μs | 13.6234 KOps/s | 13.5343 KOps/s | |
test_setitem_dim[tuple] | 62.9770μs | 40.2198μs | 24.8634 KOps/s | 24.9423 KOps/s | |
test_setitem | 0.1724ms | 20.1514μs | 49.6244 KOps/s | 47.7048 KOps/s | |
test_set | 0.1290ms | 19.5830μs | 51.0647 KOps/s | 49.1616 KOps/s | |
test_set_shared | 1.2328ms | 0.1671ms | 5.9848 KOps/s | 5.9437 KOps/s | |
test_update | 0.1642ms | 22.2304μs | 44.9834 KOps/s | 42.4280 KOps/s | |
test_update_nested | 0.1633ms | 31.6405μs | 31.6051 KOps/s | 29.9751 KOps/s | |
test_update__nested | 0.5405ms | 32.3879μs | 30.8757 KOps/s | 30.7493 KOps/s | |
test_set_nested | 0.1308ms | 21.4917μs | 46.5295 KOps/s | 45.0372 KOps/s | |
test_set_nested_new | 0.1585ms | 26.2315μs | 38.1221 KOps/s | 37.0851 KOps/s | |
test_select | 0.1624ms | 42.0589μs | 23.7762 KOps/s | 23.3569 KOps/s | |
test_select_nested | 0.1181ms | 60.9740μs | 16.4004 KOps/s | 16.8273 KOps/s | |
test_exclude_nested | 0.1607ms | 75.6884μs | 13.2121 KOps/s | 13.2772 KOps/s | |
test_empty[True] | 0.5565ms | 0.3502ms | 2.8556 KOps/s | 2.8908 KOps/s | |
test_empty[False] | 10.0513μs | 1.2277μs | 814.5476 KOps/s | 779.1032 KOps/s | |
test_unbind_speed | 0.4750ms | 0.2593ms | 3.8568 KOps/s | 3.8174 KOps/s | |
test_unbind_speed_stack0 | 0.6021ms | 0.2571ms | 3.8898 KOps/s | 3.9728 KOps/s | |
test_unbind_speed_stack1 | 0.1011s | 0.7862ms | 1.2719 KOps/s | 1.4483 KOps/s | |
test_split | 97.9506ms | 1.7163ms | 582.6583 Ops/s | 566.3169 Ops/s | |
test_chunk | 0.1018s | 1.7246ms | 579.8505 Ops/s | 560.0361 Ops/s | |
test_consolidate_njt[False-None] | 8.5788ms | 8.1337ms | 122.9447 Ops/s | 120.7663 Ops/s | |
test_creation[device0] | 0.2232ms | 91.0126μs | 10.9875 KOps/s | 10.7533 KOps/s | |
test_creation_from_tensor | 3.8319ms | 95.5667μs | 10.4639 KOps/s | 10.4193 KOps/s | |
test_add_one[memmap_tensor0] | 0.2547ms | 4.8385μs | 206.6773 KOps/s | 202.0899 KOps/s | |
test_contiguous[memmap_tensor0] | 7.8640μs | 0.5318μs | 1.8806 MOps/s | 1.9280 MOps/s | |
test_stack[memmap_tensor0] | 48.8510μs | 3.3766μs | 296.1561 KOps/s | 291.4253 KOps/s | |
test_memmaptd_index | 0.9953ms | 0.2384ms | 4.1940 KOps/s | 4.3114 KOps/s | |
test_memmaptd_index_astensor | 0.5458ms | 0.3143ms | 3.1814 KOps/s | 3.2279 KOps/s | |
test_memmaptd_index_op | 1.3257ms | 0.5840ms | 1.7122 KOps/s | 1.6644 KOps/s | |
test_serialize_model | 0.1378s | 0.1183s | 8.4525 Ops/s | 8.4746 Ops/s | |
test_serialize_model_pickle | 0.4754s | 0.3951s | 2.5307 Ops/s | 2.5185 Ops/s | |
test_serialize_weights | 0.2132s | 0.1300s | 7.6909 Ops/s | 8.6701 Ops/s | |
test_serialize_weights_returnearly | 0.1800s | 0.1584s | 6.3138 Ops/s | 6.3220 Ops/s | |
test_serialize_weights_pickle | 0.5497s | 0.4150s | 2.4095 Ops/s | 2.5100 Ops/s | |
test_serialize_weights_filesystem | 0.1510s | 0.1411s | 7.0884 Ops/s | 7.1894 Ops/s | |
test_serialize_model_filesystem | 0.2543s | 0.1608s | 6.2179 Ops/s | 5.9465 Ops/s | |
test_reshape_pytree | 72.9950μs | 27.1690μs | 36.8066 KOps/s | 36.8068 KOps/s | |
test_reshape_td | 77.5550μs | 33.3250μs | 30.0075 KOps/s | 30.6460 KOps/s | |
test_view_pytree | 69.9800μs | 27.1799μs | 36.7919 KOps/s | 36.8293 KOps/s | |
test_view_td | 0.1121ms | 38.3105μs | 26.1025 KOps/s | 26.6028 KOps/s | |
test_unbind_pytree | 72.5150μs | 30.1814μs | 33.1329 KOps/s | 33.0428 KOps/s | |
test_unbind_td | 0.3571ms | 38.6773μs | 25.8550 KOps/s | 25.9180 KOps/s | |
test_split_pytree | 82.0450μs | 29.5053μs | 33.8922 KOps/s | 33.4586 KOps/s | |
test_split_td | 0.2005ms | 44.6939μs | 22.3744 KOps/s | 22.2036 KOps/s | |
test_add_pytree | 98.5120μs | 35.8972μs | 27.8573 KOps/s | 26.8807 KOps/s | |
test_add_td | 0.1350ms | 56.2000μs | 17.7936 KOps/s | 17.4267 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1373ms | 61.4305μs | 16.2786 KOps/s | 16.0656 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3791ms | 0.1602ms | 6.2421 KOps/s | 6.2653 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1179ms | 44.2796μs | 22.5838 KOps/s | 22.0777 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2676ms | 0.1210ms | 8.2670 KOps/s | 8.4024 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 70.0000μs | 25.4916μs | 39.2287 KOps/s | 38.8337 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1288ms | 53.2925μs | 18.7644 KOps/s | 18.1018 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1350ms | 78.3955μs | 12.7558 KOps/s | 12.3433 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1542ms | 68.0149μs | 14.7027 KOps/s | 14.2693 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1757ms | 0.1030ms | 9.7043 KOps/s | 9.6335 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3727ms | 0.1995ms | 5.0121 KOps/s | 5.0009 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1083ms | 43.6051μs | 22.9331 KOps/s | 22.2345 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4989ms | 61.1694μs | 16.3480 KOps/s | 16.2437 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2218ms | 0.1014ms | 9.8643 KOps/s | 9.7050 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3573ms | 0.2028ms | 4.9308 KOps/s | 4.8555 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4315ms | 0.2124ms | 4.7086 KOps/s | 4.8110 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1980ms | 0.1045ms | 9.5662 KOps/s | 9.5042 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1903ms | 55.3519μs | 18.0662 KOps/s | 18.3285 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1842ms | 45.0080μs | 22.2183 KOps/s | 22.2032 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6586ms | 0.1585ms | 6.3088 KOps/s | 6.1629 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1826ms | 0.1007ms | 9.9301 KOps/s | 9.9022 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 63.1280μs | 21.2900μs | 46.9705 KOps/s | 48.0396 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1203ms | 58.6610μs | 17.0471 KOps/s | 17.2172 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1751ms | 82.0942μs | 12.1811 KOps/s | 11.9150 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1617ms | 70.1762μs | 14.2498 KOps/s | 13.8319 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2931ms | 0.2081ms | 4.8058 KOps/s | 4.9326 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.3939ms | 1.2725ms | 785.8432 Ops/s | 775.7781 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3369ms | 0.2023ms | 4.9426 KOps/s | 4.9474 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.8951ms | 0.7796ms | 1.2827 KOps/s | 1.2741 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8137ms | 0.4597ms | 2.1752 KOps/s | 2.1992 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.5410ms | 2.5908ms | 385.9865 Ops/s | 369.4862 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 87.2920μs | 35.1998μs | 28.4093 KOps/s | 27.7142 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6016ms | 31.7192μs | 31.5267 KOps/s | 30.1840 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 95.6580μs | 28.6638μs | 34.8872 KOps/s | 34.1056 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 65.4920μs | 22.9795μs | 43.5170 KOps/s | 42.5717 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1109ms | 29.3536μs | 34.0674 KOps/s | 33.3315 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 67.2360μs | 23.0128μs | 43.4540 KOps/s | 43.0734 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1495ms | 51.6776μs | 19.3507 KOps/s | 19.3736 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6417ms | 19.4892μs | 51.3105 KOps/s | 49.6281 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1114ms | 43.2634μs | 23.1142 KOps/s | 22.2244 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 78.2950μs | 18.8165μs | 53.1449 KOps/s | 51.9381 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1033ms | 44.0589μs | 22.6969 KOps/s | 22.2421 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 57.6670μs | 18.9524μs | 52.7638 KOps/s | 51.9635 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1276ms | 51.4473μs | 19.4374 KOps/s | 19.0789 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.1454ms | 19.3711μs | 51.6232 KOps/s | 50.2654 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 89.2460μs | 43.9166μs | 22.7704 KOps/s | 22.2599 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 81.6140μs | 18.6914μs | 53.5005 KOps/s | 52.8095 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1166ms | 43.7498μs | 22.8573 KOps/s | 21.8417 KOps/s | |
test_compile_indexing[int-pytree-eager] | 64.8910μs | 18.7417μs | 53.3571 KOps/s | 52.1695 KOps/s | |
test_mod_add[eager] | 90.8690μs | 25.7706μs | 38.8039 KOps/s | 35.9710 KOps/s | |
test_mod_add[compile] | 87.8530μs | 43.4176μs | 23.0322 KOps/s | 21.9549 KOps/s | |
test_mod_add[compile-overhead] | 0.1021ms | 43.2903μs | 23.0999 KOps/s | 21.4054 KOps/s | |
test_mod_wrap[eager] | 0.3638ms | 0.2191ms | 4.5635 KOps/s | 4.4886 KOps/s | |
test_mod_wrap[compile] | 1.5940ms | 0.2043ms | 4.8950 KOps/s | 4.8730 KOps/s | |
test_mod_wrap[compile-overhead] | 1.6964ms | 0.2026ms | 4.9349 KOps/s | 4.8873 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.7567ms | 10.7441ms | 93.0747 Ops/s | 90.3604 Ops/s | |
test_mod_wrap_and_backward[compile] | 11.6715ms | 10.7399ms | 93.1111 Ops/s | 80.4137 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.1889ms | 10.6819ms | 93.6164 Ops/s | 77.3839 Ops/s | |
test_seq_add[eager] | 0.1999ms | 90.6525μs | 11.0311 KOps/s | 10.5091 KOps/s | |
test_seq_add[compile] | 0.1890ms | 59.1380μs | 16.9096 KOps/s | 16.1977 KOps/s | |
test_seq_add[compile-overhead] | 0.1316ms | 57.6315μs | 17.3516 KOps/s | 16.8013 KOps/s | |
test_seq_wrap[eager] | 0.6872ms | 0.3897ms | 2.5662 KOps/s | 2.4702 KOps/s | |
test_seq_wrap[compile] | 0.4333ms | 0.2278ms | 4.3893 KOps/s | 4.3981 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3289ms | 0.2271ms | 4.4029 KOps/s | 4.4551 KOps/s | |
test_func_call_runtime[False-eager] | 0.7871ms | 0.5617ms | 1.7804 KOps/s | 1.7892 KOps/s | |
test_func_call_runtime[False-compile] | 0.5919ms | 0.4278ms | 2.3376 KOps/s | 2.3540 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6139ms | 0.4305ms | 2.3230 KOps/s | 2.3575 KOps/s | |
test_func_call_runtime[True-eager] | 0.9321ms | 0.7724ms | 1.2946 KOps/s | 1.2894 KOps/s | |
test_func_call_runtime[True-compile] | 0.6373ms | 0.4663ms | 2.1446 KOps/s | 2.1356 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5659ms | 0.4675ms | 2.1390 KOps/s | 2.1485 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8180ms | 0.5691ms | 1.7573 KOps/s | 1.7885 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5554ms | 0.4262ms | 2.3461 KOps/s | 2.3486 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5572ms | 0.4275ms | 2.3391 KOps/s | 2.3625 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3896ms | 0.9097ms | 1.0993 KOps/s | 1.1164 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7825ms | 0.4931ms | 2.0279 KOps/s | 2.0557 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5908ms | 0.4932ms | 2.0275 KOps/s | 2.0507 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5862ms | 1.8956ms | 527.5418 Ops/s | 531.4638 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0511ms | 0.5159ms | 1.9383 KOps/s | 1.9333 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 1.0253ms | 0.5202ms | 1.9225 KOps/s | 1.9223 KOps/s | |
test_distributed | 0.2753ms | 0.1250ms | 7.9995 KOps/s | 7.7682 KOps/s | |
test_tdmodule | 31.4880μs | 17.9181μs | 55.8095 KOps/s | 51.1669 KOps/s | |
test_tdmodule_dispatch | 57.4980μs | 35.4861μs | 28.1800 KOps/s | 26.2961 KOps/s | |
test_tdseq | 51.2550μs | 21.5349μs | 46.4362 KOps/s | 45.1365 KOps/s | |
test_tdseq_dispatch | 75.1000μs | 42.3549μs | 23.6100 KOps/s | 22.6023 KOps/s | |
test_instantiation_functorch | 1.6760ms | 1.5253ms | 655.6184 Ops/s | 653.5182 Ops/s | |
test_exec_functorch | 0.4293ms | 0.1794ms | 5.5752 KOps/s | 5.4410 KOps/s | |
test_exec_functional_call | 0.3291ms | 0.1765ms | 5.6662 KOps/s | 5.7696 KOps/s | |
test_exec_td_decorator | 0.5585ms | 0.2305ms | 4.3390 KOps/s | 4.3113 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9798ms | 0.6462ms | 1.5474 KOps/s | 1.5545 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9624ms | 0.6551ms | 1.5266 KOps/s | 1.5570 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7398ms | 0.5288ms | 1.8912 KOps/s | 1.8987 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8500ms | 0.5290ms | 1.8903 KOps/s | 1.9012 KOps/s | |
test_to_module_speed[True] | 1.8910ms | 1.3003ms | 769.0758 Ops/s | 777.6661 Ops/s | |
test_to_module_speed[False] | 2.0801ms | 1.2728ms | 785.6508 Ops/s | 796.6053 Ops/s | |
test_tc_init | 74.8900μs | 45.6754μs | 21.8936 KOps/s | 21.4395 KOps/s | |
test_tc_init_nested | 0.1645ms | 90.3840μs | 11.0639 KOps/s | 10.7376 KOps/s | |
test_tc_first_layer_tensor | 17.5530μs | 1.4985μs | 667.3461 KOps/s | 643.5403 KOps/s | |
test_tc_first_layer_nontensor | 45.3940μs | 4.6077μs | 217.0257 KOps/s | 208.0972 KOps/s | |
test_tc_second_layer_tensor | 28.9340μs | 2.8160μs | 355.1183 KOps/s | 346.3342 KOps/s | |
test_tc_second_layer_nontensor | 46.8970μs | 6.0028μs | 166.5897 KOps/s | 162.0180 KOps/s | |
test_unbind | 0.2175s | 13.7727ms | 72.6074 Ops/s | 83.4786 Ops/s | |
test_full_like | 7.9754ms | 7.0375ms | 142.0954 Ops/s | 128.2185 Ops/s | |
test_zeros_like | 3.2317ms | 2.6928ms | 371.3657 Ops/s | 338.3334 Ops/s | |
test_ones_like | 10.7908ms | 5.8517ms | 170.8912 Ops/s | 281.2960 Ops/s | |
test_clone | 13.3608ms | 7.7415ms | 129.1744 Ops/s | 192.8556 Ops/s | |
test_squeeze | 63.6990μs | 11.9462μs | 83.7087 KOps/s | 84.9024 KOps/s | |
test_unsqueeze | 0.1553ms | 88.3121μs | 11.3235 KOps/s | 11.2307 KOps/s | |
test_split | 0.4638ms | 0.1920ms | 5.2089 KOps/s | 5.2875 KOps/s | |
test_permute | 0.4282ms | 0.2199ms | 4.5465 KOps/s | 4.5940 KOps/s | |
test_stack | 25.5693ms | 24.2591ms | 41.2216 Ops/s | 40.7772 Ops/s | |
test_cat | 30.9139ms | 24.3608ms | 41.0495 Ops/s | 41.3629 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 42.1010μs | 10.5898μs | 94.4302 KOps/s | 97.9073 KOps/s | |
test_plain_set_stack_nested | 36.7110μs | 10.7423μs | 93.0899 KOps/s | 97.5513 KOps/s | |
test_plain_set_nested_inplace | 41.0210μs | 11.4740μs | 87.1536 KOps/s | 89.8276 KOps/s | |
test_plain_set_stack_nested_inplace | 42.8610μs | 11.5222μs | 86.7887 KOps/s | 90.3733 KOps/s | |
test_items | 21.4100μs | 2.9043μs | 344.3157 KOps/s | 346.0004 KOps/s | |
test_items_nested | 0.4351ms | 0.3207ms | 3.1178 KOps/s | 3.1294 KOps/s | |
test_items_nested_locked | 0.3494ms | 0.3216ms | 3.1091 KOps/s | 3.1308 KOps/s | |
test_items_nested_leaf | 84.5610μs | 58.1366μs | 17.2009 KOps/s | 17.2451 KOps/s | |
test_items_stack_nested | 0.3711ms | 0.3213ms | 3.1126 KOps/s | 3.1312 KOps/s | |
test_items_stack_nested_leaf | 0.1309ms | 59.5001μs | 16.8067 KOps/s | 17.1627 KOps/s | |
test_items_stack_nested_locked | 0.3475ms | 0.3232ms | 3.0938 KOps/s | 3.1242 KOps/s | |
test_keys | 23.7600μs | 3.4591μs | 289.0952 KOps/s | 288.6596 KOps/s | |
test_keys_nested | 95.8220μs | 70.6146μs | 14.1614 KOps/s | 14.2494 KOps/s | |
test_keys_nested_locked | 3.1593ms | 76.0749μs | 13.1449 KOps/s | 13.2378 KOps/s | |
test_keys_nested_leaf | 90.6110μs | 61.6784μs | 16.2131 KOps/s | 16.2490 KOps/s | |
test_keys_stack_nested | 0.1011ms | 70.4217μs | 14.2002 KOps/s | 14.2109 KOps/s | |
test_keys_stack_nested_leaf | 91.8210μs | 61.3179μs | 16.3085 KOps/s | 16.2758 KOps/s | |
test_keys_stack_nested_locked | 0.1074ms | 75.6164μs | 13.2246 KOps/s | 13.2612 KOps/s | |
test_values | 5.4585μs | 0.9341μs | 1.0706 MOps/s | 1.1464 MOps/s | |
test_values_nested | 58.8710μs | 31.3013μs | 31.9475 KOps/s | 32.1899 KOps/s | |
test_values_nested_locked | 57.0010μs | 32.6309μs | 30.6458 KOps/s | 30.6146 KOps/s | |
test_values_nested_leaf | 0.1984ms | 33.5670μs | 29.7912 KOps/s | 30.0851 KOps/s | |
test_values_stack_nested | 0.1196ms | 31.7771μs | 31.4692 KOps/s | 31.9348 KOps/s | |
test_values_stack_nested_leaf | 0.1483ms | 34.2344μs | 29.2104 KOps/s | 29.7283 KOps/s | |
test_values_stack_nested_locked | 0.2337ms | 33.1306μs | 30.1836 KOps/s | 30.4944 KOps/s | |
test_membership | 1.6221μs | 0.5054μs | 1.9788 MOps/s | 1.9690 MOps/s | |
test_membership_nested | 33.1000μs | 1.8925μs | 528.3930 KOps/s | 528.9927 KOps/s | |
test_membership_nested_leaf | 12.7037μs | 1.8137μs | 551.3695 KOps/s | 522.1154 KOps/s | |
test_membership_stacked_nested | 27.0500μs | 1.9222μs | 520.2345 KOps/s | 498.6146 KOps/s | |
test_membership_stacked_nested_leaf | 27.8610μs | 1.9296μs | 518.2512 KOps/s | 493.4354 KOps/s | |
test_membership_nested_last | 23.3400μs | 2.7993μs | 357.2370 KOps/s | 349.4448 KOps/s | |
test_membership_nested_leaf_last | 41.9210μs | 2.8104μs | 355.8191 KOps/s | 354.9403 KOps/s | |
test_membership_stacked_nested_last | 29.9500μs | 3.5532μs | 281.4351 KOps/s | 348.4933 KOps/s | |
test_membership_stacked_nested_leaf_last | 74.6520μs | 3.4879μs | 286.7060 KOps/s | 352.0206 KOps/s | |
test_nested_getleaf | 39.3200μs | 6.0043μs | 166.5470 KOps/s | 167.3115 KOps/s | |
test_nested_get | 33.7410μs | 5.6755μs | 176.1951 KOps/s | 176.5084 KOps/s | |
test_stacked_getleaf | 31.9210μs | 6.0248μs | 165.9801 KOps/s | 166.4377 KOps/s | |
test_stacked_get | 32.0200μs | 5.6599μs | 176.6826 KOps/s | 176.4181 KOps/s | |
test_nested_getitemleaf | 31.3800μs | 6.0626μs | 164.9466 KOps/s | 162.5772 KOps/s | |
test_nested_getitem | 36.4810μs | 5.7847μs | 172.8709 KOps/s | 172.3676 KOps/s | |
test_stacked_getitemleaf | 36.6110μs | 6.0679μs | 164.8013 KOps/s | 163.3819 KOps/s | |
test_stacked_getitem | 28.3300μs | 5.7696μs | 173.3210 KOps/s | 172.4130 KOps/s | |
test_lock_nested | 5.1704ms | 0.3706ms | 2.6983 KOps/s | 2.6332 KOps/s | |
test_lock_stack_nested | 0.3881ms | 0.3330ms | 3.0030 KOps/s | 2.8937 KOps/s | |
test_unlock_nested | 0.6707ms | 0.3055ms | 3.2737 KOps/s | 3.2275 KOps/s | |
test_unlock_stack_nested | 0.3779ms | 0.2714ms | 3.6850 KOps/s | 3.5649 KOps/s | |
test_flatten_speed | 0.1084ms | 73.1124μs | 13.6776 KOps/s | 13.7943 KOps/s | |
test_unflatten_speed | 0.4267ms | 0.2900ms | 3.4483 KOps/s | 3.4408 KOps/s | |
test_common_ops | 1.6744ms | 0.5966ms | 1.6761 KOps/s | 1.6852 KOps/s | |
test_creation | 0.1631ms | 1.4705μs | 680.0386 KOps/s | 671.3835 KOps/s | |
test_creation_empty | 30.3800μs | 7.5377μs | 132.6668 KOps/s | 152.1213 KOps/s | |
test_creation_nested_1 | 42.0110μs | 8.9762μs | 111.4056 KOps/s | 123.6918 KOps/s | |
test_creation_nested_2 | 31.1500μs | 11.5659μs | 86.4614 KOps/s | 93.3626 KOps/s | |
test_clone | 1.5569ms | 11.0270μs | 90.6866 KOps/s | 86.3306 KOps/s | |
test_getitem[int] | 0.8862ms | 11.1484μs | 89.6990 KOps/s | 87.0612 KOps/s | |
test_getitem[slice_int] | 0.1176ms | 21.4714μs | 46.5735 KOps/s | 46.3651 KOps/s | |
test_getitem[range] | 0.1460ms | 39.1576μs | 25.5378 KOps/s | 24.8796 KOps/s | |
test_getitem[tuple] | 0.1570ms | 19.4371μs | 51.4480 KOps/s | 52.9753 KOps/s | |
test_getitem[list] | 97.1108ms | 50.4760μs | 19.8114 KOps/s | 28.6736 KOps/s | |
test_setitem_dim[int] | 0.1237ms | 20.0925μs | 49.7697 KOps/s | 49.1229 KOps/s | |
test_setitem_dim[slice_int] | 57.7710μs | 38.3020μs | 26.1083 KOps/s | 25.7229 KOps/s | |
test_setitem_dim[range] | 85.1110μs | 54.6112μs | 18.3113 KOps/s | 18.3438 KOps/s | |
test_setitem_dim[tuple] | 52.6710μs | 32.6561μs | 30.6222 KOps/s | 30.4690 KOps/s | |
test_setitem | 94.3910μs | 15.3481μs | 65.1545 KOps/s | 64.3146 KOps/s | |
test_set | 79.0010μs | 14.7705μs | 67.7025 KOps/s | 66.6637 KOps/s | |
test_set_shared | 1.5436ms | 0.1480ms | 6.7589 KOps/s | 6.7262 KOps/s | |
test_update | 0.6447ms | 17.6616μs | 56.6199 KOps/s | 57.5146 KOps/s | |
test_update_nested | 60.3510μs | 23.4296μs | 42.6811 KOps/s | 44.8457 KOps/s | |
test_update__nested | 0.1359ms | 24.8480μs | 40.2447 KOps/s | 38.9808 KOps/s | |
test_set_nested | 0.1955ms | 15.8995μs | 62.8950 KOps/s | 61.4475 KOps/s | |
test_set_nested_new | 86.2010μs | 18.2359μs | 54.8368 KOps/s | 52.8503 KOps/s | |
test_select | 0.1083ms | 29.6260μs | 33.7542 KOps/s | 33.6068 KOps/s | |
test_select_nested | 74.3820μs | 41.3653μs | 24.1748 KOps/s | 23.7002 KOps/s | |
test_exclude_nested | 93.7720μs | 58.8109μs | 17.0037 KOps/s | 16.9312 KOps/s | |
test_empty[True] | 0.2927ms | 0.2540ms | 3.9366 KOps/s | 3.9049 KOps/s | |
test_empty[False] | 3.2811μs | 0.7491μs | 1.3349 MOps/s | 1.3378 MOps/s | |
test_to | 88.3220μs | 54.9153μs | 18.2099 KOps/s | 17.1465 KOps/s | |
test_to_nonblocking | 0.2114ms | 47.5253μs | 21.0414 KOps/s | 21.4238 KOps/s | |
test_unbind_speed | 1.5686ms | 0.2365ms | 4.2275 KOps/s | 4.2265 KOps/s | |
test_unbind_speed_stack0 | 0.4103ms | 0.2327ms | 4.2967 KOps/s | 4.2063 KOps/s | |
test_unbind_speed_stack1 | 98.9269ms | 0.6439ms | 1.5531 KOps/s | 1.6591 KOps/s | |
test_split | 0.1003s | 1.6366ms | 611.0139 Ops/s | 605.4534 Ops/s | |
test_chunk | 0.1036s | 1.6383ms | 610.3809 Ops/s | 608.8469 Ops/s | |
test_consolidate[False-None] | 0.1023s | 2.8987ms | 344.9876 Ops/s | 340.7227 Ops/s | |
test_consolidate[default-None] | 1.8815ms | 1.7615ms | 567.7073 Ops/s | 557.8984 Ops/s | |
test_consolidate[reduce-overhead-None] | 2.0680ms | 1.8126ms | 551.6848 Ops/s | 549.8873 Ops/s | |
test_consolidate_njt[False-None] | 6.9032ms | 6.5270ms | 153.2095 Ops/s | 152.0782 Ops/s | |
test_to[False-False-None] | 1.8284ms | 1.6652ms | 600.5425 Ops/s | 580.3871 Ops/s | |
test_to[True-False-None] | 1.4523ms | 1.3315ms | 751.0367 Ops/s | 723.2214 Ops/s | |
test_to[within-False-None] | 4.3494ms | 4.0865ms | 244.7054 Ops/s | 240.3531 Ops/s | |
test_to[True-default-None] | 5.5200ms | 5.2030ms | 192.1955 Ops/s | 189.3266 Ops/s | |
test_to_njt[False-False-None] | 7.2177ms | 6.9801ms | 143.2650 Ops/s | 142.1985 Ops/s | |
test_to_njt[True-False-None] | 5.8031ms | 5.5256ms | 180.9772 Ops/s | 180.4635 Ops/s | |
test_to_njt[within-False-None] | 12.6041ms | 12.2432ms | 81.6783 Ops/s | 81.9795 Ops/s | |
test_creation[device0] | 0.4873ms | 80.5699μs | 12.4116 KOps/s | 12.3573 KOps/s | |
test_creation_from_tensor | 0.5475ms | 83.2720μs | 12.0088 KOps/s | 11.9957 KOps/s | |
test_add_one[memmap_tensor0] | 0.4157ms | 7.2342μs | 138.2314 KOps/s | 133.4512 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8741μs | 0.3980μs | 2.5127 MOps/s | 2.4376 MOps/s | |
test_stack[memmap_tensor0] | 41.0210μs | 4.9623μs | 201.5180 KOps/s | 206.0896 KOps/s | |
test_memmaptd_index | 1.6909ms | 0.2630ms | 3.8024 KOps/s | 3.8307 KOps/s | |
test_memmaptd_index_astensor | 0.6070ms | 0.3192ms | 3.1331 KOps/s | 3.1161 KOps/s | |
test_memmaptd_index_op | 1.0684ms | 0.6091ms | 1.6417 KOps/s | 1.6544 KOps/s | |
test_serialize_model | 0.1319s | 0.1311s | 7.6287 Ops/s | 7.6219 Ops/s | |
test_serialize_model_pickle | 1.3826s | 1.2171s | 0.8216 Ops/s | 0.8246 Ops/s | |
test_serialize_weights | 0.1320s | 0.1306s | 7.6564 Ops/s | 7.6745 Ops/s | |
test_serialize_weights_returnearly | 0.5298s | 74.1762ms | 13.4814 Ops/s | 13.7575 Ops/s | |
test_serialize_weights_pickle | 1.3687s | 1.2195s | 0.8200 Ops/s | 0.8182 Ops/s | |
test_reshape_pytree | 57.8310μs | 22.6349μs | 44.1796 KOps/s | 42.8554 KOps/s | |
test_reshape_td | 0.1010ms | 26.6236μs | 37.5606 KOps/s | 35.6026 KOps/s | |
test_view_pytree | 0.1569ms | 22.6622μs | 44.1263 KOps/s | 43.5040 KOps/s | |
test_view_td | 58.4610μs | 29.0752μs | 34.3936 KOps/s | 30.8339 KOps/s | |
test_unbind_pytree | 0.1204ms | 28.2575μs | 35.3889 KOps/s | 34.9203 KOps/s | |
test_unbind_td | 0.7748ms | 36.4075μs | 27.4669 KOps/s | 27.0062 KOps/s | |
test_split_pytree | 84.6010μs | 30.7229μs | 32.5490 KOps/s | 32.7309 KOps/s | |
test_split_td | 1.0562ms | 39.7221μs | 25.1749 KOps/s | 24.4990 KOps/s | |
test_add_pytree | 0.2000ms | 35.3654μs | 28.2762 KOps/s | 27.2143 KOps/s | |
test_add_td | 0.1494ms | 47.2925μs | 21.1450 KOps/s | 21.3780 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.3324ms | 0.1216ms | 8.2269 KOps/s | 7.9869 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2796ms | 0.1231ms | 8.1240 KOps/s | 7.9745 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2512ms | 96.0137μs | 10.4152 KOps/s | 10.3418 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3471ms | 0.1474ms | 6.7852 KOps/s | 6.4659 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1398ms | 22.3741μs | 44.6945 KOps/s | 43.8167 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1274ms | 26.6453μs | 37.5301 KOps/s | 36.7789 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3193ms | 64.8350μs | 15.4238 KOps/s | 15.2223 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.2037ms | 49.3667μs | 20.2566 KOps/s | 20.1096 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2873ms | 0.1430ms | 6.9915 KOps/s | 6.8629 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3252ms | 0.2063ms | 4.8471 KOps/s | 4.7983 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2338ms | 98.5393μs | 10.1482 KOps/s | 10.1096 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1845ms | 50.7667μs | 19.6979 KOps/s | 19.2446 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2962ms | 0.1368ms | 7.3087 KOps/s | 7.1962 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6519ms | 0.4758ms | 2.1019 KOps/s | 1.9755 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3912ms | 0.2452ms | 4.0787 KOps/s | 3.9977 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2597ms | 0.1442ms | 6.9359 KOps/s | 6.8704 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2205ms | 61.0380μs | 16.3832 KOps/s | 16.0334 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2714ms | 0.1044ms | 9.5831 KOps/s | 10.0341 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5786ms | 0.4078ms | 2.4521 KOps/s | 2.3778 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1742ms | 0.1371ms | 7.2925 KOps/s | 7.3199 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2176ms | 22.1742μs | 45.0974 KOps/s | 53.2201 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 49.9400μs | 26.8161μs | 37.2911 KOps/s | 36.6972 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1456ms | 70.1115μs | 14.2630 KOps/s | 14.3265 KOps/s | |
test_compile_copy_flat[pytree-eager] | 83.2810μs | 51.5369μs | 19.4036 KOps/s | 19.2916 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6952ms | 0.4642ms | 2.1541 KOps/s | 2.1246 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9825ms | 2.6364ms | 379.3054 Ops/s | 371.8811 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6204ms | 0.4377ms | 2.2846 KOps/s | 2.1942 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8532ms | 2.6246ms | 381.0116 Ops/s | 360.2249 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3103ms | 0.1187ms | 8.4217 KOps/s | 8.6964 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5700ms | 80.7235μs | 12.3880 KOps/s | 11.9265 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.7392ms | 0.1121ms | 8.9212 KOps/s | 9.2223 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.4613ms | 69.3723μs | 14.4150 KOps/s | 14.6455 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.5556ms | 0.1125ms | 8.8873 KOps/s | 9.3521 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.4764ms | 69.7625μs | 14.3344 KOps/s | 14.5316 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2547ms | 0.1055ms | 9.4829 KOps/s | 9.7875 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4133ms | 17.8855μs | 55.9111 KOps/s | 46.3890 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2589ms | 0.1036ms | 9.6528 KOps/s | 10.1905 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1613ms | 16.2170μs | 61.6636 KOps/s | 60.6959 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2952ms | 0.1041ms | 9.6061 KOps/s | 10.1258 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 93.1710μs | 16.3481μs | 61.1693 KOps/s | 61.4313 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2776ms | 0.1072ms | 9.3257 KOps/s | 9.5533 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5735ms | 17.8035μs | 56.1686 KOps/s | 55.0139 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2575ms | 0.1059ms | 9.4445 KOps/s | 10.1110 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2083ms | 17.4727μs | 57.2320 KOps/s | 61.1883 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2857ms | 0.1049ms | 9.5368 KOps/s | 10.1756 KOps/s | |
test_compile_indexing[int-pytree-eager] | 94.7810μs | 16.2366μs | 61.5891 KOps/s | 60.8844 KOps/s | |
test_mod_add[eager] | 0.2175ms | 34.0800μs | 29.3427 KOps/s | 30.8485 KOps/s | |
test_mod_add[compile] | 0.3387ms | 81.9407μs | 12.2039 KOps/s | 12.7317 KOps/s | |
test_mod_add[compile-overhead] | 0.3401ms | 0.1687ms | 5.9269 KOps/s | 5.7340 KOps/s | |
test_mod_wrap[eager] | 0.3948ms | 0.2412ms | 4.1462 KOps/s | 4.0148 KOps/s | |
test_mod_wrap[compile] | 1.6625ms | 0.2804ms | 3.5659 KOps/s | 3.4749 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3292ms | 3.8315ms | 260.9952 Ops/s | 261.0023 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7188ms | 1.3454ms | 743.2967 Ops/s | 682.6280 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4360ms | 1.2787ms | 782.0293 Ops/s | 717.2098 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4374ms | 0.9401ms | 1.0637 KOps/s | 910.6140 Ops/s | |
test_seq_add[eager] | 0.2445ms | 95.0692μs | 10.5187 KOps/s | 10.3337 KOps/s | |
test_seq_add[compile] | 0.2392ms | 87.9840μs | 11.3657 KOps/s | 11.3276 KOps/s | |
test_seq_add[compile-overhead] | 0.2772ms | 0.1285ms | 7.7821 KOps/s | 7.7231 KOps/s | |
test_seq_wrap[eager] | 0.5133ms | 0.3781ms | 2.6447 KOps/s | 2.5862 KOps/s | |
test_seq_wrap[compile] | 0.4431ms | 0.2990ms | 3.3442 KOps/s | 3.1850 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3713ms | 0.2245ms | 4.4547 KOps/s | 4.4398 KOps/s | |
test_func_call_runtime[False-eager] | 0.9138ms | 0.7378ms | 1.3554 KOps/s | 1.2757 KOps/s | |
test_func_call_runtime[False-compile] | 0.9035ms | 0.7438ms | 1.3445 KOps/s | 1.3221 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4658ms | 0.3643ms | 2.7449 KOps/s | 2.7211 KOps/s | |
test_func_call_runtime[True-eager] | 1.0656ms | 0.9025ms | 1.1080 KOps/s | 1.0833 KOps/s | |
test_func_call_runtime[True-compile] | 0.9173ms | 0.7675ms | 1.3030 KOps/s | 1.2867 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5035ms | 0.3855ms | 2.5940 KOps/s | 2.5790 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9564ms | 0.7438ms | 1.3445 KOps/s | 1.3253 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9197ms | 0.7477ms | 1.3374 KOps/s | 1.3063 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4723ms | 0.3681ms | 2.7164 KOps/s | 2.6993 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1839ms | 1.0046ms | 995.3772 Ops/s | 967.4601 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9540ms | 0.7981ms | 1.2530 KOps/s | 1.2331 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5627ms | 0.4140ms | 2.4152 KOps/s | 2.4031 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5595ms | 2.0774ms | 481.3701 Ops/s | 472.0380 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9873ms | 0.8075ms | 1.2383 KOps/s | 1.2081 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5679ms | 0.4147ms | 2.4115 KOps/s | 2.3856 KOps/s | |
test_distributed | 3.4128ms | 0.1803ms | 5.5452 KOps/s | 8.6385 KOps/s | |
test_tdmodule | 0.3851ms | 13.9313μs | 71.7808 KOps/s | 73.3501 KOps/s | |
test_tdmodule_dispatch | 63.0110μs | 26.3490μs | 37.9521 KOps/s | 38.0048 KOps/s | |
test_tdseq | 36.7710μs | 15.2340μs | 65.6425 KOps/s | 65.2426 KOps/s | |
test_tdseq_dispatch | 53.0210μs | 30.0867μs | 33.2373 KOps/s | 33.8422 KOps/s | |
test_instantiation_functorch | 1.6651ms | 1.5316ms | 652.8919 Ops/s | 633.7152 Ops/s | |
test_exec_functorch | 0.2620ms | 0.1498ms | 6.6740 KOps/s | 6.6732 KOps/s | |
test_exec_functional_call | 0.3006ms | 0.1467ms | 6.8187 KOps/s | 6.9289 KOps/s | |
test_exec_td_decorator | 0.3783ms | 0.1898ms | 5.2685 KOps/s | 5.3083 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9067ms | 0.6681ms | 1.4967 KOps/s | 1.4588 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8315ms | 0.6680ms | 1.4969 KOps/s | 1.4601 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7699ms | 0.6127ms | 1.6322 KOps/s | 1.6016 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7888ms | 0.6167ms | 1.6215 KOps/s | 1.6081 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.0229ms | 19.1452ms | 52.2325 Ops/s | 51.6135 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.4057ms | 19.1904ms | 52.1093 Ops/s | 51.6580 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.2858ms | 19.0588ms | 52.4693 Ops/s | 52.1035 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1988ms | 19.0649ms | 52.4523 Ops/s | 51.9336 Ops/s | |
test_to_module_speed[True] | 2.2066ms | 0.9299ms | 1.0754 KOps/s | 1.0678 KOps/s | |
test_to_module_speed[False] | 1.0117ms | 0.9214ms | 1.0853 KOps/s | 1.0955 KOps/s | |
test_tc_init | 0.1444ms | 34.6765μs | 28.8380 KOps/s | 29.1467 KOps/s | |
test_tc_init_nested | 0.1420ms | 71.9375μs | 13.9010 KOps/s | 14.2513 KOps/s | |
test_tc_first_layer_tensor | 4.0744μs | 0.6969μs | 1.4349 MOps/s | 1.4448 MOps/s | |
test_tc_first_layer_nontensor | 30.4600μs | 2.2921μs | 436.2817 KOps/s | 434.4751 KOps/s | |
test_tc_second_layer_tensor | 8.8775μs | 1.4183μs | 705.0807 KOps/s | 710.0806 KOps/s | |
test_tc_second_layer_nontensor | 28.4800μs | 3.0085μs | 332.3894 KOps/s | 325.8015 KOps/s | |
test_unbind | 0.2349s | 12.3199ms | 81.1692 Ops/s | 152.1887 Ops/s | |
test_full_like | 11.4392ms | 10.1223ms | 98.7913 Ops/s | 95.3636 Ops/s | |
test_zeros_like | 9.5740ms | 7.3986ms | 135.1605 Ops/s | 221.9968 Ops/s | |
test_ones_like | 9.9062ms | 7.4690ms | 133.8869 Ops/s | 220.1602 Ops/s | |
test_clone | 13.0623ms | 9.9583ms | 100.4188 Ops/s | 139.5947 Ops/s | |
test_squeeze | 0.1044ms | 9.3016μs | 107.5089 KOps/s | 106.2713 KOps/s | |
test_unsqueeze | 0.1970ms | 68.9073μs | 14.5123 KOps/s | 13.5698 KOps/s | |
test_split | 0.3886ms | 0.1548ms | 6.4591 KOps/s | 6.0751 KOps/s | |
test_permute | 0.3276ms | 0.1752ms | 5.7089 KOps/s | 5.4499 KOps/s | |
test_stack | 52.9674ms | 52.2200ms | 19.1498 Ops/s | 18.7691 Ops/s | |
test_cat | 52.6448ms | 52.2008ms | 19.1568 Ops/s | 19.0105 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):