Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Graceful attribute error exit in TensorDictParams #571

Merged
merged 1 commit into from
Nov 23, 2023

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 23, 2023

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 23, 2023
@vmoens vmoens added the bug Something isn't working label Nov 23, 2023
@vmoens vmoens marked this pull request as ready for review November 23, 2023 17:52
@vmoens vmoens merged commit 42bf143 into main Nov 23, 2023
39 of 41 checks passed
@vmoens vmoens deleted the fix_getattr_tdparams branch November 23, 2023 17:52
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 113. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 37.1290μs 15.6458μs 63.9149 KOps/s 61.4645 KOps/s $\color{#35bf28}+3.99\%$
test_plain_set_stack_nested 0.2092ms 0.1475ms 6.7781 KOps/s 6.7851 KOps/s $\color{#d91a1a}-0.10\%$
test_plain_set_nested_inplace 64.2090μs 18.7377μs 53.3682 KOps/s 51.4915 KOps/s $\color{#35bf28}+3.64\%$
test_plain_set_stack_nested_inplace 0.3769ms 0.1722ms 5.8066 KOps/s 5.6601 KOps/s $\color{#35bf28}+2.59\%$
test_items 22.7420μs 2.4446μs 409.0565 KOps/s 412.4635 KOps/s $\color{#d91a1a}-0.83\%$
test_items_nested 0.3469ms 0.2673ms 3.7407 KOps/s 3.7314 KOps/s $\color{#35bf28}+0.25\%$
test_items_nested_locked 0.3398ms 0.2687ms 3.7212 KOps/s 3.7174 KOps/s $\color{#35bf28}+0.10\%$
test_items_nested_leaf 0.5869ms 0.1651ms 6.0586 KOps/s 6.0921 KOps/s $\color{#d91a1a}-0.55\%$
test_items_stack_nested 1.6155ms 1.4957ms 668.5708 Ops/s 682.1543 Ops/s $\color{#d91a1a}-1.99\%$
test_items_stack_nested_leaf 2.1230ms 1.3690ms 730.4814 Ops/s 752.1662 Ops/s $\color{#d91a1a}-2.88\%$
test_items_stack_nested_locked 0.9479ms 0.7693ms 1.2999 KOps/s 1.3157 KOps/s $\color{#d91a1a}-1.20\%$
test_keys 16.3510μs 3.8558μs 259.3462 KOps/s 259.4990 KOps/s $\color{#d91a1a}-0.06\%$
test_keys_nested 1.4419ms 0.1397ms 7.1565 KOps/s 6.7906 KOps/s $\textbf{\color{#35bf28}+5.39\%}$
test_keys_nested_locked 0.2710ms 0.1397ms 7.1566 KOps/s 7.1944 KOps/s $\color{#d91a1a}-0.53\%$
test_keys_nested_leaf 0.3182ms 0.1394ms 7.1742 KOps/s 7.2114 KOps/s $\color{#d91a1a}-0.52\%$
test_keys_stack_nested 2.3159ms 1.4278ms 700.3619 Ops/s 724.8409 Ops/s $\color{#d91a1a}-3.38\%$
test_keys_stack_nested_leaf 1.5425ms 1.4286ms 700.0098 Ops/s 725.3851 Ops/s $\color{#d91a1a}-3.50\%$
test_keys_stack_nested_locked 1.6182ms 0.6916ms 1.4459 KOps/s 1.4812 KOps/s $\color{#d91a1a}-2.38\%$
test_values 9.0142μs 1.1808μs 846.8981 KOps/s 895.2947 KOps/s $\textbf{\color{#d91a1a}-5.41\%}$
test_values_nested 84.5480μs 48.4567μs 20.6370 KOps/s 20.2489 KOps/s $\color{#35bf28}+1.92\%$
test_values_nested_locked 92.9130μs 49.0057μs 20.4058 KOps/s 20.1854 KOps/s $\color{#35bf28}+1.09\%$
test_values_nested_leaf 77.0440μs 43.4721μs 23.0033 KOps/s 22.5326 KOps/s $\color{#35bf28}+2.09\%$
test_values_stack_nested 1.5303ms 1.2106ms 826.0634 Ops/s 857.9231 Ops/s $\color{#d91a1a}-3.71\%$
test_values_stack_nested_leaf 1.2924ms 1.1983ms 834.5068 Ops/s 861.9399 Ops/s $\color{#d91a1a}-3.18\%$
test_values_stack_nested_locked 0.7480ms 0.5227ms 1.9131 KOps/s 1.9930 KOps/s $\color{#d91a1a}-4.01\%$
test_membership 19.5770μs 1.3493μs 741.1264 KOps/s 746.0425 KOps/s $\color{#d91a1a}-0.66\%$
test_membership_nested 25.9780μs 2.8353μs 352.6999 KOps/s 360.8546 KOps/s $\color{#d91a1a}-2.26\%$
test_membership_nested_leaf 20.4290μs 2.8772μs 347.5645 KOps/s 359.0237 KOps/s $\color{#d91a1a}-3.19\%$
test_membership_stacked_nested 43.5410μs 12.0491μs 82.9935 KOps/s 85.8505 KOps/s $\color{#d91a1a}-3.33\%$
test_membership_stacked_nested_leaf 44.7740μs 11.9993μs 83.3382 KOps/s 85.0918 KOps/s $\color{#d91a1a}-2.06\%$
test_membership_nested_last 33.4220μs 6.0428μs 165.4857 KOps/s 167.4294 KOps/s $\color{#d91a1a}-1.16\%$
test_membership_nested_leaf_last 29.5350μs 6.0349μs 165.7030 KOps/s 152.8394 KOps/s $\textbf{\color{#35bf28}+8.42\%}$
test_membership_stacked_nested_last 0.3117ms 0.1729ms 5.7849 KOps/s 5.8854 KOps/s $\color{#d91a1a}-1.71\%$
test_membership_stacked_nested_leaf_last 45.8260μs 14.1730μs 70.5566 KOps/s 73.2995 KOps/s $\color{#d91a1a}-3.74\%$
test_nested_getleaf 49.0820μs 10.6092μs 94.2578 KOps/s 94.4023 KOps/s $\color{#d91a1a}-0.15\%$
test_nested_get 39.5540μs 10.1834μs 98.1986 KOps/s 98.4857 KOps/s $\color{#d91a1a}-0.29\%$
test_stacked_getleaf 0.7791ms 0.6459ms 1.5482 KOps/s 1.5962 KOps/s $\color{#d91a1a}-3.01\%$
test_stacked_get 0.8523ms 0.6145ms 1.6272 KOps/s 1.6826 KOps/s $\color{#d91a1a}-3.29\%$
test_nested_getitemleaf 52.4480μs 11.0135μs 90.7980 KOps/s 90.7002 KOps/s $\color{#35bf28}+0.11\%$
test_nested_getitem 37.0200μs 10.3623μs 96.5034 KOps/s 95.8171 KOps/s $\color{#35bf28}+0.72\%$
test_stacked_getitemleaf 0.7482ms 0.6480ms 1.5431 KOps/s 1.5908 KOps/s $\color{#d91a1a}-3.00\%$
test_stacked_getitem 0.7327ms 0.6201ms 1.6126 KOps/s 1.6727 KOps/s $\color{#d91a1a}-3.59\%$
test_lock_nested 64.7735ms 0.5585ms 1.7905 KOps/s 2.0339 KOps/s $\textbf{\color{#d91a1a}-11.97\%}$
test_lock_stack_nested 88.9300ms 8.7997ms 113.6408 Ops/s 113.3699 Ops/s $\color{#35bf28}+0.24\%$
test_unlock_nested 65.5725ms 0.5131ms 1.9491 KOps/s 1.9464 KOps/s $\color{#35bf28}+0.14\%$
test_unlock_stack_nested 81.7981ms 8.6588ms 115.4896 Ops/s 201.3287 Ops/s $\textbf{\color{#d91a1a}-42.64\%}$
test_flatten_speed 0.6828ms 0.2698ms 3.7059 KOps/s 3.7008 KOps/s $\color{#35bf28}+0.14\%$
test_unflatten_speed 0.8044ms 0.4700ms 2.1278 KOps/s 2.0654 KOps/s $\color{#35bf28}+3.02\%$
test_common_ops 4.2657ms 0.6946ms 1.4398 KOps/s 1.4392 KOps/s $\color{#35bf28}+0.04\%$
test_creation 0.1073ms 2.3764μs 420.8099 KOps/s 421.5310 KOps/s $\color{#d91a1a}-0.17\%$
test_creation_empty 37.4000μs 8.0136μs 124.7883 KOps/s 113.4591 KOps/s $\textbf{\color{#35bf28}+9.99\%}$
test_creation_nested_1 46.2070μs 11.4722μs 87.1675 KOps/s 82.2039 KOps/s $\textbf{\color{#35bf28}+6.04\%}$
test_creation_nested_2 45.3950μs 14.9624μs 66.8344 KOps/s 62.7447 KOps/s $\textbf{\color{#35bf28}+6.52\%}$
test_clone 0.1690ms 13.5053μs 74.0451 KOps/s 76.2861 KOps/s $\color{#d91a1a}-2.94\%$
test_getitem[int] 41.4770μs 12.9318μs 77.3286 KOps/s 74.7256 KOps/s $\color{#35bf28}+3.48\%$
test_getitem[slice_int] 82.0640μs 25.2671μs 39.5772 KOps/s 39.6570 KOps/s $\color{#d91a1a}-0.20\%$
test_getitem[range] 0.1538ms 47.5352μs 21.0371 KOps/s 22.3648 KOps/s $\textbf{\color{#d91a1a}-5.94\%}$
test_getitem[tuple] 55.5940μs 20.4363μs 48.9326 KOps/s 48.8137 KOps/s $\color{#35bf28}+0.24\%$
test_getitem[list] 0.1023ms 40.7053μs 24.5668 KOps/s 25.2071 KOps/s $\color{#d91a1a}-2.54\%$
test_setitem_dim[int] 68.4580μs 28.2189μs 35.4373 KOps/s 35.1248 KOps/s $\color{#35bf28}+0.89\%$
test_setitem_dim[slice_int] 0.1169ms 51.7453μs 19.3254 KOps/s 19.0352 KOps/s $\color{#35bf28}+1.52\%$
test_setitem_dim[range] 0.1312ms 71.7309μs 13.9410 KOps/s 13.9857 KOps/s $\color{#d91a1a}-0.32\%$
test_setitem_dim[tuple] 71.2130μs 40.9866μs 24.3982 KOps/s 23.7395 KOps/s $\color{#35bf28}+2.77\%$
test_setitem 0.2225ms 18.4851μs 54.0977 KOps/s 54.8369 KOps/s $\color{#d91a1a}-1.35\%$
test_set 0.2060ms 17.8881μs 55.9031 KOps/s 55.9580 KOps/s $\color{#d91a1a}-0.10\%$
test_set_shared 1.6902ms 0.1383ms 7.2283 KOps/s 6.9355 KOps/s $\color{#35bf28}+4.22\%$
test_update 0.2065ms 23.9261μs 41.7954 KOps/s 41.3968 KOps/s $\color{#35bf28}+0.96\%$
test_update_nested 0.2007ms 34.0218μs 29.3929 KOps/s 29.2090 KOps/s $\color{#35bf28}+0.63\%$
test_set_nested 0.2040ms 19.6720μs 50.8337 KOps/s 50.9790 KOps/s $\color{#d91a1a}-0.29\%$
test_set_nested_new 0.1996ms 24.9781μs 40.0351 KOps/s 40.5078 KOps/s $\color{#d91a1a}-1.17\%$
test_select 1.1214ms 51.5633μs 19.3936 KOps/s 19.8472 KOps/s $\color{#d91a1a}-2.29\%$
test_unbind_speed 0.4500ms 0.3720ms 2.6879 KOps/s 2.6890 KOps/s $\color{#d91a1a}-0.04\%$
test_unbind_speed_stack0 69.6970ms 5.6788ms 176.0925 Ops/s 173.6200 Ops/s $\color{#35bf28}+1.42\%$
test_unbind_speed_stack1 2.5413μs 0.6427μs 1.5560 MOps/s 1.5164 MOps/s $\color{#35bf28}+2.61\%$
test_split 1.9705ms 1.6481ms 606.7510 Ops/s 604.9008 Ops/s $\color{#35bf28}+0.31\%$
test_chunk 63.0989ms 1.7561ms 569.4558 Ops/s 564.6467 Ops/s $\color{#35bf28}+0.85\%$
test_creation[device0] 2.6052ms 0.3026ms 3.3048 KOps/s 3.3481 KOps/s $\color{#d91a1a}-1.29\%$
test_creation_from_tensor 66.8442ms 0.3807ms 2.6270 KOps/s 2.6882 KOps/s $\color{#d91a1a}-2.28\%$
test_add_one[memmap_tensor0] 0.1001ms 25.8228μs 38.7255 KOps/s 38.6148 KOps/s $\color{#35bf28}+0.29\%$
test_contiguous[memmap_tensor0] 27.8020μs 5.9050μs 169.3486 KOps/s 176.1256 KOps/s $\color{#d91a1a}-3.85\%$
test_stack[memmap_tensor0] 0.1310ms 20.2278μs 49.4368 KOps/s 52.2832 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_memmaptd_index 0.4017ms 0.1943ms 5.1469 KOps/s 5.2260 KOps/s $\color{#d91a1a}-1.51\%$
test_memmaptd_index_astensor 0.4787ms 0.2577ms 3.8810 KOps/s 3.9799 KOps/s $\color{#d91a1a}-2.48\%$
test_memmaptd_index_op 1.1561ms 0.4888ms 2.0458 KOps/s 2.0226 KOps/s $\color{#35bf28}+1.15\%$
test_reshape_pytree 0.3164ms 23.5279μs 42.5028 KOps/s 43.3464 KOps/s $\color{#d91a1a}-1.95\%$
test_reshape_td 67.2350μs 31.4650μs 31.7814 KOps/s 31.3555 KOps/s $\color{#35bf28}+1.36\%$
test_view_pytree 58.2780μs 23.3900μs 42.7534 KOps/s 42.7576 KOps/s $-0.01\%$
test_view_td 29.7760μs 4.9806μs 200.7793 KOps/s 206.5375 KOps/s $\color{#d91a1a}-2.79\%$
test_unbind_pytree 61.3350μs 26.5152μs 37.7143 KOps/s 37.3471 KOps/s $\color{#35bf28}+0.98\%$
test_unbind_td 0.1144ms 59.8284μs 16.7145 KOps/s 16.9365 KOps/s $\color{#d91a1a}-1.31\%$
test_split_pytree 57.1860μs 26.5148μs 37.7148 KOps/s 37.9125 KOps/s $\color{#d91a1a}-0.52\%$
test_split_td 90.7890μs 46.5579μs 21.4786 KOps/s 21.5061 KOps/s $\color{#d91a1a}-0.13\%$
test_add_pytree 73.3970μs 32.6541μs 30.6240 KOps/s 31.3697 KOps/s $\color{#d91a1a}-2.38\%$
test_add_td 0.1331ms 44.9325μs 22.2556 KOps/s 22.0959 KOps/s $\color{#35bf28}+0.72\%$
test_distributed 27.1610μs 6.0429μs 165.4834 KOps/s 168.1664 KOps/s $\color{#d91a1a}-1.60\%$
test_tdmodule 0.1663ms 21.3706μs 46.7932 KOps/s 44.5006 KOps/s $\textbf{\color{#35bf28}+5.15\%}$
test_tdmodule_dispatch 0.1935ms 38.5509μs 25.9397 KOps/s 24.5864 KOps/s $\textbf{\color{#35bf28}+5.50\%}$
test_tdseq 0.1156ms 23.9746μs 41.7107 KOps/s 41.0530 KOps/s $\color{#35bf28}+1.60\%$
test_tdseq_dispatch 0.4441ms 42.6832μs 23.4284 KOps/s 22.5560 KOps/s $\color{#35bf28}+3.87\%$
test_instantiation_functorch 1.4615ms 1.3023ms 767.8948 Ops/s 758.4053 Ops/s $\color{#35bf28}+1.25\%$
test_instantiation_td 1.9294ms 1.0252ms 975.4529 Ops/s 953.2425 Ops/s $\color{#35bf28}+2.33\%$
test_exec_functorch 0.2479ms 0.1590ms 6.2884 KOps/s 6.2268 KOps/s $\color{#35bf28}+0.99\%$
test_exec_functional_call 0.2866ms 0.1476ms 6.7728 KOps/s 6.6991 KOps/s $\color{#35bf28}+1.10\%$
test_exec_td 0.3488ms 0.1451ms 6.8897 KOps/s 6.7469 KOps/s $\color{#35bf28}+2.12\%$
test_exec_td_decorator 0.9540ms 0.2177ms 4.5938 KOps/s 4.5237 KOps/s $\color{#35bf28}+1.55\%$
test_vmap_mlp_speed[True-True] 1.7367ms 0.9122ms 1.0962 KOps/s 1.0905 KOps/s $\color{#35bf28}+0.52\%$
test_vmap_mlp_speed[True-False] 0.5839ms 0.4738ms 2.1107 KOps/s 2.0685 KOps/s $\color{#35bf28}+2.04\%$
test_vmap_mlp_speed[False-True] 1.4360ms 0.7858ms 1.2726 KOps/s 1.2706 KOps/s $\color{#35bf28}+0.16\%$
test_vmap_mlp_speed[False-False] 6.5667ms 0.3973ms 2.5167 KOps/s 2.4726 KOps/s $\color{#35bf28}+1.79\%$
test_vmap_mlp_speed_decorator[True-True] 2.2762ms 1.5743ms 635.1924 Ops/s 619.5697 Ops/s $\color{#35bf28}+2.52\%$
test_vmap_mlp_speed_decorator[True-False] 1.1009ms 0.5555ms 1.8003 KOps/s 1.7439 KOps/s $\color{#35bf28}+3.23\%$
test_vmap_mlp_speed_decorator[False-True] 1.9138ms 1.3680ms 731.0147 Ops/s 726.9831 Ops/s $\color{#35bf28}+0.55\%$
test_vmap_mlp_speed_decorator[False-False] 1.0482ms 0.4319ms 2.3154 KOps/s 2.2868 KOps/s $\color{#35bf28}+1.25\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.0610μs 19.6227μs 50.9614 KOps/s 49.8756 KOps/s $\color{#35bf28}+2.18\%$
test_plain_set_stack_nested 0.2133ms 0.1861ms 5.3725 KOps/s 5.3035 KOps/s $\color{#35bf28}+1.30\%$
test_plain_set_nested_inplace 40.1600μs 23.5078μs 42.5391 KOps/s 41.9294 KOps/s $\color{#35bf28}+1.45\%$
test_plain_set_stack_nested_inplace 0.2494ms 0.2167ms 4.6139 KOps/s 4.5186 KOps/s $\color{#35bf28}+2.11\%$
test_items 22.4600μs 4.0046μs 249.7127 KOps/s 245.6311 KOps/s $\color{#35bf28}+1.66\%$
test_items_nested 0.3655ms 0.3336ms 2.9975 KOps/s 2.9447 KOps/s $\color{#35bf28}+1.79\%$
test_items_nested_locked 0.3657ms 0.3317ms 3.0148 KOps/s 2.9854 KOps/s $\color{#35bf28}+0.98\%$
test_items_nested_leaf 0.2453ms 0.2038ms 4.9061 KOps/s 4.9213 KOps/s $\color{#d91a1a}-0.31\%$
test_items_stack_nested 1.7995ms 1.6589ms 602.8247 Ops/s 589.5080 Ops/s $\color{#35bf28}+2.26\%$
test_items_stack_nested_leaf 1.5879ms 1.5113ms 661.6813 Ops/s 653.8721 Ops/s $\color{#35bf28}+1.19\%$
test_items_stack_nested_locked 1.0975ms 0.8970ms 1.1148 KOps/s 1.1263 KOps/s $\color{#d91a1a}-1.02\%$
test_keys 28.0400μs 4.7202μs 211.8566 KOps/s 209.5411 KOps/s $\color{#35bf28}+1.11\%$
test_keys_nested 41.6896ms 0.1761ms 5.6791 KOps/s 5.9938 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_keys_nested_locked 0.2449ms 0.1656ms 6.0396 KOps/s 5.9967 KOps/s $\color{#35bf28}+0.72\%$
test_keys_nested_leaf 0.2248ms 0.1632ms 6.1256 KOps/s 6.1584 KOps/s $\color{#d91a1a}-0.53\%$
test_keys_stack_nested 1.6184ms 1.5446ms 647.4255 Ops/s 635.4227 Ops/s $\color{#35bf28}+1.89\%$
test_keys_stack_nested_leaf 1.7422ms 1.5626ms 639.9422 Ops/s 642.6017 Ops/s $\color{#d91a1a}-0.41\%$
test_keys_stack_nested_locked 0.8368ms 0.7402ms 1.3509 KOps/s 1.3589 KOps/s $\color{#d91a1a}-0.59\%$
test_values 10.5600μs 2.0162μs 495.9802 KOps/s 495.7317 KOps/s $\color{#35bf28}+0.05\%$
test_values_nested 86.2700μs 59.4348μs 16.8252 KOps/s 16.8571 KOps/s $\color{#d91a1a}-0.19\%$
test_values_nested_locked 82.1820μs 58.4739μs 17.1017 KOps/s 17.0091 KOps/s $\color{#35bf28}+0.54\%$
test_values_nested_leaf 69.0220μs 54.8287μs 18.2386 KOps/s 17.8769 KOps/s $\color{#35bf28}+2.02\%$
test_values_stack_nested 1.4007ms 1.3245ms 755.0295 Ops/s 743.4644 Ops/s $\color{#35bf28}+1.56\%$
test_values_stack_nested_leaf 1.5116ms 1.3303ms 751.7303 Ops/s 750.3769 Ops/s $\color{#35bf28}+0.18\%$
test_values_stack_nested_locked 0.6319ms 0.5845ms 1.7107 KOps/s 1.7023 KOps/s $\color{#35bf28}+0.50\%$
test_membership 6.3275μs 1.3490μs 741.2934 KOps/s 645.5020 KOps/s $\textbf{\color{#35bf28}+14.84\%}$
test_membership_nested 69.5220μs 3.0118μs 332.0228 KOps/s 321.8822 KOps/s $\color{#35bf28}+3.15\%$
test_membership_nested_leaf 31.1110μs 2.9923μs 334.1889 KOps/s 321.1703 KOps/s $\color{#35bf28}+4.05\%$
test_membership_stacked_nested 33.6900μs 11.7964μs 84.7717 KOps/s 83.2521 KOps/s $\color{#35bf28}+1.83\%$
test_membership_stacked_nested_leaf 43.7200μs 11.8252μs 84.5651 KOps/s 82.7469 KOps/s $\color{#35bf28}+2.20\%$
test_membership_nested_last 42.2810μs 6.8290μs 146.4348 KOps/s 143.4618 KOps/s $\color{#35bf28}+2.07\%$
test_membership_nested_leaf_last 34.3910μs 6.8323μs 146.3642 KOps/s 144.1049 KOps/s $\color{#35bf28}+1.57\%$
test_membership_stacked_nested_last 0.2747ms 0.1989ms 5.0269 KOps/s 4.9618 KOps/s $\color{#35bf28}+1.31\%$
test_membership_stacked_nested_leaf_last 31.1400μs 14.4041μs 69.4248 KOps/s 68.6916 KOps/s $\color{#35bf28}+1.07\%$
test_nested_getleaf 33.2310μs 13.1626μs 75.9727 KOps/s 76.2052 KOps/s $\color{#d91a1a}-0.31\%$
test_nested_get 34.3900μs 12.4449μs 80.3543 KOps/s 80.3338 KOps/s $\color{#35bf28}+0.03\%$
test_stacked_getleaf 0.7596ms 0.6994ms 1.4297 KOps/s 1.4000 KOps/s $\color{#35bf28}+2.12\%$
test_stacked_get 0.8016ms 0.6722ms 1.4878 KOps/s 1.4455 KOps/s $\color{#35bf28}+2.92\%$
test_nested_getitemleaf 33.0800μs 13.0990μs 76.3420 KOps/s 76.3895 KOps/s $\color{#d91a1a}-0.06\%$
test_nested_getitem 33.5210μs 12.4121μs 80.5665 KOps/s 80.5336 KOps/s $\color{#35bf28}+0.04\%$
test_stacked_getitemleaf 0.8809ms 0.7072ms 1.4140 KOps/s 1.3910 KOps/s $\color{#35bf28}+1.66\%$
test_stacked_getitem 0.7632ms 0.6642ms 1.5056 KOps/s 1.4449 KOps/s $\color{#35bf28}+4.21\%$
test_lock_nested 1.8369ms 0.5621ms 1.7789 KOps/s 1.7895 KOps/s $\color{#d91a1a}-0.59\%$
test_lock_stack_nested 71.8432ms 8.6942ms 115.0195 Ops/s 114.3942 Ops/s $\color{#35bf28}+0.55\%$
test_unlock_nested 60.2678ms 0.5636ms 1.7745 KOps/s 1.7553 KOps/s $\color{#35bf28}+1.09\%$
test_unlock_stack_nested 64.1273ms 8.2633ms 121.0168 Ops/s 120.7470 Ops/s $\color{#35bf28}+0.22\%$
test_flatten_speed 53.7817ms 0.3420ms 2.9237 KOps/s 2.9335 KOps/s $\color{#d91a1a}-0.33\%$
test_unflatten_speed 0.5859ms 0.5461ms 1.8312 KOps/s 1.8115 KOps/s $\color{#35bf28}+1.09\%$
test_common_ops 1.0826ms 0.6878ms 1.4539 KOps/s 1.4544 KOps/s $\color{#d91a1a}-0.04\%$
test_creation 31.5200μs 2.6947μs 371.0952 KOps/s 377.0327 KOps/s $\color{#d91a1a}-1.57\%$
test_creation_empty 24.9800μs 8.8666μs 112.7833 KOps/s 107.9493 KOps/s $\color{#35bf28}+4.48\%$
test_creation_nested_1 43.1000μs 12.4116μs 80.5697 KOps/s 78.5279 KOps/s $\color{#35bf28}+2.60\%$
test_creation_nested_2 35.7600μs 16.0000μs 62.5000 KOps/s 60.4822 KOps/s $\color{#35bf28}+3.34\%$
test_clone 0.1152ms 15.3423μs 65.1792 KOps/s 65.3362 KOps/s $\color{#d91a1a}-0.24\%$
test_getitem[int] 39.7100μs 14.8024μs 67.5565 KOps/s 66.6716 KOps/s $\color{#35bf28}+1.33\%$
test_getitem[slice_int] 53.1510μs 27.8408μs 35.9185 KOps/s 36.1871 KOps/s $\color{#d91a1a}-0.74\%$
test_getitem[range] 71.9810μs 42.5675μs 23.4921 KOps/s 24.0789 KOps/s $\color{#d91a1a}-2.44\%$
test_getitem[tuple] 42.1800μs 23.4488μs 42.6461 KOps/s 42.3299 KOps/s $\color{#35bf28}+0.75\%$
test_getitem[list] 0.3415ms 40.1467μs 24.9086 KOps/s 25.2391 KOps/s $\color{#d91a1a}-1.31\%$
test_setitem_dim[int] 58.9610μs 30.5796μs 32.7015 KOps/s 32.2911 KOps/s $\color{#35bf28}+1.27\%$
test_setitem_dim[slice_int] 79.1210μs 52.4493μs 19.0660 KOps/s 18.8732 KOps/s $\color{#35bf28}+1.02\%$
test_setitem_dim[range] 0.1007ms 69.3903μs 14.4112 KOps/s 14.2403 KOps/s $\color{#35bf28}+1.20\%$
test_setitem_dim[tuple] 66.2810μs 45.3374μs 22.0568 KOps/s 22.2463 KOps/s $\color{#d91a1a}-0.85\%$
test_setitem 0.1147ms 19.9304μs 50.1745 KOps/s 50.6274 KOps/s $\color{#d91a1a}-0.89\%$
test_set 0.1181ms 19.4858μs 51.3195 KOps/s 51.5876 KOps/s $\color{#d91a1a}-0.52\%$
test_set_shared 2.5174ms 0.1077ms 9.2827 KOps/s 9.3850 KOps/s $\color{#d91a1a}-1.09\%$
test_update 98.9820μs 26.3438μs 37.9596 KOps/s 38.6615 KOps/s $\color{#d91a1a}-1.82\%$
test_update_nested 0.1120ms 37.9918μs 26.3215 KOps/s 25.8611 KOps/s $\color{#35bf28}+1.78\%$
test_set_nested 0.1043ms 21.3497μs 46.8390 KOps/s 47.6625 KOps/s $\color{#d91a1a}-1.73\%$
test_set_nested_new 0.1056ms 26.9290μs 37.1347 KOps/s 36.5322 KOps/s $\color{#35bf28}+1.65\%$
test_select 0.1289ms 54.9265μs 18.2061 KOps/s 18.2821 KOps/s $\color{#d91a1a}-0.42\%$
test_to 82.0810μs 53.5387μs 18.6781 KOps/s 19.0453 KOps/s $\color{#d91a1a}-1.93\%$
test_to_nonblocking 63.9210μs 36.6565μs 27.2803 KOps/s 27.0315 KOps/s $\color{#35bf28}+0.92\%$
test_unbind_speed 0.5281ms 0.4278ms 2.3378 KOps/s 2.3118 KOps/s $\color{#35bf28}+1.12\%$
test_unbind_speed_stack0 56.1217ms 5.7865ms 172.8170 Ops/s 168.6071 Ops/s $\color{#35bf28}+2.50\%$
test_unbind_speed_stack1 2.1610μs 0.7750μs 1.2903 MOps/s 1.2895 MOps/s $\color{#35bf28}+0.06\%$
test_split 56.2668ms 2.1163ms 472.5159 Ops/s 478.3562 Ops/s $\color{#d91a1a}-1.22\%$
test_chunk 51.1851ms 2.0878ms 478.9628 Ops/s 507.0119 Ops/s $\textbf{\color{#d91a1a}-5.53\%}$
test_creation[device0] 0.4750ms 0.3115ms 3.2098 KOps/s 2.9663 KOps/s $\textbf{\color{#35bf28}+8.21\%}$
test_creation[device1] 0.6116ms 0.3139ms 3.1858 KOps/s 3.2128 KOps/s $\color{#d91a1a}-0.84\%$
test_creation_from_tensor 0.6118ms 0.3419ms 2.9251 KOps/s 2.9671 KOps/s $\color{#d91a1a}-1.42\%$
test_add_one[memmap_tensor0] 59.1510μs 24.6702μs 40.5347 KOps/s 41.8727 KOps/s $\color{#d91a1a}-3.20\%$
test_add_one[memmap_tensor1] 0.2081ms 74.6222μs 13.4008 KOps/s 13.7999 KOps/s $\color{#d91a1a}-2.89\%$
test_contiguous[memmap_tensor0] 28.4700μs 6.2531μs 159.9195 KOps/s 171.5847 KOps/s $\textbf{\color{#d91a1a}-6.80\%}$
test_contiguous[memmap_tensor1] 52.2210μs 21.5732μs 46.3539 KOps/s 46.3635 KOps/s $\color{#d91a1a}-0.02\%$
test_stack[memmap_tensor0] 49.7410μs 19.4946μs 51.2962 KOps/s 50.9002 KOps/s $\color{#35bf28}+0.78\%$
test_stack[memmap_tensor1] 0.1700ms 72.6947μs 13.7562 KOps/s 13.8665 KOps/s $\color{#d91a1a}-0.80\%$
test_memmaptd_index 0.2638ms 0.2301ms 4.3468 KOps/s 4.4759 KOps/s $\color{#d91a1a}-2.89\%$
test_memmaptd_index_astensor 0.3884ms 0.2861ms 3.4953 KOps/s 3.4130 KOps/s $\color{#35bf28}+2.41\%$
test_memmaptd_index_op 0.6373ms 0.5758ms 1.7367 KOps/s 1.7463 KOps/s $\color{#d91a1a}-0.55\%$
test_reshape_pytree 50.0700μs 24.7329μs 40.4320 KOps/s 40.0661 KOps/s $\color{#35bf28}+0.91\%$
test_reshape_td 65.0200μs 35.7742μs 27.9531 KOps/s 28.3148 KOps/s $\color{#d91a1a}-1.28\%$
test_view_pytree 40.2500μs 25.1900μs 39.6983 KOps/s 40.0880 KOps/s $\color{#d91a1a}-0.97\%$
test_view_td 20.9300μs 5.0093μs 199.6271 KOps/s 190.7653 KOps/s $\color{#35bf28}+4.65\%$
test_unbind_pytree 58.3710μs 29.8534μs 33.4970 KOps/s 33.1433 KOps/s $\color{#35bf28}+1.07\%$
test_unbind_td 0.1502ms 68.5611μs 14.5855 KOps/s 14.4372 KOps/s $\color{#35bf28}+1.03\%$
test_split_pytree 46.5810μs 27.8527μs 35.9032 KOps/s 35.4622 KOps/s $\color{#35bf28}+1.24\%$
test_split_td 76.9410μs 52.9287μs 18.8933 KOps/s 18.8148 KOps/s $\color{#35bf28}+0.42\%$
test_add_pytree 54.4200μs 36.1259μs 27.6810 KOps/s 27.2920 KOps/s $\color{#35bf28}+1.43\%$
test_add_td 86.5610μs 49.7096μs 20.1168 KOps/s 19.9848 KOps/s $\color{#35bf28}+0.66\%$
test_distributed 21.7700μs 6.4307μs 155.5039 KOps/s 155.7890 KOps/s $\color{#d91a1a}-0.18\%$
test_tdmodule 0.7284ms 25.7813μs 38.7878 KOps/s 11.2805 KOps/s $\textbf{\color{#35bf28}+243.85\%}$
test_tdmodule_dispatch 0.1402ms 40.0892μs 24.9444 KOps/s 25.7969 KOps/s $\color{#d91a1a}-3.30\%$
test_tdseq 0.3322ms 30.9827μs 32.2761 KOps/s 32.0195 KOps/s $\color{#35bf28}+0.80\%$
test_tdseq_dispatch 59.5110μs 42.6611μs 23.4406 KOps/s 23.3655 KOps/s $\color{#35bf28}+0.32\%$
test_instantiation_functorch 1.5342ms 1.4337ms 697.4876 Ops/s 700.9446 Ops/s $\color{#d91a1a}-0.49\%$
test_instantiation_td 1.5339ms 1.0849ms 921.7301 Ops/s 926.1864 Ops/s $\color{#d91a1a}-0.48\%$
test_exec_functorch 0.2222ms 0.1740ms 5.7470 KOps/s 5.7127 KOps/s $\color{#35bf28}+0.60\%$
test_exec_functional_call 0.2174ms 0.1682ms 5.9464 KOps/s 5.9312 KOps/s $\color{#35bf28}+0.26\%$
test_exec_td 0.1975ms 0.1610ms 6.2113 KOps/s 6.2290 KOps/s $\color{#d91a1a}-0.28\%$
test_exec_td_decorator 0.7612ms 0.2452ms 4.0784 KOps/s 4.0977 KOps/s $\color{#d91a1a}-0.47\%$
test_vmap_mlp_speed[True-True] 1.3032ms 1.1735ms 852.1416 Ops/s 849.2942 Ops/s $\color{#35bf28}+0.34\%$
test_vmap_mlp_speed[True-False] 0.7020ms 0.6549ms 1.5270 KOps/s 1.4933 KOps/s $\color{#35bf28}+2.26\%$
test_vmap_mlp_speed[False-True] 1.1401ms 1.0615ms 942.0723 Ops/s 933.7221 Ops/s $\color{#35bf28}+0.89\%$
test_vmap_mlp_speed[False-False] 0.7139ms 0.5718ms 1.7488 KOps/s 1.7159 KOps/s $\color{#35bf28}+1.92\%$
test_vmap_mlp_speed_decorator[True-True] 2.6865ms 1.9630ms 509.4328 Ops/s 509.1262 Ops/s $\color{#35bf28}+0.06\%$
test_vmap_mlp_speed_decorator[True-False] 65.6559ms 0.7808ms 1.2808 KOps/s 1.3605 KOps/s $\textbf{\color{#d91a1a}-5.86\%}$
test_vmap_mlp_speed_decorator[False-True] 2.1371ms 1.7570ms 569.1677 Ops/s 556.2812 Ops/s $\color{#35bf28}+2.32\%$
test_vmap_mlp_speed_decorator[False-False] 1.0105ms 0.6202ms 1.6123 KOps/s 1.6060 KOps/s $\color{#35bf28}+0.39\%$
test_vmap_transformer_speed[True-True] 13.3749ms 13.2534ms 75.4524 Ops/s 75.2711 Ops/s $\color{#35bf28}+0.24\%$
test_vmap_transformer_speed[True-False] 8.5186ms 8.4444ms 118.4212 Ops/s 118.1202 Ops/s $\color{#35bf28}+0.25\%$
test_vmap_transformer_speed[False-True] 13.2629ms 13.1586ms 75.9961 Ops/s 74.0776 Ops/s $\color{#35bf28}+2.59\%$
test_vmap_transformer_speed[False-False] 8.4778ms 8.3527ms 119.7222 Ops/s 119.2172 Ops/s $\color{#35bf28}+0.42\%$
test_vmap_transformer_speed_decorator[True-True] 46.9393ms 45.9596ms 21.7582 Ops/s 20.2932 Ops/s $\textbf{\color{#35bf28}+7.22\%}$
test_vmap_transformer_speed_decorator[True-False] 22.3272ms 20.8241ms 48.0214 Ops/s 47.8188 Ops/s $\color{#35bf28}+0.42\%$
test_vmap_transformer_speed_decorator[False-True] 0.1198s 48.8400ms 20.4750 Ops/s 20.4680 Ops/s $\color{#35bf28}+0.03\%$
test_vmap_transformer_speed_decorator[False-False] 21.8896ms 20.3794ms 49.0691 Ops/s 48.9990 Ops/s $\color{#35bf28}+0.14\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants