Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Quality] Better error message for incongruent lists of keys #1077

Merged
merged 1 commit into from
Nov 6, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 6, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 6, 2024
ghstack-source-id: 34940a47d84bcf171bf4511187fcc82df88f801f
Pull Request resolved: #1077
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 6, 2024
Copy link

github-actions bot commented Nov 6, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}40$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 35.5060μs 17.7287μs 56.4056 KOps/s 60.2000 KOps/s $\textbf{\color{#d91a1a}-6.30\%}$
test_plain_set_stack_nested 48.9710μs 17.9870μs 55.5958 KOps/s 60.2929 KOps/s $\textbf{\color{#d91a1a}-7.79\%}$
test_plain_set_nested_inplace 82.7320μs 19.2749μs 51.8809 KOps/s 54.3695 KOps/s $\color{#d91a1a}-4.58\%$
test_plain_set_stack_nested_inplace 64.3590μs 19.7052μs 50.7480 KOps/s 54.8633 KOps/s $\textbf{\color{#d91a1a}-7.50\%}$
test_items 34.0340μs 4.1637μs 240.1695 KOps/s 239.7203 KOps/s $\color{#35bf28}+0.19\%$
test_items_nested 0.7155ms 0.3390ms 2.9501 KOps/s 2.8387 KOps/s $\color{#35bf28}+3.92\%$
test_items_nested_locked 0.4872ms 0.3390ms 2.9502 KOps/s 2.9418 KOps/s $\color{#35bf28}+0.29\%$
test_items_nested_leaf 0.1330ms 71.1832μs 14.0483 KOps/s 13.8216 KOps/s $\color{#35bf28}+1.64\%$
test_items_stack_nested 0.5603ms 0.3412ms 2.9310 KOps/s 2.9377 KOps/s $\color{#d91a1a}-0.23\%$
test_items_stack_nested_leaf 0.1480ms 74.0077μs 13.5121 KOps/s 13.4135 KOps/s $\color{#35bf28}+0.74\%$
test_items_stack_nested_locked 0.5426ms 0.3442ms 2.9052 KOps/s 2.8997 KOps/s $\color{#35bf28}+0.19\%$
test_keys 32.8510μs 3.5929μs 278.3295 KOps/s 284.0466 KOps/s $\color{#d91a1a}-2.01\%$
test_keys_nested 0.2231ms 0.1405ms 7.1150 KOps/s 7.0776 KOps/s $\color{#35bf28}+0.53\%$
test_keys_nested_locked 0.6532ms 0.1468ms 6.8101 KOps/s 6.8591 KOps/s $\color{#d91a1a}-0.72\%$
test_keys_nested_leaf 0.2028ms 0.1216ms 8.2243 KOps/s 8.1876 KOps/s $\color{#35bf28}+0.45\%$
test_keys_stack_nested 0.2311ms 0.1397ms 7.1582 KOps/s 7.1188 KOps/s $\color{#35bf28}+0.55\%$
test_keys_stack_nested_leaf 0.2090ms 0.1187ms 8.4222 KOps/s 8.3001 KOps/s $\color{#35bf28}+1.47\%$
test_keys_stack_nested_locked 0.2449ms 0.1448ms 6.9081 KOps/s 6.8916 KOps/s $\color{#35bf28}+0.24\%$
test_values 9.2072μs 1.1226μs 890.8271 KOps/s 973.0364 KOps/s $\textbf{\color{#d91a1a}-8.45\%}$
test_values_nested 0.1015ms 55.8993μs 17.8893 KOps/s 18.1948 KOps/s $\color{#d91a1a}-1.68\%$
test_values_nested_locked 0.1118ms 56.1401μs 17.8126 KOps/s 18.2122 KOps/s $\color{#d91a1a}-2.19\%$
test_values_nested_leaf 0.1090ms 59.9425μs 16.6827 KOps/s 15.9846 KOps/s $\color{#35bf28}+4.37\%$
test_values_stack_nested 0.1118ms 58.0812μs 17.2173 KOps/s 17.2957 KOps/s $\color{#d91a1a}-0.45\%$
test_values_stack_nested_leaf 0.1172ms 60.8017μs 16.4469 KOps/s 16.6767 KOps/s $\color{#d91a1a}-1.38\%$
test_values_stack_nested_locked 0.1116ms 57.9222μs 17.2645 KOps/s 17.8613 KOps/s $\color{#d91a1a}-3.34\%$
test_membership 5.0079μs 0.7668μs 1.3041 MOps/s 1.1000 MOps/s $\textbf{\color{#35bf28}+18.56\%}$
test_membership_nested 32.2500μs 2.8182μs 354.8354 KOps/s 356.6127 KOps/s $\color{#d91a1a}-0.50\%$
test_membership_nested_leaf 24.8370μs 2.8282μs 353.5774 KOps/s 355.5978 KOps/s $\color{#d91a1a}-0.57\%$
test_membership_stacked_nested 33.3730μs 2.7641μs 361.7836 KOps/s 355.7091 KOps/s $\color{#35bf28}+1.71\%$
test_membership_stacked_nested_leaf 15.6990μs 2.7719μs 360.7687 KOps/s 358.0223 KOps/s $\color{#35bf28}+0.77\%$
test_membership_nested_last 34.9250μs 4.1471μs 241.1336 KOps/s 239.3512 KOps/s $\color{#35bf28}+0.74\%$
test_membership_nested_leaf_last 34.0030μs 4.1272μs 242.2924 KOps/s 239.4576 KOps/s $\color{#35bf28}+1.18\%$
test_membership_stacked_nested_last 33.8430μs 5.3294μs 187.6373 KOps/s 208.5667 KOps/s $\textbf{\color{#d91a1a}-10.03\%}$
test_membership_stacked_nested_leaf_last 19.6270μs 5.2884μs 189.0939 KOps/s 208.9162 KOps/s $\textbf{\color{#d91a1a}-9.49\%}$
test_nested_getleaf 38.4910μs 10.5460μs 94.8229 KOps/s 91.9924 KOps/s $\color{#35bf28}+3.08\%$
test_nested_get 37.5900μs 10.0813μs 99.1940 KOps/s 98.0216 KOps/s $\color{#35bf28}+1.20\%$
test_stacked_getleaf 37.7200μs 10.4855μs 95.3697 KOps/s 93.7733 KOps/s $\color{#35bf28}+1.70\%$
test_stacked_get 38.2810μs 9.9898μs 100.1023 KOps/s 99.5682 KOps/s $\color{#35bf28}+0.54\%$
test_nested_getitemleaf 46.1660μs 11.0292μs 90.6687 KOps/s 90.1624 KOps/s $\color{#35bf28}+0.56\%$
test_nested_getitem 37.2890μs 10.2202μs 97.8453 KOps/s 97.4498 KOps/s $\color{#35bf28}+0.41\%$
test_stacked_getitemleaf 37.2600μs 10.9757μs 91.1105 KOps/s 89.9784 KOps/s $\color{#35bf28}+1.26\%$
test_stacked_getitem 42.5390μs 10.3449μs 96.6658 KOps/s 96.5534 KOps/s $\color{#35bf28}+0.12\%$
test_lock_nested 5.7012ms 0.4503ms 2.2208 KOps/s 2.2843 KOps/s $\color{#d91a1a}-2.78\%$
test_lock_stack_nested 0.6777ms 0.4110ms 2.4330 KOps/s 2.4130 KOps/s $\color{#35bf28}+0.83\%$
test_unlock_nested 0.7288ms 0.3646ms 2.7427 KOps/s 2.7584 KOps/s $\color{#d91a1a}-0.57\%$
test_unlock_stack_nested 0.5386ms 0.3303ms 3.0273 KOps/s 2.9901 KOps/s $\color{#35bf28}+1.25\%$
test_flatten_speed 0.1488ms 92.7463μs 10.7821 KOps/s 10.7569 KOps/s $\color{#35bf28}+0.23\%$
test_unflatten_speed 0.7686ms 0.4732ms 2.1133 KOps/s 2.1153 KOps/s $\color{#d91a1a}-0.09\%$
test_common_ops 6.2783ms 0.7668ms 1.3042 KOps/s 1.4076 KOps/s $\textbf{\color{#d91a1a}-7.35\%}$
test_creation 26.2190μs 2.0910μs 478.2307 KOps/s 482.3252 KOps/s $\color{#d91a1a}-0.85\%$
test_creation_empty 49.6720μs 10.8118μs 92.4915 KOps/s 119.0075 KOps/s $\textbf{\color{#d91a1a}-22.28\%}$
test_creation_nested_1 47.4080μs 13.7479μs 72.7382 KOps/s 91.4983 KOps/s $\textbf{\color{#d91a1a}-20.50\%}$
test_creation_nested_2 82.1530μs 17.7953μs 56.1948 KOps/s 65.8700 KOps/s $\textbf{\color{#d91a1a}-14.69\%}$
test_clone 89.3860μs 13.0985μs 76.3445 KOps/s 76.5959 KOps/s $\color{#d91a1a}-0.33\%$
test_getitem[int] 1.5649ms 12.6062μs 79.3261 KOps/s 80.6071 KOps/s $\color{#d91a1a}-1.59\%$
test_getitem[slice_int] 0.1426ms 23.7646μs 42.0794 KOps/s 42.0194 KOps/s $\color{#35bf28}+0.14\%$
test_getitem[range] 0.1795ms 48.8712μs 20.4619 KOps/s 21.4152 KOps/s $\color{#d91a1a}-4.45\%$
test_getitem[tuple] 0.1290ms 19.6417μs 50.9122 KOps/s 51.5840 KOps/s $\color{#d91a1a}-1.30\%$
test_getitem[list] 0.1785ms 44.3155μs 22.5655 KOps/s 23.5866 KOps/s $\color{#d91a1a}-4.33\%$
test_setitem_dim[int] 48.3900μs 25.0673μs 39.8926 KOps/s 38.9192 KOps/s $\color{#35bf28}+2.50\%$
test_setitem_dim[slice_int] 90.8290μs 49.9834μs 20.0066 KOps/s 19.3065 KOps/s $\color{#35bf28}+3.63\%$
test_setitem_dim[range] 0.1229ms 74.5801μs 13.4084 KOps/s 13.8366 KOps/s $\color{#d91a1a}-3.09\%$
test_setitem_dim[tuple] 68.2070μs 39.0435μs 25.6124 KOps/s 24.6646 KOps/s $\color{#35bf28}+3.84\%$
test_setitem 85.9500μs 19.8909μs 50.2742 KOps/s 54.7712 KOps/s $\textbf{\color{#d91a1a}-8.21\%}$
test_set 79.0170μs 19.1426μs 52.2395 KOps/s 55.3334 KOps/s $\textbf{\color{#d91a1a}-5.59\%}$
test_set_shared 1.1702ms 0.1666ms 6.0025 KOps/s 5.9237 KOps/s $\color{#35bf28}+1.33\%$
test_update 0.7952ms 22.1891μs 45.0671 KOps/s 52.7807 KOps/s $\textbf{\color{#d91a1a}-14.61\%}$
test_update_nested 82.3530μs 31.5501μs 31.6956 KOps/s 34.5649 KOps/s $\textbf{\color{#d91a1a}-8.30\%}$
test_update__nested 0.1493ms 31.9376μs 31.3110 KOps/s 31.6658 KOps/s $\color{#d91a1a}-1.12\%$
test_set_nested 84.8580μs 21.2096μs 47.1484 KOps/s 50.8771 KOps/s $\textbf{\color{#d91a1a}-7.33\%}$
test_set_nested_new 0.1094ms 25.8649μs 38.6625 KOps/s 40.9269 KOps/s $\textbf{\color{#d91a1a}-5.53\%}$
test_select 0.2203ms 41.6108μs 24.0322 KOps/s 25.5339 KOps/s $\textbf{\color{#d91a1a}-5.88\%}$
test_select_nested 0.1423ms 60.2109μs 16.6083 KOps/s 16.8557 KOps/s $\color{#d91a1a}-1.47\%$
test_exclude_nested 0.1177ms 75.9672μs 13.1636 KOps/s 13.3532 KOps/s $\color{#d91a1a}-1.42\%$
test_empty[True] 0.5456ms 0.3534ms 2.8295 KOps/s 2.8128 KOps/s $\color{#35bf28}+0.59\%$
test_empty[False] 14.5673μs 1.3033μs 767.2588 KOps/s 798.4231 KOps/s $\color{#d91a1a}-3.90\%$
test_unbind_speed 0.5019ms 0.2708ms 3.6924 KOps/s 3.8473 KOps/s $\color{#d91a1a}-4.03\%$
test_unbind_speed_stack0 0.4667ms 0.2614ms 3.8259 KOps/s 3.8958 KOps/s $\color{#d91a1a}-1.80\%$
test_unbind_speed_stack1 0.1011s 0.7765ms 1.2878 KOps/s 1.4269 KOps/s $\textbf{\color{#d91a1a}-9.75\%}$
test_split 1.7910ms 1.5739ms 635.3529 Ops/s 583.6478 Ops/s $\textbf{\color{#35bf28}+8.86\%}$
test_chunk 0.1014s 1.8979ms 526.9111 Ops/s 578.7770 Ops/s $\textbf{\color{#d91a1a}-8.96\%}$
test_consolidate_njt[False-None] 8.9248ms 8.3358ms 119.9650 Ops/s 123.6269 Ops/s $\color{#d91a1a}-2.96\%$
test_creation[device0] 0.2701ms 91.6241μs 10.9142 KOps/s 10.9469 KOps/s $\color{#d91a1a}-0.30\%$
test_creation_from_tensor 3.8416ms 95.2757μs 10.4959 KOps/s 10.4582 KOps/s $\color{#35bf28}+0.36\%$
test_add_one[memmap_tensor0] 0.1730ms 5.1102μs 195.6861 KOps/s 208.3096 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_contiguous[memmap_tensor0] 23.5940μs 0.5253μs 1.9037 MOps/s 1.9761 MOps/s $\color{#d91a1a}-3.66\%$
test_stack[memmap_tensor0] 29.2540μs 3.3527μs 298.2709 KOps/s 303.8788 KOps/s $\color{#d91a1a}-1.85\%$
test_memmaptd_index 0.9478ms 0.2435ms 4.1064 KOps/s 4.2859 KOps/s $\color{#d91a1a}-4.19\%$
test_memmaptd_index_astensor 0.7201ms 0.3222ms 3.1034 KOps/s 3.2022 KOps/s $\color{#d91a1a}-3.08\%$
test_memmaptd_index_op 1.0151ms 0.5912ms 1.6916 KOps/s 1.8483 KOps/s $\textbf{\color{#d91a1a}-8.48\%}$
test_serialize_model 0.1245s 0.1163s 8.5998 Ops/s 7.6157 Ops/s $\textbf{\color{#35bf28}+12.92\%}$
test_serialize_model_pickle 0.4610s 0.3903s 2.5621 Ops/s 2.4835 Ops/s $\color{#35bf28}+3.16\%$
test_serialize_weights 0.2242s 0.1310s 7.6321 Ops/s 8.8643 Ops/s $\textbf{\color{#d91a1a}-13.90\%}$
test_serialize_weights_returnearly 0.1686s 0.1607s 6.2227 Ops/s 6.5843 Ops/s $\textbf{\color{#d91a1a}-5.49\%}$
test_serialize_weights_pickle 0.6311s 0.4558s 2.1941 Ops/s 2.5443 Ops/s $\textbf{\color{#d91a1a}-13.76\%}$
test_serialize_weights_filesystem 0.1490s 0.1406s 7.1119 Ops/s 7.1272 Ops/s $\color{#d91a1a}-0.22\%$
test_serialize_model_filesystem 0.1644s 0.1506s 6.6389 Ops/s 6.1530 Ops/s $\textbf{\color{#35bf28}+7.90\%}$
test_reshape_pytree 81.6120μs 26.7418μs 37.3947 KOps/s 36.0513 KOps/s $\color{#35bf28}+3.73\%$
test_reshape_td 87.4520μs 32.1421μs 31.1119 KOps/s 30.5749 KOps/s $\color{#35bf28}+1.76\%$
test_view_pytree 63.7990μs 26.9486μs 37.1077 KOps/s 37.5261 KOps/s $\color{#d91a1a}-1.12\%$
test_view_td 0.1011ms 37.7685μs 26.4771 KOps/s 26.8492 KOps/s $\color{#d91a1a}-1.39\%$
test_unbind_pytree 66.0430μs 30.2227μs 33.0877 KOps/s 33.4006 KOps/s $\color{#d91a1a}-0.94\%$
test_unbind_td 0.3222ms 39.3391μs 25.4200 KOps/s 25.7583 KOps/s $\color{#d91a1a}-1.31\%$
test_split_pytree 90.0880μs 30.2613μs 33.0455 KOps/s 33.8097 KOps/s $\color{#d91a1a}-2.26\%$
test_split_td 0.4858ms 44.2446μs 22.6016 KOps/s 22.9935 KOps/s $\color{#d91a1a}-1.70\%$
test_add_pytree 72.5550μs 36.7179μs 27.2347 KOps/s 27.3511 KOps/s $\color{#d91a1a}-0.43\%$
test_add_td 0.1186ms 53.8141μs 18.5825 KOps/s 19.9292 KOps/s $\textbf{\color{#d91a1a}-6.76\%}$
test_compile_add_one_nested[tensordict-compile] 0.2439ms 64.3184μs 15.5477 KOps/s 16.0773 KOps/s $\color{#d91a1a}-3.29\%$
test_compile_add_one_nested[tensordict-eager] 0.3732ms 0.1627ms 6.1446 KOps/s 6.2876 KOps/s $\color{#d91a1a}-2.27\%$
test_compile_add_one_nested[pytree-compile] 0.1083ms 46.5243μs 21.4942 KOps/s 21.7415 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_add_one_nested[pytree-eager] 0.2399ms 0.1214ms 8.2405 KOps/s 8.4109 KOps/s $\color{#d91a1a}-2.03\%$
test_compile_copy_nested[tensordict-compile] 0.1176ms 26.6210μs 37.5643 KOps/s 39.3398 KOps/s $\color{#d91a1a}-4.51\%$
test_compile_copy_nested[tensordict-eager] 0.1053ms 54.0539μs 18.5001 KOps/s 18.7894 KOps/s $\color{#d91a1a}-1.54\%$
test_compile_copy_nested[pytree-compile] 0.1468ms 79.9046μs 12.5149 KOps/s 12.6127 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_copy_nested[pytree-eager] 0.1304ms 68.7721μs 14.5408 KOps/s 14.7550 KOps/s $\color{#d91a1a}-1.45\%$
test_compile_add_one_flat[tensordict-compile] 0.2225ms 0.1060ms 9.4350 KOps/s 9.5666 KOps/s $\color{#d91a1a}-1.38\%$
test_compile_add_one_flat[tensordict-eager] 0.3334ms 0.2019ms 4.9522 KOps/s 4.9235 KOps/s $\color{#35bf28}+0.58\%$
test_compile_add_one_flat[tensorclass-compile] 0.1059ms 45.1827μs 22.1324 KOps/s 21.8517 KOps/s $\color{#35bf28}+1.28\%$
test_compile_add_one_flat[tensorclass-eager] 0.4497ms 61.4332μs 16.2778 KOps/s 16.6067 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_add_one_flat[pytree-compile] 0.2331ms 0.1052ms 9.5046 KOps/s 9.7368 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_add_one_flat[pytree-eager] 0.4143ms 0.2109ms 4.7407 KOps/s 5.0168 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_compile_add_self_flat[tensordict-eager] 0.3854ms 0.2134ms 4.6860 KOps/s 4.6991 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_add_self_flat[tensordict-compile] 0.1825ms 0.1064ms 9.4005 KOps/s 9.4795 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_add_self_flat[tensorclass-eager] 0.1233ms 54.2046μs 18.4486 KOps/s 18.7807 KOps/s $\color{#d91a1a}-1.77\%$
test_compile_add_self_flat[tensorclass-compile] 95.8390μs 45.5742μs 21.9422 KOps/s 22.0038 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_add_self_flat[pytree-eager] 0.5933ms 0.1639ms 6.1021 KOps/s 6.3475 KOps/s $\color{#d91a1a}-3.87\%$
test_compile_add_self_flat[pytree-compile] 0.2015ms 0.1043ms 9.5891 KOps/s 9.3294 KOps/s $\color{#35bf28}+2.78\%$
test_compile_copy_flat[tensordict-compile] 55.1330μs 21.3119μs 46.9222 KOps/s 48.3580 KOps/s $\color{#d91a1a}-2.97\%$
test_compile_copy_flat[tensordict-eager] 0.1229ms 59.2460μs 16.8788 KOps/s 15.9795 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_compile_copy_flat[pytree-compile] 0.1644ms 85.3151μs 11.7213 KOps/s 12.3039 KOps/s $\color{#d91a1a}-4.74\%$
test_compile_copy_flat[pytree-eager] 0.1450ms 72.8638μs 13.7242 KOps/s 14.3369 KOps/s $\color{#d91a1a}-4.27\%$
test_compile_assign_and_add[tensordict-compile] 0.3644ms 0.2102ms 4.7578 KOps/s 4.8428 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_assign_and_add[tensordict-eager] 2.0627ms 1.2946ms 772.4627 Ops/s 782.4049 Ops/s $\color{#d91a1a}-1.27\%$
test_compile_assign_and_add[pytree-compile] 0.3752ms 0.2086ms 4.7931 KOps/s 4.9841 KOps/s $\color{#d91a1a}-3.83\%$
test_compile_assign_and_add[pytree-eager] 1.3521ms 0.7999ms 1.2501 KOps/s 1.3054 KOps/s $\color{#d91a1a}-4.23\%$
test_compile_assign_and_add_stack[compile] 0.8013ms 0.4683ms 2.1352 KOps/s 2.1400 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_assign_and_add_stack[eager] 3.3715ms 2.6234ms 381.1854 Ops/s 399.0769 Ops/s $\color{#d91a1a}-4.48\%$
test_compile_indexing[tensor-tensordict-compile] 98.0220μs 36.4809μs 27.4116 KOps/s 27.2631 KOps/s $\color{#35bf28}+0.54\%$
test_compile_indexing[tensor-tensordict-eager] 0.5838ms 33.7031μs 29.6709 KOps/s 31.6484 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_compile_indexing[tensor-tensorclass-compile] 84.6070μs 30.0483μs 33.2797 KOps/s 34.0752 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1832ms 23.9402μs 41.7707 KOps/s 43.2867 KOps/s $\color{#d91a1a}-3.50\%$
test_compile_indexing[tensor-pytree-compile] 0.1200ms 30.3799μs 32.9165 KOps/s 33.3309 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_indexing[tensor-pytree-eager] 81.2110μs 24.1760μs 41.3633 KOps/s 43.5952 KOps/s $\textbf{\color{#d91a1a}-5.12\%}$
test_compile_indexing[slice-tensordict-compile] 0.1536ms 52.4998μs 19.0477 KOps/s 19.4417 KOps/s $\color{#d91a1a}-2.03\%$
test_compile_indexing[slice-tensordict-eager] 0.5777ms 20.0699μs 49.8259 KOps/s 51.1617 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_indexing[slice-tensorclass-compile] 0.1201ms 45.3024μs 22.0739 KOps/s 22.4237 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_indexing[slice-tensorclass-eager] 67.9260μs 18.9718μs 52.7099 KOps/s 52.9038 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_indexing[slice-pytree-compile] 0.1447ms 45.5717μs 21.9434 KOps/s 22.1375 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_indexing[slice-pytree-eager] 77.9020μs 18.9376μs 52.8049 KOps/s 51.9187 KOps/s $\color{#35bf28}+1.71\%$
test_compile_indexing[int-tensordict-compile] 0.1283ms 53.9094μs 18.5496 KOps/s 19.3852 KOps/s $\color{#d91a1a}-4.31\%$
test_compile_indexing[int-tensordict-eager] 0.9406ms 19.9025μs 50.2450 KOps/s 51.8242 KOps/s $\color{#d91a1a}-3.05\%$
test_compile_indexing[int-tensorclass-compile] 0.1004ms 45.9777μs 21.7497 KOps/s 22.6064 KOps/s $\color{#d91a1a}-3.79\%$
test_compile_indexing[int-tensorclass-eager] 68.1670μs 19.3348μs 51.7201 KOps/s 54.1685 KOps/s $\color{#d91a1a}-4.52\%$
test_compile_indexing[int-pytree-compile] 0.1127ms 46.2440μs 21.6244 KOps/s 22.5784 KOps/s $\color{#d91a1a}-4.23\%$
test_compile_indexing[int-pytree-eager] 78.5960μs 19.0504μs 52.4923 KOps/s 54.0501 KOps/s $\color{#d91a1a}-2.88\%$
test_mod_add[eager] 75.5200μs 26.6177μs 37.5690 KOps/s 41.9757 KOps/s $\textbf{\color{#d91a1a}-10.50\%}$
test_mod_add[compile] 0.1097ms 45.7086μs 21.8777 KOps/s 21.9189 KOps/s $\color{#d91a1a}-0.19\%$
test_mod_add[compile-overhead] 0.1336ms 46.4343μs 21.5358 KOps/s 22.6114 KOps/s $\color{#d91a1a}-4.76\%$
test_mod_wrap[eager] 0.4446ms 0.2169ms 4.6103 KOps/s 4.7853 KOps/s $\color{#d91a1a}-3.66\%$
test_mod_wrap[compile] 1.7393ms 0.2085ms 4.7963 KOps/s 4.9876 KOps/s $\color{#d91a1a}-3.83\%$
test_mod_wrap[compile-overhead] 1.7946ms 0.2063ms 4.8464 KOps/s 4.7029 KOps/s $\color{#35bf28}+3.05\%$
test_mod_wrap_and_backward[eager] 21.7749ms 12.4455ms 80.3506 Ops/s 88.1953 Ops/s $\textbf{\color{#d91a1a}-8.89\%}$
test_mod_wrap_and_backward[compile] 18.8910ms 12.6831ms 78.8449 Ops/s 74.1376 Ops/s $\textbf{\color{#35bf28}+6.35\%}$
test_mod_wrap_and_backward[compile-overhead] 14.2256ms 11.9939ms 83.3756 Ops/s 75.9622 Ops/s $\textbf{\color{#35bf28}+9.76\%}$
test_seq_add[eager] 0.1764ms 92.6751μs 10.7904 KOps/s 11.8210 KOps/s $\textbf{\color{#d91a1a}-8.72\%}$
test_seq_add[compile] 0.1373ms 61.6539μs 16.2196 KOps/s 16.6889 KOps/s $\color{#d91a1a}-2.81\%$
test_seq_add[compile-overhead] 0.1786ms 60.6318μs 16.4930 KOps/s 16.7977 KOps/s $\color{#d91a1a}-1.81\%$
test_seq_wrap[eager] 0.7122ms 0.3905ms 2.5606 KOps/s 2.6427 KOps/s $\color{#d91a1a}-3.11\%$
test_seq_wrap[compile] 0.4285ms 0.2312ms 4.3252 KOps/s 4.4553 KOps/s $\color{#d91a1a}-2.92\%$
test_seq_wrap[compile-overhead] 0.3935ms 0.2306ms 4.3370 KOps/s 4.4763 KOps/s $\color{#d91a1a}-3.11\%$
test_func_call_runtime[False-eager] 0.7312ms 0.5526ms 1.8097 KOps/s 1.8884 KOps/s $\color{#d91a1a}-4.17\%$
test_func_call_runtime[False-compile] 0.5655ms 0.4331ms 2.3089 KOps/s 2.3276 KOps/s $\color{#d91a1a}-0.80\%$
test_func_call_runtime[False-compile-overhead] 0.8477ms 0.4348ms 2.2997 KOps/s 2.3470 KOps/s $\color{#d91a1a}-2.01\%$
test_func_call_runtime[True-eager] 1.2729ms 0.7697ms 1.2992 KOps/s 1.3517 KOps/s $\color{#d91a1a}-3.89\%$
test_func_call_runtime[True-compile] 0.8154ms 0.4715ms 2.1209 KOps/s 2.1468 KOps/s $\color{#d91a1a}-1.21\%$
test_func_call_runtime[True-compile-overhead] 0.9948ms 0.4735ms 2.1121 KOps/s 2.1480 KOps/s $\color{#d91a1a}-1.67\%$
test_func_call_cm_runtime[False-eager] 1.1829ms 0.5584ms 1.7909 KOps/s 1.8882 KOps/s $\textbf{\color{#d91a1a}-5.15\%}$
test_func_call_cm_runtime[False-compile] 0.7821ms 0.4303ms 2.3242 KOps/s 2.3653 KOps/s $\color{#d91a1a}-1.74\%$
test_func_call_cm_runtime[False-compile-overhead] 0.7824ms 0.4296ms 2.3279 KOps/s 2.3299 KOps/s $\color{#d91a1a}-0.09\%$
test_func_call_cm_runtime[True-eager] 1.5107ms 0.9040ms 1.1062 KOps/s 1.1441 KOps/s $\color{#d91a1a}-3.31\%$
test_func_call_cm_runtime[True-compile] 0.8050ms 0.4984ms 2.0066 KOps/s 2.0096 KOps/s $\color{#d91a1a}-0.15\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6031ms 0.4945ms 2.0223 KOps/s 2.0317 KOps/s $\color{#d91a1a}-0.46\%$
test_vmap_func_call_cm_runtime[eager] 2.7126ms 1.9202ms 520.7659 Ops/s 535.6792 Ops/s $\color{#d91a1a}-2.78\%$
test_vmap_func_call_cm_runtime[compile] 0.9109ms 0.5284ms 1.8925 KOps/s 1.9129 KOps/s $\color{#d91a1a}-1.06\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9293ms 0.5295ms 1.8885 KOps/s 1.9342 KOps/s $\color{#d91a1a}-2.36\%$
test_distributed 0.3219ms 0.1282ms 7.7984 KOps/s 7.7012 KOps/s $\color{#35bf28}+1.26\%$
test_tdmodule 60.0120μs 18.3585μs 54.4706 KOps/s 56.3563 KOps/s $\color{#d91a1a}-3.35\%$
test_tdmodule_dispatch 68.7170μs 35.7605μs 27.9638 KOps/s 28.7370 KOps/s $\color{#d91a1a}-2.69\%$
test_tdseq 47.0770μs 20.9232μs 47.7938 KOps/s 50.5921 KOps/s $\textbf{\color{#d91a1a}-5.53\%}$
test_tdseq_dispatch 82.6540μs 40.7365μs 24.5480 KOps/s 25.7711 KOps/s $\color{#d91a1a}-4.75\%$
test_instantiation_functorch 2.4289ms 1.5748ms 635.0176 Ops/s 657.3026 Ops/s $\color{#d91a1a}-3.39\%$
test_exec_functorch 0.3664ms 0.1824ms 5.4825 KOps/s 5.5575 KOps/s $\color{#d91a1a}-1.35\%$
test_exec_functional_call 0.4271ms 0.1737ms 5.7575 KOps/s 5.7482 KOps/s $\color{#35bf28}+0.16\%$
test_exec_td_decorator 0.5350ms 0.2303ms 4.3414 KOps/s 4.4024 KOps/s $\color{#d91a1a}-1.39\%$
test_vmap_mlp_speed_decorator[True-True] 0.8925ms 0.6396ms 1.5635 KOps/s 1.5844 KOps/s $\color{#d91a1a}-1.32\%$
test_vmap_mlp_speed_decorator[True-False] 1.0310ms 0.6420ms 1.5578 KOps/s 1.6005 KOps/s $\color{#d91a1a}-2.67\%$
test_vmap_mlp_speed_decorator[False-True] 0.7219ms 0.5264ms 1.8996 KOps/s 1.9351 KOps/s $\color{#d91a1a}-1.83\%$
test_vmap_mlp_speed_decorator[False-False] 0.8318ms 0.5280ms 1.8939 KOps/s 1.9298 KOps/s $\color{#d91a1a}-1.86\%$
test_to_module_speed[True] 1.4998ms 1.3202ms 757.4639 Ops/s 767.6307 Ops/s $\color{#d91a1a}-1.32\%$
test_to_module_speed[False] 2.0482ms 1.2926ms 773.6465 Ops/s 786.1529 Ops/s $\color{#d91a1a}-1.59\%$
test_tc_init 82.8540μs 44.9841μs 22.2301 KOps/s 24.8824 KOps/s $\textbf{\color{#d91a1a}-10.66\%}$
test_tc_init_nested 0.1509ms 89.6982μs 11.1485 KOps/s 12.5249 KOps/s $\textbf{\color{#d91a1a}-10.99\%}$
test_tc_first_layer_tensor 39.1930μs 1.5534μs 643.7531 KOps/s 674.8370 KOps/s $\color{#d91a1a}-4.61\%$
test_tc_first_layer_nontensor 25.1670μs 4.7421μs 210.8779 KOps/s 216.2479 KOps/s $\color{#d91a1a}-2.48\%$
test_tc_second_layer_tensor 44.1030μs 2.9277μs 341.5654 KOps/s 363.9477 KOps/s $\textbf{\color{#d91a1a}-6.15\%}$
test_tc_second_layer_nontensor 36.5780μs 6.0666μs 164.8365 KOps/s 166.4325 KOps/s $\color{#d91a1a}-0.96\%$
test_unbind 0.2368s 15.2605ms 65.5288 Ops/s 80.1158 Ops/s $\textbf{\color{#d91a1a}-18.21\%}$
test_full_like 9.8212ms 8.1753ms 122.3196 Ops/s 130.2432 Ops/s $\textbf{\color{#d91a1a}-6.08\%}$
test_zeros_like 4.1974ms 3.2412ms 308.5284 Ops/s 145.3925 Ops/s $\textbf{\color{#35bf28}+112.20\%}$
test_ones_like 4.0240ms 3.5565ms 281.1784 Ops/s 123.3444 Ops/s $\textbf{\color{#35bf28}+127.96\%}$
test_clone 8.2610ms 5.6871ms 175.8358 Ops/s 103.3186 Ops/s $\textbf{\color{#35bf28}+70.19\%}$
test_squeeze 63.6690μs 12.2968μs 81.3220 KOps/s 83.1657 KOps/s $\color{#d91a1a}-2.22\%$
test_unsqueeze 0.3201ms 91.7510μs 10.8991 KOps/s 11.2268 KOps/s $\color{#d91a1a}-2.92\%$
test_split 0.4355ms 0.1887ms 5.2991 KOps/s 5.2944 KOps/s $\color{#35bf28}+0.09\%$
test_permute 0.3647ms 0.2195ms 4.5553 KOps/s 4.5622 KOps/s $\color{#d91a1a}-0.15\%$
test_stack 31.0928ms 27.2412ms 36.7091 Ops/s 39.4929 Ops/s $\textbf{\color{#d91a1a}-7.05\%}$
test_cat 31.8017ms 27.1672ms 36.8090 Ops/s 39.5785 Ops/s $\textbf{\color{#d91a1a}-7.00\%}$

Copy link

github-actions bot commented Nov 6, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}17$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 30.2600μs 10.6968μs 93.4861 KOps/s 93.7473 KOps/s $\color{#d91a1a}-0.28\%$
test_plain_set_stack_nested 33.8810μs 10.8173μs 92.4445 KOps/s 93.0594 KOps/s $\color{#d91a1a}-0.66\%$
test_plain_set_nested_inplace 0.1200ms 11.7163μs 85.3515 KOps/s 86.0545 KOps/s $\color{#d91a1a}-0.82\%$
test_plain_set_stack_nested_inplace 73.8810μs 11.6196μs 86.0616 KOps/s 86.2080 KOps/s $\color{#d91a1a}-0.17\%$
test_items 35.1310μs 2.8858μs 346.5296 KOps/s 345.2307 KOps/s $\color{#35bf28}+0.38\%$
test_items_nested 0.3669ms 0.3177ms 3.1471 KOps/s 3.1343 KOps/s $\color{#35bf28}+0.41\%$
test_items_nested_locked 0.4247ms 0.3201ms 3.1240 KOps/s 3.1080 KOps/s $\color{#35bf28}+0.52\%$
test_items_nested_leaf 0.2293ms 58.0325μs 17.2317 KOps/s 17.2449 KOps/s $\color{#d91a1a}-0.08\%$
test_items_stack_nested 0.4926ms 0.3190ms 3.1348 KOps/s 3.1236 KOps/s $\color{#35bf28}+0.36\%$
test_items_stack_nested_leaf 0.2324ms 59.3958μs 16.8362 KOps/s 17.0688 KOps/s $\color{#d91a1a}-1.36\%$
test_items_stack_nested_locked 0.4925ms 0.3260ms 3.0677 KOps/s 3.1147 KOps/s $\color{#d91a1a}-1.51\%$
test_keys 29.7610μs 3.4562μs 289.3322 KOps/s 274.8032 KOps/s $\textbf{\color{#35bf28}+5.29\%}$
test_keys_nested 0.2550ms 70.1649μs 14.2521 KOps/s 14.1591 KOps/s $\color{#35bf28}+0.66\%$
test_keys_nested_locked 0.7587ms 75.4310μs 13.2572 KOps/s 13.0998 KOps/s $\color{#35bf28}+1.20\%$
test_keys_nested_leaf 91.8910μs 61.5048μs 16.2589 KOps/s 16.1461 KOps/s $\color{#35bf28}+0.70\%$
test_keys_stack_nested 0.1044ms 70.8285μs 14.1186 KOps/s 14.0356 KOps/s $\color{#35bf28}+0.59\%$
test_keys_stack_nested_leaf 0.1072ms 61.9022μs 16.1545 KOps/s 16.1150 KOps/s $\color{#35bf28}+0.25\%$
test_keys_stack_nested_locked 0.2028ms 75.9155μs 13.1725 KOps/s 13.0238 KOps/s $\color{#35bf28}+1.14\%$
test_values 7.5885μs 0.8419μs 1.1878 MOps/s 1.1847 MOps/s $\color{#35bf28}+0.26\%$
test_values_nested 66.4910μs 31.4193μs 31.8276 KOps/s 32.1473 KOps/s $\color{#d91a1a}-0.99\%$
test_values_nested_locked 64.9610μs 32.7059μs 30.5755 KOps/s 30.5962 KOps/s $\color{#d91a1a}-0.07\%$
test_values_nested_leaf 96.1020μs 33.7108μs 29.6641 KOps/s 29.5883 KOps/s $\color{#35bf28}+0.26\%$
test_values_stack_nested 69.7310μs 31.6601μs 31.5855 KOps/s 31.6664 KOps/s $\color{#d91a1a}-0.26\%$
test_values_stack_nested_leaf 71.0810μs 34.1626μs 29.2717 KOps/s 29.6287 KOps/s $\color{#d91a1a}-1.20\%$
test_values_stack_nested_locked 82.7410μs 33.2408μs 30.0835 KOps/s 30.0197 KOps/s $\color{#35bf28}+0.21\%$
test_membership 4.0791μs 0.5088μs 1.9655 MOps/s 1.9504 MOps/s $\color{#35bf28}+0.77\%$
test_membership_nested 16.1100μs 1.9199μs 520.8592 KOps/s 507.9010 KOps/s $\color{#35bf28}+2.55\%$
test_membership_nested_leaf 14.0267μs 1.8712μs 534.4187 KOps/s 530.0153 KOps/s $\color{#35bf28}+0.83\%$
test_membership_stacked_nested 22.2400μs 1.9544μs 511.6656 KOps/s 503.9158 KOps/s $\color{#35bf28}+1.54\%$
test_membership_stacked_nested_leaf 42.3610μs 1.9989μs 500.2811 KOps/s 507.7895 KOps/s $\color{#d91a1a}-1.48\%$
test_membership_nested_last 39.2700μs 2.8274μs 353.6855 KOps/s 353.5632 KOps/s $\color{#35bf28}+0.03\%$
test_membership_nested_leaf_last 37.8910μs 2.8493μs 350.9577 KOps/s 352.1119 KOps/s $\color{#d91a1a}-0.33\%$
test_membership_stacked_nested_last 31.6010μs 4.8175μs 207.5778 KOps/s 350.4738 KOps/s $\textbf{\color{#d91a1a}-40.77\%}$
test_membership_stacked_nested_leaf_last 73.8120μs 4.8097μs 207.9112 KOps/s 353.3274 KOps/s $\textbf{\color{#d91a1a}-41.16\%}$
test_nested_getleaf 28.0800μs 6.0135μs 166.2912 KOps/s 165.0153 KOps/s $\color{#35bf28}+0.77\%$
test_nested_get 38.8500μs 5.7326μs 174.4396 KOps/s 174.8017 KOps/s $\color{#d91a1a}-0.21\%$
test_stacked_getleaf 34.3710μs 6.0324μs 165.7704 KOps/s 164.5791 KOps/s $\color{#35bf28}+0.72\%$
test_stacked_get 56.1710μs 5.7381μs 174.2749 KOps/s 174.6535 KOps/s $\color{#d91a1a}-0.22\%$
test_nested_getitemleaf 59.8310μs 6.1012μs 163.9015 KOps/s 162.6326 KOps/s $\color{#35bf28}+0.78\%$
test_nested_getitem 28.9000μs 5.7659μs 173.4331 KOps/s 172.6132 KOps/s $\color{#35bf28}+0.48\%$
test_stacked_getitemleaf 36.6700μs 6.1021μs 163.8778 KOps/s 163.2860 KOps/s $\color{#35bf28}+0.36\%$
test_stacked_getitem 32.8600μs 5.7790μs 173.0403 KOps/s 172.4611 KOps/s $\color{#35bf28}+0.34\%$
test_lock_nested 1.1314ms 0.3717ms 2.6905 KOps/s 2.7201 KOps/s $\color{#d91a1a}-1.09\%$
test_lock_stack_nested 0.4190ms 0.3387ms 2.9523 KOps/s 2.9362 KOps/s $\color{#35bf28}+0.55\%$
test_unlock_nested 0.9971ms 0.3121ms 3.2040 KOps/s 3.2479 KOps/s $\color{#d91a1a}-1.35\%$
test_unlock_stack_nested 0.4159ms 0.2785ms 3.5903 KOps/s 3.5795 KOps/s $\color{#35bf28}+0.30\%$
test_flatten_speed 0.1093ms 72.8530μs 13.7263 KOps/s 13.9089 KOps/s $\color{#d91a1a}-1.31\%$
test_unflatten_speed 0.3376ms 0.2964ms 3.3733 KOps/s 3.4637 KOps/s $\color{#d91a1a}-2.61\%$
test_common_ops 93.9528ms 0.6836ms 1.4628 KOps/s 1.6665 KOps/s $\textbf{\color{#d91a1a}-12.22\%}$
test_creation 0.1044ms 1.4875μs 672.2634 KOps/s 680.3091 KOps/s $\color{#d91a1a}-1.18\%$
test_creation_empty 41.7810μs 7.8917μs 126.7151 KOps/s 131.9778 KOps/s $\color{#d91a1a}-3.99\%$
test_creation_nested_1 29.1410μs 9.3965μs 106.4224 KOps/s 110.5683 KOps/s $\color{#d91a1a}-3.75\%$
test_creation_nested_2 44.0100μs 11.8855μs 84.1359 KOps/s 85.8615 KOps/s $\color{#d91a1a}-2.01\%$
test_clone 83.4310μs 10.6807μs 93.6267 KOps/s 92.3588 KOps/s $\color{#35bf28}+1.37\%$
test_getitem[int] 1.5288ms 11.1798μs 89.4472 KOps/s 90.0601 KOps/s $\color{#d91a1a}-0.68\%$
test_getitem[slice_int] 0.1156ms 23.1505μs 43.1956 KOps/s 46.4019 KOps/s $\textbf{\color{#d91a1a}-6.91\%}$
test_getitem[range] 0.2184ms 43.7420μs 22.8613 KOps/s 25.9231 KOps/s $\textbf{\color{#d91a1a}-11.81\%}$
test_getitem[tuple] 0.2120ms 19.8974μs 50.2579 KOps/s 53.6210 KOps/s $\textbf{\color{#d91a1a}-6.27\%}$
test_getitem[list] 0.2030ms 39.0330μs 25.6194 KOps/s 28.9461 KOps/s $\textbf{\color{#d91a1a}-11.49\%}$
test_setitem_dim[int] 43.0600μs 21.0858μs 47.4253 KOps/s 50.6924 KOps/s $\textbf{\color{#d91a1a}-6.44\%}$
test_setitem_dim[slice_int] 68.4110μs 41.1780μs 24.2848 KOps/s 25.3953 KOps/s $\color{#d91a1a}-4.37\%$
test_setitem_dim[range] 0.2322ms 59.2750μs 16.8705 KOps/s 18.1629 KOps/s $\textbf{\color{#d91a1a}-7.12\%}$
test_setitem_dim[tuple] 0.2070ms 35.7067μs 28.0060 KOps/s 29.6580 KOps/s $\textbf{\color{#d91a1a}-5.57\%}$
test_setitem 0.1939ms 15.3444μs 65.1702 KOps/s 65.4970 KOps/s $\color{#d91a1a}-0.50\%$
test_set 0.2046ms 14.8882μs 67.1673 KOps/s 66.7760 KOps/s $\color{#35bf28}+0.59\%$
test_set_shared 1.7705ms 0.1491ms 6.7072 KOps/s 6.7375 KOps/s $\color{#d91a1a}-0.45\%$
test_update 0.3269ms 17.7050μs 56.4811 KOps/s 56.9642 KOps/s $\color{#d91a1a}-0.85\%$
test_update_nested 0.2085ms 23.4370μs 42.6676 KOps/s 45.5094 KOps/s $\textbf{\color{#d91a1a}-6.24\%}$
test_update__nested 0.9921ms 24.9687μs 40.0501 KOps/s 40.2242 KOps/s $\color{#d91a1a}-0.43\%$
test_set_nested 0.1247ms 15.9423μs 62.7263 KOps/s 62.0567 KOps/s $\color{#35bf28}+1.08\%$
test_set_nested_new 97.6410μs 18.4510μs 54.1975 KOps/s 55.0399 KOps/s $\color{#d91a1a}-1.53\%$
test_select 0.2112ms 31.8102μs 31.4364 KOps/s 32.5363 KOps/s $\color{#d91a1a}-3.38\%$
test_select_nested 0.1501ms 41.8311μs 23.9057 KOps/s 23.8648 KOps/s $\color{#35bf28}+0.17\%$
test_exclude_nested 87.3410μs 59.5302μs 16.7982 KOps/s 16.9406 KOps/s $\color{#d91a1a}-0.84\%$
test_empty[True] 0.6175ms 0.2570ms 3.8907 KOps/s 3.9082 KOps/s $\color{#d91a1a}-0.45\%$
test_empty[False] 3.3990μs 0.7368μs 1.3572 MOps/s 1.3453 MOps/s $\color{#35bf28}+0.88\%$
test_to 84.8610μs 56.5809μs 17.6738 KOps/s 17.9108 KOps/s $\color{#d91a1a}-1.32\%$
test_to_nonblocking 0.2196ms 48.2288μs 20.7345 KOps/s 20.8229 KOps/s $\color{#d91a1a}-0.42\%$
test_unbind_speed 1.6770ms 0.2362ms 4.2333 KOps/s 4.2843 KOps/s $\color{#d91a1a}-1.19\%$
test_unbind_speed_stack0 0.3696ms 0.2339ms 4.2754 KOps/s 4.2843 KOps/s $\color{#d91a1a}-0.21\%$
test_unbind_speed_stack1 91.6925ms 0.6470ms 1.5457 KOps/s 1.5303 KOps/s $\color{#35bf28}+1.01\%$
test_split 94.3599ms 1.6103ms 620.9917 Ops/s 572.6692 Ops/s $\textbf{\color{#35bf28}+8.44\%}$
test_chunk 93.6143ms 1.6154ms 619.0555 Ops/s 678.4630 Ops/s $\textbf{\color{#d91a1a}-8.76\%}$
test_consolidate[False-None] 96.9454ms 2.8809ms 347.1135 Ops/s 343.7817 Ops/s $\color{#35bf28}+0.97\%$
test_consolidate[default-None] 1.8294ms 1.6463ms 607.4317 Ops/s 604.5958 Ops/s $\color{#35bf28}+0.47\%$
test_consolidate[reduce-overhead-None] 1.8768ms 1.7013ms 587.7766 Ops/s 592.6926 Ops/s $\color{#d91a1a}-0.83\%$
test_consolidate_njt[False-None] 7.0160ms 6.7831ms 147.4262 Ops/s 149.1160 Ops/s $\color{#d91a1a}-1.13\%$
test_to[False-False-None] 2.0232ms 1.7433ms 573.6389 Ops/s 574.5831 Ops/s $\color{#d91a1a}-0.16\%$
test_to[True-False-None] 1.6382ms 1.3906ms 719.1343 Ops/s 726.4584 Ops/s $\color{#d91a1a}-1.01\%$
test_to[within-False-None] 4.4532ms 4.1889ms 238.7242 Ops/s 240.0281 Ops/s $\color{#d91a1a}-0.54\%$
test_to[True-default-None] 6.1463ms 5.5655ms 179.6771 Ops/s 190.6434 Ops/s $\textbf{\color{#d91a1a}-5.75\%}$
test_to_njt[False-False-None] 7.6517ms 7.2285ms 138.3405 Ops/s 140.6843 Ops/s $\color{#d91a1a}-1.67\%$
test_to_njt[True-False-None] 6.0578ms 5.6768ms 176.1543 Ops/s 175.7311 Ops/s $\color{#35bf28}+0.24\%$
test_to_njt[within-False-None] 13.3236ms 12.5593ms 79.6224 Ops/s 79.7344 Ops/s $\color{#d91a1a}-0.14\%$
test_creation[device0] 0.5413ms 81.1880μs 12.3171 KOps/s 12.4792 KOps/s $\color{#d91a1a}-1.30\%$
test_creation_from_tensor 0.5059ms 85.0310μs 11.7604 KOps/s 11.8567 KOps/s $\color{#d91a1a}-0.81\%$
test_add_one[memmap_tensor0] 0.2377ms 6.9971μs 142.9159 KOps/s 141.0380 KOps/s $\color{#35bf28}+1.33\%$
test_contiguous[memmap_tensor0] 1.7790μs 0.4306μs 2.3226 MOps/s 2.3149 MOps/s $\color{#35bf28}+0.33\%$
test_stack[memmap_tensor0] 0.1524ms 4.6689μs 214.1838 KOps/s 211.2203 KOps/s $\color{#35bf28}+1.40\%$
test_memmaptd_index 0.5989ms 0.2631ms 3.8008 KOps/s 3.7614 KOps/s $\color{#35bf28}+1.05\%$
test_memmaptd_index_astensor 0.6205ms 0.3212ms 3.1130 KOps/s 3.0676 KOps/s $\color{#35bf28}+1.48\%$
test_memmaptd_index_op 1.0210ms 0.6059ms 1.6503 KOps/s 1.6516 KOps/s $\color{#d91a1a}-0.08\%$
test_serialize_model 0.1315s 0.1305s 7.6636 Ops/s 7.6502 Ops/s $\color{#35bf28}+0.17\%$
test_serialize_model_pickle 1.3489s 1.1856s 0.8435 Ops/s 0.8242 Ops/s $\color{#35bf28}+2.35\%$
test_serialize_weights 0.1308s 0.1299s 7.6961 Ops/s 7.6712 Ops/s $\color{#35bf28}+0.33\%$
test_serialize_weights_returnearly 0.4986s 70.7026ms 14.1437 Ops/s 11.4118 Ops/s $\textbf{\color{#35bf28}+23.94\%}$
test_serialize_weights_pickle 1.4266s 1.2288s 0.8138 Ops/s 0.8390 Ops/s $\color{#d91a1a}-3.01\%$
test_reshape_pytree 0.1714ms 22.7370μs 43.9811 KOps/s 44.1881 KOps/s $\color{#d91a1a}-0.47\%$
test_reshape_td 0.1127ms 27.1200μs 36.8732 KOps/s 36.2601 KOps/s $\color{#35bf28}+1.69\%$
test_view_pytree 0.1166ms 22.5568μs 44.3325 KOps/s 44.2455 KOps/s $\color{#35bf28}+0.20\%$
test_view_td 0.1132ms 31.6150μs 31.6305 KOps/s 31.9939 KOps/s $\color{#d91a1a}-1.14\%$
test_unbind_pytree 0.1340ms 28.7224μs 34.8160 KOps/s 35.0979 KOps/s $\color{#d91a1a}-0.80\%$
test_unbind_td 0.9026ms 36.0296μs 27.7549 KOps/s 25.3295 KOps/s $\textbf{\color{#35bf28}+9.58\%}$
test_split_pytree 0.1620ms 30.1544μs 33.1627 KOps/s 32.9190 KOps/s $\color{#35bf28}+0.74\%$
test_split_td 0.9729ms 39.8147μs 25.1164 KOps/s 24.4085 KOps/s $\color{#35bf28}+2.90\%$
test_add_pytree 0.1654ms 35.1173μs 28.4760 KOps/s 27.5815 KOps/s $\color{#35bf28}+3.24\%$
test_add_td 0.2438ms 46.2695μs 21.6125 KOps/s 20.9735 KOps/s $\color{#35bf28}+3.05\%$
test_compile_add_one_nested[tensordict-compile] 0.3092ms 0.1276ms 7.8374 KOps/s 7.9632 KOps/s $\color{#d91a1a}-1.58\%$
test_compile_add_one_nested[tensordict-eager] 0.2737ms 0.1253ms 7.9830 KOps/s 7.8324 KOps/s $\color{#35bf28}+1.92\%$
test_compile_add_one_nested[pytree-compile] 0.2460ms 99.1460μs 10.0861 KOps/s 9.8040 KOps/s $\color{#35bf28}+2.88\%$
test_compile_add_one_nested[pytree-eager] 0.3241ms 0.1543ms 6.4806 KOps/s 6.3511 KOps/s $\color{#35bf28}+2.04\%$
test_compile_copy_nested[tensordict-compile] 0.1999ms 24.1621μs 41.3872 KOps/s 42.4644 KOps/s $\color{#d91a1a}-2.54\%$
test_compile_copy_nested[tensordict-eager] 0.1355ms 26.6643μs 37.5033 KOps/s 36.6481 KOps/s $\color{#35bf28}+2.33\%$
test_compile_copy_nested[pytree-compile] 0.2102ms 64.8373μs 15.4232 KOps/s 15.4153 KOps/s $\color{#35bf28}+0.05\%$
test_compile_copy_nested[pytree-eager] 0.1955ms 49.0278μs 20.3966 KOps/s 19.9708 KOps/s $\color{#35bf28}+2.13\%$
test_compile_add_one_flat[tensordict-compile] 0.3310ms 0.1497ms 6.6782 KOps/s 6.4960 KOps/s $\color{#35bf28}+2.80\%$
test_compile_add_one_flat[tensordict-eager] 0.3703ms 0.2106ms 4.7484 KOps/s 4.7526 KOps/s $\color{#d91a1a}-0.09\%$
test_compile_add_one_flat[tensorclass-compile] 0.2951ms 0.1041ms 9.6084 KOps/s 10.0061 KOps/s $\color{#d91a1a}-3.97\%$
test_compile_add_one_flat[tensorclass-eager] 0.2393ms 54.6157μs 18.3097 KOps/s 18.7456 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_add_one_flat[pytree-compile] 0.2948ms 0.1474ms 6.7863 KOps/s 6.5158 KOps/s $\color{#35bf28}+4.15\%$
test_compile_add_one_flat[pytree-eager] 0.6964ms 0.5028ms 1.9887 KOps/s 1.8731 KOps/s $\textbf{\color{#35bf28}+6.18\%}$
test_compile_add_self_flat[tensordict-eager] 0.4154ms 0.2510ms 3.9843 KOps/s 3.9837 KOps/s $\color{#35bf28}+0.01\%$
test_compile_add_self_flat[tensordict-compile] 0.3429ms 0.1462ms 6.8382 KOps/s 6.8206 KOps/s $\color{#35bf28}+0.26\%$
test_compile_add_self_flat[tensorclass-eager] 0.2691ms 63.8121μs 15.6710 KOps/s 15.7078 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_add_self_flat[tensorclass-compile] 0.2462ms 0.1004ms 9.9602 KOps/s 10.0180 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_add_self_flat[pytree-eager] 0.6218ms 0.4230ms 2.3642 KOps/s 2.3446 KOps/s $\color{#35bf28}+0.84\%$
test_compile_add_self_flat[pytree-compile] 0.2949ms 0.1430ms 6.9930 KOps/s 7.0630 KOps/s $\color{#d91a1a}-0.99\%$
test_compile_copy_flat[tensordict-compile] 0.1632ms 19.9514μs 50.1218 KOps/s 52.0812 KOps/s $\color{#d91a1a}-3.76\%$
test_compile_copy_flat[tensordict-eager] 0.1308ms 26.8189μs 37.2871 KOps/s 37.1817 KOps/s $\color{#35bf28}+0.28\%$
test_compile_copy_flat[pytree-compile] 0.2001ms 70.0236μs 14.2809 KOps/s 14.3648 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_copy_flat[pytree-eager] 82.0610μs 51.6348μs 19.3668 KOps/s 19.3351 KOps/s $\color{#35bf28}+0.16\%$
test_compile_assign_and_add[tensordict-compile] 1.6734ms 0.3996ms 2.5023 KOps/s 2.1543 KOps/s $\textbf{\color{#35bf28}+16.16\%}$
test_compile_assign_and_add[tensordict-eager] 3.1234ms 2.7923ms 358.1254 Ops/s 354.2340 Ops/s $\color{#35bf28}+1.10\%$
test_compile_assign_and_add[pytree-compile] 1.6101ms 0.4399ms 2.2734 KOps/s 2.2055 KOps/s $\color{#35bf28}+3.08\%$
test_compile_assign_and_add[pytree-eager] 3.1941ms 2.8022ms 356.8630 Ops/s 351.6482 Ops/s $\color{#35bf28}+1.48\%$
test_compile_indexing[tensor-tensordict-compile] 0.3288ms 0.1244ms 8.0408 KOps/s 8.3875 KOps/s $\color{#d91a1a}-4.13\%$
test_compile_indexing[tensor-tensordict-eager] 0.5938ms 86.3579μs 11.5797 KOps/s 11.5154 KOps/s $\color{#35bf28}+0.56\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3078ms 0.1159ms 8.6298 KOps/s 8.9319 KOps/s $\color{#d91a1a}-3.38\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2555ms 74.1303μs 13.4898 KOps/s 13.9153 KOps/s $\color{#d91a1a}-3.06\%$
test_compile_indexing[tensor-pytree-compile] 0.2978ms 0.1137ms 8.7986 KOps/s 8.8496 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[tensor-pytree-eager] 0.2556ms 74.4684μs 13.4285 KOps/s 13.5643 KOps/s $\color{#d91a1a}-1.00\%$
test_compile_indexing[slice-tensordict-compile] 0.3301ms 0.1079ms 9.2684 KOps/s 9.6353 KOps/s $\color{#d91a1a}-3.81\%$
test_compile_indexing[slice-tensordict-eager] 0.2233ms 18.1779μs 55.0120 KOps/s 48.0995 KOps/s $\textbf{\color{#35bf28}+14.37\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2753ms 0.1021ms 9.7981 KOps/s 10.0277 KOps/s $\color{#d91a1a}-2.29\%$
test_compile_indexing[slice-tensorclass-eager] 0.1627ms 16.0402μs 62.3434 KOps/s 61.8433 KOps/s $\color{#35bf28}+0.81\%$
test_compile_indexing[slice-pytree-compile] 0.2868ms 0.1012ms 9.8791 KOps/s 9.9591 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_indexing[slice-pytree-eager] 0.1597ms 16.0985μs 62.1177 KOps/s 61.6286 KOps/s $\color{#35bf28}+0.79\%$
test_compile_indexing[int-tensordict-compile] 0.3074ms 0.1081ms 9.2504 KOps/s 9.5827 KOps/s $\color{#d91a1a}-3.47\%$
test_compile_indexing[int-tensordict-eager] 0.5826ms 17.7408μs 56.3673 KOps/s 56.0950 KOps/s $\color{#35bf28}+0.49\%$
test_compile_indexing[int-tensorclass-compile] 0.2995ms 0.1040ms 9.6178 KOps/s 9.9697 KOps/s $\color{#d91a1a}-3.53\%$
test_compile_indexing[int-tensorclass-eager] 0.1648ms 16.1247μs 62.0168 KOps/s 61.0681 KOps/s $\color{#35bf28}+1.55\%$
test_compile_indexing[int-pytree-compile] 0.2613ms 99.7593μs 10.0241 KOps/s 9.9510 KOps/s $\color{#35bf28}+0.73\%$
test_compile_indexing[int-pytree-eager] 0.1568ms 16.0372μs 62.3551 KOps/s 61.5870 KOps/s $\color{#35bf28}+1.25\%$
test_mod_add[eager] 0.1912ms 32.7897μs 30.4974 KOps/s 30.7110 KOps/s $\color{#d91a1a}-0.70\%$
test_mod_add[compile] 0.2278ms 78.6156μs 12.7201 KOps/s 12.4856 KOps/s $\color{#35bf28}+1.88\%$
test_mod_add[compile-overhead] 0.3119ms 0.1664ms 6.0087 KOps/s 5.6961 KOps/s $\textbf{\color{#35bf28}+5.49\%}$
test_mod_wrap[eager] 0.4377ms 0.2509ms 3.9854 KOps/s 3.9386 KOps/s $\color{#35bf28}+1.19\%$
test_mod_wrap[compile] 1.5818ms 0.2994ms 3.3404 KOps/s 3.4418 KOps/s $\color{#d91a1a}-2.95\%$
test_mod_wrap[compile-overhead] 7.6891ms 4.0866ms 244.7026 Ops/s 249.7799 Ops/s $\color{#d91a1a}-2.03\%$
test_mod_wrap_and_backward[eager] 1.6648ms 1.3893ms 719.7669 Ops/s 670.1383 Ops/s $\textbf{\color{#35bf28}+7.41\%}$
test_mod_wrap_and_backward[compile] 1.4836ms 1.2830ms 779.4342 Ops/s 771.7806 Ops/s $\color{#35bf28}+0.99\%$
test_mod_wrap_and_backward[compile-overhead] 1.3802ms 0.9158ms 1.0919 KOps/s 1.0665 KOps/s $\color{#35bf28}+2.38\%$
test_seq_add[eager] 0.5152ms 98.4599μs 10.1564 KOps/s 10.0898 KOps/s $\color{#35bf28}+0.66\%$
test_seq_add[compile] 0.4829ms 88.2810μs 11.3275 KOps/s 11.1509 KOps/s $\color{#35bf28}+1.58\%$
test_seq_add[compile-overhead] 0.2760ms 0.1297ms 7.7095 KOps/s 7.7251 KOps/s $\color{#d91a1a}-0.20\%$
test_seq_wrap[eager] 0.7962ms 0.3918ms 2.5523 KOps/s 2.5162 KOps/s $\color{#35bf28}+1.43\%$
test_seq_wrap[compile] 0.7316ms 0.3204ms 3.1216 KOps/s 3.2487 KOps/s $\color{#d91a1a}-3.91\%$
test_seq_wrap[compile-overhead] 0.3909ms 0.2237ms 4.4695 KOps/s 4.3892 KOps/s $\color{#35bf28}+1.83\%$
test_func_call_runtime[False-eager] 1.2304ms 0.8146ms 1.2275 KOps/s 1.2975 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_func_call_runtime[False-compile] 1.1683ms 0.7578ms 1.3196 KOps/s 1.3014 KOps/s $\color{#35bf28}+1.40\%$
test_func_call_runtime[False-compile-overhead] 0.7576ms 0.3630ms 2.7545 KOps/s 2.7366 KOps/s $\color{#35bf28}+0.65\%$
test_func_call_runtime[True-eager] 1.3500ms 0.9261ms 1.0798 KOps/s 1.0660 KOps/s $\color{#35bf28}+1.30\%$
test_func_call_runtime[True-compile] 1.1977ms 0.7790ms 1.2836 KOps/s 1.2653 KOps/s $\color{#35bf28}+1.45\%$
test_func_call_runtime[True-compile-overhead] 0.5353ms 0.3870ms 2.5841 KOps/s 2.5990 KOps/s $\color{#d91a1a}-0.57\%$
test_func_call_cm_runtime[False-eager] 1.2093ms 0.7742ms 1.2917 KOps/s 1.2708 KOps/s $\color{#35bf28}+1.64\%$
test_func_call_cm_runtime[False-compile] 1.1728ms 0.7721ms 1.2952 KOps/s 1.2443 KOps/s $\color{#35bf28}+4.09\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5616ms 0.3651ms 2.7393 KOps/s 2.7117 KOps/s $\color{#35bf28}+1.02\%$
test_func_call_cm_runtime[True-eager] 1.4343ms 1.0283ms 972.4348 Ops/s 966.7460 Ops/s $\color{#35bf28}+0.59\%$
test_func_call_cm_runtime[True-compile] 1.2125ms 0.8181ms 1.2223 KOps/s 1.2158 KOps/s $\color{#35bf28}+0.54\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5655ms 0.4119ms 2.4280 KOps/s 2.4050 KOps/s $\color{#35bf28}+0.95\%$
test_vmap_func_call_cm_runtime[eager] 2.6758ms 2.1275ms 470.0312 Ops/s 468.1666 Ops/s $\color{#35bf28}+0.40\%$
test_vmap_func_call_cm_runtime[compile] 1.0857ms 0.8491ms 1.1777 KOps/s 1.1995 KOps/s $\color{#d91a1a}-1.82\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5588ms 0.4165ms 2.4008 KOps/s 2.4159 KOps/s $\color{#d91a1a}-0.62\%$
test_distributed 5.4844ms 0.2699ms 3.7045 KOps/s 8.4105 KOps/s $\textbf{\color{#d91a1a}-55.95\%}$
test_tdmodule 52.0010μs 14.0712μs 71.0672 KOps/s 71.8202 KOps/s $\color{#d91a1a}-1.05\%$
test_tdmodule_dispatch 0.1423ms 27.8612μs 35.8922 KOps/s 36.4481 KOps/s $\color{#d91a1a}-1.53\%$
test_tdseq 32.9900μs 15.2518μs 65.5660 KOps/s 62.5510 KOps/s $\color{#35bf28}+4.82\%$
test_tdseq_dispatch 0.1314ms 30.4899μs 32.7978 KOps/s 31.5678 KOps/s $\color{#35bf28}+3.90\%$
test_instantiation_functorch 1.7105ms 1.5583ms 641.7080 Ops/s 638.0219 Ops/s $\color{#35bf28}+0.58\%$
test_exec_functorch 0.2738ms 0.1531ms 6.5310 KOps/s 6.5210 KOps/s $\color{#35bf28}+0.15\%$
test_exec_functional_call 0.2739ms 0.1427ms 7.0100 KOps/s 6.8617 KOps/s $\color{#35bf28}+2.16\%$
test_exec_td_decorator 0.4114ms 0.1908ms 5.2408 KOps/s 5.2365 KOps/s $\color{#35bf28}+0.08\%$
test_vmap_mlp_speed_decorator[True-True] 0.8854ms 0.6768ms 1.4775 KOps/s 1.4658 KOps/s $\color{#35bf28}+0.80\%$
test_vmap_mlp_speed_decorator[True-False] 0.8990ms 0.6808ms 1.4689 KOps/s 1.4514 KOps/s $\color{#35bf28}+1.21\%$
test_vmap_mlp_speed_decorator[False-True] 0.9217ms 0.6226ms 1.6062 KOps/s 1.6750 KOps/s $\color{#d91a1a}-4.11\%$
test_vmap_mlp_speed_decorator[False-False] 0.7865ms 0.6070ms 1.6473 KOps/s 1.6723 KOps/s $\color{#d91a1a}-1.49\%$
test_vmap_transformer_speed_decorator[True-True] 19.8156ms 19.5862ms 51.0562 Ops/s 50.9604 Ops/s $\color{#35bf28}+0.19\%$
test_vmap_transformer_speed_decorator[True-False] 20.5012ms 19.7317ms 50.6798 Ops/s 50.9140 Ops/s $\color{#d91a1a}-0.46\%$
test_vmap_transformer_speed_decorator[False-True] 19.7473ms 19.5007ms 51.2802 Ops/s 51.2738 Ops/s $\color{#35bf28}+0.01\%$
test_vmap_transformer_speed_decorator[False-False] 19.7510ms 19.4871ms 51.3160 Ops/s 51.3100 Ops/s $\color{#35bf28}+0.01\%$
test_to_module_speed[True] 1.0779ms 0.9388ms 1.0652 KOps/s 1.0552 KOps/s $\color{#35bf28}+0.94\%$
test_to_module_speed[False] 1.2830ms 0.9272ms 1.0785 KOps/s 1.0808 KOps/s $\color{#d91a1a}-0.21\%$
test_tc_init 92.6610μs 35.2660μs 28.3560 KOps/s 29.2644 KOps/s $\color{#d91a1a}-3.10\%$
test_tc_init_nested 0.1668ms 70.8528μs 14.1138 KOps/s 14.0853 KOps/s $\color{#35bf28}+0.20\%$
test_tc_first_layer_tensor 4.7300μs 0.7064μs 1.4157 MOps/s 1.4253 MOps/s $\color{#d91a1a}-0.68\%$
test_tc_first_layer_nontensor 0.1661ms 2.3639μs 423.0369 KOps/s 432.0422 KOps/s $\color{#d91a1a}-2.08\%$
test_tc_second_layer_tensor 57.0910μs 1.4404μs 694.2418 KOps/s 698.8364 KOps/s $\color{#d91a1a}-0.66\%$
test_tc_second_layer_nontensor 35.2110μs 3.0912μs 323.4965 KOps/s 325.9764 KOps/s $\color{#d91a1a}-0.76\%$
test_unbind 0.2445s 10.1121ms 98.8912 Ops/s 146.3474 Ops/s $\textbf{\color{#d91a1a}-32.43\%}$
test_full_like 10.4107ms 9.4277ms 106.0706 Ops/s 105.2675 Ops/s $\color{#35bf28}+0.76\%$
test_zeros_like 4.9166ms 4.3646ms 229.1175 Ops/s 230.5094 Ops/s $\color{#d91a1a}-0.60\%$
test_ones_like 4.7033ms 4.3720ms 228.7307 Ops/s 229.7932 Ops/s $\color{#d91a1a}-0.46\%$
test_clone 12.0631ms 9.3757ms 106.6584 Ops/s 152.6799 Ops/s $\textbf{\color{#d91a1a}-30.14\%}$
test_squeeze 0.1534ms 9.5739μs 104.4504 KOps/s 100.4905 KOps/s $\color{#35bf28}+3.94\%$
test_unsqueeze 0.2142ms 71.2983μs 14.0256 KOps/s 13.8976 KOps/s $\color{#35bf28}+0.92\%$
test_split 0.3910ms 0.1600ms 6.2503 KOps/s 5.9646 KOps/s $\color{#35bf28}+4.79\%$
test_permute 0.3452ms 0.1820ms 5.4932 KOps/s 5.3308 KOps/s $\color{#35bf28}+3.05\%$
test_stack 52.1419ms 51.5381ms 19.4031 Ops/s 19.3841 Ops/s $\color{#35bf28}+0.10\%$
test_cat 51.9602ms 51.2580ms 19.5092 Ops/s 19.5693 Ops/s $\color{#d91a1a}-0.31\%$

@vmoens vmoens merged commit 4eb3af1 into gh/vmoens/36/base Nov 6, 2024
46 of 55 checks passed
vmoens added a commit that referenced this pull request Nov 6, 2024
ghstack-source-id: 34940a47d84bcf171bf4511187fcc82df88f801f
Pull Request resolved: #1077
@vmoens vmoens deleted the gh/vmoens/36/head branch November 6, 2024 17:36
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 34940a47d84bcf171bf4511187fcc82df88f801f
Pull Request resolved: #1077

(cherry picked from commit 78b7802)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants