Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Performance] Faster to #1073

Merged
merged 3 commits into from
Nov 5, 2024
Merged

[Performance] Faster to #1073

merged 3 commits into from
Nov 5, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 5, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 5, 2024
vmoens added a commit that referenced this pull request Nov 5, 2024
ghstack-source-id: 3ff1db59f081b75f24c34c5239f88b5c5de8dbe4
Pull Request resolved: #1073
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 5, 2024
ghstack-source-id: 63222e1497d3be45d831003c387926fbfaade67d
Pull Request resolved: #1073
Copy link

github-actions bot commented Nov 5, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 42.2990μs 17.8200μs 56.1167 KOps/s 56.1628 KOps/s $\color{#d91a1a}-0.08\%$
test_plain_set_stack_nested 47.2490μs 17.7486μs 56.3423 KOps/s 54.9797 KOps/s $\color{#35bf28}+2.48\%$
test_plain_set_nested_inplace 44.3930μs 19.4382μs 51.4450 KOps/s 49.8002 KOps/s $\color{#35bf28}+3.30\%$
test_plain_set_stack_nested_inplace 43.5620μs 19.3934μs 51.5639 KOps/s 48.6930 KOps/s $\textbf{\color{#35bf28}+5.90\%}$
test_items 21.5410μs 4.1548μs 240.6866 KOps/s 239.4114 KOps/s $\color{#35bf28}+0.53\%$
test_items_nested 0.6309ms 0.3434ms 2.9122 KOps/s 2.9192 KOps/s $\color{#d91a1a}-0.24\%$
test_items_nested_locked 0.4906ms 0.3432ms 2.9137 KOps/s 2.9411 KOps/s $\color{#d91a1a}-0.93\%$
test_items_nested_leaf 0.1324ms 71.5837μs 13.9697 KOps/s 14.0722 KOps/s $\color{#d91a1a}-0.73\%$
test_items_stack_nested 0.6586ms 0.3457ms 2.8924 KOps/s 2.8903 KOps/s $\color{#35bf28}+0.07\%$
test_items_stack_nested_leaf 0.1340ms 70.9951μs 14.0855 KOps/s 13.6424 KOps/s $\color{#35bf28}+3.25\%$
test_items_stack_nested_locked 0.7924ms 0.3536ms 2.8281 KOps/s 2.9091 KOps/s $\color{#d91a1a}-2.79\%$
test_keys 44.2130μs 3.5082μs 285.0457 KOps/s 284.7310 KOps/s $\color{#35bf28}+0.11\%$
test_keys_nested 0.2257ms 0.1387ms 7.2081 KOps/s 7.2309 KOps/s $\color{#d91a1a}-0.32\%$
test_keys_nested_locked 0.9177ms 0.1419ms 7.0474 KOps/s 6.9348 KOps/s $\color{#35bf28}+1.62\%$
test_keys_nested_leaf 0.2047ms 0.1193ms 8.3791 KOps/s 8.5264 KOps/s $\color{#d91a1a}-1.73\%$
test_keys_stack_nested 0.2469ms 0.1391ms 7.1899 KOps/s 7.3426 KOps/s $\color{#d91a1a}-2.08\%$
test_keys_stack_nested_leaf 0.1776ms 0.1194ms 8.3727 KOps/s 8.5260 KOps/s $\color{#d91a1a}-1.80\%$
test_keys_stack_nested_locked 0.2668ms 0.1432ms 6.9852 KOps/s 7.0605 KOps/s $\color{#d91a1a}-1.07\%$
test_values 7.0212μs 1.0595μs 943.8252 KOps/s 966.4813 KOps/s $\color{#d91a1a}-2.34\%$
test_values_nested 0.1101ms 55.3433μs 18.0690 KOps/s 18.1599 KOps/s $\color{#d91a1a}-0.50\%$
test_values_nested_locked 0.1094ms 56.2113μs 17.7900 KOps/s 18.1499 KOps/s $\color{#d91a1a}-1.98\%$
test_values_nested_leaf 0.1119ms 60.4521μs 16.5420 KOps/s 16.0083 KOps/s $\color{#35bf28}+3.33\%$
test_values_stack_nested 0.1044ms 55.3497μs 18.0669 KOps/s 17.8017 KOps/s $\color{#35bf28}+1.49\%$
test_values_stack_nested_leaf 0.1169ms 60.9613μs 16.4038 KOps/s 16.4589 KOps/s $\color{#d91a1a}-0.33\%$
test_values_stack_nested_locked 90.4590μs 55.5665μs 17.9965 KOps/s 17.8069 KOps/s $\color{#35bf28}+1.06\%$
test_membership 23.7940μs 0.9062μs 1.1036 MOps/s 1.1441 MOps/s $\color{#d91a1a}-3.54\%$
test_membership_nested 26.0890μs 2.7585μs 362.5187 KOps/s 359.7531 KOps/s $\color{#35bf28}+0.77\%$
test_membership_nested_leaf 25.9780μs 2.7639μs 361.8013 KOps/s 362.4091 KOps/s $\color{#d91a1a}-0.17\%$
test_membership_stacked_nested 17.9840μs 2.7069μs 369.4254 KOps/s 363.5144 KOps/s $\color{#35bf28}+1.63\%$
test_membership_stacked_nested_leaf 22.3220μs 2.7457μs 364.2034 KOps/s 358.5334 KOps/s $\color{#35bf28}+1.58\%$
test_membership_nested_last 31.6090μs 4.1563μs 240.6006 KOps/s 239.5194 KOps/s $\color{#35bf28}+0.45\%$
test_membership_nested_leaf_last 29.5950μs 4.1653μs 240.0816 KOps/s 237.2419 KOps/s $\color{#35bf28}+1.20\%$
test_membership_stacked_nested_last 29.5750μs 4.1565μs 240.5845 KOps/s 207.1252 KOps/s $\textbf{\color{#35bf28}+16.15\%}$
test_membership_stacked_nested_leaf_last 38.6730μs 4.1579μs 240.5046 KOps/s 204.6098 KOps/s $\textbf{\color{#35bf28}+17.54\%}$
test_nested_getleaf 38.4630μs 10.8353μs 92.2909 KOps/s 94.2970 KOps/s $\color{#d91a1a}-2.13\%$
test_nested_get 41.3380μs 10.1073μs 98.9384 KOps/s 98.2556 KOps/s $\color{#35bf28}+0.69\%$
test_stacked_getleaf 40.2250μs 10.5488μs 94.7976 KOps/s 90.6993 KOps/s $\color{#35bf28}+4.52\%$
test_stacked_get 35.6270μs 10.1976μs 98.0624 KOps/s 94.9593 KOps/s $\color{#35bf28}+3.27\%$
test_nested_getitemleaf 36.4080μs 11.0517μs 90.4836 KOps/s 88.4393 KOps/s $\color{#35bf28}+2.31\%$
test_nested_getitem 35.2160μs 10.3491μs 96.6264 KOps/s 95.7829 KOps/s $\color{#35bf28}+0.88\%$
test_stacked_getitemleaf 35.1960μs 11.0328μs 90.6386 KOps/s 88.6585 KOps/s $\color{#35bf28}+2.23\%$
test_stacked_getitem 32.3710μs 10.2527μs 97.5355 KOps/s 96.6931 KOps/s $\color{#35bf28}+0.87\%$
test_lock_nested 0.8824ms 0.4344ms 2.3018 KOps/s 2.2647 KOps/s $\color{#35bf28}+1.64\%$
test_lock_stack_nested 0.6297ms 0.4085ms 2.4479 KOps/s 2.4165 KOps/s $\color{#35bf28}+1.30\%$
test_unlock_nested 0.7405ms 0.3503ms 2.8550 KOps/s 2.7929 KOps/s $\color{#35bf28}+2.22\%$
test_unlock_stack_nested 0.5797ms 0.3278ms 3.0506 KOps/s 3.0446 KOps/s $\color{#35bf28}+0.20\%$
test_flatten_speed 0.1709ms 92.4950μs 10.8114 KOps/s 10.7628 KOps/s $\color{#35bf28}+0.45\%$
test_unflatten_speed 0.6408ms 0.4780ms 2.0920 KOps/s 2.0711 KOps/s $\color{#35bf28}+1.01\%$
test_common_ops 3.9015ms 0.7705ms 1.2979 KOps/s 1.3131 KOps/s $\color{#d91a1a}-1.16\%$
test_creation 71.2230μs 2.1863μs 457.3941 KOps/s 473.2352 KOps/s $\color{#d91a1a}-3.35\%$
test_creation_empty 32.2800μs 10.5734μs 94.5766 KOps/s 89.4496 KOps/s $\textbf{\color{#35bf28}+5.73\%}$
test_creation_nested_1 67.2190μs 13.2542μs 75.4477 KOps/s 73.1560 KOps/s $\color{#35bf28}+3.13\%$
test_creation_nested_2 48.4610μs 17.8656μs 55.9734 KOps/s 56.1110 KOps/s $\color{#d91a1a}-0.25\%$
test_clone 0.1328ms 13.4580μs 74.3053 KOps/s 75.2244 KOps/s $\color{#d91a1a}-1.22\%$
test_getitem[int] 1.1134ms 12.7875μs 78.2016 KOps/s 80.0654 KOps/s $\color{#d91a1a}-2.33\%$
test_getitem[slice_int] 0.1420ms 24.7269μs 40.4418 KOps/s 42.0545 KOps/s $\color{#d91a1a}-3.83\%$
test_getitem[range] 0.3793ms 49.2948μs 20.2861 KOps/s 20.7958 KOps/s $\color{#d91a1a}-2.45\%$
test_getitem[tuple] 0.1408ms 19.8701μs 50.3269 KOps/s 51.2595 KOps/s $\color{#d91a1a}-1.82\%$
test_getitem[list] 0.3375ms 45.2292μs 22.1096 KOps/s 23.2676 KOps/s $\color{#d91a1a}-4.98\%$
test_setitem_dim[int] 45.6660μs 24.8899μs 40.1769 KOps/s 38.6312 KOps/s $\color{#35bf28}+4.00\%$
test_setitem_dim[slice_int] 90.7500μs 50.1290μs 19.9485 KOps/s 19.5636 KOps/s $\color{#35bf28}+1.97\%$
test_setitem_dim[range] 0.1366ms 73.8851μs 13.5345 KOps/s 13.7992 KOps/s $\color{#d91a1a}-1.92\%$
test_setitem_dim[tuple] 68.5990μs 39.4453μs 25.3516 KOps/s 24.2017 KOps/s $\color{#35bf28}+4.75\%$
test_setitem 0.1850ms 20.2834μs 49.3015 KOps/s 49.3389 KOps/s $\color{#d91a1a}-0.08\%$
test_set 0.1148ms 19.6692μs 50.8409 KOps/s 50.4272 KOps/s $\color{#35bf28}+0.82\%$
test_set_shared 1.1244ms 0.1681ms 5.9491 KOps/s 5.9586 KOps/s $\color{#d91a1a}-0.16\%$
test_update 0.1437ms 22.1836μs 45.0783 KOps/s 44.0594 KOps/s $\color{#35bf28}+2.31\%$
test_update_nested 0.1157ms 31.7233μs 31.5225 KOps/s 30.2316 KOps/s $\color{#35bf28}+4.27\%$
test_update__nested 0.5760ms 32.6600μs 30.6185 KOps/s 29.8200 KOps/s $\color{#35bf28}+2.68\%$
test_set_nested 0.1507ms 21.5254μs 46.4567 KOps/s 45.5103 KOps/s $\color{#35bf28}+2.08\%$
test_set_nested_new 0.1556ms 26.0534μs 38.3828 KOps/s 37.8636 KOps/s $\color{#35bf28}+1.37\%$
test_select 0.1335ms 41.8542μs 23.8925 KOps/s 23.8011 KOps/s $\color{#35bf28}+0.38\%$
test_select_nested 0.1205ms 60.0558μs 16.6512 KOps/s 16.9956 KOps/s $\color{#d91a1a}-2.03\%$
test_exclude_nested 0.1479ms 75.3266μs 13.2755 KOps/s 13.2264 KOps/s $\color{#35bf28}+0.37\%$
test_empty[True] 0.8190ms 0.3525ms 2.8371 KOps/s 2.8344 KOps/s $\color{#35bf28}+0.10\%$
test_empty[False] 6.1065μs 1.2100μs 826.4435 KOps/s 824.5927 KOps/s $\color{#35bf28}+0.22\%$
test_unbind_speed 0.4623ms 0.2582ms 3.8726 KOps/s 3.8686 KOps/s $\color{#35bf28}+0.10\%$
test_unbind_speed_stack0 0.4364ms 0.2567ms 3.8963 KOps/s 3.8522 KOps/s $\color{#35bf28}+1.15\%$
test_unbind_speed_stack1 97.9686ms 0.7531ms 1.3278 KOps/s 1.4432 KOps/s $\textbf{\color{#d91a1a}-8.00\%}$
test_split 0.1018s 1.7804ms 561.6864 Ops/s 578.0106 Ops/s $\color{#d91a1a}-2.82\%$
test_chunk 97.2289ms 1.7852ms 560.1719 Ops/s 578.1846 Ops/s $\color{#d91a1a}-3.12\%$
test_consolidate_njt[False-None] 8.3939ms 8.0624ms 124.0319 Ops/s 122.8318 Ops/s $\color{#35bf28}+0.98\%$
test_creation[device0] 3.3135ms 93.2586μs 10.7229 KOps/s 10.9147 KOps/s $\color{#d91a1a}-1.76\%$
test_creation_from_tensor 0.2296ms 93.9300μs 10.6462 KOps/s 10.4856 KOps/s $\color{#35bf28}+1.53\%$
test_add_one[memmap_tensor0] 0.2155ms 4.9572μs 201.7251 KOps/s 202.0930 KOps/s $\color{#d91a1a}-0.18\%$
test_contiguous[memmap_tensor0] 25.9480μs 0.5035μs 1.9862 MOps/s 1.9585 MOps/s $\color{#35bf28}+1.41\%$
test_stack[memmap_tensor0] 46.8780μs 3.4381μs 290.8581 KOps/s 310.5030 KOps/s $\textbf{\color{#d91a1a}-6.33\%}$
test_memmaptd_index 0.9030ms 0.2331ms 4.2899 KOps/s 4.3050 KOps/s $\color{#d91a1a}-0.35\%$
test_memmaptd_index_astensor 0.6107ms 0.3116ms 3.2092 KOps/s 3.2393 KOps/s $\color{#d91a1a}-0.93\%$
test_memmaptd_index_op 1.0679ms 0.5825ms 1.7168 KOps/s 1.7235 KOps/s $\color{#d91a1a}-0.39\%$
test_serialize_model 0.1310s 0.1149s 8.7065 Ops/s 7.6711 Ops/s $\textbf{\color{#35bf28}+13.50\%}$
test_serialize_model_pickle 0.4513s 0.3954s 2.5293 Ops/s 2.5263 Ops/s $\color{#35bf28}+0.12\%$
test_serialize_weights 0.2162s 0.1303s 7.6747 Ops/s 8.7639 Ops/s $\textbf{\color{#d91a1a}-12.43\%}$
test_serialize_weights_returnearly 0.1706s 0.1560s 6.4095 Ops/s 6.3970 Ops/s $\color{#35bf28}+0.20\%$
test_serialize_weights_pickle 1.0832s 0.7089s 1.4106 Ops/s 2.4931 Ops/s $\textbf{\color{#d91a1a}-43.42\%}$
test_serialize_weights_filesystem 0.1439s 0.1385s 7.2184 Ops/s 6.4490 Ops/s $\textbf{\color{#35bf28}+11.93\%}$
test_serialize_model_filesystem 0.2406s 0.1519s 6.5847 Ops/s 6.6488 Ops/s $\color{#d91a1a}-0.96\%$
test_reshape_pytree 57.2980μs 27.3174μs 36.6067 KOps/s 36.8492 KOps/s $\color{#d91a1a}-0.66\%$
test_reshape_td 67.5070μs 32.7840μs 30.5027 KOps/s 31.0518 KOps/s $\color{#d91a1a}-1.77\%$
test_view_pytree 62.3160μs 26.9517μs 37.1035 KOps/s 36.5710 KOps/s $\color{#35bf28}+1.46\%$
test_view_td 98.9360μs 38.4685μs 25.9953 KOps/s 26.5757 KOps/s $\color{#d91a1a}-2.18\%$
test_unbind_pytree 67.2860μs 29.8744μs 33.4735 KOps/s 33.4130 KOps/s $\color{#35bf28}+0.18\%$
test_unbind_td 0.3172ms 38.5889μs 25.9142 KOps/s 26.1629 KOps/s $\color{#d91a1a}-0.95\%$
test_split_pytree 0.1034ms 29.7083μs 33.6606 KOps/s 33.4438 KOps/s $\color{#35bf28}+0.65\%$
test_split_td 0.5381ms 44.9261μs 22.2588 KOps/s 22.3729 KOps/s $\color{#d91a1a}-0.51\%$
test_add_pytree 84.8080μs 36.8572μs 27.1317 KOps/s 27.2292 KOps/s $\color{#d91a1a}-0.36\%$
test_add_td 0.1553ms 57.4017μs 17.4211 KOps/s 17.9941 KOps/s $\color{#d91a1a}-3.18\%$
test_compile_add_one_nested[tensordict-compile] 0.1281ms 63.4179μs 15.7684 KOps/s 15.9544 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_add_one_nested[tensordict-eager] 0.4826ms 0.1599ms 6.2544 KOps/s 6.2981 KOps/s $\color{#d91a1a}-0.69\%$
test_compile_add_one_nested[pytree-compile] 0.1331ms 47.2039μs 21.1847 KOps/s 21.5256 KOps/s $\color{#d91a1a}-1.58\%$
test_compile_add_one_nested[pytree-eager] 0.2285ms 0.1200ms 8.3356 KOps/s 8.5073 KOps/s $\color{#d91a1a}-2.02\%$
test_compile_copy_nested[tensordict-compile] 82.4640μs 26.3692μs 37.9231 KOps/s 38.5604 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_copy_nested[tensordict-eager] 0.1145ms 54.8099μs 18.2449 KOps/s 18.7881 KOps/s $\color{#d91a1a}-2.89\%$
test_compile_copy_nested[pytree-compile] 0.1881ms 78.6969μs 12.7070 KOps/s 12.7077 KOps/s $-0.01\%$
test_compile_copy_nested[pytree-eager] 0.1426ms 68.8531μs 14.5237 KOps/s 14.8297 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_add_one_flat[tensordict-compile] 0.2228ms 0.1061ms 9.4212 KOps/s 9.3660 KOps/s $\color{#35bf28}+0.59\%$
test_compile_add_one_flat[tensordict-eager] 0.3382ms 0.1983ms 5.0432 KOps/s 5.0629 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_add_one_flat[tensorclass-compile] 0.1459ms 45.3763μs 22.0380 KOps/s 21.7243 KOps/s $\color{#35bf28}+1.44\%$
test_compile_add_one_flat[tensorclass-eager] 0.4868ms 61.5346μs 16.2510 KOps/s 16.0087 KOps/s $\color{#35bf28}+1.51\%$
test_compile_add_one_flat[pytree-compile] 0.1987ms 0.1033ms 9.6803 KOps/s 9.7822 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_add_one_flat[pytree-eager] 0.4288ms 0.2031ms 4.9226 KOps/s 4.9738 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_add_self_flat[tensordict-eager] 0.3860ms 0.2095ms 4.7737 KOps/s 4.7788 KOps/s $\color{#d91a1a}-0.11\%$
test_compile_add_self_flat[tensordict-compile] 0.1879ms 0.1055ms 9.4760 KOps/s 9.4921 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_add_self_flat[tensorclass-eager] 0.2056ms 54.9300μs 18.2050 KOps/s 18.1000 KOps/s $\color{#35bf28}+0.58\%$
test_compile_add_self_flat[tensorclass-compile] 0.4249ms 47.8958μs 20.8787 KOps/s 20.7904 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_self_flat[pytree-eager] 0.2338ms 0.1600ms 6.2502 KOps/s 6.3361 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_add_self_flat[pytree-compile] 0.5083ms 0.1065ms 9.3916 KOps/s 9.6113 KOps/s $\color{#d91a1a}-2.29\%$
test_compile_copy_flat[tensordict-compile] 67.8270μs 22.5115μs 44.4217 KOps/s 46.9900 KOps/s $\textbf{\color{#d91a1a}-5.47\%}$
test_compile_copy_flat[tensordict-eager] 0.1985ms 62.6575μs 15.9598 KOps/s 16.7792 KOps/s $\color{#d91a1a}-4.88\%$
test_compile_copy_flat[pytree-compile] 0.1676ms 81.1881μs 12.3171 KOps/s 12.2728 KOps/s $\color{#35bf28}+0.36\%$
test_compile_copy_flat[pytree-eager] 0.1302ms 69.3311μs 14.4235 KOps/s 14.6891 KOps/s $\color{#d91a1a}-1.81\%$
test_compile_assign_and_add[tensordict-compile] 0.3042ms 0.2114ms 4.7308 KOps/s 4.8012 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_assign_and_add[tensordict-eager] 1.4144ms 1.2763ms 783.5191 Ops/s 770.4657 Ops/s $\color{#35bf28}+1.69\%$
test_compile_assign_and_add[pytree-compile] 0.3062ms 0.2059ms 4.8578 KOps/s 4.8872 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_assign_and_add[pytree-eager] 1.3437ms 0.7864ms 1.2717 KOps/s 1.2816 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_assign_and_add_stack[compile] 0.7030ms 0.4612ms 2.1681 KOps/s 2.1617 KOps/s $\color{#35bf28}+0.30\%$
test_compile_assign_and_add_stack[eager] 4.1288ms 2.6000ms 384.6108 Ops/s 382.4691 Ops/s $\color{#35bf28}+0.56\%$
test_compile_indexing[tensor-tensordict-compile] 0.1108ms 37.4728μs 26.6861 KOps/s 26.9911 KOps/s $\color{#d91a1a}-1.13\%$
test_compile_indexing[tensor-tensordict-eager] 0.4875ms 33.7251μs 29.6515 KOps/s 29.9533 KOps/s $\color{#d91a1a}-1.01\%$
test_compile_indexing[tensor-tensorclass-compile] 86.2420μs 29.8705μs 33.4779 KOps/s 33.9261 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_indexing[tensor-tensorclass-eager] 72.3850μs 23.5050μs 42.5441 KOps/s 42.9644 KOps/s $\color{#d91a1a}-0.98\%$
test_compile_indexing[tensor-pytree-compile] 93.1040μs 30.4497μs 32.8411 KOps/s 33.3173 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_indexing[tensor-pytree-eager] 68.2070μs 23.5981μs 42.3764 KOps/s 43.1632 KOps/s $\color{#d91a1a}-1.82\%$
test_compile_indexing[slice-tensordict-compile] 0.1151ms 52.1376μs 19.1800 KOps/s 19.0217 KOps/s $\color{#35bf28}+0.83\%$
test_compile_indexing[slice-tensordict-eager] 0.6453ms 20.5774μs 48.5970 KOps/s 48.4249 KOps/s $\color{#35bf28}+0.36\%$
test_compile_indexing[slice-tensorclass-compile] 0.1105ms 44.2860μs 22.5805 KOps/s 22.7310 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_indexing[slice-tensorclass-eager] 78.0160μs 19.2471μs 51.9560 KOps/s 51.5905 KOps/s $\color{#35bf28}+0.71\%$
test_compile_indexing[slice-pytree-compile] 93.7160μs 45.2594μs 22.0949 KOps/s 22.2192 KOps/s $\color{#d91a1a}-0.56\%$
test_compile_indexing[slice-pytree-eager] 75.1810μs 19.0724μs 52.4319 KOps/s 51.4405 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[int-tensordict-compile] 0.1105ms 52.7182μs 18.9688 KOps/s 18.8541 KOps/s $\color{#35bf28}+0.61\%$
test_compile_indexing[int-tensordict-eager] 0.8565ms 19.9989μs 50.0027 KOps/s 50.8581 KOps/s $\color{#d91a1a}-1.68\%$
test_compile_indexing[int-tensorclass-compile] 95.6090μs 45.0318μs 22.2065 KOps/s 22.3106 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_indexing[int-tensorclass-eager] 70.9430μs 19.0639μs 52.4552 KOps/s 52.9583 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_indexing[int-pytree-compile] 0.1010ms 45.1441μs 22.1513 KOps/s 22.3579 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_indexing[int-pytree-eager] 0.4547ms 19.2136μs 52.0464 KOps/s 52.5060 KOps/s $\color{#d91a1a}-0.88\%$
test_mod_add[eager] 63.4380μs 25.7472μs 38.8392 KOps/s 38.3506 KOps/s $\color{#35bf28}+1.27\%$
test_mod_add[compile] 85.5300μs 45.2368μs 22.1059 KOps/s 22.2236 KOps/s $\color{#d91a1a}-0.53\%$
test_mod_add[compile-overhead] 96.5300μs 44.9266μs 22.2585 KOps/s 22.3061 KOps/s $\color{#d91a1a}-0.21\%$
test_mod_wrap[eager] 0.4591ms 0.2134ms 4.6850 KOps/s 4.6366 KOps/s $\color{#35bf28}+1.04\%$
test_mod_wrap[compile] 1.5411ms 0.2107ms 4.7459 KOps/s 4.8365 KOps/s $\color{#d91a1a}-1.87\%$
test_mod_wrap[compile-overhead] 1.5693ms 0.2081ms 4.8051 KOps/s 4.8470 KOps/s $\color{#d91a1a}-0.86\%$
test_mod_wrap_and_backward[eager] 14.4694ms 12.6446ms 79.0853 Ops/s 83.3196 Ops/s $\textbf{\color{#d91a1a}-5.08\%}$
test_mod_wrap_and_backward[compile] 16.3591ms 12.7340ms 78.5299 Ops/s 79.0981 Ops/s $\color{#d91a1a}-0.72\%$
test_mod_wrap_and_backward[compile-overhead] 17.0178ms 12.7734ms 78.2876 Ops/s 83.1785 Ops/s $\textbf{\color{#d91a1a}-5.88\%}$
test_seq_add[eager] 0.2169ms 88.1651μs 11.3424 KOps/s 10.7916 KOps/s $\textbf{\color{#35bf28}+5.10\%}$
test_seq_add[compile] 0.1401ms 60.8539μs 16.4328 KOps/s 16.7219 KOps/s $\color{#d91a1a}-1.73\%$
test_seq_add[compile-overhead] 0.1181ms 59.1354μs 16.9103 KOps/s 16.7706 KOps/s $\color{#35bf28}+0.83\%$
test_seq_wrap[eager] 0.5478ms 0.3848ms 2.5990 KOps/s 2.5581 KOps/s $\color{#35bf28}+1.60\%$
test_seq_wrap[compile] 0.4271ms 0.2301ms 4.3464 KOps/s 4.3768 KOps/s $\color{#d91a1a}-0.69\%$
test_seq_wrap[compile-overhead] 0.4197ms 0.2273ms 4.3997 KOps/s 4.3984 KOps/s $\color{#35bf28}+0.03\%$
test_func_call_runtime[False-eager] 0.7829ms 0.5603ms 1.7847 KOps/s 1.8154 KOps/s $\color{#d91a1a}-1.70\%$
test_func_call_runtime[False-compile] 1.0595ms 0.4294ms 2.3288 KOps/s 2.3443 KOps/s $\color{#d91a1a}-0.66\%$
test_func_call_runtime[False-compile-overhead] 0.9045ms 0.4343ms 2.3028 KOps/s 2.3460 KOps/s $\color{#d91a1a}-1.84\%$
test_func_call_runtime[True-eager] 1.5158ms 0.7707ms 1.2976 KOps/s 1.2981 KOps/s $\color{#d91a1a}-0.04\%$
test_func_call_runtime[True-compile] 0.8386ms 0.4719ms 2.1192 KOps/s 2.1604 KOps/s $\color{#d91a1a}-1.91\%$
test_func_call_runtime[True-compile-overhead] 0.9125ms 0.4763ms 2.0994 KOps/s 2.1472 KOps/s $\color{#d91a1a}-2.23\%$
test_func_call_cm_runtime[False-eager] 1.0529ms 0.5587ms 1.7899 KOps/s 1.8141 KOps/s $\color{#d91a1a}-1.33\%$
test_func_call_cm_runtime[False-compile] 0.8634ms 0.4281ms 2.3361 KOps/s 2.3579 KOps/s $\color{#d91a1a}-0.93\%$
test_func_call_cm_runtime[False-compile-overhead] 0.9062ms 0.4307ms 2.3216 KOps/s 2.3632 KOps/s $\color{#d91a1a}-1.76\%$
test_func_call_cm_runtime[True-eager] 1.2821ms 0.9067ms 1.1029 KOps/s 1.1017 KOps/s $\color{#35bf28}+0.11\%$
test_func_call_cm_runtime[True-compile] 0.6268ms 0.4961ms 2.0156 KOps/s 2.0566 KOps/s $\color{#d91a1a}-1.99\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6746ms 0.4957ms 2.0173 KOps/s 2.0438 KOps/s $\color{#d91a1a}-1.30\%$
test_vmap_func_call_cm_runtime[eager] 2.6459ms 1.8928ms 528.3296 Ops/s 529.2865 Ops/s $\color{#d91a1a}-0.18\%$
test_vmap_func_call_cm_runtime[compile] 0.7094ms 0.5215ms 1.9177 KOps/s 1.9053 KOps/s $\color{#35bf28}+0.65\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7367ms 0.5284ms 1.8924 KOps/s 1.9165 KOps/s $\color{#d91a1a}-1.26\%$
test_distributed 0.2873ms 0.1285ms 7.7814 KOps/s 7.7080 KOps/s $\color{#35bf28}+0.95\%$
test_tdmodule 45.2340μs 18.5054μs 54.0383 KOps/s 51.6002 KOps/s $\color{#35bf28}+4.72\%$
test_tdmodule_dispatch 71.8440μs 37.1797μs 26.8964 KOps/s 27.1718 KOps/s $\color{#d91a1a}-1.01\%$
test_tdseq 36.7480μs 20.7307μs 48.2376 KOps/s 44.5433 KOps/s $\textbf{\color{#35bf28}+8.29\%}$
test_tdseq_dispatch 68.3280μs 41.1711μs 24.2889 KOps/s 23.2586 KOps/s $\color{#35bf28}+4.43\%$
test_instantiation_functorch 2.1661ms 1.5504ms 645.0150 Ops/s 660.3664 Ops/s $\color{#d91a1a}-2.32\%$
test_exec_functorch 0.2860ms 0.1816ms 5.5066 KOps/s 5.6163 KOps/s $\color{#d91a1a}-1.95\%$
test_exec_functional_call 0.4429ms 0.1724ms 5.8020 KOps/s 5.6822 KOps/s $\color{#35bf28}+2.11\%$
test_exec_td_decorator 0.5232ms 0.2263ms 4.4198 KOps/s 4.3143 KOps/s $\color{#35bf28}+2.45\%$
test_vmap_mlp_speed_decorator[True-True] 0.8752ms 0.6355ms 1.5735 KOps/s 1.5804 KOps/s $\color{#d91a1a}-0.44\%$
test_vmap_mlp_speed_decorator[True-False] 0.8777ms 0.6347ms 1.5756 KOps/s 1.5365 KOps/s $\color{#35bf28}+2.54\%$
test_vmap_mlp_speed_decorator[False-True] 0.9235ms 0.5225ms 1.9140 KOps/s 1.9125 KOps/s $\color{#35bf28}+0.08\%$
test_vmap_mlp_speed_decorator[False-False] 0.8155ms 0.5227ms 1.9130 KOps/s 1.9091 KOps/s $\color{#35bf28}+0.20\%$
test_to_module_speed[True] 1.3765ms 1.2878ms 776.5270 Ops/s 766.7625 Ops/s $\color{#35bf28}+1.27\%$
test_to_module_speed[False] 1.7717ms 1.2637ms 791.3442 Ops/s 777.5382 Ops/s $\color{#35bf28}+1.78\%$
test_tc_init 86.8430μs 42.9718μs 23.2711 KOps/s 22.7022 KOps/s $\color{#35bf28}+2.51\%$
test_tc_init_nested 0.1554ms 87.8757μs 11.3797 KOps/s 11.2877 KOps/s $\color{#35bf28}+0.81\%$
test_tc_first_layer_tensor 22.8130μs 1.5419μs 648.5586 KOps/s 659.2786 KOps/s $\color{#d91a1a}-1.63\%$
test_tc_first_layer_nontensor 43.5610μs 4.8714μs 205.2785 KOps/s 212.3511 KOps/s $\color{#d91a1a}-3.33\%$
test_tc_second_layer_tensor 44.4030μs 2.7882μs 358.6495 KOps/s 357.6847 KOps/s $\color{#35bf28}+0.27\%$
test_tc_second_layer_nontensor 43.3210μs 6.1785μs 161.8514 KOps/s 167.6861 KOps/s $\color{#d91a1a}-3.48\%$
test_unbind 0.2127s 12.2958ms 81.3285 Ops/s 78.9360 Ops/s $\color{#35bf28}+3.03\%$
test_full_like 17.7212ms 11.6405ms 85.9068 Ops/s 85.3003 Ops/s $\color{#35bf28}+0.71\%$
test_zeros_like 9.8019ms 7.1102ms 140.6421 Ops/s 133.7339 Ops/s $\textbf{\color{#35bf28}+5.17\%}$
test_ones_like 13.0645ms 7.8133ms 127.9874 Ops/s 131.5689 Ops/s $\color{#d91a1a}-2.72\%$
test_clone 14.0065ms 9.4608ms 105.6988 Ops/s 109.0471 Ops/s $\color{#d91a1a}-3.07\%$
test_squeeze 60.0530μs 12.0184μs 83.2056 KOps/s 83.5152 KOps/s $\color{#d91a1a}-0.37\%$
test_unsqueeze 0.1653ms 87.3775μs 11.4446 KOps/s 11.2195 KOps/s $\color{#35bf28}+2.01\%$
test_split 0.5998ms 0.1929ms 5.1842 KOps/s 5.3098 KOps/s $\color{#d91a1a}-2.36\%$
test_permute 0.4274ms 0.2194ms 4.5578 KOps/s 4.5810 KOps/s $\color{#d91a1a}-0.51\%$
test_stack 28.3028ms 23.9160ms 41.8131 Ops/s 42.1023 Ops/s $\color{#d91a1a}-0.69\%$
test_cat 27.3812ms 23.5713ms 42.4246 Ops/s 41.8822 Ops/s $\color{#35bf28}+1.29\%$

Copy link

github-actions bot commented Nov 5, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}19$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 26.8710μs 10.5217μs 95.0421 KOps/s 93.8905 KOps/s $\color{#35bf28}+1.23\%$
test_plain_set_stack_nested 28.3500μs 10.6472μs 93.9213 KOps/s 93.0242 KOps/s $\color{#35bf28}+0.96\%$
test_plain_set_nested_inplace 0.1002ms 11.4753μs 87.1437 KOps/s 85.4243 KOps/s $\color{#35bf28}+2.01\%$
test_plain_set_stack_nested_inplace 57.2410μs 11.4953μs 86.9922 KOps/s 86.4631 KOps/s $\color{#35bf28}+0.61\%$
test_items 27.6110μs 2.8884μs 346.2156 KOps/s 344.1281 KOps/s $\color{#35bf28}+0.61\%$
test_items_nested 0.4581ms 0.3219ms 3.1061 KOps/s 3.1187 KOps/s $\color{#d91a1a}-0.40\%$
test_items_nested_locked 0.4806ms 0.3237ms 3.0894 KOps/s 3.1160 KOps/s $\color{#d91a1a}-0.85\%$
test_items_nested_leaf 0.1027ms 57.8740μs 17.2789 KOps/s 17.2580 KOps/s $\color{#35bf28}+0.12\%$
test_items_stack_nested 0.4341ms 0.3217ms 3.1087 KOps/s 3.0393 KOps/s $\color{#35bf28}+2.28\%$
test_items_stack_nested_leaf 92.6010μs 57.6787μs 17.3374 KOps/s 17.1618 KOps/s $\color{#35bf28}+1.02\%$
test_items_stack_nested_locked 0.4253ms 0.3249ms 3.0781 KOps/s 3.0941 KOps/s $\color{#d91a1a}-0.52\%$
test_keys 25.7900μs 3.4725μs 287.9769 KOps/s 290.3897 KOps/s $\color{#d91a1a}-0.83\%$
test_keys_nested 0.1315ms 70.4455μs 14.1954 KOps/s 14.1550 KOps/s $\color{#35bf28}+0.29\%$
test_keys_nested_locked 0.8795ms 75.8876μs 13.1774 KOps/s 13.0238 KOps/s $\color{#35bf28}+1.18\%$
test_keys_nested_leaf 0.1107ms 62.0557μs 16.1146 KOps/s 16.0852 KOps/s $\color{#35bf28}+0.18\%$
test_keys_stack_nested 0.1199ms 70.2233μs 14.2403 KOps/s 13.9370 KOps/s $\color{#35bf28}+2.18\%$
test_keys_stack_nested_leaf 91.0820μs 61.5284μs 16.2527 KOps/s 16.0384 KOps/s $\color{#35bf28}+1.34\%$
test_keys_stack_nested_locked 0.1227ms 75.7085μs 13.2086 KOps/s 13.0761 KOps/s $\color{#35bf28}+1.01\%$
test_values 7.2500μs 0.9565μs 1.0455 MOps/s 1.1818 MOps/s $\textbf{\color{#d91a1a}-11.53\%}$
test_values_nested 70.5210μs 31.3366μs 31.9116 KOps/s 32.0888 KOps/s $\color{#d91a1a}-0.55\%$
test_values_nested_locked 0.3976ms 32.6232μs 30.6530 KOps/s 30.3984 KOps/s $\color{#35bf28}+0.84\%$
test_values_nested_leaf 0.4183ms 33.6350μs 29.7309 KOps/s 29.6178 KOps/s $\color{#35bf28}+0.38\%$
test_values_stack_nested 65.5810μs 31.4300μs 31.8168 KOps/s 31.8268 KOps/s $\color{#d91a1a}-0.03\%$
test_values_stack_nested_leaf 94.2510μs 33.6985μs 29.6749 KOps/s 29.4258 KOps/s $\color{#35bf28}+0.85\%$
test_values_stack_nested_locked 0.4143ms 32.8537μs 30.4380 KOps/s 30.3574 KOps/s $\color{#35bf28}+0.27\%$
test_membership 18.9108μs 0.5204μs 1.9214 MOps/s 1.9742 MOps/s $\color{#d91a1a}-2.67\%$
test_membership_nested 0.1938ms 1.8762μs 533.0005 KOps/s 527.7844 KOps/s $\color{#35bf28}+0.99\%$
test_membership_nested_leaf 15.3655μs 1.9008μs 526.0910 KOps/s 534.9057 KOps/s $\color{#d91a1a}-1.65\%$
test_membership_stacked_nested 24.7810μs 1.9421μs 514.9099 KOps/s 503.5655 KOps/s $\color{#35bf28}+2.25\%$
test_membership_stacked_nested_leaf 17.0200μs 1.9254μs 519.3706 KOps/s 504.5934 KOps/s $\color{#35bf28}+2.93\%$
test_membership_nested_last 0.3864ms 2.8390μs 352.2412 KOps/s 355.2958 KOps/s $\color{#d91a1a}-0.86\%$
test_membership_nested_leaf_last 0.3814ms 2.8198μs 354.6384 KOps/s 352.7081 KOps/s $\color{#35bf28}+0.55\%$
test_membership_stacked_nested_last 30.4410μs 2.8300μs 353.3626 KOps/s 359.1944 KOps/s $\color{#d91a1a}-1.62\%$
test_membership_stacked_nested_leaf_last 43.3700μs 2.8224μs 354.3113 KOps/s 356.5444 KOps/s $\color{#d91a1a}-0.63\%$
test_nested_getleaf 0.3893ms 6.0453μs 165.4183 KOps/s 166.3351 KOps/s $\color{#d91a1a}-0.55\%$
test_nested_get 0.3912ms 5.7040μs 175.3140 KOps/s 175.0504 KOps/s $\color{#35bf28}+0.15\%$
test_stacked_getleaf 40.4410μs 6.0079μs 166.4474 KOps/s 166.7393 KOps/s $\color{#d91a1a}-0.18\%$
test_stacked_get 0.3876ms 5.7083μs 175.1833 KOps/s 175.0716 KOps/s $\color{#35bf28}+0.06\%$
test_nested_getitemleaf 33.9000μs 6.0865μs 164.2979 KOps/s 163.6521 KOps/s $\color{#35bf28}+0.39\%$
test_nested_getitem 0.3946ms 5.7711μs 173.2763 KOps/s 173.5886 KOps/s $\color{#d91a1a}-0.18\%$
test_stacked_getitemleaf 29.9900μs 6.1148μs 163.5375 KOps/s 164.0938 KOps/s $\color{#d91a1a}-0.34\%$
test_stacked_getitem 0.3859ms 5.7919μs 172.6563 KOps/s 173.1619 KOps/s $\color{#d91a1a}-0.29\%$
test_lock_nested 0.8724ms 0.3648ms 2.7411 KOps/s 2.6882 KOps/s $\color{#35bf28}+1.97\%$
test_lock_stack_nested 0.3683ms 0.3388ms 2.9513 KOps/s 2.9217 KOps/s $\color{#35bf28}+1.01\%$
test_unlock_nested 0.6587ms 0.3099ms 3.2266 KOps/s 3.2328 KOps/s $\color{#d91a1a}-0.19\%$
test_unlock_stack_nested 0.6511ms 0.2790ms 3.5847 KOps/s 3.5553 KOps/s $\color{#35bf28}+0.83\%$
test_flatten_speed 0.4677ms 72.2459μs 13.8416 KOps/s 13.9166 KOps/s $\color{#d91a1a}-0.54\%$
test_unflatten_speed 0.6550ms 0.2878ms 3.4751 KOps/s 3.4141 KOps/s $\color{#35bf28}+1.79\%$
test_common_ops 1.5473ms 0.5932ms 1.6858 KOps/s 1.6777 KOps/s $\color{#35bf28}+0.48\%$
test_creation 0.1810ms 1.4796μs 675.8383 KOps/s 664.5598 KOps/s $\color{#35bf28}+1.70\%$
test_creation_empty 28.4300μs 7.2499μs 137.9333 KOps/s 133.9479 KOps/s $\color{#35bf28}+2.98\%$
test_creation_nested_1 0.3906ms 8.8471μs 113.0312 KOps/s 109.8027 KOps/s $\color{#35bf28}+2.94\%$
test_creation_nested_2 34.2300μs 11.3687μs 87.9608 KOps/s 86.0556 KOps/s $\color{#35bf28}+2.21\%$
test_clone 37.8510μs 11.6057μs 86.1646 KOps/s 90.9333 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_getitem[int] 1.2811ms 11.0245μs 90.7072 KOps/s 89.7130 KOps/s $\color{#35bf28}+1.11\%$
test_getitem[slice_int] 0.4056ms 21.5146μs 46.4801 KOps/s 46.2337 KOps/s $\color{#35bf28}+0.53\%$
test_getitem[range] 0.1374ms 39.8716μs 25.0805 KOps/s 24.9862 KOps/s $\color{#35bf28}+0.38\%$
test_getitem[tuple] 0.1046ms 18.8382μs 53.0836 KOps/s 52.8734 KOps/s $\color{#35bf28}+0.40\%$
test_getitem[list] 0.4238ms 34.8759μs 28.6731 KOps/s 28.8505 KOps/s $\color{#d91a1a}-0.61\%$
test_setitem_dim[int] 39.9700μs 20.0659μs 49.8358 KOps/s 50.1193 KOps/s $\color{#d91a1a}-0.57\%$
test_setitem_dim[slice_int] 62.9210μs 38.7021μs 25.8384 KOps/s 25.6963 KOps/s $\color{#35bf28}+0.55\%$
test_setitem_dim[range] 77.5210μs 55.5497μs 18.0019 KOps/s 18.1281 KOps/s $\color{#d91a1a}-0.70\%$
test_setitem_dim[tuple] 55.2910μs 33.0631μs 30.2452 KOps/s 29.7989 KOps/s $\color{#35bf28}+1.50\%$
test_setitem 0.3991ms 15.9791μs 62.5819 KOps/s 64.6692 KOps/s $\color{#d91a1a}-3.23\%$
test_set 0.1207ms 15.0372μs 66.5016 KOps/s 66.8706 KOps/s $\color{#d91a1a}-0.55\%$
test_set_shared 1.8141ms 0.1486ms 6.7296 KOps/s 6.7358 KOps/s $\color{#d91a1a}-0.09\%$
test_update 1.2865ms 17.5105μs 57.1085 KOps/s 57.0048 KOps/s $\color{#35bf28}+0.18\%$
test_update_nested 0.4036ms 22.2373μs 44.9694 KOps/s 44.0000 KOps/s $\color{#35bf28}+2.20\%$
test_update__nested 0.1280ms 25.1719μs 39.7269 KOps/s 40.6120 KOps/s $\color{#d91a1a}-2.18\%$
test_set_nested 0.3999ms 16.4447μs 60.8100 KOps/s 57.7487 KOps/s $\textbf{\color{#35bf28}+5.30\%}$
test_set_nested_new 0.1165ms 18.6128μs 53.7264 KOps/s 49.7702 KOps/s $\textbf{\color{#35bf28}+7.95\%}$
test_select 0.4127ms 30.5320μs 32.7525 KOps/s 32.0495 KOps/s $\color{#35bf28}+2.19\%$
test_select_nested 76.9410μs 41.8571μs 23.8908 KOps/s 24.1279 KOps/s $\color{#d91a1a}-0.98\%$
test_exclude_nested 0.4404ms 59.4539μs 16.8197 KOps/s 16.9945 KOps/s $\color{#d91a1a}-1.03\%$
test_empty[True] 0.6283ms 0.2549ms 3.9231 KOps/s 3.9314 KOps/s $\color{#d91a1a}-0.21\%$
test_empty[False] 38.4136μs 0.7416μs 1.3484 MOps/s 1.3570 MOps/s $\color{#d91a1a}-0.63\%$
test_to 85.4320μs 55.9723μs 17.8660 KOps/s 16.8566 KOps/s $\textbf{\color{#35bf28}+5.99\%}$
test_to_nonblocking 0.4392ms 48.3636μs 20.6767 KOps/s 18.9736 KOps/s $\textbf{\color{#35bf28}+8.98\%}$
test_unbind_speed 0.2739ms 0.2352ms 4.2512 KOps/s 4.1930 KOps/s $\color{#35bf28}+1.39\%$
test_unbind_speed_stack0 0.6236ms 0.2353ms 4.2498 KOps/s 4.2043 KOps/s $\color{#35bf28}+1.08\%$
test_unbind_speed_stack1 92.3932ms 0.6543ms 1.5283 KOps/s 1.5028 KOps/s $\color{#35bf28}+1.70\%$
test_split 92.6383ms 1.6098ms 621.1975 Ops/s 575.0972 Ops/s $\textbf{\color{#35bf28}+8.02\%}$
test_chunk 94.9654ms 1.7403ms 574.6094 Ops/s 685.3991 Ops/s $\textbf{\color{#d91a1a}-16.16\%}$
test_consolidate[False-None] 3.2561ms 2.6545ms 376.7146 Ops/s 347.7775 Ops/s $\textbf{\color{#35bf28}+8.32\%}$
test_consolidate[default-None] 1.8207ms 1.7001ms 588.1983 Ops/s 598.8832 Ops/s $\color{#d91a1a}-1.78\%$
test_consolidate[reduce-overhead-None] 1.8401ms 1.6934ms 590.5216 Ops/s 584.9597 Ops/s $\color{#35bf28}+0.95\%$
test_consolidate_njt[False-None] 6.9644ms 6.6839ms 149.6134 Ops/s 148.5429 Ops/s $\color{#35bf28}+0.72\%$
test_to[False-False-None] 1.8091ms 1.7631ms 567.1864 Ops/s 482.8376 Ops/s $\textbf{\color{#35bf28}+17.47\%}$
test_to[True-False-None] 1.6258ms 1.3767ms 726.3558 Ops/s 715.5130 Ops/s $\color{#35bf28}+1.52\%$
test_to[within-False-None] 4.1863ms 4.0673ms 245.8608 Ops/s 241.2657 Ops/s $\color{#35bf28}+1.90\%$
test_to[True-default-None] 5.6343ms 5.2034ms 192.1813 Ops/s 187.6993 Ops/s $\color{#35bf28}+2.39\%$
test_to_njt[False-False-None] 7.2010ms 7.0748ms 141.3475 Ops/s 127.2648 Ops/s $\textbf{\color{#35bf28}+11.07\%}$
test_to_njt[True-False-None] 5.8147ms 5.6423ms 177.2341 Ops/s 166.0823 Ops/s $\textbf{\color{#35bf28}+6.71\%}$
test_to_njt[within-False-None] 12.5532ms 12.3906ms 80.7061 Ops/s 75.9551 Ops/s $\textbf{\color{#35bf28}+6.26\%}$
test_creation[device0] 0.3812ms 79.8014μs 12.5311 KOps/s 11.6946 KOps/s $\textbf{\color{#35bf28}+7.15\%}$
test_creation_from_tensor 0.6103ms 83.2098μs 12.0178 KOps/s 11.1977 KOps/s $\textbf{\color{#35bf28}+7.32\%}$
test_add_one[memmap_tensor0] 0.4208ms 7.2930μs 137.1171 KOps/s 137.9714 KOps/s $\color{#d91a1a}-0.62\%$
test_contiguous[memmap_tensor0] 1.7900μs 0.4346μs 2.3008 MOps/s 2.3042 MOps/s $\color{#d91a1a}-0.15\%$
test_stack[memmap_tensor0] 37.7810μs 4.8843μs 204.7377 KOps/s 201.2213 KOps/s $\color{#35bf28}+1.75\%$
test_memmaptd_index 1.9694ms 0.2585ms 3.8679 KOps/s 3.8556 KOps/s $\color{#35bf28}+0.32\%$
test_memmaptd_index_astensor 0.5986ms 0.3158ms 3.1662 KOps/s 3.1356 KOps/s $\color{#35bf28}+0.98\%$
test_memmaptd_index_op 1.0078ms 0.5940ms 1.6836 KOps/s 1.6822 KOps/s $\color{#35bf28}+0.09\%$
test_serialize_model 0.1305s 0.1295s 7.7245 Ops/s 7.6867 Ops/s $\color{#35bf28}+0.49\%$
test_serialize_model_pickle 1.3490s 1.2167s 0.8219 Ops/s 0.8419 Ops/s $\color{#d91a1a}-2.38\%$
test_serialize_weights 0.1304s 0.1291s 7.7455 Ops/s 7.7193 Ops/s $\color{#35bf28}+0.34\%$
test_serialize_weights_returnearly 0.3508s 65.7085ms 15.2187 Ops/s 23.3183 Ops/s $\textbf{\color{#d91a1a}-34.73\%}$
test_serialize_weights_pickle 1.3742s 1.2175s 0.8214 Ops/s 0.8399 Ops/s $\color{#d91a1a}-2.21\%$
test_reshape_pytree 53.3210μs 22.4613μs 44.5210 KOps/s 43.6607 KOps/s $\color{#35bf28}+1.97\%$
test_reshape_td 52.9800μs 27.3626μs 36.5463 KOps/s 36.9970 KOps/s $\color{#d91a1a}-1.22\%$
test_view_pytree 55.4710μs 22.1071μs 45.2343 KOps/s 43.9263 KOps/s $\color{#35bf28}+2.98\%$
test_view_td 58.6210μs 30.3173μs 32.9845 KOps/s 31.7099 KOps/s $\color{#35bf28}+4.02\%$
test_unbind_pytree 56.9210μs 28.1314μs 35.5475 KOps/s 35.2596 KOps/s $\color{#35bf28}+0.82\%$
test_unbind_td 0.8952ms 36.9144μs 27.0897 KOps/s 26.8170 KOps/s $\color{#35bf28}+1.02\%$
test_split_pytree 66.6210μs 30.3083μs 32.9942 KOps/s 32.9196 KOps/s $\color{#35bf28}+0.23\%$
test_split_td 0.1766ms 39.8841μs 25.0727 KOps/s 25.2286 KOps/s $\color{#d91a1a}-0.62\%$
test_add_pytree 76.8310μs 37.1182μs 26.9409 KOps/s 27.7704 KOps/s $\color{#d91a1a}-2.99\%$
test_add_td 86.9010μs 48.0388μs 20.8165 KOps/s 19.9195 KOps/s $\color{#35bf28}+4.50\%$
test_compile_add_one_nested[tensordict-compile] 0.1711ms 0.1244ms 8.0415 KOps/s 8.0422 KOps/s $-0.01\%$
test_compile_add_one_nested[tensordict-eager] 0.2701ms 0.1266ms 7.9006 KOps/s 7.7761 KOps/s $\color{#35bf28}+1.60\%$
test_compile_add_one_nested[pytree-compile] 0.1418ms 0.1037ms 9.6449 KOps/s 9.8664 KOps/s $\color{#d91a1a}-2.25\%$
test_compile_add_one_nested[pytree-eager] 0.5836ms 0.1611ms 6.2063 KOps/s 6.5024 KOps/s $\color{#d91a1a}-4.55\%$
test_compile_copy_nested[tensordict-compile] 68.3710μs 22.9738μs 43.5278 KOps/s 41.7793 KOps/s $\color{#35bf28}+4.19\%$
test_compile_copy_nested[tensordict-eager] 0.1081ms 26.8698μs 37.2165 KOps/s 36.4486 KOps/s $\color{#35bf28}+2.11\%$
test_compile_copy_nested[pytree-compile] 0.1421ms 63.3368μs 15.7886 KOps/s 15.3201 KOps/s $\color{#35bf28}+3.06\%$
test_compile_copy_nested[pytree-eager] 78.5610μs 49.1318μs 20.3534 KOps/s 20.0700 KOps/s $\color{#35bf28}+1.41\%$
test_compile_add_one_flat[tensordict-compile] 0.1819ms 0.1402ms 7.1343 KOps/s 6.8676 KOps/s $\color{#35bf28}+3.88\%$
test_compile_add_one_flat[tensordict-eager] 0.3590ms 0.2069ms 4.8329 KOps/s 4.7091 KOps/s $\color{#35bf28}+2.63\%$
test_compile_add_one_flat[tensorclass-compile] 0.1340ms 96.3000μs 10.3842 KOps/s 10.1660 KOps/s $\color{#35bf28}+2.15\%$
test_compile_add_one_flat[tensorclass-eager] 0.1353ms 53.0337μs 18.8559 KOps/s 18.9519 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_add_one_flat[pytree-compile] 0.1791ms 0.1376ms 7.2662 KOps/s 6.9226 KOps/s $\color{#35bf28}+4.96\%$
test_compile_add_one_flat[pytree-eager] 0.5898ms 0.5219ms 1.9161 KOps/s 2.0243 KOps/s $\textbf{\color{#d91a1a}-5.35\%}$
test_compile_add_self_flat[tensordict-eager] 0.3544ms 0.2483ms 4.0280 KOps/s 4.0193 KOps/s $\color{#35bf28}+0.22\%$
test_compile_add_self_flat[tensordict-compile] 0.1822ms 0.1415ms 7.0651 KOps/s 6.9913 KOps/s $\color{#35bf28}+1.05\%$
test_compile_add_self_flat[tensorclass-eager] 0.1404ms 62.4288μs 16.0182 KOps/s 15.7654 KOps/s $\color{#35bf28}+1.60\%$
test_compile_add_self_flat[tensorclass-compile] 0.2317ms 97.2035μs 10.2877 KOps/s 9.8277 KOps/s $\color{#35bf28}+4.68\%$
test_compile_add_self_flat[pytree-eager] 0.4817ms 0.4388ms 2.2789 KOps/s 2.4126 KOps/s $\textbf{\color{#d91a1a}-5.54\%}$
test_compile_add_self_flat[pytree-compile] 0.1827ms 0.1381ms 7.2404 KOps/s 7.1308 KOps/s $\color{#35bf28}+1.54\%$
test_compile_copy_flat[tensordict-compile] 0.2022ms 27.6392μs 36.1805 KOps/s 52.3759 KOps/s $\textbf{\color{#d91a1a}-30.92\%}$
test_compile_copy_flat[tensordict-eager] 64.4710μs 27.0393μs 36.9832 KOps/s 36.6417 KOps/s $\color{#35bf28}+0.93\%$
test_compile_copy_flat[pytree-compile] 0.1283ms 69.3859μs 14.4121 KOps/s 14.3415 KOps/s $\color{#35bf28}+0.49\%$
test_compile_copy_flat[pytree-eager] 82.5610μs 51.6750μs 19.3517 KOps/s 19.4240 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_assign_and_add[tensordict-compile] 1.6596ms 0.3963ms 2.5236 KOps/s 2.1927 KOps/s $\textbf{\color{#35bf28}+15.09\%}$
test_compile_assign_and_add[tensordict-eager] 3.2028ms 2.7944ms 357.8566 Ops/s 360.4011 Ops/s $\color{#d91a1a}-0.71\%$
test_compile_assign_and_add[pytree-compile] 1.6352ms 0.4407ms 2.2691 KOps/s 2.2206 KOps/s $\color{#35bf28}+2.18\%$
test_compile_assign_and_add[pytree-eager] 3.0330ms 2.8566ms 350.0643 Ops/s 354.7956 Ops/s $\color{#d91a1a}-1.33\%$
test_compile_indexing[tensor-tensordict-compile] 0.4100ms 0.1149ms 8.7055 KOps/s 8.1365 KOps/s $\textbf{\color{#35bf28}+6.99\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5805ms 83.1589μs 12.0252 KOps/s 11.4177 KOps/s $\textbf{\color{#35bf28}+5.32\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.3953ms 0.1119ms 8.9362 KOps/s 9.1326 KOps/s $\color{#d91a1a}-2.15\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2099ms 70.0180μs 14.2820 KOps/s 14.1111 KOps/s $\color{#35bf28}+1.21\%$
test_compile_indexing[tensor-pytree-compile] 0.2066ms 0.1111ms 9.0022 KOps/s 9.2159 KOps/s $\color{#d91a1a}-2.32\%$
test_compile_indexing[tensor-pytree-eager] 0.1192ms 70.0232μs 14.2810 KOps/s 14.1374 KOps/s $\color{#35bf28}+1.02\%$
test_compile_indexing[slice-tensordict-compile] 0.2548ms 0.1075ms 9.2988 KOps/s 9.8400 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_compile_indexing[slice-tensordict-eager] 0.1538ms 18.0104μs 55.5235 KOps/s 54.5182 KOps/s $\color{#35bf28}+1.84\%$
test_compile_indexing[slice-tensorclass-compile] 0.2728ms 98.8945μs 10.1118 KOps/s 10.2490 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_indexing[slice-tensorclass-eager] 0.1219ms 16.1665μs 61.8564 KOps/s 62.0558 KOps/s $\color{#d91a1a}-0.32\%$
test_compile_indexing[slice-pytree-compile] 0.2918ms 0.1023ms 9.7780 KOps/s 10.2091 KOps/s $\color{#d91a1a}-4.22\%$
test_compile_indexing[slice-pytree-eager] 48.6610μs 16.1136μs 62.0595 KOps/s 62.5605 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_indexing[int-tensordict-compile] 0.2421ms 0.1063ms 9.4080 KOps/s 9.7531 KOps/s $\color{#d91a1a}-3.54\%$
test_compile_indexing[int-tensordict-eager] 0.5720ms 17.9233μs 55.7933 KOps/s 55.5097 KOps/s $\color{#35bf28}+0.51\%$
test_compile_indexing[int-tensorclass-compile] 0.2615ms 0.1032ms 9.6944 KOps/s 10.1572 KOps/s $\color{#d91a1a}-4.56\%$
test_compile_indexing[int-tensorclass-eager] 47.2610μs 16.0919μs 62.1432 KOps/s 62.0259 KOps/s $\color{#35bf28}+0.19\%$
test_compile_indexing[int-pytree-compile] 0.2018ms 99.2004μs 10.0806 KOps/s 10.1627 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_indexing[int-pytree-eager] 46.1310μs 16.1806μs 61.8026 KOps/s 62.4001 KOps/s $\color{#d91a1a}-0.96\%$
test_mod_add[eager] 75.9210μs 31.9680μs 31.2813 KOps/s 30.7202 KOps/s $\color{#35bf28}+1.83\%$
test_mod_add[compile] 0.2473ms 77.7375μs 12.8638 KOps/s 12.7043 KOps/s $\color{#35bf28}+1.26\%$
test_mod_add[compile-overhead] 0.3106ms 0.1611ms 6.2068 KOps/s 5.8737 KOps/s $\textbf{\color{#35bf28}+5.67\%}$
test_mod_wrap[eager] 0.3262ms 0.2552ms 3.9188 KOps/s 3.8908 KOps/s $\color{#35bf28}+0.72\%$
test_mod_wrap[compile] 1.5851ms 0.2858ms 3.4990 KOps/s 3.4370 KOps/s $\color{#35bf28}+1.81\%$
test_mod_wrap[compile-overhead] 7.8839ms 4.0651ms 245.9975 Ops/s 255.6534 Ops/s $\color{#d91a1a}-3.78\%$
test_mod_wrap_and_backward[eager] 1.6328ms 1.5078ms 663.2340 Ops/s 682.0321 Ops/s $\color{#d91a1a}-2.76\%$
test_mod_wrap_and_backward[compile] 1.5596ms 1.4029ms 712.7888 Ops/s 719.0793 Ops/s $\color{#d91a1a}-0.87\%$
test_mod_wrap_and_backward[compile-overhead] 1.4938ms 1.0241ms 976.5118 Ops/s 968.8107 Ops/s $\color{#35bf28}+0.79\%$
test_seq_add[eager] 0.2399ms 96.2835μs 10.3860 KOps/s 9.8328 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_seq_add[compile] 0.2189ms 92.1589μs 10.8508 KOps/s 10.7371 KOps/s $\color{#35bf28}+1.06\%$
test_seq_add[compile-overhead] 0.1698ms 0.1276ms 7.8385 KOps/s 7.7521 KOps/s $\color{#35bf28}+1.11\%$
test_seq_wrap[eager] 0.5133ms 0.3976ms 2.5149 KOps/s 2.4448 KOps/s $\color{#35bf28}+2.87\%$
test_seq_wrap[compile] 0.4351ms 0.3053ms 3.2758 KOps/s 3.2632 KOps/s $\color{#35bf28}+0.39\%$
test_seq_wrap[compile-overhead] 0.2950ms 0.2248ms 4.4483 KOps/s 4.4305 KOps/s $\color{#35bf28}+0.40\%$
test_func_call_runtime[False-eager] 0.8946ms 0.7535ms 1.3271 KOps/s 1.3058 KOps/s $\color{#35bf28}+1.63\%$
test_func_call_runtime[False-compile] 0.9822ms 0.7701ms 1.2986 KOps/s 1.2936 KOps/s $\color{#35bf28}+0.39\%$
test_func_call_runtime[False-compile-overhead] 0.5436ms 0.3639ms 2.7483 KOps/s 2.7270 KOps/s $\color{#35bf28}+0.78\%$
test_func_call_runtime[True-eager] 1.1954ms 0.9186ms 1.0886 KOps/s 1.0803 KOps/s $\color{#35bf28}+0.77\%$
test_func_call_runtime[True-compile] 0.8531ms 0.7839ms 1.2757 KOps/s 1.2612 KOps/s $\color{#35bf28}+1.14\%$
test_func_call_runtime[True-compile-overhead] 0.4628ms 0.3840ms 2.6042 KOps/s 2.5780 KOps/s $\color{#35bf28}+1.02\%$
test_func_call_cm_runtime[False-eager] 0.8326ms 0.7533ms 1.3276 KOps/s 1.3114 KOps/s $\color{#35bf28}+1.23\%$
test_func_call_cm_runtime[False-compile] 0.8386ms 0.7591ms 1.3174 KOps/s 1.3061 KOps/s $\color{#35bf28}+0.86\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4370ms 0.3650ms 2.7397 KOps/s 2.7204 KOps/s $\color{#35bf28}+0.71\%$
test_func_call_cm_runtime[True-eager] 1.1903ms 1.0204ms 979.9999 Ops/s 970.9429 Ops/s $\color{#35bf28}+0.93\%$
test_func_call_cm_runtime[True-compile] 0.8930ms 0.8113ms 1.2326 KOps/s 1.2234 KOps/s $\color{#35bf28}+0.76\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5348ms 0.4097ms 2.4407 KOps/s 2.4149 KOps/s $\color{#35bf28}+1.07\%$
test_vmap_func_call_cm_runtime[eager] 2.5776ms 2.1108ms 473.7562 Ops/s 472.4186 Ops/s $\color{#35bf28}+0.28\%$
test_vmap_func_call_cm_runtime[compile] 0.9731ms 0.8220ms 1.2165 KOps/s 1.1942 KOps/s $\color{#35bf28}+1.86\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4852ms 0.4132ms 2.4203 KOps/s 2.4113 KOps/s $\color{#35bf28}+0.37\%$
test_distributed 3.7597ms 0.1341ms 7.4567 KOps/s 8.7914 KOps/s $\textbf{\color{#d91a1a}-15.18\%}$
test_tdmodule 0.3124ms 14.5291μs 68.8276 KOps/s 72.7571 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_tdmodule_dispatch 46.7700μs 27.3677μs 36.5395 KOps/s 37.1916 KOps/s $\color{#d91a1a}-1.75\%$
test_tdseq 38.6410μs 15.6519μs 63.8899 KOps/s 65.5575 KOps/s $\color{#d91a1a}-2.54\%$
test_tdseq_dispatch 53.4700μs 30.4383μs 32.8534 KOps/s 33.0635 KOps/s $\color{#d91a1a}-0.64\%$
test_instantiation_functorch 1.7186ms 1.5747ms 635.0537 Ops/s 630.3275 Ops/s $\color{#35bf28}+0.75\%$
test_exec_functorch 0.2545ms 0.1505ms 6.6456 KOps/s 6.7186 KOps/s $\color{#d91a1a}-1.09\%$
test_exec_functional_call 0.1922ms 0.1430ms 6.9953 KOps/s 7.0488 KOps/s $\color{#d91a1a}-0.76\%$
test_exec_td_decorator 0.3758ms 0.1925ms 5.1944 KOps/s 5.3703 KOps/s $\color{#d91a1a}-3.28\%$
test_vmap_mlp_speed_decorator[True-True] 0.8242ms 0.7014ms 1.4257 KOps/s 1.4088 KOps/s $\color{#35bf28}+1.20\%$
test_vmap_mlp_speed_decorator[True-False] 0.8253ms 0.6930ms 1.4431 KOps/s 1.4069 KOps/s $\color{#35bf28}+2.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.7571ms 0.6091ms 1.6418 KOps/s 1.5981 KOps/s $\color{#35bf28}+2.73\%$
test_vmap_mlp_speed_decorator[False-False] 0.7117ms 0.5951ms 1.6804 KOps/s 1.6323 KOps/s $\color{#35bf28}+2.95\%$
test_vmap_transformer_speed_decorator[True-True] 20.0284ms 19.7176ms 50.7161 Ops/s 50.9890 Ops/s $\color{#d91a1a}-0.54\%$
test_vmap_transformer_speed_decorator[True-False] 20.4206ms 19.7829ms 50.5487 Ops/s 51.0459 Ops/s $\color{#d91a1a}-0.97\%$
test_vmap_transformer_speed_decorator[False-True] 19.7722ms 19.6211ms 50.9655 Ops/s 51.4541 Ops/s $\color{#d91a1a}-0.95\%$
test_vmap_transformer_speed_decorator[False-False] 19.6675ms 19.5746ms 51.0866 Ops/s 51.3989 Ops/s $\color{#d91a1a}-0.61\%$
test_to_module_speed[True] 2.1616ms 0.9405ms 1.0632 KOps/s 1.0560 KOps/s $\color{#35bf28}+0.69\%$
test_to_module_speed[False] 1.0108ms 0.9166ms 1.0909 KOps/s 1.0937 KOps/s $\color{#d91a1a}-0.25\%$
test_tc_init 77.0210μs 37.0087μs 27.0207 KOps/s 29.4483 KOps/s $\textbf{\color{#d91a1a}-8.24\%}$
test_tc_init_nested 0.1150ms 68.6509μs 14.5665 KOps/s 13.9524 KOps/s $\color{#35bf28}+4.40\%$
test_tc_first_layer_tensor 4.7229μs 0.6898μs 1.4498 MOps/s 1.4425 MOps/s $\color{#35bf28}+0.51\%$
test_tc_first_layer_nontensor 25.8100μs 2.3107μs 432.7720 KOps/s 429.0044 KOps/s $\color{#35bf28}+0.88\%$
test_tc_second_layer_tensor 8.6277μs 1.4112μs 708.5949 KOps/s 704.2830 KOps/s $\color{#35bf28}+0.61\%$
test_tc_second_layer_nontensor 25.2510μs 3.0671μs 326.0420 KOps/s 325.8264 KOps/s $\color{#35bf28}+0.07\%$
test_unbind 7.0541ms 6.7325ms 148.5324 Ops/s 148.5034 Ops/s $\color{#35bf28}+0.02\%$
test_full_like 10.8952ms 9.3820ms 106.5872 Ops/s 103.9159 Ops/s $\color{#35bf28}+2.57\%$
test_zeros_like 9.1767ms 7.1421ms 140.0139 Ops/s 113.9894 Ops/s $\textbf{\color{#35bf28}+22.83\%}$
test_ones_like 4.9481ms 4.2736ms 233.9934 Ops/s 231.5253 Ops/s $\color{#35bf28}+1.07\%$
test_clone 6.8602ms 6.4225ms 155.7035 Ops/s 155.5442 Ops/s $\color{#35bf28}+0.10\%$
test_squeeze 78.7710μs 10.0152μs 99.8487 KOps/s 91.4597 KOps/s $\textbf{\color{#35bf28}+9.17\%}$
test_unsqueeze 0.1282ms 77.1532μs 12.9612 KOps/s 13.5811 KOps/s $\color{#d91a1a}-4.56\%$
test_split 0.3077ms 0.1731ms 5.7766 KOps/s 5.9984 KOps/s $\color{#d91a1a}-3.70\%$
test_permute 0.2368ms 0.1894ms 5.2809 KOps/s 5.3346 KOps/s $\color{#d91a1a}-1.01\%$
test_stack 50.8373ms 50.5695ms 19.7748 Ops/s 19.7770 Ops/s $\color{#d91a1a}-0.01\%$
test_cat 50.8380ms 50.4965ms 19.8033 Ops/s 19.8564 Ops/s $\color{#d91a1a}-0.27\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 5, 2024
ghstack-source-id: 3dfb0b66fae82dc8cf5ef2a14eccb1bec5237ebb
Pull Request resolved: #1073
@vmoens vmoens merged commit 50c43f9 into gh/vmoens/35/base Nov 5, 2024
50 of 51 checks passed
vmoens added a commit that referenced this pull request Nov 5, 2024
ghstack-source-id: 3dfb0b66fae82dc8cf5ef2a14eccb1bec5237ebb
Pull Request resolved: #1073
@vmoens vmoens deleted the gh/vmoens/35/head branch November 5, 2024 15:28
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 3dfb0b66fae82dc8cf5ef2a14eccb1bec5237ebb
Pull Request resolved: #1073

(cherry picked from commit 6272510)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants