Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Versioning] python 3.8 compatibility fix #1127

Merged
merged 2 commits into from
Dec 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 4, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: b59931a5b6d3062593228cb16fb026438c3c6105
Pull Request resolved: #1127
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 4, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: ba7e9325c4125892522ee63253c148ed34adac7c
Pull Request resolved: #1127
@vmoens vmoens merged commit c4ef995 into gh/vmoens/35/base Dec 4, 2024
13 of 23 checks passed
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: ba7e9325c4125892522ee63253c148ed34adac7c
Pull Request resolved: #1127
@vmoens vmoens deleted the gh/vmoens/35/head branch December 4, 2024 11:33
Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.5740μs 18.5849μs 53.8070 KOps/s 54.1005 KOps/s $\color{#d91a1a}-0.54\%$
test_plain_set_stack_nested 43.8120μs 18.6885μs 53.5090 KOps/s 52.9763 KOps/s $\color{#35bf28}+1.01\%$
test_plain_set_nested_inplace 85.6890μs 20.3171μs 49.2196 KOps/s 47.9837 KOps/s $\color{#35bf28}+2.58\%$
test_plain_set_stack_nested_inplace 77.0540μs 20.3523μs 49.1345 KOps/s 49.0546 KOps/s $\color{#35bf28}+0.16\%$
test_items 20.6690μs 4.1807μs 239.1971 KOps/s 242.3602 KOps/s $\color{#d91a1a}-1.31\%$
test_items_nested 0.7226ms 0.3986ms 2.5090 KOps/s 2.5297 KOps/s $\color{#d91a1a}-0.82\%$
test_items_nested_locked 0.8204ms 0.3993ms 2.5043 KOps/s 2.5177 KOps/s $\color{#d91a1a}-0.53\%$
test_items_nested_leaf 0.1241ms 71.2917μs 14.0269 KOps/s 13.9613 KOps/s $\color{#35bf28}+0.47\%$
test_items_stack_nested 0.6131ms 0.3978ms 2.5136 KOps/s 2.5200 KOps/s $\color{#d91a1a}-0.25\%$
test_items_stack_nested_leaf 0.1633ms 74.2696μs 13.4645 KOps/s 13.1662 KOps/s $\color{#35bf28}+2.27\%$
test_items_stack_nested_locked 0.5212ms 0.4017ms 2.4892 KOps/s 2.4858 KOps/s $\color{#35bf28}+0.14\%$
test_keys 24.6660μs 3.4675μs 288.3900 KOps/s 287.7919 KOps/s $\color{#35bf28}+0.21\%$
test_keys_nested 0.6707ms 0.1431ms 6.9890 KOps/s 7.3559 KOps/s $\color{#d91a1a}-4.99\%$
test_keys_nested_locked 2.1093ms 0.1478ms 6.7662 KOps/s 7.0405 KOps/s $\color{#d91a1a}-3.90\%$
test_keys_nested_leaf 0.2299ms 0.1169ms 8.5533 KOps/s 8.5642 KOps/s $\color{#d91a1a}-0.13\%$
test_keys_stack_nested 0.2205ms 0.1356ms 7.3736 KOps/s 7.4071 KOps/s $\color{#d91a1a}-0.45\%$
test_keys_stack_nested_leaf 0.2265ms 0.1174ms 8.5191 KOps/s 8.6549 KOps/s $\color{#d91a1a}-1.57\%$
test_keys_stack_nested_locked 0.2769ms 0.1409ms 7.0949 KOps/s 7.0135 KOps/s $\color{#35bf28}+1.16\%$
test_values 26.3412μs 1.0758μs 929.5178 KOps/s 897.7016 KOps/s $\color{#35bf28}+3.54\%$
test_values_nested 0.1071ms 55.9141μs 17.8846 KOps/s 18.1807 KOps/s $\color{#d91a1a}-1.63\%$
test_values_nested_locked 0.2971ms 56.1038μs 17.8241 KOps/s 18.0973 KOps/s $\color{#d91a1a}-1.51\%$
test_values_nested_leaf 0.1166ms 60.5285μs 16.5212 KOps/s 16.5675 KOps/s $\color{#d91a1a}-0.28\%$
test_values_stack_nested 0.1072ms 56.7348μs 17.6259 KOps/s 17.7872 KOps/s $\color{#d91a1a}-0.91\%$
test_values_stack_nested_leaf 0.1391ms 60.3617μs 16.5668 KOps/s 15.9750 KOps/s $\color{#35bf28}+3.70\%$
test_values_stack_nested_locked 0.3346ms 57.0957μs 17.5145 KOps/s 17.2385 KOps/s $\color{#35bf28}+1.60\%$
test_membership 11.7920μs 0.8675μs 1.1528 MOps/s 1.1660 MOps/s $\color{#d91a1a}-1.13\%$
test_membership_nested 40.7760μs 2.9257μs 341.7939 KOps/s 345.3711 KOps/s $\color{#d91a1a}-1.04\%$
test_membership_nested_leaf 27.8210μs 3.0197μs 331.1593 KOps/s 339.3983 KOps/s $\color{#d91a1a}-2.43\%$
test_membership_stacked_nested 16.9220μs 2.9158μs 342.9570 KOps/s 342.1109 KOps/s $\color{#35bf28}+0.25\%$
test_membership_stacked_nested_leaf 16.5510μs 2.9197μs 342.5042 KOps/s 339.9051 KOps/s $\color{#35bf28}+0.76\%$
test_membership_nested_last 34.9750μs 4.1613μs 240.3082 KOps/s 235.4210 KOps/s $\color{#35bf28}+2.08\%$
test_membership_nested_leaf_last 31.6290μs 4.1966μs 238.2862 KOps/s 229.5091 KOps/s $\color{#35bf28}+3.82\%$
test_membership_stacked_nested_last 34.8850μs 6.3817μs 156.6981 KOps/s 234.5219 KOps/s $\textbf{\color{#d91a1a}-33.18\%}$
test_membership_stacked_nested_leaf_last 44.3720μs 6.2443μs 160.1454 KOps/s 236.1297 KOps/s $\textbf{\color{#d91a1a}-32.18\%}$
test_nested_getleaf 34.6550μs 10.7589μs 92.9465 KOps/s 91.6146 KOps/s $\color{#35bf28}+1.45\%$
test_nested_get 66.7850μs 10.1967μs 98.0712 KOps/s 96.2388 KOps/s $\color{#35bf28}+1.90\%$
test_stacked_getleaf 59.3110μs 10.6734μs 93.6905 KOps/s 93.7308 KOps/s $\color{#d91a1a}-0.04\%$
test_stacked_get 38.9130μs 10.2306μs 97.7464 KOps/s 95.0987 KOps/s $\color{#35bf28}+2.78\%$
test_nested_getitemleaf 0.2564ms 11.6034μs 86.1814 KOps/s 88.5762 KOps/s $\color{#d91a1a}-2.70\%$
test_nested_getitem 0.1416ms 10.8975μs 91.7645 KOps/s 95.8040 KOps/s $\color{#d91a1a}-4.22\%$
test_stacked_getitemleaf 34.6850μs 11.1740μs 89.4937 KOps/s 89.6071 KOps/s $\color{#d91a1a}-0.13\%$
test_stacked_getitem 69.6220μs 10.3326μs 96.7811 KOps/s 95.8049 KOps/s $\color{#35bf28}+1.02\%$
test_lock_nested 4.3557ms 0.4496ms 2.2242 KOps/s 2.1831 KOps/s $\color{#35bf28}+1.88\%$
test_lock_stack_nested 0.7435ms 0.4102ms 2.4376 KOps/s 2.3948 KOps/s $\color{#35bf28}+1.79\%$
test_unlock_nested 1.1233ms 0.3674ms 2.7215 KOps/s 2.6511 KOps/s $\color{#35bf28}+2.65\%$
test_unlock_stack_nested 0.5585ms 0.3254ms 3.0736 KOps/s 2.9405 KOps/s $\color{#35bf28}+4.53\%$
test_flatten_speed 0.5986ms 95.2225μs 10.5017 KOps/s 10.3943 KOps/s $\color{#35bf28}+1.03\%$
test_unflatten_speed 0.6192ms 0.4895ms 2.0429 KOps/s 2.0230 KOps/s $\color{#35bf28}+0.98\%$
test_common_ops 3.6704ms 0.8009ms 1.2486 KOps/s 1.2286 KOps/s $\color{#35bf28}+1.63\%$
test_creation 20.3680μs 2.0728μs 482.4357 KOps/s 497.5153 KOps/s $\color{#d91a1a}-3.03\%$
test_creation_empty 41.8180μs 12.2298μs 81.7676 KOps/s 82.1081 KOps/s $\color{#d91a1a}-0.41\%$
test_creation_nested_1 58.7690μs 15.1195μs 66.1399 KOps/s 64.9257 KOps/s $\color{#35bf28}+1.87\%$
test_creation_nested_2 0.1305ms 21.1487μs 47.2842 KOps/s 47.6513 KOps/s $\color{#d91a1a}-0.77\%$
test_clone 90.3090μs 12.8881μs 77.5910 KOps/s 75.3643 KOps/s $\color{#35bf28}+2.95\%$
test_getitem[int] 1.2710ms 12.9437μs 77.2578 KOps/s 80.1297 KOps/s $\color{#d91a1a}-3.58\%$
test_getitem[slice_int] 0.1791ms 24.6092μs 40.6351 KOps/s 40.1878 KOps/s $\color{#35bf28}+1.11\%$
test_getitem[range] 0.2897ms 49.8539μs 20.0586 KOps/s 19.8105 KOps/s $\color{#35bf28}+1.25\%$
test_getitem[tuple] 0.1452ms 20.2335μs 49.4231 KOps/s 49.3415 KOps/s $\color{#35bf28}+0.17\%$
test_getitem[list] 0.3595ms 44.3689μs 22.5383 KOps/s 22.2996 KOps/s $\color{#35bf28}+1.07\%$
test_setitem_dim[int] 78.7470μs 25.6312μs 39.0150 KOps/s 39.1122 KOps/s $\color{#d91a1a}-0.25\%$
test_setitem_dim[slice_int] 0.1348ms 51.2195μs 19.5238 KOps/s 18.5968 KOps/s $\color{#35bf28}+4.98\%$
test_setitem_dim[range] 0.1166ms 73.5504μs 13.5961 KOps/s 13.3606 KOps/s $\color{#35bf28}+1.76\%$
test_setitem_dim[tuple] 71.4640μs 40.8728μs 24.4662 KOps/s 23.6727 KOps/s $\color{#35bf28}+3.35\%$
test_setitem 0.1242ms 20.7587μs 48.1725 KOps/s 47.0215 KOps/s $\color{#35bf28}+2.45\%$
test_set 0.1137ms 20.5923μs 48.5619 KOps/s 47.2386 KOps/s $\color{#35bf28}+2.80\%$
test_set_shared 1.3548ms 0.1708ms 5.8554 KOps/s 5.5864 KOps/s $\color{#35bf28}+4.81\%$
test_update 0.1547ms 23.8911μs 41.8566 KOps/s 42.6568 KOps/s $\color{#d91a1a}-1.88\%$
test_update_nested 1.1011ms 34.4195μs 29.0533 KOps/s 28.7742 KOps/s $\color{#35bf28}+0.97\%$
test_update__nested 0.1513ms 32.4329μs 30.8329 KOps/s 30.4850 KOps/s $\color{#35bf28}+1.14\%$
test_set_nested 0.1456ms 22.9445μs 43.5834 KOps/s 44.2709 KOps/s $\color{#d91a1a}-1.55\%$
test_set_nested_new 0.1611ms 27.0903μs 36.9136 KOps/s 35.8039 KOps/s $\color{#35bf28}+3.10\%$
test_select 0.1419ms 42.4771μs 23.5421 KOps/s 22.2761 KOps/s $\textbf{\color{#35bf28}+5.68\%}$
test_select_nested 0.4560ms 61.6673μs 16.2161 KOps/s 16.7264 KOps/s $\color{#d91a1a}-3.05\%$
test_exclude_nested 0.1535ms 79.9095μs 12.5142 KOps/s 12.9048 KOps/s $\color{#d91a1a}-3.03\%$
test_empty[True] 0.4670ms 0.3826ms 2.6138 KOps/s 2.6549 KOps/s $\color{#d91a1a}-1.55\%$
test_empty[False] 9.0690μs 1.2046μs 830.1813 KOps/s 812.6427 KOps/s $\color{#35bf28}+2.16\%$
test_unbind_speed 0.3893ms 0.2596ms 3.8515 KOps/s 3.8126 KOps/s $\color{#35bf28}+1.02\%$
test_unbind_speed_stack0 0.5250ms 0.2566ms 3.8976 KOps/s 3.8291 KOps/s $\color{#35bf28}+1.79\%$
test_unbind_speed_stack1 0.1084s 0.7613ms 1.3136 KOps/s 1.4237 KOps/s $\textbf{\color{#d91a1a}-7.73\%}$
test_split 2.5048ms 1.5887ms 629.4260 Ops/s 557.9234 Ops/s $\textbf{\color{#35bf28}+12.82\%}$
test_chunk 0.1120s 1.9261ms 519.1776 Ops/s 558.7965 Ops/s $\textbf{\color{#d91a1a}-7.09\%}$
test_consolidate_njt[False-None] 10.6630ms 8.2288ms 121.5242 Ops/s 116.6861 Ops/s $\color{#35bf28}+4.15\%$
test_creation[device0] 0.2334ms 91.3167μs 10.9509 KOps/s 10.6104 KOps/s $\color{#35bf28}+3.21\%$
test_creation_from_tensor 3.3085ms 95.8635μs 10.4315 KOps/s 10.2018 KOps/s $\color{#35bf28}+2.25\%$
test_add_one[memmap_tensor0] 0.2364ms 4.7724μs 209.5400 KOps/s 207.4388 KOps/s $\color{#35bf28}+1.01\%$
test_contiguous[memmap_tensor0] 16.0700μs 0.5127μs 1.9506 MOps/s 1.9255 MOps/s $\color{#35bf28}+1.31\%$
test_stack[memmap_tensor0] 46.9780μs 3.5935μs 278.2794 KOps/s 282.5147 KOps/s $\color{#d91a1a}-1.50\%$
test_memmaptd_index 1.0082ms 0.2365ms 4.2280 KOps/s 4.1874 KOps/s $\color{#35bf28}+0.97\%$
test_memmaptd_index_astensor 0.5765ms 0.3111ms 3.2147 KOps/s 3.1287 KOps/s $\color{#35bf28}+2.75\%$
test_memmaptd_index_op 1.0498ms 0.5919ms 1.6896 KOps/s 1.6596 KOps/s $\color{#35bf28}+1.81\%$
test_serialize_model 0.1214s 0.1159s 8.6296 Ops/s 8.3853 Ops/s $\color{#35bf28}+2.91\%$
test_serialize_model_pickle 0.4454s 0.3859s 2.5913 Ops/s 2.5339 Ops/s $\color{#35bf28}+2.27\%$
test_serialize_weights 0.2190s 0.1268s 7.8882 Ops/s 7.2169 Ops/s $\textbf{\color{#35bf28}+9.30\%}$
test_serialize_weights_returnearly 0.1809s 0.1582s 6.3198 Ops/s 6.2881 Ops/s $\color{#35bf28}+0.50\%$
test_serialize_weights_pickle 0.5613s 0.4604s 2.1722 Ops/s 2.4635 Ops/s $\textbf{\color{#d91a1a}-11.83\%}$
test_serialize_weights_filesystem 0.1535s 0.1423s 7.0289 Ops/s 6.9098 Ops/s $\color{#35bf28}+1.72\%$
test_serialize_model_filesystem 0.1556s 0.1468s 6.8114 Ops/s 6.4313 Ops/s $\textbf{\color{#35bf28}+5.91\%}$
test_reshape_pytree 68.8390μs 27.2516μs 36.6951 KOps/s 37.7531 KOps/s $\color{#d91a1a}-2.80\%$
test_reshape_td 72.4650μs 33.0727μs 30.2364 KOps/s 30.2364 KOps/s $-0.00\%$
test_view_pytree 60.0130μs 26.6484μs 37.5256 KOps/s 35.6904 KOps/s $\textbf{\color{#35bf28}+5.14\%}$
test_view_td 85.3390μs 37.5263μs 26.6480 KOps/s 26.5131 KOps/s $\color{#35bf28}+0.51\%$
test_unbind_pytree 77.1150μs 29.9878μs 33.3469 KOps/s 33.8308 KOps/s $\color{#d91a1a}-1.43\%$
test_unbind_td 0.3240ms 37.7225μs 26.5094 KOps/s 25.7234 KOps/s $\color{#35bf28}+3.06\%$
test_split_pytree 73.9780μs 29.5100μs 33.8869 KOps/s 34.1553 KOps/s $\color{#d91a1a}-0.79\%$
test_split_td 0.1017s 53.2989μs 18.7621 KOps/s 22.6622 KOps/s $\textbf{\color{#d91a1a}-17.21\%}$
test_add_pytree 0.1054ms 35.6397μs 28.0586 KOps/s 28.0542 KOps/s $\color{#35bf28}+0.02\%$
test_add_td 0.1265ms 59.0482μs 16.9353 KOps/s 18.4918 KOps/s $\textbf{\color{#d91a1a}-8.42\%}$
test_compile_add_one_nested[tensordict-compile] 0.1364ms 63.4412μs 15.7626 KOps/s 15.5229 KOps/s $\color{#35bf28}+1.54\%$
test_compile_add_one_nested[tensordict-eager] 0.5195ms 0.1655ms 6.0408 KOps/s 6.2005 KOps/s $\color{#d91a1a}-2.58\%$
test_compile_add_one_nested[pytree-compile] 0.1061ms 46.3833μs 21.5595 KOps/s 21.5630 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_one_nested[pytree-eager] 0.2217ms 0.1185ms 8.4365 KOps/s 8.4273 KOps/s $\color{#35bf28}+0.11\%$
test_compile_copy_nested[tensordict-compile] 88.9270μs 25.7728μs 38.8005 KOps/s 38.4174 KOps/s $\color{#35bf28}+1.00\%$
test_compile_copy_nested[tensordict-eager] 0.1028ms 53.1685μs 18.8081 KOps/s 18.4753 KOps/s $\color{#35bf28}+1.80\%$
test_compile_copy_nested[pytree-compile] 0.1442ms 77.5692μs 12.8917 KOps/s 12.7307 KOps/s $\color{#35bf28}+1.27\%$
test_compile_copy_nested[pytree-eager] 0.1523ms 65.9976μs 15.1521 KOps/s 14.7282 KOps/s $\color{#35bf28}+2.88\%$
test_compile_add_one_flat[tensordict-compile] 0.2369ms 0.1062ms 9.4194 KOps/s 9.6190 KOps/s $\color{#d91a1a}-2.07\%$
test_compile_add_one_flat[tensordict-eager] 0.3214ms 0.1970ms 5.0760 KOps/s 5.0852 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_add_one_flat[tensorclass-compile] 0.1114ms 45.3770μs 22.0376 KOps/s 22.6510 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_add_one_flat[tensorclass-eager] 0.4853ms 61.4536μs 16.2724 KOps/s 16.2818 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_add_one_flat[pytree-compile] 0.2178ms 0.1054ms 9.4841 KOps/s 9.5222 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_add_one_flat[pytree-eager] 0.3506ms 0.2000ms 4.9991 KOps/s 5.0058 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_add_self_flat[tensordict-eager] 0.4361ms 0.2086ms 4.7932 KOps/s 4.7901 KOps/s $\color{#35bf28}+0.06\%$
test_compile_add_self_flat[tensordict-compile] 0.1783ms 0.1060ms 9.4373 KOps/s 9.5990 KOps/s $\color{#d91a1a}-1.68\%$
test_compile_add_self_flat[tensorclass-eager] 0.1172ms 55.1597μs 18.1292 KOps/s 18.0475 KOps/s $\color{#35bf28}+0.45\%$
test_compile_add_self_flat[tensorclass-compile] 0.1028ms 47.7454μs 20.9444 KOps/s 22.1273 KOps/s $\textbf{\color{#d91a1a}-5.35\%}$
test_compile_add_self_flat[pytree-eager] 0.3360ms 0.1602ms 6.2416 KOps/s 6.3525 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_add_self_flat[pytree-compile] 0.2485ms 0.1069ms 9.3560 KOps/s 9.7777 KOps/s $\color{#d91a1a}-4.31\%$
test_compile_copy_flat[tensordict-compile] 80.6700μs 21.0643μs 47.4738 KOps/s 47.7861 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_copy_flat[tensordict-eager] 0.1352ms 60.3162μs 16.5793 KOps/s 17.2212 KOps/s $\color{#d91a1a}-3.73\%$
test_compile_copy_flat[pytree-compile] 0.1490ms 80.6929μs 12.3927 KOps/s 12.4618 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_copy_flat[pytree-eager] 0.1244ms 67.6832μs 14.7747 KOps/s 14.5642 KOps/s $\color{#35bf28}+1.45\%$
test_compile_assign_and_add[tensordict-compile] 0.3247ms 0.2115ms 4.7287 KOps/s 4.9393 KOps/s $\color{#d91a1a}-4.26\%$
test_compile_assign_and_add[tensordict-eager] 1.4396ms 1.2891ms 775.7465 Ops/s 771.6264 Ops/s $\color{#35bf28}+0.53\%$
test_compile_assign_and_add[pytree-compile] 0.3001ms 0.2065ms 4.8430 KOps/s 4.9810 KOps/s $\color{#d91a1a}-2.77\%$
test_compile_assign_and_add[pytree-eager] 0.8738ms 0.7773ms 1.2866 KOps/s 1.2871 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_assign_and_add_stack[compile] 0.5776ms 0.4633ms 2.1585 KOps/s 2.2286 KOps/s $\color{#d91a1a}-3.15\%$
test_compile_assign_and_add_stack[eager] 3.2840ms 2.6701ms 374.5175 Ops/s 368.7355 Ops/s $\color{#35bf28}+1.57\%$
test_compile_indexing[tensor-tensordict-compile] 0.1067ms 37.2024μs 26.8800 KOps/s 27.7460 KOps/s $\color{#d91a1a}-3.12\%$
test_compile_indexing[tensor-tensordict-eager] 0.6959ms 33.6023μs 29.7599 KOps/s 20.6146 KOps/s $\textbf{\color{#35bf28}+44.36\%}$
test_compile_indexing[tensor-tensorclass-compile] 89.9580μs 30.2915μs 33.0126 KOps/s 32.9360 KOps/s $\color{#35bf28}+0.23\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1077ms 23.7815μs 42.0494 KOps/s 43.0419 KOps/s $\color{#d91a1a}-2.31\%$
test_compile_indexing[tensor-pytree-compile] 0.1296ms 31.4687μs 31.7777 KOps/s 32.2791 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_indexing[tensor-pytree-eager] 0.1890ms 24.6172μs 40.6221 KOps/s 42.2722 KOps/s $\color{#d91a1a}-3.90\%$
test_compile_indexing[slice-tensordict-compile] 0.1326ms 51.5790μs 19.3877 KOps/s 19.1011 KOps/s $\color{#35bf28}+1.50\%$
test_compile_indexing[slice-tensordict-eager] 0.5994ms 19.8568μs 50.3606 KOps/s 49.4737 KOps/s $\color{#35bf28}+1.79\%$
test_compile_indexing[slice-tensorclass-compile] 0.1501ms 44.0731μs 22.6896 KOps/s 22.7659 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_indexing[slice-tensorclass-eager] 64.2400μs 18.8850μs 52.9521 KOps/s 53.9675 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_indexing[slice-pytree-compile] 0.1170ms 45.2780μs 22.0858 KOps/s 22.0607 KOps/s $\color{#35bf28}+0.11\%$
test_compile_indexing[slice-pytree-eager] 68.1170μs 18.9864μs 52.6694 KOps/s 53.0921 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_indexing[int-tensordict-compile] 0.1052ms 52.5850μs 19.0168 KOps/s 18.2448 KOps/s $\color{#35bf28}+4.23\%$
test_compile_indexing[int-tensordict-eager] 1.0700ms 19.5023μs 51.2760 KOps/s 50.3881 KOps/s $\color{#35bf28}+1.76\%$
test_compile_indexing[int-tensorclass-compile] 0.1342ms 43.8320μs 22.8144 KOps/s 21.6929 KOps/s $\textbf{\color{#35bf28}+5.17\%}$
test_compile_indexing[int-tensorclass-eager] 73.7670μs 18.6572μs 53.5987 KOps/s 53.7210 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_indexing[int-pytree-compile] 97.9430μs 45.0525μs 22.1963 KOps/s 21.9676 KOps/s $\color{#35bf28}+1.04\%$
test_compile_indexing[int-pytree-eager] 68.0370μs 18.7029μs 53.4676 KOps/s 53.6262 KOps/s $\color{#d91a1a}-0.30\%$
test_mod_add[eager] 95.7390μs 35.9034μs 27.8525 KOps/s 27.9449 KOps/s $\color{#d91a1a}-0.33\%$
test_mod_add[compile] 0.1157ms 48.7420μs 20.5162 KOps/s 20.6276 KOps/s $\color{#d91a1a}-0.54\%$
test_mod_add[compile-overhead] 99.6560μs 49.0696μs 20.3792 KOps/s 20.6923 KOps/s $\color{#d91a1a}-1.51\%$
test_mod_wrap[eager] 0.5368ms 0.2321ms 4.3076 KOps/s 4.4375 KOps/s $\color{#d91a1a}-2.93\%$
test_mod_wrap[compile] 0.4169ms 0.2147ms 4.6566 KOps/s 4.8597 KOps/s $\color{#d91a1a}-4.18\%$
test_mod_wrap[compile-overhead] 0.3173ms 0.2096ms 4.7705 KOps/s 4.8573 KOps/s $\color{#d91a1a}-1.79\%$
test_mod_wrap_and_backward[eager] 13.4372ms 11.2529ms 88.8659 Ops/s 89.9260 Ops/s $\color{#d91a1a}-1.18\%$
test_mod_wrap_and_backward[compile] 16.1758ms 12.5156ms 79.9000 Ops/s 87.5073 Ops/s $\textbf{\color{#d91a1a}-8.69\%}$
test_mod_wrap_and_backward[compile-overhead] 18.2842ms 13.0361ms 76.7101 Ops/s 76.6244 Ops/s $\color{#35bf28}+0.11\%$
test_seq_add[eager] 0.3396ms 0.1161ms 8.6142 KOps/s 8.5644 KOps/s $\color{#35bf28}+0.58\%$
test_seq_add[compile] 0.1194ms 63.4331μs 15.7646 KOps/s 16.4601 KOps/s $\color{#d91a1a}-4.22\%$
test_seq_add[compile-overhead] 0.3202ms 63.1844μs 15.8267 KOps/s 16.5280 KOps/s $\color{#d91a1a}-4.24\%$
test_seq_wrap[eager] 0.6212ms 0.4479ms 2.2328 KOps/s 2.2443 KOps/s $\color{#d91a1a}-0.51\%$
test_seq_wrap[compile] 0.6495ms 0.2381ms 4.1995 KOps/s 4.3516 KOps/s $\color{#d91a1a}-3.49\%$
test_seq_wrap[compile-overhead] 0.5221ms 0.2319ms 4.3129 KOps/s 4.3661 KOps/s $\color{#d91a1a}-1.22\%$
test_func_call_runtime[False-eager] 1.0503ms 0.5545ms 1.8035 KOps/s 1.8315 KOps/s $\color{#d91a1a}-1.53\%$
test_func_call_runtime[False-compile] 0.8818ms 0.4331ms 2.3091 KOps/s 2.3660 KOps/s $\color{#d91a1a}-2.41\%$
test_func_call_runtime[False-compile-overhead] 0.8850ms 0.4343ms 2.3026 KOps/s 2.3694 KOps/s $\color{#d91a1a}-2.82\%$
test_func_call_runtime[True-eager] 0.8877ms 0.7510ms 1.3315 KOps/s 1.3362 KOps/s $\color{#d91a1a}-0.35\%$
test_func_call_runtime[True-compile] 0.7013ms 0.4739ms 2.1101 KOps/s 2.1373 KOps/s $\color{#d91a1a}-1.27\%$
test_func_call_runtime[True-compile-overhead] 0.6532ms 0.4725ms 2.1164 KOps/s 2.1220 KOps/s $\color{#d91a1a}-0.27\%$
test_func_call_cm_runtime[False-eager] 0.6944ms 0.5397ms 1.8528 KOps/s 1.8246 KOps/s $\color{#35bf28}+1.55\%$
test_func_call_cm_runtime[False-compile] 0.8519ms 0.4316ms 2.3171 KOps/s 2.3545 KOps/s $\color{#d91a1a}-1.59\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5886ms 0.4318ms 2.3159 KOps/s 2.3744 KOps/s $\color{#d91a1a}-2.47\%$
test_func_call_cm_runtime[True-eager] 1.4480ms 0.9091ms 1.1000 KOps/s 1.1315 KOps/s $\color{#d91a1a}-2.79\%$
test_func_call_cm_runtime[True-compile] 0.7105ms 0.5002ms 1.9992 KOps/s 2.0118 KOps/s $\color{#d91a1a}-0.63\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8257ms 0.4982ms 2.0072 KOps/s 2.0169 KOps/s $\color{#d91a1a}-0.48\%$
test_vmap_func_call_cm_runtime[eager] 2.3760ms 1.9053ms 524.8501 Ops/s 533.6447 Ops/s $\color{#d91a1a}-1.65\%$
test_vmap_func_call_cm_runtime[compile] 0.8839ms 0.5229ms 1.9125 KOps/s 1.9169 KOps/s $\color{#d91a1a}-0.23\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8664ms 0.5299ms 1.8870 KOps/s 1.9119 KOps/s $\color{#d91a1a}-1.30\%$
test_distributed 0.2480ms 0.1261ms 7.9314 KOps/s 7.3282 KOps/s $\textbf{\color{#35bf28}+8.23\%}$
test_tdmodule 81.1520μs 26.5097μs 37.7220 KOps/s 37.3176 KOps/s $\color{#35bf28}+1.08\%$
test_tdmodule_dispatch 77.8360μs 48.4609μs 20.6352 KOps/s 19.7202 KOps/s $\color{#35bf28}+4.64\%$
test_tdseq 47.6890μs 26.7877μs 37.3305 KOps/s 37.7595 KOps/s $\color{#d91a1a}-1.14\%$
test_tdseq_dispatch 82.0330μs 52.5882μs 19.0157 KOps/s 18.4661 KOps/s $\color{#35bf28}+2.98\%$
test_instantiation_functorch 2.3160ms 1.5518ms 644.4219 Ops/s 649.8235 Ops/s $\color{#d91a1a}-0.83\%$
test_exec_functorch 0.3019ms 0.1851ms 5.4031 KOps/s 5.4915 KOps/s $\color{#d91a1a}-1.61\%$
test_exec_functional_call 0.3250ms 0.1777ms 5.6265 KOps/s 5.5798 KOps/s $\color{#35bf28}+0.84\%$
test_exec_td_decorator 0.4884ms 0.2311ms 4.3265 KOps/s 4.3197 KOps/s $\color{#35bf28}+0.16\%$
test_vmap_mlp_speed_decorator[True-True] 0.9745ms 0.6671ms 1.4991 KOps/s 1.5303 KOps/s $\color{#d91a1a}-2.04\%$
test_vmap_mlp_speed_decorator[True-False] 0.9573ms 0.6567ms 1.5229 KOps/s 1.5472 KOps/s $\color{#d91a1a}-1.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.8673ms 0.5275ms 1.8956 KOps/s 1.9209 KOps/s $\color{#d91a1a}-1.32\%$
test_vmap_mlp_speed_decorator[False-False] 0.7662ms 0.5238ms 1.9091 KOps/s 1.9255 KOps/s $\color{#d91a1a}-0.85\%$
test_to_module_speed[True] 1.9318ms 1.2968ms 771.1283 Ops/s 772.2169 Ops/s $\color{#d91a1a}-0.14\%$
test_to_module_speed[False] 1.6669ms 1.2778ms 782.5961 Ops/s 798.1392 Ops/s $\color{#d91a1a}-1.95\%$
test_tc_init 78.8170μs 46.8373μs 21.3505 KOps/s 20.8914 KOps/s $\color{#35bf28}+2.20\%$
test_tc_init_nested 0.1853ms 91.5961μs 10.9175 KOps/s 10.3477 KOps/s $\textbf{\color{#35bf28}+5.51\%}$
test_tc_first_layer_tensor 39.1130μs 1.5537μs 643.6288 KOps/s 655.0711 KOps/s $\color{#d91a1a}-1.75\%$
test_tc_first_layer_nontensor 29.3740μs 4.6914μs 213.1581 KOps/s 209.3773 KOps/s $\color{#35bf28}+1.81\%$
test_tc_second_layer_tensor 52.0980μs 2.8608μs 349.5532 KOps/s 358.6616 KOps/s $\color{#d91a1a}-2.54\%$
test_tc_second_layer_nontensor 50.4140μs 6.0955μs 164.0545 KOps/s 165.7680 KOps/s $\color{#d91a1a}-1.03\%$
test_unbind 0.2344s 13.4507ms 74.3457 Ops/s 75.5177 Ops/s $\color{#d91a1a}-1.55\%$
test_full_like 10.7647ms 8.4601ms 118.2022 Ops/s 126.8764 Ops/s $\textbf{\color{#d91a1a}-6.84\%}$
test_zeros_like 7.3960ms 3.3672ms 296.9840 Ops/s 343.0516 Ops/s $\textbf{\color{#d91a1a}-13.43\%}$
test_ones_like 6.0233ms 3.8060ms 262.7429 Ops/s 156.8629 Ops/s $\textbf{\color{#35bf28}+67.50\%}$
test_clone 6.2252ms 5.1592ms 193.8279 Ops/s 117.0705 Ops/s $\textbf{\color{#35bf28}+65.57\%}$
test_squeeze 68.6880μs 12.5040μs 79.9746 KOps/s 85.3968 KOps/s $\textbf{\color{#d91a1a}-6.35\%}$
test_unsqueeze 0.1586ms 88.3960μs 11.3127 KOps/s 11.2215 KOps/s $\color{#35bf28}+0.81\%$
test_split 0.3957ms 0.1944ms 5.1441 KOps/s 5.2550 KOps/s $\color{#d91a1a}-2.11\%$
test_permute 0.5744ms 0.2244ms 4.4571 KOps/s 4.5789 KOps/s $\color{#d91a1a}-2.66\%$
test_stack 33.2437ms 25.8809ms 38.6385 Ops/s 38.0484 Ops/s $\color{#35bf28}+1.55\%$
test_cat 32.0759ms 25.7658ms 38.8112 Ops/s 37.3259 Ops/s $\color{#35bf28}+3.98\%$

Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}26$. Worsened: $\large\color{#d91a1a}17$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 36.6510μs 10.9236μs 91.5453 KOps/s 96.4125 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_plain_set_stack_nested 36.1200μs 10.9013μs 91.7321 KOps/s 95.8888 KOps/s $\color{#d91a1a}-4.33\%$
test_plain_set_nested_inplace 59.5920μs 11.7704μs 84.9591 KOps/s 88.6062 KOps/s $\color{#d91a1a}-4.12\%$
test_plain_set_stack_nested_inplace 41.3510μs 11.8792μs 84.1810 KOps/s 88.8053 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_items 33.2210μs 2.8585μs 349.8287 KOps/s 338.8499 KOps/s $\color{#35bf28}+3.24\%$
test_items_nested 0.4474ms 0.3571ms 2.8007 KOps/s 2.8549 KOps/s $\color{#d91a1a}-1.90\%$
test_items_nested_locked 0.3932ms 0.3552ms 2.8151 KOps/s 2.8389 KOps/s $\color{#d91a1a}-0.84\%$
test_items_nested_leaf 91.0220μs 58.0494μs 17.2267 KOps/s 17.1664 KOps/s $\color{#35bf28}+0.35\%$
test_items_stack_nested 0.4775ms 0.3570ms 2.8011 KOps/s 2.8348 KOps/s $\color{#d91a1a}-1.19\%$
test_items_stack_nested_leaf 92.7420μs 57.8209μs 17.2948 KOps/s 16.8894 KOps/s $\color{#35bf28}+2.40\%$
test_items_stack_nested_locked 0.4011ms 0.3556ms 2.8123 KOps/s 2.8094 KOps/s $\color{#35bf28}+0.10\%$
test_keys 31.8700μs 3.5010μs 285.6336 KOps/s 288.3943 KOps/s $\color{#d91a1a}-0.96\%$
test_keys_nested 0.1045ms 70.0036μs 14.2850 KOps/s 14.0960 KOps/s $\color{#35bf28}+1.34\%$
test_keys_nested_locked 0.7718ms 76.0813μs 13.1438 KOps/s 13.1157 KOps/s $\color{#35bf28}+0.21\%$
test_keys_nested_leaf 97.8520μs 61.6643μs 16.2168 KOps/s 16.2765 KOps/s $\color{#d91a1a}-0.37\%$
test_keys_stack_nested 0.1115ms 70.3170μs 14.2213 KOps/s 14.1130 KOps/s $\color{#35bf28}+0.77\%$
test_keys_stack_nested_leaf 0.1004ms 61.3258μs 16.3063 KOps/s 15.9152 KOps/s $\color{#35bf28}+2.46\%$
test_keys_stack_nested_locked 0.1189ms 75.5893μs 13.2294 KOps/s 13.0306 KOps/s $\color{#35bf28}+1.53\%$
test_values 5.9452μs 0.8469μs 1.1808 MOps/s 1.1755 MOps/s $\color{#35bf28}+0.45\%$
test_values_nested 60.1820μs 31.1131μs 32.1408 KOps/s 32.2819 KOps/s $\color{#d91a1a}-0.44\%$
test_values_nested_locked 58.6010μs 32.7434μs 30.5405 KOps/s 30.6097 KOps/s $\color{#d91a1a}-0.23\%$
test_values_nested_leaf 73.4620μs 33.6024μs 29.7598 KOps/s 29.6940 KOps/s $\color{#35bf28}+0.22\%$
test_values_stack_nested 56.2910μs 31.3232μs 31.9252 KOps/s 31.2868 KOps/s $\color{#35bf28}+2.04\%$
test_values_stack_nested_leaf 60.9420μs 33.7606μs 29.6204 KOps/s 29.0493 KOps/s $\color{#35bf28}+1.97\%$
test_values_stack_nested_locked 68.0720μs 33.1322μs 30.1821 KOps/s 29.8655 KOps/s $\color{#35bf28}+1.06\%$
test_membership 2.0580μs 0.5100μs 1.9608 MOps/s 1.9660 MOps/s $\color{#d91a1a}-0.27\%$
test_membership_nested 20.4205μs 1.9655μs 508.7636 KOps/s 500.9542 KOps/s $\color{#35bf28}+1.56\%$
test_membership_nested_leaf 19.1005μs 1.9517μs 512.3644 KOps/s 491.9401 KOps/s $\color{#35bf28}+4.15\%$
test_membership_stacked_nested 26.1700μs 2.0669μs 483.8100 KOps/s 476.9157 KOps/s $\color{#35bf28}+1.45\%$
test_membership_stacked_nested_leaf 32.1100μs 2.0530μs 487.0888 KOps/s 480.8793 KOps/s $\color{#35bf28}+1.29\%$
test_membership_nested_last 41.1900μs 3.3615μs 297.4877 KOps/s 339.2930 KOps/s $\textbf{\color{#d91a1a}-12.32\%}$
test_membership_nested_leaf_last 35.5410μs 2.9630μs 337.4932 KOps/s 338.0500 KOps/s $\color{#d91a1a}-0.16\%$
test_membership_stacked_nested_last 26.9210μs 2.9295μs 341.3577 KOps/s 295.5853 KOps/s $\textbf{\color{#35bf28}+15.49\%}$
test_membership_stacked_nested_leaf_last 26.4200μs 3.0161μs 331.5495 KOps/s 297.0164 KOps/s $\textbf{\color{#35bf28}+11.63\%}$
test_nested_getleaf 36.4510μs 6.2174μs 160.8393 KOps/s 162.7424 KOps/s $\color{#d91a1a}-1.17\%$
test_nested_get 28.9600μs 5.9362μs 168.4584 KOps/s 171.2693 KOps/s $\color{#d91a1a}-1.64\%$
test_stacked_getleaf 29.9300μs 6.2584μs 159.7863 KOps/s 163.2343 KOps/s $\color{#d91a1a}-2.11\%$
test_stacked_get 0.1012ms 5.9898μs 166.9512 KOps/s 171.4864 KOps/s $\color{#d91a1a}-2.64\%$
test_nested_getitemleaf 75.2910μs 6.4535μs 154.9537 KOps/s 160.8501 KOps/s $\color{#d91a1a}-3.67\%$
test_nested_getitem 42.9010μs 6.0292μs 165.8598 KOps/s 170.5801 KOps/s $\color{#d91a1a}-2.77\%$
test_stacked_getitemleaf 28.2900μs 6.3255μs 158.0891 KOps/s 161.1957 KOps/s $\color{#d91a1a}-1.93\%$
test_stacked_getitem 28.6200μs 6.0047μs 166.5366 KOps/s 169.6416 KOps/s $\color{#d91a1a}-1.83\%$
test_lock_nested 8.8794ms 0.3820ms 2.6181 KOps/s 2.6273 KOps/s $\color{#d91a1a}-0.35\%$
test_lock_stack_nested 0.3839ms 0.3378ms 2.9605 KOps/s 2.9387 KOps/s $\color{#35bf28}+0.74\%$
test_unlock_nested 0.6518ms 0.3130ms 3.1946 KOps/s 3.1927 KOps/s $\color{#35bf28}+0.06\%$
test_unlock_stack_nested 0.3171ms 0.2795ms 3.5779 KOps/s 3.5994 KOps/s $\color{#d91a1a}-0.60\%$
test_flatten_speed 0.1085ms 76.8573μs 13.0111 KOps/s 13.4130 KOps/s $\color{#d91a1a}-3.00\%$
test_unflatten_speed 0.3587ms 0.3111ms 3.2146 KOps/s 3.2775 KOps/s $\color{#d91a1a}-1.92\%$
test_common_ops 1.7693ms 0.6084ms 1.6435 KOps/s 1.6750 KOps/s $\color{#d91a1a}-1.88\%$
test_creation 98.7920μs 1.4738μs 678.4954 KOps/s 682.4262 KOps/s $\color{#d91a1a}-0.58\%$
test_creation_empty 31.0310μs 7.8645μs 127.1536 KOps/s 143.6285 KOps/s $\textbf{\color{#d91a1a}-11.47\%}$
test_creation_nested_1 39.5410μs 9.3400μs 107.0668 KOps/s 118.8723 KOps/s $\textbf{\color{#d91a1a}-9.93\%}$
test_creation_nested_2 35.7700μs 11.8776μs 84.1922 KOps/s 91.5844 KOps/s $\textbf{\color{#d91a1a}-8.07\%}$
test_clone 50.1710μs 10.3974μs 96.1780 KOps/s 89.4349 KOps/s $\textbf{\color{#35bf28}+7.54\%}$
test_getitem[int] 91.7955ms 15.5332μs 64.3784 KOps/s 89.1582 KOps/s $\textbf{\color{#d91a1a}-27.79\%}$
test_getitem[slice_int] 0.1067ms 20.6927μs 48.3263 KOps/s 45.2534 KOps/s $\textbf{\color{#35bf28}+6.79\%}$
test_getitem[range] 0.1411ms 37.6094μs 26.5891 KOps/s 25.1131 KOps/s $\textbf{\color{#35bf28}+5.88\%}$
test_getitem[tuple] 0.1049ms 18.1289μs 55.1605 KOps/s 51.5144 KOps/s $\textbf{\color{#35bf28}+7.08\%}$
test_getitem[list] 0.2478ms 33.1237μs 30.1899 KOps/s 28.1421 KOps/s $\textbf{\color{#35bf28}+7.28\%}$
test_setitem_dim[int] 38.3210μs 18.8621μs 53.0164 KOps/s 51.1363 KOps/s $\color{#35bf28}+3.68\%$
test_setitem_dim[slice_int] 59.5910μs 38.3799μs 26.0553 KOps/s 25.2331 KOps/s $\color{#35bf28}+3.26\%$
test_setitem_dim[range] 76.0820μs 53.8240μs 18.5791 KOps/s 18.0086 KOps/s $\color{#35bf28}+3.17\%$
test_setitem_dim[tuple] 53.0910μs 32.7851μs 30.5016 KOps/s 29.5712 KOps/s $\color{#35bf28}+3.15\%$
test_setitem 83.0920μs 14.6621μs 68.2029 KOps/s 63.7553 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_set 75.7020μs 14.2175μs 70.3359 KOps/s 65.6979 KOps/s $\textbf{\color{#35bf28}+7.06\%}$
test_set_shared 1.5815ms 0.1471ms 6.7993 KOps/s 6.7372 KOps/s $\color{#35bf28}+0.92\%$
test_update 0.2678ms 17.0207μs 58.7522 KOps/s 57.1006 KOps/s $\color{#35bf28}+2.89\%$
test_update_nested 76.3410μs 21.9656μs 45.5257 KOps/s 43.4126 KOps/s $\color{#35bf28}+4.87\%$
test_update__nested 1.0441ms 24.5400μs 40.7497 KOps/s 39.3447 KOps/s $\color{#35bf28}+3.57\%$
test_set_nested 79.3820μs 15.3208μs 65.2707 KOps/s 60.3033 KOps/s $\textbf{\color{#35bf28}+8.24\%}$
test_set_nested_new 86.4420μs 17.7360μs 56.3825 KOps/s 46.6023 KOps/s $\textbf{\color{#35bf28}+20.99\%}$
test_select 89.4820μs 29.8086μs 33.5474 KOps/s 29.6809 KOps/s $\textbf{\color{#35bf28}+13.03\%}$
test_select_nested 65.7510μs 41.5317μs 24.0780 KOps/s 23.8821 KOps/s $\color{#35bf28}+0.82\%$
test_exclude_nested 89.8520μs 61.9990μs 16.1293 KOps/s 16.3491 KOps/s $\color{#d91a1a}-1.34\%$
test_empty[True] 0.3035ms 0.2774ms 3.6046 KOps/s 3.6477 KOps/s $\color{#d91a1a}-1.18\%$
test_empty[False] 3.8961μs 0.7459μs 1.3407 MOps/s 1.3478 MOps/s $\color{#d91a1a}-0.53\%$
test_to 88.8810μs 55.7797μs 17.9277 KOps/s 16.9607 KOps/s $\textbf{\color{#35bf28}+5.70\%}$
test_to_nonblocking 0.1011ms 45.6197μs 21.9204 KOps/s 21.1427 KOps/s $\color{#35bf28}+3.68\%$
test_unbind_speed 1.4674ms 0.2321ms 4.3078 KOps/s 4.2742 KOps/s $\color{#35bf28}+0.78\%$
test_unbind_speed_stack0 0.2652ms 0.2327ms 4.2981 KOps/s 4.2779 KOps/s $\color{#35bf28}+0.47\%$
test_unbind_speed_stack1 93.3499ms 0.6514ms 1.5351 KOps/s 1.5387 KOps/s $\color{#d91a1a}-0.23\%$
test_split 93.6369ms 1.7158ms 582.8306 Ops/s 612.3013 Ops/s $\color{#d91a1a}-4.81\%$
test_chunk 94.8771ms 1.5975ms 625.9860 Ops/s 607.5958 Ops/s $\color{#35bf28}+3.03\%$
test_consolidate[False-None] 2.7123ms 2.6515ms 377.1496 Ops/s 344.3569 Ops/s $\textbf{\color{#35bf28}+9.52\%}$
test_consolidate[default-None] 1.8037ms 1.6856ms 593.2476 Ops/s 575.7367 Ops/s $\color{#35bf28}+3.04\%$
test_consolidate[reduce-overhead-None] 1.8419ms 1.7174ms 582.2699 Ops/s 566.8801 Ops/s $\color{#35bf28}+2.71\%$
test_consolidate_njt[False-None] 6.7285ms 6.5744ms 152.1052 Ops/s 147.3508 Ops/s $\color{#35bf28}+3.23\%$
test_to[False-False-None] 1.8362ms 1.7559ms 569.4935 Ops/s 579.4321 Ops/s $\color{#d91a1a}-1.72\%$
test_to[True-False-None] 1.5284ms 1.3000ms 769.2489 Ops/s 739.4569 Ops/s $\color{#35bf28}+4.03\%$
test_to[within-False-None] 4.4122ms 4.1010ms 243.8451 Ops/s 241.7887 Ops/s $\color{#35bf28}+0.85\%$
test_to[True-default-None] 5.3672ms 5.1981ms 192.3776 Ops/s 185.3742 Ops/s $\color{#35bf28}+3.78\%$
test_to_njt[False-False-None] 7.2287ms 7.0579ms 141.6848 Ops/s 139.6915 Ops/s $\color{#35bf28}+1.43\%$
test_to_njt[True-False-None] 5.6859ms 5.5733ms 179.4265 Ops/s 173.8803 Ops/s $\color{#35bf28}+3.19\%$
test_to_njt[within-False-None] 12.4311ms 12.2987ms 81.3096 Ops/s 78.9137 Ops/s $\color{#35bf28}+3.04\%$
test_creation[device0] 0.4503ms 80.2933μs 12.4543 KOps/s 11.9941 KOps/s $\color{#35bf28}+3.84\%$
test_creation_from_tensor 0.4924ms 83.1314μs 12.0291 KOps/s 11.7746 KOps/s $\color{#35bf28}+2.16\%$
test_add_one[memmap_tensor0] 0.2269ms 7.0049μs 142.7565 KOps/s 140.1507 KOps/s $\color{#35bf28}+1.86\%$
test_contiguous[memmap_tensor0] 1.7946μs 0.4123μs 2.4256 MOps/s 2.3963 MOps/s $\color{#35bf28}+1.22\%$
test_stack[memmap_tensor0] 38.9910μs 4.4475μs 224.8439 KOps/s 208.2513 KOps/s $\textbf{\color{#35bf28}+7.97\%}$
test_memmaptd_index 1.5280ms 0.2558ms 3.9096 KOps/s 3.8684 KOps/s $\color{#35bf28}+1.06\%$
test_memmaptd_index_astensor 0.6138ms 0.3131ms 3.1937 KOps/s 3.2000 KOps/s $\color{#d91a1a}-0.20\%$
test_memmaptd_index_op 1.0498ms 0.6043ms 1.6549 KOps/s 1.6992 KOps/s $\color{#d91a1a}-2.61\%$
test_serialize_model 0.1309s 0.1301s 7.6842 Ops/s 7.6845 Ops/s $-0.00\%$
test_serialize_model_pickle 1.3469s 1.2122s 0.8250 Ops/s 0.8256 Ops/s $\color{#d91a1a}-0.07\%$
test_serialize_weights 0.1307s 0.1297s 7.7078 Ops/s 7.7433 Ops/s $\color{#d91a1a}-0.46\%$
test_serialize_weights_returnearly 0.6094s 66.1437ms 15.1186 Ops/s 14.5139 Ops/s $\color{#35bf28}+4.17\%$
test_serialize_weights_pickle 1.3475s 1.1848s 0.8440 Ops/s 0.8382 Ops/s $\color{#35bf28}+0.69\%$
test_reshape_pytree 51.6410μs 22.4914μs 44.4614 KOps/s 43.6855 KOps/s $\color{#35bf28}+1.78\%$
test_reshape_td 58.1010μs 27.1978μs 36.7677 KOps/s 36.0648 KOps/s $\color{#35bf28}+1.95\%$
test_view_pytree 56.8510μs 22.5699μs 44.3068 KOps/s 44.1909 KOps/s $\color{#35bf28}+0.26\%$
test_view_td 66.7810μs 30.0467μs 33.2815 KOps/s 31.2316 KOps/s $\textbf{\color{#35bf28}+6.56\%}$
test_unbind_pytree 57.1810μs 28.2395μs 35.4114 KOps/s 35.0199 KOps/s $\color{#35bf28}+1.12\%$
test_unbind_td 0.7455ms 36.1225μs 27.6836 KOps/s 27.2387 KOps/s $\color{#35bf28}+1.63\%$
test_split_pytree 57.2610μs 29.9605μs 33.3773 KOps/s 32.0242 KOps/s $\color{#35bf28}+4.23\%$
test_split_td 0.9511ms 38.9699μs 25.6608 KOps/s 24.5917 KOps/s $\color{#35bf28}+4.35\%$
test_add_pytree 61.3210μs 35.3699μs 28.2726 KOps/s 27.8431 KOps/s $\color{#35bf28}+1.54\%$
test_add_td 79.3820μs 47.2286μs 21.1736 KOps/s 21.6484 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_nested[tensordict-compile] 0.1731ms 0.1210ms 8.2626 KOps/s 7.9441 KOps/s $\color{#35bf28}+4.01\%$
test_compile_add_one_nested[tensordict-eager] 0.2240ms 0.1259ms 7.9401 KOps/s 7.7944 KOps/s $\color{#35bf28}+1.87\%$
test_compile_add_one_nested[pytree-compile] 0.1337ms 97.0072μs 10.3085 KOps/s 10.0355 KOps/s $\color{#35bf28}+2.72\%$
test_compile_add_one_nested[pytree-eager] 2.1413ms 0.1510ms 6.6239 KOps/s 6.5248 KOps/s $\color{#35bf28}+1.52\%$
test_compile_copy_nested[tensordict-compile] 54.9010μs 22.9002μs 43.6677 KOps/s 44.1255 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_copy_nested[tensordict-eager] 57.0210μs 27.0809μs 36.9264 KOps/s 36.8101 KOps/s $\color{#35bf28}+0.32\%$
test_compile_copy_nested[pytree-compile] 0.1125ms 64.9945μs 15.3859 KOps/s 15.1319 KOps/s $\color{#35bf28}+1.68\%$
test_compile_copy_nested[pytree-eager] 78.9210μs 49.3220μs 20.2749 KOps/s 20.0085 KOps/s $\color{#35bf28}+1.33\%$
test_compile_add_one_flat[tensordict-compile] 0.1839ms 0.1442ms 6.9336 KOps/s 6.8638 KOps/s $\color{#35bf28}+1.02\%$
test_compile_add_one_flat[tensordict-eager] 0.3059ms 0.2107ms 4.7468 KOps/s 4.8505 KOps/s $\color{#d91a1a}-2.14\%$
test_compile_add_one_flat[tensorclass-compile] 0.1377ms 0.1004ms 9.9570 KOps/s 10.0396 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_add_one_flat[tensorclass-eager] 0.1163ms 53.4990μs 18.6920 KOps/s 18.5612 KOps/s $\color{#35bf28}+0.70\%$
test_compile_add_one_flat[pytree-compile] 0.1825ms 0.1378ms 7.2562 KOps/s 7.2595 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_add_one_flat[pytree-eager] 0.5562ms 0.4904ms 2.0390 KOps/s 2.0125 KOps/s $\color{#35bf28}+1.32\%$
test_compile_add_self_flat[tensordict-eager] 0.3673ms 0.2504ms 3.9932 KOps/s 4.0022 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_add_self_flat[tensordict-compile] 0.1867ms 0.1459ms 6.8539 KOps/s 6.6971 KOps/s $\color{#35bf28}+2.34\%$
test_compile_add_self_flat[tensorclass-eager] 0.1447ms 63.4521μs 15.7599 KOps/s 15.7533 KOps/s $\color{#35bf28}+0.04\%$
test_compile_add_self_flat[tensorclass-compile] 0.2369ms 0.1042ms 9.5924 KOps/s 10.0834 KOps/s $\color{#d91a1a}-4.87\%$
test_compile_add_self_flat[pytree-eager] 0.5874ms 0.4217ms 2.3716 KOps/s 2.3828 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_add_self_flat[pytree-compile] 0.2581ms 0.1468ms 6.8133 KOps/s 7.2881 KOps/s $\textbf{\color{#d91a1a}-6.51\%}$
test_compile_copy_flat[tensordict-compile] 0.1268ms 19.9056μs 50.2372 KOps/s 53.8933 KOps/s $\textbf{\color{#d91a1a}-6.78\%}$
test_compile_copy_flat[tensordict-eager] 0.1129ms 26.6657μs 37.5013 KOps/s 36.9076 KOps/s $\color{#35bf28}+1.61\%$
test_compile_copy_flat[pytree-compile] 0.1433ms 69.3768μs 14.4140 KOps/s 14.1487 KOps/s $\color{#35bf28}+1.88\%$
test_compile_copy_flat[pytree-eager] 0.1368ms 50.6691μs 19.7359 KOps/s 19.2621 KOps/s $\color{#35bf28}+2.46\%$
test_compile_assign_and_add[tensordict-compile] 1.6470ms 0.3953ms 2.5300 KOps/s 2.1976 KOps/s $\textbf{\color{#35bf28}+15.13\%}$
test_compile_assign_and_add[tensordict-eager] 2.7143ms 2.6479ms 377.6524 Ops/s 370.1032 Ops/s $\color{#35bf28}+2.04\%$
test_compile_assign_and_add[pytree-compile] 1.6492ms 0.4447ms 2.2488 KOps/s 2.2282 KOps/s $\color{#35bf28}+0.92\%$
test_compile_assign_and_add[pytree-eager] 2.7596ms 2.7005ms 370.3036 Ops/s 363.5488 Ops/s $\color{#35bf28}+1.86\%$
test_compile_indexing[tensor-tensordict-compile] 0.2660ms 0.1219ms 8.2024 KOps/s 8.3066 KOps/s $\color{#d91a1a}-1.26\%$
test_compile_indexing[tensor-tensordict-eager] 0.5521ms 79.7277μs 12.5427 KOps/s 12.0229 KOps/s $\color{#35bf28}+4.32\%$
test_compile_indexing[tensor-tensorclass-compile] 0.6846ms 0.1147ms 8.7176 KOps/s 9.3619 KOps/s $\textbf{\color{#d91a1a}-6.88\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1270ms 73.8341μs 13.5439 KOps/s 14.3230 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_compile_indexing[tensor-pytree-compile] 0.3096ms 0.1163ms 8.5967 KOps/s 9.2857 KOps/s $\textbf{\color{#d91a1a}-7.42\%}$
test_compile_indexing[tensor-pytree-eager] 0.1792ms 74.0720μs 13.5004 KOps/s 14.3995 KOps/s $\textbf{\color{#d91a1a}-6.24\%}$
test_compile_indexing[slice-tensordict-compile] 0.1598ms 0.1050ms 9.5205 KOps/s 9.4527 KOps/s $\color{#35bf28}+0.72\%$
test_compile_indexing[slice-tensordict-eager] 0.1480ms 17.7178μs 56.4403 KOps/s 53.1103 KOps/s $\textbf{\color{#35bf28}+6.27\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1350ms 97.1268μs 10.2958 KOps/s 10.1438 KOps/s $\color{#35bf28}+1.50\%$
test_compile_indexing[slice-tensorclass-eager] 52.1110μs 15.9124μs 62.8440 KOps/s 60.2333 KOps/s $\color{#35bf28}+4.33\%$
test_compile_indexing[slice-pytree-compile] 0.1509ms 0.1014ms 9.8618 KOps/s 10.0784 KOps/s $\color{#d91a1a}-2.15\%$
test_compile_indexing[slice-pytree-eager] 60.2910μs 15.8182μs 63.2184 KOps/s 60.6951 KOps/s $\color{#35bf28}+4.16\%$
test_compile_indexing[int-tensordict-compile] 0.1481ms 0.1039ms 9.6278 KOps/s 9.6025 KOps/s $\color{#35bf28}+0.26\%$
test_compile_indexing[int-tensordict-eager] 0.5608ms 17.4891μs 57.1784 KOps/s 53.2374 KOps/s $\textbf{\color{#35bf28}+7.40\%}$
test_compile_indexing[int-tensorclass-compile] 0.1466ms 98.3315μs 10.1697 KOps/s 10.0728 KOps/s $\color{#35bf28}+0.96\%$
test_compile_indexing[int-tensorclass-eager] 44.6910μs 15.9960μs 62.5157 KOps/s 61.2090 KOps/s $\color{#35bf28}+2.13\%$
test_compile_indexing[int-pytree-compile] 0.1395ms 0.1023ms 9.7795 KOps/s 10.1117 KOps/s $\color{#d91a1a}-3.29\%$
test_compile_indexing[int-pytree-eager] 0.1281ms 16.9555μs 58.9780 KOps/s 61.1677 KOps/s $\color{#d91a1a}-3.58\%$
test_mod_add[eager] 83.8020μs 39.5854μs 25.2618 KOps/s 26.4372 KOps/s $\color{#d91a1a}-4.45\%$
test_mod_add[compile] 0.1198ms 80.8431μs 12.3696 KOps/s 12.0210 KOps/s $\color{#35bf28}+2.90\%$
test_mod_add[compile-overhead] 0.3241ms 0.1673ms 5.9757 KOps/s 5.6371 KOps/s $\textbf{\color{#35bf28}+6.01\%}$
test_mod_wrap[eager] 0.3475ms 0.2559ms 3.9071 KOps/s 3.8894 KOps/s $\color{#35bf28}+0.46\%$
test_mod_wrap[compile] 0.8058ms 0.2875ms 3.4786 KOps/s 3.4189 KOps/s $\color{#35bf28}+1.75\%$
test_mod_wrap[compile-overhead] 7.2105ms 3.8110ms 262.3961 Ops/s 264.0811 Ops/s $\color{#d91a1a}-0.64\%$
test_mod_wrap_and_backward[eager] 1.5111ms 1.3957ms 716.4813 Ops/s 667.6135 Ops/s $\textbf{\color{#35bf28}+7.32\%}$
test_mod_wrap_and_backward[compile] 1.3970ms 1.2768ms 783.2225 Ops/s 715.1015 Ops/s $\textbf{\color{#35bf28}+9.53\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3751ms 0.9258ms 1.0801 KOps/s 962.8859 Ops/s $\textbf{\color{#35bf28}+12.18\%}$
test_seq_add[eager] 0.1654ms 0.1139ms 8.7770 KOps/s 8.7420 KOps/s $\color{#35bf28}+0.40\%$
test_seq_add[compile] 0.1420ms 88.4623μs 11.3043 KOps/s 10.9937 KOps/s $\color{#35bf28}+2.83\%$
test_seq_add[compile-overhead] 0.1820ms 0.1290ms 7.7548 KOps/s 7.5946 KOps/s $\color{#35bf28}+2.11\%$
test_seq_wrap[eager] 0.5029ms 0.4343ms 2.3024 KOps/s 2.3623 KOps/s $\color{#d91a1a}-2.54\%$
test_seq_wrap[compile] 0.4234ms 0.3072ms 3.2555 KOps/s 3.2390 KOps/s $\color{#35bf28}+0.51\%$
test_seq_wrap[compile-overhead] 0.2808ms 0.2233ms 4.4780 KOps/s 4.3801 KOps/s $\color{#35bf28}+2.23\%$
test_func_call_runtime[False-eager] 0.8760ms 0.7746ms 1.2910 KOps/s 1.2998 KOps/s $\color{#d91a1a}-0.68\%$
test_func_call_runtime[False-compile] 0.8529ms 0.7601ms 1.3156 KOps/s 1.3104 KOps/s $\color{#35bf28}+0.40\%$
test_func_call_runtime[False-compile-overhead] 0.4358ms 0.3648ms 2.7413 KOps/s 2.7192 KOps/s $\color{#35bf28}+0.81\%$
test_func_call_runtime[True-eager] 1.0745ms 0.9236ms 1.0828 KOps/s 1.0744 KOps/s $\color{#35bf28}+0.78\%$
test_func_call_runtime[True-compile] 0.8119ms 0.7699ms 1.2989 KOps/s 1.2733 KOps/s $\color{#35bf28}+2.01\%$
test_func_call_runtime[True-compile-overhead] 0.4964ms 0.3853ms 2.5954 KOps/s 2.5797 KOps/s $\color{#35bf28}+0.61\%$
test_func_call_cm_runtime[False-eager] 0.8245ms 0.7521ms 1.3296 KOps/s 1.3117 KOps/s $\color{#35bf28}+1.37\%$
test_func_call_cm_runtime[False-compile] 0.8053ms 0.7503ms 1.3328 KOps/s 1.3031 KOps/s $\color{#35bf28}+2.28\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4286ms 0.3664ms 2.7295 KOps/s 2.7024 KOps/s $\color{#35bf28}+1.00\%$
test_func_call_cm_runtime[True-eager] 1.1211ms 1.0279ms 972.8375 Ops/s 968.7507 Ops/s $\color{#35bf28}+0.42\%$
test_func_call_cm_runtime[True-compile] 0.9102ms 0.7996ms 1.2506 KOps/s 1.2232 KOps/s $\color{#35bf28}+2.24\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4743ms 0.4116ms 2.4298 KOps/s 2.4094 KOps/s $\color{#35bf28}+0.85\%$
test_vmap_func_call_cm_runtime[eager] 2.5729ms 2.1029ms 475.5312 Ops/s 474.5265 Ops/s $\color{#35bf28}+0.21\%$
test_vmap_func_call_cm_runtime[compile] 0.8805ms 0.8113ms 1.2326 KOps/s 1.2034 KOps/s $\color{#35bf28}+2.43\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4868ms 0.4118ms 2.4282 KOps/s 2.3802 KOps/s $\color{#35bf28}+2.01\%$
test_distributed 4.0609ms 0.3173ms 3.1519 KOps/s 8.4837 KOps/s $\textbf{\color{#d91a1a}-62.85\%}$
test_tdmodule 57.4710μs 19.0081μs 52.6092 KOps/s 52.7672 KOps/s $\color{#d91a1a}-0.30\%$
test_tdmodule_dispatch 84.8120μs 34.5397μs 28.9522 KOps/s 29.7152 KOps/s $\color{#d91a1a}-2.57\%$
test_tdseq 38.8810μs 19.1828μs 52.1300 KOps/s 54.0071 KOps/s $\color{#d91a1a}-3.48\%$
test_tdseq_dispatch 58.9510μs 37.0301μs 27.0050 KOps/s 28.1755 KOps/s $\color{#d91a1a}-4.15\%$
test_instantiation_functorch 1.6204ms 1.5255ms 655.5436 Ops/s 633.2222 Ops/s $\color{#35bf28}+3.53\%$
test_exec_functorch 0.1980ms 0.1434ms 6.9749 KOps/s 6.6983 KOps/s $\color{#35bf28}+4.13\%$
test_exec_functional_call 0.1681ms 0.1392ms 7.1813 KOps/s 6.9568 KOps/s $\color{#35bf28}+3.23\%$
test_exec_td_decorator 0.3650ms 0.1841ms 5.4319 KOps/s 5.3385 KOps/s $\color{#35bf28}+1.75\%$
test_vmap_mlp_speed_decorator[True-True] 0.8189ms 0.6921ms 1.4449 KOps/s 1.4555 KOps/s $\color{#d91a1a}-0.72\%$
test_vmap_mlp_speed_decorator[True-False] 0.8535ms 0.6915ms 1.4461 KOps/s 1.4544 KOps/s $\color{#d91a1a}-0.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.7176ms 0.5994ms 1.6684 KOps/s 1.6676 KOps/s $\color{#35bf28}+0.05\%$
test_vmap_mlp_speed_decorator[False-False] 0.7219ms 0.6004ms 1.6655 KOps/s 1.6631 KOps/s $\color{#35bf28}+0.15\%$
test_vmap_transformer_speed_decorator[True-True] 19.3975ms 19.3310ms 51.7305 Ops/s 51.8756 Ops/s $\color{#d91a1a}-0.28\%$
test_vmap_transformer_speed_decorator[True-False] 20.0542ms 19.4037ms 51.5367 Ops/s 51.8703 Ops/s $\color{#d91a1a}-0.64\%$
test_vmap_transformer_speed_decorator[False-True] 19.3583ms 19.2732ms 51.8854 Ops/s 52.2212 Ops/s $\color{#d91a1a}-0.64\%$
test_vmap_transformer_speed_decorator[False-False] 19.3063ms 19.2409ms 51.9727 Ops/s 52.2255 Ops/s $\color{#d91a1a}-0.48\%$
test_to_module_speed[True] 1.0324ms 0.9356ms 1.0689 KOps/s 1.0647 KOps/s $\color{#35bf28}+0.40\%$
test_to_module_speed[False] 1.3083ms 0.9308ms 1.0743 KOps/s 1.0945 KOps/s $\color{#d91a1a}-1.85\%$
test_tc_init 70.3510μs 36.7201μs 27.2330 KOps/s 28.7852 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_tc_init_nested 0.2097ms 74.3437μs 13.4510 KOps/s 13.9384 KOps/s $\color{#d91a1a}-3.50\%$
test_tc_first_layer_tensor 5.0744μs 0.6964μs 1.4359 MOps/s 1.4385 MOps/s $\color{#d91a1a}-0.18\%$
test_tc_first_layer_nontensor 22.0200μs 2.3128μs 432.3768 KOps/s 430.0189 KOps/s $\color{#35bf28}+0.55\%$
test_tc_second_layer_tensor 11.6570μs 1.4443μs 692.3877 KOps/s 712.0949 KOps/s $\color{#d91a1a}-2.77\%$
test_tc_second_layer_nontensor 26.0300μs 3.0778μs 324.9055 KOps/s 327.6916 KOps/s $\color{#d91a1a}-0.85\%$
test_unbind 0.2237s 9.7359ms 102.7123 Ops/s 148.0410 Ops/s $\textbf{\color{#d91a1a}-30.62\%}$
test_full_like 12.1973ms 9.1929ms 108.7791 Ops/s 105.5583 Ops/s $\color{#35bf28}+3.05\%$
test_zeros_like 5.3271ms 4.2142ms 237.2948 Ops/s 113.6337 Ops/s $\textbf{\color{#35bf28}+108.82\%}$
test_ones_like 4.9370ms 4.2756ms 233.8828 Ops/s 232.2611 Ops/s $\color{#35bf28}+0.70\%$
test_clone 11.4001ms 9.0161ms 110.9128 Ops/s 158.7720 Ops/s $\textbf{\color{#d91a1a}-30.14\%}$
test_squeeze 92.6210μs 9.5104μs 105.1476 KOps/s 104.3574 KOps/s $\color{#35bf28}+0.76\%$
test_unsqueeze 0.1200ms 71.0279μs 14.0790 KOps/s 13.5958 KOps/s $\color{#35bf28}+3.55\%$
test_split 0.3831ms 0.1588ms 6.2980 KOps/s 5.9449 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_permute 0.3032ms 0.1776ms 5.6322 KOps/s 5.5119 KOps/s $\color{#35bf28}+2.18\%$
test_stack 50.6048ms 50.3333ms 19.8676 Ops/s 18.9092 Ops/s $\textbf{\color{#35bf28}+5.07\%}$
test_cat 51.5313ms 50.2388ms 19.9049 Ops/s 19.8153 Ops/s $\color{#35bf28}+0.45\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants