Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Update the download-artifacts version #1177

Merged
merged 1 commit into from
Jan 9, 2025
Merged

[CI] Update the download-artifacts version #1177

merged 1 commit into from
Jan 9, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 9, 2025

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 9, 2025
@vmoens vmoens merged commit a63e7f3 into main Jan 9, 2025
10 of 24 checks passed
@vmoens vmoens added the CI label Jan 9, 2025
Copy link

github-actions bot commented Jan 9, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}36$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 43.9330μs 21.5356μs 46.4348 KOps/s 51.6860 KOps/s $\textbf{\color{#d91a1a}-10.16\%}$
test_plain_set_stack_nested 86.1020μs 21.5298μs 46.4473 KOps/s 51.3868 KOps/s $\textbf{\color{#d91a1a}-9.61\%}$
test_plain_set_nested_inplace 67.0260μs 22.8477μs 43.7680 KOps/s 47.1527 KOps/s $\textbf{\color{#d91a1a}-7.18\%}$
test_plain_set_stack_nested_inplace 65.6730μs 23.1209μs 43.2508 KOps/s 47.4690 KOps/s $\textbf{\color{#d91a1a}-8.89\%}$
test_items 29.5450μs 4.1672μs 239.9667 KOps/s 235.6827 KOps/s $\color{#35bf28}+1.82\%$
test_items_nested 0.8662ms 0.3995ms 2.5030 KOps/s 2.5284 KOps/s $\color{#d91a1a}-1.00\%$
test_items_nested_locked 0.5229ms 0.4022ms 2.4862 KOps/s 2.4949 KOps/s $\color{#d91a1a}-0.35\%$
test_items_nested_leaf 0.1426ms 77.9328μs 12.8316 KOps/s 12.9473 KOps/s $\color{#d91a1a}-0.89\%$
test_items_stack_nested 0.5983ms 0.4013ms 2.4916 KOps/s 2.5031 KOps/s $\color{#d91a1a}-0.46\%$
test_items_stack_nested_leaf 0.1524ms 80.4205μs 12.4346 KOps/s 12.4096 KOps/s $\color{#35bf28}+0.20\%$
test_items_stack_nested_locked 0.6055ms 0.4023ms 2.4854 KOps/s 2.5085 KOps/s $\color{#d91a1a}-0.92\%$
test_keys 27.4820μs 3.4676μs 288.3813 KOps/s 290.0123 KOps/s $\color{#d91a1a}-0.56\%$
test_keys_nested 0.2278ms 0.1619ms 6.1763 KOps/s 6.1363 KOps/s $\color{#35bf28}+0.65\%$
test_keys_nested_locked 0.7708ms 0.1687ms 5.9264 KOps/s 5.9031 KOps/s $\color{#35bf28}+0.39\%$
test_keys_nested_leaf 0.2327ms 0.1408ms 7.0998 KOps/s 6.8923 KOps/s $\color{#35bf28}+3.01\%$
test_keys_stack_nested 0.2558ms 0.1600ms 6.2490 KOps/s 6.2711 KOps/s $\color{#d91a1a}-0.35\%$
test_keys_stack_nested_leaf 0.2494ms 0.1366ms 7.3201 KOps/s 7.1927 KOps/s $\color{#35bf28}+1.77\%$
test_keys_stack_nested_locked 0.2917ms 0.1646ms 6.0767 KOps/s 5.9696 KOps/s $\color{#35bf28}+1.79\%$
test_values 5.6386μs 1.0348μs 966.3895 KOps/s 974.5596 KOps/s $\color{#d91a1a}-0.84\%$
test_values_nested 0.1254ms 61.6976μs 16.2081 KOps/s 16.3977 KOps/s $\color{#d91a1a}-1.16\%$
test_values_nested_locked 0.1189ms 62.0541μs 16.1150 KOps/s 16.2692 KOps/s $\color{#d91a1a}-0.95\%$
test_values_nested_leaf 0.1331ms 71.5293μs 13.9803 KOps/s 13.8593 KOps/s $\color{#35bf28}+0.87\%$
test_values_stack_nested 0.1309ms 63.3530μs 15.7846 KOps/s 15.4836 KOps/s $\color{#35bf28}+1.94\%$
test_values_stack_nested_leaf 0.1380ms 69.6147μs 14.3648 KOps/s 14.3280 KOps/s $\color{#35bf28}+0.26\%$
test_values_stack_nested_locked 0.1139ms 63.6173μs 15.7190 KOps/s 15.6581 KOps/s $\color{#35bf28}+0.39\%$
test_membership 2.2222μs 0.6856μs 1.4587 MOps/s 1.1668 MOps/s $\textbf{\color{#35bf28}+25.01\%}$
test_membership_nested 15.1890μs 2.8598μs 349.6801 KOps/s 349.1445 KOps/s $\color{#35bf28}+0.15\%$
test_membership_nested_leaf 23.5240μs 2.9007μs 344.7452 KOps/s 352.1408 KOps/s $\color{#d91a1a}-2.10\%$
test_membership_stacked_nested 15.6990μs 2.8536μs 350.4355 KOps/s 351.2543 KOps/s $\color{#d91a1a}-0.23\%$
test_membership_stacked_nested_leaf 32.4110μs 2.9246μs 341.9243 KOps/s 346.1216 KOps/s $\color{#d91a1a}-1.21\%$
test_membership_nested_last 22.1420μs 4.3188μs 231.5469 KOps/s 228.1329 KOps/s $\color{#35bf28}+1.50\%$
test_membership_nested_leaf_last 28.3940μs 4.3559μs 229.5714 KOps/s 228.2604 KOps/s $\color{#35bf28}+0.57\%$
test_membership_stacked_nested_last 43.5210μs 13.0808μs 76.4479 KOps/s 233.1622 KOps/s $\textbf{\color{#d91a1a}-67.21\%}$
test_membership_stacked_nested_leaf_last 39.8650μs 13.1385μs 76.1120 KOps/s 233.1946 KOps/s $\textbf{\color{#d91a1a}-67.36\%}$
test_nested_getleaf 34.7450μs 10.6834μs 93.6035 KOps/s 94.4035 KOps/s $\color{#d91a1a}-0.85\%$
test_nested_get 37.5310μs 10.0113μs 99.8868 KOps/s 100.2566 KOps/s $\color{#d91a1a}-0.37\%$
test_stacked_getleaf 43.5620μs 10.3590μs 96.5346 KOps/s 94.0658 KOps/s $\color{#35bf28}+2.62\%$
test_stacked_get 33.6630μs 9.9795μs 100.2052 KOps/s 98.9774 KOps/s $\color{#35bf28}+1.24\%$
test_nested_getitemleaf 34.0730μs 11.0020μs 90.8922 KOps/s 91.7685 KOps/s $\color{#d91a1a}-0.95\%$
test_nested_getitem 49.1380μs 10.1964μs 98.0741 KOps/s 99.2889 KOps/s $\color{#d91a1a}-1.22\%$
test_stacked_getitemleaf 39.8950μs 10.8526μs 92.1440 KOps/s 91.8954 KOps/s $\color{#35bf28}+0.27\%$
test_stacked_getitem 46.3070μs 9.9738μs 100.2627 KOps/s 99.0707 KOps/s $\color{#35bf28}+1.20\%$
test_lock_nested 0.8031ms 0.4584ms 2.1816 KOps/s 1.7286 KOps/s $\textbf{\color{#35bf28}+26.20\%}$
test_lock_stack_nested 0.8291ms 0.4143ms 2.4138 KOps/s 2.3634 KOps/s $\color{#35bf28}+2.13\%$
test_unlock_nested 0.7168ms 0.3746ms 2.6697 KOps/s 2.6739 KOps/s $\color{#d91a1a}-0.16\%$
test_unlock_stack_nested 0.5235ms 0.3327ms 3.0061 KOps/s 2.9700 KOps/s $\color{#35bf28}+1.22\%$
test_flatten_speed 0.2209ms 0.1012ms 9.8852 KOps/s 10.1138 KOps/s $\color{#d91a1a}-2.26\%$
test_unflatten_speed 0.8646ms 0.5165ms 1.9363 KOps/s 1.9567 KOps/s $\color{#d91a1a}-1.04\%$
test_common_ops 4.2566ms 0.8348ms 1.1979 KOps/s 1.3003 KOps/s $\textbf{\color{#d91a1a}-7.88\%}$
test_creation 18.7350μs 2.4493μs 408.2789 KOps/s 408.3982 KOps/s $\color{#d91a1a}-0.03\%$
test_creation_empty 34.3840μs 13.6433μs 73.2960 KOps/s 101.6930 KOps/s $\textbf{\color{#d91a1a}-27.92\%}$
test_creation_nested_1 64.3110μs 16.4392μs 60.8301 KOps/s 78.2483 KOps/s $\textbf{\color{#d91a1a}-22.26\%}$
test_creation_nested_2 48.4710μs 20.9725μs 47.6816 KOps/s 57.9602 KOps/s $\textbf{\color{#d91a1a}-17.73\%}$
test_clone 62.4570μs 13.3716μs 74.7856 KOps/s 74.7824 KOps/s $+0.00\%$
test_getitem[int] 1.4208ms 13.0637μs 76.5482 KOps/s 76.1834 KOps/s $\color{#35bf28}+0.48\%$
test_getitem[slice_int] 0.1395ms 24.7633μs 40.3824 KOps/s 40.7582 KOps/s $\color{#d91a1a}-0.92\%$
test_getitem[range] 0.1901ms 49.6534μs 20.1396 KOps/s 21.6671 KOps/s $\textbf{\color{#d91a1a}-7.05\%}$
test_getitem[tuple] 0.1336ms 20.6717μs 48.3753 KOps/s 48.3900 KOps/s $\color{#d91a1a}-0.03\%$
test_getitem[list] 0.2395ms 44.7746μs 22.3341 KOps/s 23.7586 KOps/s $\textbf{\color{#d91a1a}-6.00\%}$
test_setitem_dim[int] 52.1380μs 25.2099μs 39.6669 KOps/s 39.9300 KOps/s $\color{#d91a1a}-0.66\%$
test_setitem_dim[slice_int] 92.4440μs 51.9456μs 19.2509 KOps/s 19.4512 KOps/s $\color{#d91a1a}-1.03\%$
test_setitem_dim[range] 0.1263ms 74.1750μs 13.4816 KOps/s 13.9563 KOps/s $\color{#d91a1a}-3.40\%$
test_setitem_dim[tuple] 68.7490μs 40.6881μs 24.5772 KOps/s 24.9575 KOps/s $\color{#d91a1a}-1.52\%$
test_setitem 81.7030μs 21.2939μs 46.9619 KOps/s 51.8026 KOps/s $\textbf{\color{#d91a1a}-9.34\%}$
test_set 82.4340μs 20.9863μs 47.6501 KOps/s 53.0830 KOps/s $\textbf{\color{#d91a1a}-10.23\%}$
test_set_shared 1.1979ms 0.1686ms 5.9309 KOps/s 5.8084 KOps/s $\color{#35bf28}+2.11\%$
test_update 0.1195ms 24.8309μs 40.2725 KOps/s 47.7991 KOps/s $\textbf{\color{#d91a1a}-15.75\%}$
test_update_nested 96.6110μs 35.0398μs 28.5390 KOps/s 31.8067 KOps/s $\textbf{\color{#d91a1a}-10.27\%}$
test_update__nested 0.7906ms 33.1291μs 30.1849 KOps/s 29.5755 KOps/s $\color{#35bf28}+2.06\%$
test_set_nested 74.6100μs 22.8726μs 43.7204 KOps/s 47.1654 KOps/s $\textbf{\color{#d91a1a}-7.30\%}$
test_set_nested_new 0.1054ms 29.6111μs 33.7711 KOps/s 37.7566 KOps/s $\textbf{\color{#d91a1a}-10.56\%}$
test_select 90.4890μs 43.1333μs 23.1839 KOps/s 23.4495 KOps/s $\color{#d91a1a}-1.13\%$
test_select_nested 0.1189ms 63.3603μs 15.7828 KOps/s 15.8559 KOps/s $\color{#d91a1a}-0.46\%$
test_exclude_nested 0.1347ms 81.8581μs 12.2163 KOps/s 12.2598 KOps/s $\color{#d91a1a}-0.36\%$
test_empty[True] 0.7141ms 0.4085ms 2.4482 KOps/s 2.4716 KOps/s $\color{#d91a1a}-0.95\%$
test_empty[False] 7.4565μs 1.4048μs 711.8251 KOps/s 703.4953 KOps/s $\color{#35bf28}+1.18\%$
test_unbind_speed 0.4425ms 0.2730ms 3.6632 KOps/s 3.7303 KOps/s $\color{#d91a1a}-1.80\%$
test_unbind_speed_stack0 0.4267ms 0.2594ms 3.8552 KOps/s 3.8393 KOps/s $\color{#35bf28}+0.41\%$
test_unbind_speed_stack1 0.1049s 0.8428ms 1.1865 KOps/s 1.4003 KOps/s $\textbf{\color{#d91a1a}-15.27\%}$
test_split 0.1130s 1.7983ms 556.0664 Ops/s 561.4347 Ops/s $\color{#d91a1a}-0.96\%$
test_chunk 0.1074s 1.7887ms 559.0616 Ops/s 562.3905 Ops/s $\color{#d91a1a}-0.59\%$
test_consolidate_njt[False-None] 8.9416ms 8.3092ms 120.3491 Ops/s 123.4744 Ops/s $\color{#d91a1a}-2.53\%$
test_creation[device0] 4.1446ms 92.8957μs 10.7648 KOps/s 10.9107 KOps/s $\color{#d91a1a}-1.34\%$
test_creation_from_tensor 0.2413ms 94.7141μs 10.5581 KOps/s 10.5521 KOps/s $\color{#35bf28}+0.06\%$
test_add_one[memmap_tensor0] 0.1653ms 5.0429μs 198.2991 KOps/s 206.5753 KOps/s $\color{#d91a1a}-4.01\%$
test_contiguous[memmap_tensor0] 13.8260μs 0.5108μs 1.9579 MOps/s 1.9370 MOps/s $\color{#35bf28}+1.08\%$
test_stack[memmap_tensor0] 41.2070μs 3.5403μs 282.4644 KOps/s 295.2695 KOps/s $\color{#d91a1a}-4.34\%$
test_memmaptd_index 0.8832ms 0.2365ms 4.2292 KOps/s 4.2456 KOps/s $\color{#d91a1a}-0.39\%$
test_memmaptd_index_astensor 0.5710ms 0.3221ms 3.1048 KOps/s 3.1195 KOps/s $\color{#d91a1a}-0.47\%$
test_memmaptd_index_op 0.9742ms 0.6192ms 1.6149 KOps/s 1.7832 KOps/s $\textbf{\color{#d91a1a}-9.44\%}$
test_serialize_model 0.1234s 0.1170s 8.5469 Ops/s 7.2912 Ops/s $\textbf{\color{#35bf28}+17.22\%}$
test_serialize_model_pickle 0.4670s 0.3943s 2.5359 Ops/s 2.5040 Ops/s $\color{#35bf28}+1.27\%$
test_serialize_weights 0.1185s 0.1132s 8.8349 Ops/s 8.6550 Ops/s $\color{#35bf28}+2.08\%$
test_serialize_weights_returnearly 0.1679s 0.1573s 6.3584 Ops/s 6.2151 Ops/s $\color{#35bf28}+2.31\%$
test_serialize_weights_pickle 0.4682s 0.4031s 2.4805 Ops/s 2.5078 Ops/s $\color{#d91a1a}-1.09\%$
test_serialize_weights_filesystem 0.1552s 0.1465s 6.8270 Ops/s 7.1960 Ops/s $\textbf{\color{#d91a1a}-5.13\%}$
test_serialize_model_filesystem 0.2550s 0.1616s 6.1889 Ops/s 6.1041 Ops/s $\color{#35bf28}+1.39\%$
test_reshape_pytree 57.7880μs 26.7413μs 37.3953 KOps/s 37.2102 KOps/s $\color{#35bf28}+0.50\%$
test_reshape_td 81.6930μs 32.2829μs 30.9761 KOps/s 29.8615 KOps/s $\color{#35bf28}+3.73\%$
test_view_pytree 61.9560μs 26.5738μs 37.6310 KOps/s 37.3521 KOps/s $\color{#35bf28}+0.75\%$
test_view_td 88.7570μs 38.3374μs 26.0842 KOps/s 26.3286 KOps/s $\color{#d91a1a}-0.93\%$
test_unbind_pytree 61.7770μs 29.8657μs 33.4832 KOps/s 33.9472 KOps/s $\color{#d91a1a}-1.37\%$
test_unbind_td 0.3524ms 40.2578μs 24.8399 KOps/s 25.6021 KOps/s $\color{#d91a1a}-2.98\%$
test_split_pytree 0.1048ms 29.5279μs 33.8663 KOps/s 33.8930 KOps/s $\color{#d91a1a}-0.08\%$
test_split_td 0.5273ms 45.2769μs 22.0863 KOps/s 22.3799 KOps/s $\color{#d91a1a}-1.31\%$
test_add_pytree 79.7400μs 35.2132μs 28.3984 KOps/s 28.0941 KOps/s $\color{#35bf28}+1.08\%$
test_add_td 0.1086ms 58.4734μs 17.1018 KOps/s 18.4331 KOps/s $\textbf{\color{#d91a1a}-7.22\%}$
test_compile_add_one_nested[tensordict-compile] 0.1181ms 62.9299μs 15.8907 KOps/s 15.9027 KOps/s $\color{#d91a1a}-0.08\%$
test_compile_add_one_nested[tensordict-eager] 0.5220ms 0.1737ms 5.7563 KOps/s 5.8139 KOps/s $\color{#d91a1a}-0.99\%$
test_compile_add_one_nested[pytree-compile] 0.1351ms 46.1152μs 21.6848 KOps/s 22.0736 KOps/s $\color{#d91a1a}-1.76\%$
test_compile_add_one_nested[pytree-eager] 0.2433ms 0.1190ms 8.4054 KOps/s 8.3327 KOps/s $\color{#35bf28}+0.87\%$
test_compile_copy_nested[tensordict-compile] 63.2490μs 26.7773μs 37.3450 KOps/s 39.1061 KOps/s $\color{#d91a1a}-4.50\%$
test_compile_copy_nested[tensordict-eager] 0.1177ms 58.1611μs 17.1936 KOps/s 16.8345 KOps/s $\color{#35bf28}+2.13\%$
test_compile_copy_nested[pytree-compile] 0.1459ms 77.2332μs 12.9478 KOps/s 12.6843 KOps/s $\color{#35bf28}+2.08\%$
test_compile_copy_nested[pytree-eager] 0.1470ms 67.6592μs 14.7799 KOps/s 14.9139 KOps/s $\color{#d91a1a}-0.90\%$
test_compile_add_one_flat[tensordict-compile] 0.2432ms 0.1076ms 9.2912 KOps/s 9.5740 KOps/s $\color{#d91a1a}-2.95\%$
test_compile_add_one_flat[tensordict-eager] 0.4389ms 0.2182ms 4.5829 KOps/s 4.7018 KOps/s $\color{#d91a1a}-2.53\%$
test_compile_add_one_flat[tensorclass-compile] 0.1700ms 45.5495μs 21.9541 KOps/s 22.1298 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_add_one_flat[tensorclass-eager] 0.5039ms 67.5358μs 14.8070 KOps/s 15.1625 KOps/s $\color{#d91a1a}-2.34\%$
test_compile_add_one_flat[pytree-compile] 0.1772ms 0.1012ms 9.8834 KOps/s 9.9308 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_add_one_flat[pytree-eager] 0.2928ms 0.2059ms 4.8564 KOps/s 4.9495 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_add_self_flat[tensordict-eager] 0.4904ms 0.2349ms 4.2569 KOps/s 4.3321 KOps/s $\color{#d91a1a}-1.73\%$
test_compile_add_self_flat[tensordict-compile] 0.2168ms 0.1068ms 9.3640 KOps/s 9.6432 KOps/s $\color{#d91a1a}-2.90\%$
test_compile_add_self_flat[tensorclass-eager] 0.2112ms 67.7746μs 14.7548 KOps/s 15.4669 KOps/s $\color{#d91a1a}-4.60\%$
test_compile_add_self_flat[tensorclass-compile] 0.1879ms 47.2102μs 21.1819 KOps/s 21.4784 KOps/s $\color{#d91a1a}-1.38\%$
test_compile_add_self_flat[pytree-eager] 0.2906ms 0.1592ms 6.2813 KOps/s 6.3197 KOps/s $\color{#d91a1a}-0.61\%$
test_compile_add_self_flat[pytree-compile] 0.2177ms 0.1021ms 9.7919 KOps/s 9.9249 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_copy_flat[tensordict-compile] 60.9130μs 21.1428μs 47.2974 KOps/s 47.9758 KOps/s $\color{#d91a1a}-1.41\%$
test_compile_copy_flat[tensordict-eager] 0.1648ms 67.3228μs 14.8538 KOps/s 15.0522 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_copy_flat[pytree-compile] 0.1268ms 80.0453μs 12.4929 KOps/s 12.8099 KOps/s $\color{#d91a1a}-2.47\%$
test_compile_copy_flat[pytree-eager] 0.1153ms 68.5083μs 14.5968 KOps/s 14.6950 KOps/s $\color{#d91a1a}-0.67\%$
test_compile_assign_and_add[tensordict-compile] 1.3030ms 0.2093ms 4.7768 KOps/s 4.9276 KOps/s $\color{#d91a1a}-3.06\%$
test_compile_assign_and_add[tensordict-eager] 1.6118ms 1.3070ms 765.0888 Ops/s 764.6990 Ops/s $\color{#35bf28}+0.05\%$
test_compile_assign_and_add[pytree-compile] 0.2854ms 0.2004ms 4.9895 KOps/s 5.0153 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_assign_and_add[pytree-eager] 0.9867ms 0.7851ms 1.2738 KOps/s 1.2824 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_assign_and_add_stack[compile] 0.5781ms 0.4446ms 2.2493 KOps/s 2.2739 KOps/s $\color{#d91a1a}-1.08\%$
test_compile_assign_and_add_stack[eager] 4.1616ms 2.7987ms 357.3128 Ops/s 385.3752 Ops/s $\textbf{\color{#d91a1a}-7.28\%}$
test_compile_indexing[tensor-tensordict-compile] 87.4530μs 36.2777μs 27.5651 KOps/s 28.1363 KOps/s $\color{#d91a1a}-2.03\%$
test_compile_indexing[tensor-tensordict-eager] 0.4857ms 33.5752μs 29.7839 KOps/s 30.2033 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_indexing[tensor-tensorclass-compile] 78.2870μs 28.5470μs 35.0299 KOps/s 33.7828 KOps/s $\color{#35bf28}+3.69\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1074ms 23.0856μs 43.3170 KOps/s 42.5111 KOps/s $\color{#35bf28}+1.90\%$
test_compile_indexing[tensor-pytree-compile] 0.1022ms 29.0069μs 34.4746 KOps/s 33.2998 KOps/s $\color{#35bf28}+3.53\%$
test_compile_indexing[tensor-pytree-eager] 68.2180μs 23.0748μs 43.3373 KOps/s 42.5169 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[slice-tensordict-compile] 0.1276ms 51.8916μs 19.2710 KOps/s 19.4963 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_indexing[slice-tensordict-eager] 0.6079ms 20.6360μs 48.4589 KOps/s 49.8482 KOps/s $\color{#d91a1a}-2.79\%$
test_compile_indexing[slice-tensorclass-compile] 0.1207ms 44.8070μs 22.3179 KOps/s 22.8162 KOps/s $\color{#d91a1a}-2.18\%$
test_compile_indexing[slice-tensorclass-eager] 61.9560μs 18.7326μs 53.3828 KOps/s 53.4713 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_indexing[slice-pytree-compile] 97.7330μs 45.9488μs 21.7634 KOps/s 22.4367 KOps/s $\color{#d91a1a}-3.00\%$
test_compile_indexing[slice-pytree-eager] 51.7270μs 18.7456μs 53.3457 KOps/s 53.4359 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_indexing[int-tensordict-compile] 0.1084ms 53.3247μs 18.7530 KOps/s 19.1620 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_indexing[int-tensordict-eager] 1.0079ms 20.3351μs 49.1760 KOps/s 50.8150 KOps/s $\color{#d91a1a}-3.23\%$
test_compile_indexing[int-tensorclass-compile] 91.5320μs 45.4845μs 21.9855 KOps/s 22.5322 KOps/s $\color{#d91a1a}-2.43\%$
test_compile_indexing[int-tensorclass-eager] 51.5870μs 18.5942μs 53.7802 KOps/s 53.5229 KOps/s $\color{#35bf28}+0.48\%$
test_compile_indexing[int-pytree-compile] 0.1196ms 45.5611μs 21.9485 KOps/s 22.3781 KOps/s $\color{#d91a1a}-1.92\%$
test_compile_indexing[int-pytree-eager] 67.6660μs 18.4124μs 54.3113 KOps/s 53.6981 KOps/s $\color{#35bf28}+1.14\%$
test_mod_add[eager] 0.1063ms 35.9032μs 27.8527 KOps/s 31.0872 KOps/s $\textbf{\color{#d91a1a}-10.40\%}$
test_mod_add[compile] 0.1026ms 47.2313μs 21.1724 KOps/s 21.1684 KOps/s $\color{#35bf28}+0.02\%$
test_mod_add[compile-overhead] 0.1047ms 47.5442μs 21.0331 KOps/s 21.5317 KOps/s $\color{#d91a1a}-2.32\%$
test_mod_wrap[eager] 0.3467ms 0.2221ms 4.5018 KOps/s 4.4961 KOps/s $\color{#35bf28}+0.13\%$
test_mod_wrap[compile] 0.3105ms 0.2037ms 4.9104 KOps/s 4.9066 KOps/s $\color{#35bf28}+0.08\%$
test_mod_wrap[compile-overhead] 0.4954ms 0.2154ms 4.6415 KOps/s 4.9791 KOps/s $\textbf{\color{#d91a1a}-6.78\%}$
test_mod_wrap_and_backward[eager] 15.1852ms 11.8938ms 84.0772 Ops/s 77.8764 Ops/s $\textbf{\color{#35bf28}+7.96\%}$
test_mod_wrap_and_backward[compile] 16.9878ms 13.6200ms 73.4214 Ops/s 81.1555 Ops/s $\textbf{\color{#d91a1a}-9.53\%}$
test_mod_wrap_and_backward[compile-overhead] 15.5957ms 12.9558ms 77.1854 Ops/s 82.8773 Ops/s $\textbf{\color{#d91a1a}-6.87\%}$
test_seq_add[eager] 0.1921ms 0.1188ms 8.4165 KOps/s 8.7457 KOps/s $\color{#d91a1a}-3.76\%$
test_seq_add[compile] 0.1087ms 62.9354μs 15.8893 KOps/s 16.8047 KOps/s $\textbf{\color{#d91a1a}-5.45\%}$
test_seq_add[compile-overhead] 0.1116ms 59.9163μs 16.6899 KOps/s 16.2575 KOps/s $\color{#35bf28}+2.66\%$
test_seq_wrap[eager] 0.7523ms 0.4505ms 2.2197 KOps/s 2.2435 KOps/s $\color{#d91a1a}-1.06\%$
test_seq_wrap[compile] 0.3081ms 0.2271ms 4.4028 KOps/s 4.4000 KOps/s $\color{#35bf28}+0.06\%$
test_seq_wrap[compile-overhead] 0.4376ms 0.2246ms 4.4529 KOps/s 4.3977 KOps/s $\color{#35bf28}+1.25\%$
test_func_call_runtime[False-eager] 0.9639ms 0.5429ms 1.8420 KOps/s 1.8297 KOps/s $\color{#35bf28}+0.67\%$
test_func_call_runtime[False-compile] 0.7649ms 0.4222ms 2.3686 KOps/s 2.3426 KOps/s $\color{#35bf28}+1.11\%$
test_func_call_runtime[False-compile-overhead] 0.5133ms 0.4191ms 2.3862 KOps/s 2.3421 KOps/s $\color{#35bf28}+1.88\%$
test_func_call_runtime[True-eager] 1.4104ms 0.7462ms 1.3400 KOps/s 1.3067 KOps/s $\color{#35bf28}+2.55\%$
test_func_call_runtime[True-compile] 0.6195ms 0.4616ms 2.1665 KOps/s 2.1583 KOps/s $\color{#35bf28}+0.38\%$
test_func_call_runtime[True-compile-overhead] 0.9180ms 0.4614ms 2.1672 KOps/s 2.1616 KOps/s $\color{#35bf28}+0.26\%$
test_func_call_cm_runtime[False-eager] 0.7806ms 0.5423ms 1.8439 KOps/s 1.8406 KOps/s $\color{#35bf28}+0.18\%$
test_func_call_cm_runtime[False-compile] 0.5215ms 0.4176ms 2.3946 KOps/s 2.3759 KOps/s $\color{#35bf28}+0.79\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6012ms 0.4178ms 2.3936 KOps/s 2.3608 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_cm_runtime[True-eager] 0.9922ms 0.8944ms 1.1180 KOps/s 1.0947 KOps/s $\color{#35bf28}+2.13\%$
test_func_call_cm_runtime[True-compile] 0.5714ms 0.4787ms 2.0891 KOps/s 2.0625 KOps/s $\color{#35bf28}+1.29\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8001ms 0.4841ms 2.0657 KOps/s 2.0546 KOps/s $\color{#35bf28}+0.54\%$
test_vmap_func_call_cm_runtime[eager] 2.3353ms 1.8737ms 533.7173 Ops/s 523.7140 Ops/s $\color{#35bf28}+1.91\%$
test_vmap_func_call_cm_runtime[compile] 0.7467ms 0.5137ms 1.9466 KOps/s 1.9160 KOps/s $\color{#35bf28}+1.60\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9998ms 0.5166ms 1.9356 KOps/s 1.9283 KOps/s $\color{#35bf28}+0.38\%$
test_distributed 0.2337ms 0.1251ms 7.9922 KOps/s 7.8659 KOps/s $\color{#35bf28}+1.61\%$
test_tdmodule 51.9780μs 27.5359μs 36.3162 KOps/s 39.9478 KOps/s $\textbf{\color{#d91a1a}-9.09\%}$
test_tdmodule_dispatch 82.0140μs 50.1085μs 19.9567 KOps/s 22.3228 KOps/s $\textbf{\color{#d91a1a}-10.60\%}$
test_tdseq 60.3630μs 30.0239μs 33.3068 KOps/s 36.9900 KOps/s $\textbf{\color{#d91a1a}-9.96\%}$
test_tdseq_dispatch 89.9990μs 55.5714μs 17.9949 KOps/s 19.6776 KOps/s $\textbf{\color{#d91a1a}-8.55\%}$
test_instantiation_functorch 3.4666ms 1.5396ms 649.5137 Ops/s 660.0593 Ops/s $\color{#d91a1a}-1.60\%$
test_exec_functorch 0.3188ms 0.1815ms 5.5106 KOps/s 5.6263 KOps/s $\color{#d91a1a}-2.06\%$
test_exec_functional_call 0.3142ms 0.1752ms 5.7091 KOps/s 5.8221 KOps/s $\color{#d91a1a}-1.94\%$
test_exec_td_decorator 0.4411ms 0.2315ms 4.3201 KOps/s 4.4091 KOps/s $\color{#d91a1a}-2.02\%$
test_vmap_mlp_speed_decorator[True-True] 0.8949ms 0.6494ms 1.5400 KOps/s 1.5456 KOps/s $\color{#d91a1a}-0.36\%$
test_vmap_mlp_speed_decorator[True-False] 0.9882ms 0.6512ms 1.5356 KOps/s 1.5344 KOps/s $\color{#35bf28}+0.08\%$
test_vmap_mlp_speed_decorator[False-True] 0.7823ms 0.5204ms 1.9216 KOps/s 1.9065 KOps/s $\color{#35bf28}+0.79\%$
test_vmap_mlp_speed_decorator[False-False] 0.7697ms 0.5226ms 1.9135 KOps/s 1.9021 KOps/s $\color{#35bf28}+0.60\%$
test_to_module_speed[True] 2.0909ms 1.3493ms 741.1200 Ops/s 748.2388 Ops/s $\color{#d91a1a}-0.95\%$
test_to_module_speed[False] 2.4731ms 1.3109ms 762.8410 Ops/s 777.6265 Ops/s $\color{#d91a1a}-1.90\%$
test_tc_init 86.9130μs 48.3717μs 20.6732 KOps/s 22.4598 KOps/s $\textbf{\color{#d91a1a}-7.95\%}$
test_tc_init_nested 0.1973ms 94.6835μs 10.5615 KOps/s 11.4533 KOps/s $\textbf{\color{#d91a1a}-7.79\%}$
test_tc_first_layer_tensor 21.6810μs 1.5291μs 653.9796 KOps/s 656.0728 KOps/s $\color{#d91a1a}-0.32\%$
test_tc_first_layer_nontensor 21.2200μs 4.6126μs 216.7976 KOps/s 219.5368 KOps/s $\color{#d91a1a}-1.25\%$
test_tc_second_layer_tensor 38.7430μs 2.8539μs 350.4002 KOps/s 361.5192 KOps/s $\color{#d91a1a}-3.08\%$
test_tc_second_layer_nontensor 37.8510μs 5.9494μs 168.0835 KOps/s 169.2071 KOps/s $\color{#d91a1a}-0.66\%$
test_unbind 0.2333s 15.4756ms 64.6178 Ops/s 77.9538 Ops/s $\textbf{\color{#d91a1a}-17.11\%}$
test_full_like 9.5799ms 8.2079ms 121.8340 Ops/s 132.0870 Ops/s $\textbf{\color{#d91a1a}-7.76\%}$
test_zeros_like 3.6145ms 2.9316ms 341.1148 Ops/s 354.8626 Ops/s $\color{#d91a1a}-3.87\%$
test_ones_like 4.7861ms 3.4363ms 291.0082 Ops/s 156.2021 Ops/s $\textbf{\color{#35bf28}+86.30\%}$
test_clone 6.0699ms 5.4440ms 183.6883 Ops/s 119.5684 Ops/s $\textbf{\color{#35bf28}+53.63\%}$
test_squeeze 62.2160μs 12.3982μs 80.6572 KOps/s 79.8162 KOps/s $\color{#35bf28}+1.05\%$
test_unsqueeze 0.2988ms 92.3432μs 10.8292 KOps/s 11.0896 KOps/s $\color{#d91a1a}-2.35\%$
test_split 0.3743ms 0.1943ms 5.1468 KOps/s 5.2477 KOps/s $\color{#d91a1a}-1.92\%$
test_permute 0.2785ms 0.2012ms 4.9709 KOps/s 4.9110 KOps/s $\color{#35bf28}+1.22\%$
test_stack 29.9227ms 27.5885ms 36.2470 Ops/s 37.3067 Ops/s $\color{#d91a1a}-2.84\%$
test_cat 31.1309ms 26.7310ms 37.4097 Ops/s 37.9162 Ops/s $\color{#d91a1a}-1.34\%$

Copy link

github-actions bot commented Jan 9, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}41$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 36.6310μs 12.3668μs 80.8613 KOps/s 76.6001 KOps/s $\textbf{\color{#35bf28}+5.56\%}$
test_plain_set_stack_nested 37.6500μs 12.4102μs 80.5787 KOps/s 76.1812 KOps/s $\textbf{\color{#35bf28}+5.77\%}$
test_plain_set_nested_inplace 45.2310μs 13.3300μs 75.0188 KOps/s 71.0366 KOps/s $\textbf{\color{#35bf28}+5.61\%}$
test_plain_set_stack_nested_inplace 44.0410μs 13.3107μs 75.1275 KOps/s 71.3264 KOps/s $\textbf{\color{#35bf28}+5.33\%}$
test_items 23.0600μs 2.8821μs 346.9729 KOps/s 341.7641 KOps/s $\color{#35bf28}+1.52\%$
test_items_nested 0.4341ms 0.3580ms 2.7930 KOps/s 2.8285 KOps/s $\color{#d91a1a}-1.25\%$
test_items_nested_locked 0.4246ms 0.3570ms 2.8008 KOps/s 2.8229 KOps/s $\color{#d91a1a}-0.79\%$
test_items_nested_leaf 81.3010μs 57.9626μs 17.2525 KOps/s 17.4158 KOps/s $\color{#d91a1a}-0.94\%$
test_items_stack_nested 0.4238ms 0.3623ms 2.7604 KOps/s 2.8186 KOps/s $\color{#d91a1a}-2.07\%$
test_items_stack_nested_leaf 89.8520μs 59.9555μs 16.6790 KOps/s 16.5865 KOps/s $\color{#35bf28}+0.56\%$
test_items_stack_nested_locked 0.4106ms 0.3603ms 2.7757 KOps/s 2.8220 KOps/s $\color{#d91a1a}-1.64\%$
test_keys 24.3000μs 3.4354μs 291.0901 KOps/s 288.0469 KOps/s $\color{#35bf28}+1.06\%$
test_keys_nested 0.1231ms 80.5583μs 12.4134 KOps/s 12.3712 KOps/s $\color{#35bf28}+0.34\%$
test_keys_nested_locked 0.7832ms 86.3885μs 11.5756 KOps/s 11.5457 KOps/s $\color{#35bf28}+0.26\%$
test_keys_nested_leaf 0.1121ms 71.9239μs 13.9036 KOps/s 13.9345 KOps/s $\color{#d91a1a}-0.22\%$
test_keys_stack_nested 0.1162ms 81.8166μs 12.2225 KOps/s 12.1504 KOps/s $\color{#35bf28}+0.59\%$
test_keys_stack_nested_leaf 0.1216ms 73.1626μs 13.6682 KOps/s 13.6102 KOps/s $\color{#35bf28}+0.43\%$
test_keys_stack_nested_locked 0.1415ms 89.1599μs 11.2158 KOps/s 11.3854 KOps/s $\color{#d91a1a}-1.49\%$
test_values 5.2902μs 0.8582μs 1.1653 MOps/s 1.1627 MOps/s $\color{#35bf28}+0.22\%$
test_values_nested 60.2310μs 34.4164μs 29.0559 KOps/s 29.4432 KOps/s $\color{#d91a1a}-1.32\%$
test_values_nested_locked 63.3210μs 35.8490μs 27.8948 KOps/s 28.0466 KOps/s $\color{#d91a1a}-0.54\%$
test_values_nested_leaf 72.3310μs 39.1172μs 25.5642 KOps/s 25.6237 KOps/s $\color{#d91a1a}-0.23\%$
test_values_stack_nested 60.4410μs 34.7527μs 28.7748 KOps/s 28.9204 KOps/s $\color{#d91a1a}-0.50\%$
test_values_stack_nested_leaf 73.4210μs 39.7739μs 25.1421 KOps/s 25.5711 KOps/s $\color{#d91a1a}-1.68\%$
test_values_stack_nested_locked 71.8110μs 36.5693μs 27.3453 KOps/s 27.6246 KOps/s $\color{#d91a1a}-1.01\%$
test_membership 7.1706μs 0.5043μs 1.9828 MOps/s 1.9893 MOps/s $\color{#d91a1a}-0.32\%$
test_membership_nested 13.3650μs 1.9362μs 516.4654 KOps/s 508.6961 KOps/s $\color{#35bf28}+1.53\%$
test_membership_nested_leaf 17.1455μs 1.9219μs 520.3251 KOps/s 510.6658 KOps/s $\color{#35bf28}+1.89\%$
test_membership_stacked_nested 27.9200μs 2.0301μs 492.5801 KOps/s 482.8565 KOps/s $\color{#35bf28}+2.01\%$
test_membership_stacked_nested_leaf 40.6310μs 2.0213μs 494.7292 KOps/s 487.1460 KOps/s $\color{#35bf28}+1.56\%$
test_membership_nested_last 35.0710μs 2.9909μs 334.3492 KOps/s 328.0863 KOps/s $\color{#35bf28}+1.91\%$
test_membership_nested_leaf_last 36.0610μs 2.9876μs 334.7201 KOps/s 333.7883 KOps/s $\color{#35bf28}+0.28\%$
test_membership_stacked_nested_last 27.4700μs 3.0507μs 327.7889 KOps/s 203.1771 KOps/s $\textbf{\color{#35bf28}+61.33\%}$
test_membership_stacked_nested_leaf_last 26.8100μs 2.9584μs 338.0259 KOps/s 204.3167 KOps/s $\textbf{\color{#35bf28}+65.44\%}$
test_nested_getleaf 33.7600μs 6.1089μs 163.6964 KOps/s 164.1616 KOps/s $\color{#d91a1a}-0.28\%$
test_nested_get 34.6000μs 5.8065μs 172.2215 KOps/s 173.4035 KOps/s $\color{#d91a1a}-0.68\%$
test_stacked_getleaf 43.7810μs 6.0587μs 165.0508 KOps/s 163.8273 KOps/s $\color{#35bf28}+0.75\%$
test_stacked_get 29.6100μs 5.7927μs 172.6311 KOps/s 172.7710 KOps/s $\color{#d91a1a}-0.08\%$
test_nested_getitemleaf 36.8400μs 6.1565μs 162.4310 KOps/s 162.0909 KOps/s $\color{#35bf28}+0.21\%$
test_nested_getitem 35.1610μs 5.9508μs 168.0457 KOps/s 170.5791 KOps/s $\color{#d91a1a}-1.49\%$
test_stacked_getitemleaf 30.6810μs 6.1389μs 162.8959 KOps/s 161.8054 KOps/s $\color{#35bf28}+0.67\%$
test_stacked_getitem 33.3300μs 5.8724μs 170.2879 KOps/s 171.3390 KOps/s $\color{#d91a1a}-0.61\%$
test_lock_nested 0.7460ms 0.3725ms 2.6847 KOps/s 2.7060 KOps/s $\color{#d91a1a}-0.79\%$
test_lock_stack_nested 0.3916ms 0.3409ms 2.9338 KOps/s 2.9422 KOps/s $\color{#d91a1a}-0.29\%$
test_unlock_nested 0.7867ms 0.3100ms 3.2259 KOps/s 3.2187 KOps/s $\color{#35bf28}+0.22\%$
test_unlock_stack_nested 0.3403ms 0.2791ms 3.5832 KOps/s 3.5961 KOps/s $\color{#d91a1a}-0.36\%$
test_flatten_speed 0.1038ms 75.4480μs 13.2542 KOps/s 13.3513 KOps/s $\color{#d91a1a}-0.73\%$
test_unflatten_speed 0.4017ms 0.3202ms 3.1230 KOps/s 3.1478 KOps/s $\color{#d91a1a}-0.79\%$
test_common_ops 93.8521ms 0.6844ms 1.4612 KOps/s 1.5554 KOps/s $\textbf{\color{#d91a1a}-6.05\%}$
test_creation 35.7110μs 1.7288μs 578.4443 KOps/s 577.7725 KOps/s $\color{#35bf28}+0.12\%$
test_creation_empty 34.3200μs 8.2950μs 120.5549 KOps/s 101.8570 KOps/s $\textbf{\color{#35bf28}+18.36\%}$
test_creation_nested_1 34.2010μs 9.7750μs 102.3013 KOps/s 87.3017 KOps/s $\textbf{\color{#35bf28}+17.18\%}$
test_creation_nested_2 36.6800μs 12.4502μs 80.3202 KOps/s 70.9365 KOps/s $\textbf{\color{#35bf28}+13.23\%}$
test_clone 76.2410μs 10.5459μs 94.8237 KOps/s 92.1476 KOps/s $\color{#35bf28}+2.90\%$
test_getitem[int] 1.5886ms 10.3647μs 96.4814 KOps/s 94.0231 KOps/s $\color{#35bf28}+2.61\%$
test_getitem[slice_int] 0.1254ms 20.5430μs 48.6784 KOps/s 47.0420 KOps/s $\color{#35bf28}+3.48\%$
test_getitem[range] 0.1289ms 36.4386μs 27.4434 KOps/s 27.0721 KOps/s $\color{#35bf28}+1.37\%$
test_getitem[tuple] 0.1047ms 17.8697μs 55.9605 KOps/s 55.6552 KOps/s $\color{#35bf28}+0.55\%$
test_getitem[list] 0.1732ms 32.4850μs 30.7834 KOps/s 30.4786 KOps/s $\color{#35bf28}+1.00\%$
test_setitem_dim[int] 41.9310μs 17.9757μs 55.6306 KOps/s 52.4268 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_setitem_dim[slice_int] 62.8900μs 37.8121μs 26.4465 KOps/s 26.3414 KOps/s $\color{#35bf28}+0.40\%$
test_setitem_dim[range] 81.0710μs 51.1667μs 19.5439 KOps/s 19.2144 KOps/s $\color{#35bf28}+1.72\%$
test_setitem_dim[tuple] 51.7710μs 30.6861μs 32.5880 KOps/s 31.2856 KOps/s $\color{#35bf28}+4.16\%$
test_setitem 67.9510μs 15.1031μs 66.2117 KOps/s 61.7585 KOps/s $\textbf{\color{#35bf28}+7.21\%}$
test_set 65.6210μs 14.7684μs 67.7121 KOps/s 63.8176 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_set_shared 1.6504ms 0.1504ms 6.6487 KOps/s 6.6312 KOps/s $\color{#35bf28}+0.26\%$
test_update 0.4598ms 17.7399μs 56.3702 KOps/s 51.0573 KOps/s $\textbf{\color{#35bf28}+10.41\%}$
test_update_nested 71.2710μs 23.4091μs 42.7185 KOps/s 40.3658 KOps/s $\textbf{\color{#35bf28}+5.83\%}$
test_update__nested 0.6048ms 24.8317μs 40.2711 KOps/s 39.4695 KOps/s $\color{#35bf28}+2.03\%$
test_set_nested 63.7610μs 15.8885μs 62.9386 KOps/s 59.1880 KOps/s $\textbf{\color{#35bf28}+6.34\%}$
test_set_nested_new 64.2410μs 18.2642μs 54.7519 KOps/s 52.6288 KOps/s $\color{#35bf28}+4.03\%$
test_select 92.8220μs 30.5647μs 32.7174 KOps/s 31.5357 KOps/s $\color{#35bf28}+3.75\%$
test_select_nested 69.3910μs 43.1145μs 23.1941 KOps/s 23.0045 KOps/s $\color{#35bf28}+0.82\%$
test_exclude_nested 89.2110μs 62.2525μs 16.0636 KOps/s 16.1394 KOps/s $\color{#d91a1a}-0.47\%$
test_empty[True] 0.3423ms 0.2908ms 3.4390 KOps/s 3.4831 KOps/s $\color{#d91a1a}-1.26\%$
test_empty[False] 3.3810μs 0.8183μs 1.2221 MOps/s 1.2195 MOps/s $\color{#35bf28}+0.21\%$
test_to 88.2310μs 55.7459μs 17.9385 KOps/s 17.1955 KOps/s $\color{#35bf28}+4.32\%$
test_to_nonblocking 93.6120μs 48.0740μs 20.8013 KOps/s 20.7056 KOps/s $\color{#35bf28}+0.46\%$
test_unbind_speed 1.3286ms 0.2315ms 4.3206 KOps/s 4.1765 KOps/s $\color{#35bf28}+3.45\%$
test_unbind_speed_stack0 0.2777ms 0.2350ms 4.2559 KOps/s 4.1849 KOps/s $\color{#35bf28}+1.70\%$
test_unbind_speed_stack1 93.4131ms 0.6632ms 1.5079 KOps/s 1.5025 KOps/s $\color{#35bf28}+0.36\%$
test_split 94.8115ms 1.5889ms 629.3507 Ops/s 630.1966 Ops/s $\color{#d91a1a}-0.13\%$
test_chunk 95.0579ms 1.5816ms 632.2675 Ops/s 631.5419 Ops/s $\color{#35bf28}+0.11\%$
test_consolidate[False-None] 97.6825ms 2.9447ms 339.5988 Ops/s 342.6524 Ops/s $\color{#d91a1a}-0.89\%$
test_consolidate[default-None] 1.7991ms 1.6434ms 608.5026 Ops/s 593.1332 Ops/s $\color{#35bf28}+2.59\%$
test_consolidate[reduce-overhead-None] 1.7639ms 1.6865ms 592.9537 Ops/s 584.3206 Ops/s $\color{#35bf28}+1.48\%$
test_consolidate_njt[False-None] 6.6411ms 6.3868ms 156.5735 Ops/s 152.3785 Ops/s $\color{#35bf28}+2.75\%$
test_to[False-False-None] 1.8675ms 1.7468ms 572.4883 Ops/s 567.1778 Ops/s $\color{#35bf28}+0.94\%$
test_to[True-False-None] 1.5795ms 1.3295ms 752.1849 Ops/s 738.1657 Ops/s $\color{#35bf28}+1.90\%$
test_to[within-False-None] 4.2230ms 4.1152ms 243.0007 Ops/s 242.3968 Ops/s $\color{#35bf28}+0.25\%$
test_to[True-default-None] 5.4110ms 5.1461ms 194.3203 Ops/s 185.5109 Ops/s $\color{#35bf28}+4.75\%$
test_to_njt[False-False-None] 7.0673ms 6.8507ms 145.9701 Ops/s 141.6678 Ops/s $\color{#35bf28}+3.04\%$
test_to_njt[True-False-None] 5.6395ms 5.3764ms 185.9977 Ops/s 186.1709 Ops/s $\color{#d91a1a}-0.09\%$
test_to_njt[within-False-None] 12.1684ms 11.9421ms 83.7376 Ops/s 82.4203 Ops/s $\color{#35bf28}+1.60\%$
test_creation[device0] 0.5389ms 82.1199μs 12.1773 KOps/s 12.2871 KOps/s $\color{#d91a1a}-0.89\%$
test_creation_from_tensor 0.6055ms 83.2384μs 12.0137 KOps/s 12.0270 KOps/s $\color{#d91a1a}-0.11\%$
test_add_one[memmap_tensor0] 0.2403ms 7.0214μs 142.4211 KOps/s 147.7350 KOps/s $\color{#d91a1a}-3.60\%$
test_contiguous[memmap_tensor0] 1.8530μs 0.4017μs 2.4892 MOps/s 2.4669 MOps/s $\color{#35bf28}+0.90\%$
test_stack[memmap_tensor0] 38.4510μs 4.2635μs 234.5485 KOps/s 231.7473 KOps/s $\color{#35bf28}+1.21\%$
test_memmaptd_index 0.5897ms 0.2465ms 4.0569 KOps/s 4.0386 KOps/s $\color{#35bf28}+0.45\%$
test_memmaptd_index_astensor 0.5890ms 0.3099ms 3.2271 KOps/s 3.2275 KOps/s $\color{#d91a1a}-0.01\%$
test_memmaptd_index_op 1.0557ms 0.5793ms 1.7261 KOps/s 1.6267 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_serialize_model 0.1324s 0.1315s 7.6072 Ops/s 7.6078 Ops/s $-0.01\%$
test_serialize_model_pickle 1.3490s 1.2116s 0.8254 Ops/s 0.8242 Ops/s $\color{#35bf28}+0.15\%$
test_serialize_weights 0.1324s 0.1311s 7.6281 Ops/s 7.6614 Ops/s $\color{#d91a1a}-0.43\%$
test_serialize_weights_returnearly 0.4320s 67.8227ms 14.7443 Ops/s 16.3354 Ops/s $\textbf{\color{#d91a1a}-9.74\%}$
test_serialize_weights_pickle 1.3762s 1.2170s 0.8217 Ops/s 0.8219 Ops/s $\color{#d91a1a}-0.02\%$
test_reshape_pytree 53.6810μs 22.0362μs 45.3799 KOps/s 43.3117 KOps/s $\color{#35bf28}+4.78\%$
test_reshape_td 53.6410μs 26.5022μs 37.7327 KOps/s 34.4201 KOps/s $\textbf{\color{#35bf28}+9.62\%}$
test_view_pytree 47.9900μs 21.9880μs 45.4795 KOps/s 43.0702 KOps/s $\textbf{\color{#35bf28}+5.59\%}$
test_view_td 57.0510μs 30.5654μs 32.7168 KOps/s 29.2950 KOps/s $\textbf{\color{#35bf28}+11.68\%}$
test_unbind_pytree 68.5310μs 27.5430μs 36.3068 KOps/s 33.7785 KOps/s $\textbf{\color{#35bf28}+7.49\%}$
test_unbind_td 0.7740ms 35.2392μs 28.3775 KOps/s 26.1719 KOps/s $\textbf{\color{#35bf28}+8.43\%}$
test_split_pytree 73.7610μs 29.4550μs 33.9501 KOps/s 32.0377 KOps/s $\textbf{\color{#35bf28}+5.97\%}$
test_split_td 0.9136ms 37.5017μs 26.6655 KOps/s 25.7449 KOps/s $\color{#35bf28}+3.58\%$
test_add_pytree 70.8810μs 34.4302μs 29.0443 KOps/s 27.1144 KOps/s $\textbf{\color{#35bf28}+7.12\%}$
test_add_td 0.1894ms 51.0607μs 19.5845 KOps/s 18.6118 KOps/s $\textbf{\color{#35bf28}+5.23\%}$
test_compile_add_one_nested[tensordict-compile] 0.1760ms 0.1217ms 8.2170 KOps/s 7.6966 KOps/s $\textbf{\color{#35bf28}+6.76\%}$
test_compile_add_one_nested[tensordict-eager] 0.2262ms 0.1293ms 7.7345 KOps/s 7.2339 KOps/s $\textbf{\color{#35bf28}+6.92\%}$
test_compile_add_one_nested[pytree-compile] 0.1399ms 95.7167μs 10.4475 KOps/s 10.3243 KOps/s $\color{#35bf28}+1.19\%$
test_compile_add_one_nested[pytree-eager] 0.2066ms 0.1480ms 6.7569 KOps/s 6.6785 KOps/s $\color{#35bf28}+1.17\%$
test_compile_copy_nested[tensordict-compile] 57.4500μs 22.5039μs 44.4368 KOps/s 44.4647 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_copy_nested[tensordict-eager] 53.4310μs 29.1839μs 34.2655 KOps/s 34.8043 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_copy_nested[pytree-compile] 0.3363ms 64.6960μs 15.4569 KOps/s 15.2747 KOps/s $\color{#35bf28}+1.19\%$
test_compile_copy_nested[pytree-eager] 87.5310μs 49.2459μs 20.3063 KOps/s 20.0396 KOps/s $\color{#35bf28}+1.33\%$
test_compile_add_one_flat[tensordict-compile] 0.1871ms 0.1418ms 7.0510 KOps/s 6.9410 KOps/s $\color{#35bf28}+1.58\%$
test_compile_add_one_flat[tensordict-eager] 0.3050ms 0.2152ms 4.6458 KOps/s 4.6346 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_one_flat[tensorclass-compile] 0.1355ms 97.9767μs 10.2065 KOps/s 10.1378 KOps/s $\color{#35bf28}+0.68\%$
test_compile_add_one_flat[tensorclass-eager] 0.1106ms 54.3187μs 18.4099 KOps/s 17.9494 KOps/s $\color{#35bf28}+2.57\%$
test_compile_add_one_flat[pytree-compile] 0.1825ms 0.1364ms 7.3331 KOps/s 7.3134 KOps/s $\color{#35bf28}+0.27\%$
test_compile_add_one_flat[pytree-eager] 0.5654ms 0.4816ms 2.0765 KOps/s 2.0643 KOps/s $\color{#35bf28}+0.59\%$
test_compile_add_self_flat[tensordict-eager] 0.3711ms 0.2592ms 3.8581 KOps/s 3.8383 KOps/s $\color{#35bf28}+0.52\%$
test_compile_add_self_flat[tensordict-compile] 0.1904ms 0.1430ms 6.9920 KOps/s 6.9828 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_self_flat[tensorclass-eager] 0.1508ms 67.4367μs 14.8287 KOps/s 14.3487 KOps/s $\color{#35bf28}+3.35\%$
test_compile_add_self_flat[tensorclass-compile] 0.1424ms 0.1012ms 9.8806 KOps/s 9.9060 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_add_self_flat[pytree-eager] 0.4645ms 0.4109ms 2.4335 KOps/s 2.4154 KOps/s $\color{#35bf28}+0.75\%$
test_compile_add_self_flat[pytree-compile] 0.1762ms 0.1359ms 7.3577 KOps/s 7.3812 KOps/s $\color{#d91a1a}-0.32\%$
test_compile_copy_flat[tensordict-compile] 60.2410μs 19.2404μs 51.9740 KOps/s 56.0618 KOps/s $\textbf{\color{#d91a1a}-7.29\%}$
test_compile_copy_flat[tensordict-eager] 57.2600μs 31.4180μs 31.8289 KOps/s 32.5048 KOps/s $\color{#d91a1a}-2.08\%$
test_compile_copy_flat[pytree-compile] 0.1019ms 71.3401μs 14.0174 KOps/s 14.0195 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_copy_flat[pytree-eager] 0.1629ms 52.0375μs 19.2169 KOps/s 19.0636 KOps/s $\color{#35bf28}+0.80\%$
test_compile_assign_and_add[tensordict-compile] 1.6055ms 0.3894ms 2.5683 KOps/s 2.2536 KOps/s $\textbf{\color{#35bf28}+13.96\%}$
test_compile_assign_and_add[tensordict-eager] 3.1071ms 2.6696ms 374.5881 Ops/s 382.0570 Ops/s $\color{#d91a1a}-1.95\%$
test_compile_assign_and_add[pytree-compile] 1.5778ms 0.4309ms 2.3208 KOps/s 2.2928 KOps/s $\color{#35bf28}+1.22\%$
test_compile_assign_and_add[pytree-eager] 2.7938ms 2.6880ms 372.0237 Ops/s 372.7449 Ops/s $\color{#d91a1a}-0.19\%$
test_compile_indexing[tensor-tensordict-compile] 0.1848ms 0.1137ms 8.7948 KOps/s 8.3669 KOps/s $\textbf{\color{#35bf28}+5.11\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5658ms 78.8906μs 12.6758 KOps/s 12.0288 KOps/s $\textbf{\color{#35bf28}+5.38\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.1545ms 0.1066ms 9.3831 KOps/s 9.2339 KOps/s $\color{#35bf28}+1.62\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1257ms 68.4332μs 14.6128 KOps/s 14.1700 KOps/s $\color{#35bf28}+3.12\%$
test_compile_indexing[tensor-pytree-compile] 0.2033ms 0.1074ms 9.3089 KOps/s 8.7677 KOps/s $\textbf{\color{#35bf28}+6.17\%}$
test_compile_indexing[tensor-pytree-eager] 0.1193ms 70.0244μs 14.2807 KOps/s 13.7522 KOps/s $\color{#35bf28}+3.84\%$
test_compile_indexing[slice-tensordict-compile] 0.2412ms 99.9012μs 10.0099 KOps/s 9.9399 KOps/s $\color{#35bf28}+0.70\%$
test_compile_indexing[slice-tensordict-eager] 0.1388ms 17.1034μs 58.4680 KOps/s 56.5867 KOps/s $\color{#35bf28}+3.32\%$
test_compile_indexing[slice-tensorclass-compile] 0.1389ms 96.2666μs 10.3878 KOps/s 10.2532 KOps/s $\color{#35bf28}+1.31\%$
test_compile_indexing[slice-tensorclass-eager] 50.9610μs 15.8774μs 62.9827 KOps/s 60.9758 KOps/s $\color{#35bf28}+3.29\%$
test_compile_indexing[slice-pytree-compile] 0.1425ms 0.1017ms 9.8322 KOps/s 10.2508 KOps/s $\color{#d91a1a}-4.08\%$
test_compile_indexing[slice-pytree-eager] 49.4810μs 16.5231μs 60.5215 KOps/s 61.3113 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_indexing[int-tensordict-compile] 0.1622ms 0.1060ms 9.4318 KOps/s 9.4169 KOps/s $\color{#35bf28}+0.16\%$
test_compile_indexing[int-tensordict-eager] 0.5849ms 16.9792μs 58.8955 KOps/s 56.5328 KOps/s $\color{#35bf28}+4.18\%$
test_compile_indexing[int-tensorclass-compile] 0.1484ms 0.1014ms 9.8591 KOps/s 10.1993 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_indexing[int-tensorclass-eager] 61.3510μs 16.9517μs 58.9912 KOps/s 61.5631 KOps/s $\color{#d91a1a}-4.18\%$
test_compile_indexing[int-pytree-compile] 0.1572ms 0.1022ms 9.7863 KOps/s 9.7675 KOps/s $\color{#35bf28}+0.19\%$
test_compile_indexing[int-pytree-eager] 53.6110μs 16.5064μs 60.5826 KOps/s 61.1322 KOps/s $\color{#d91a1a}-0.90\%$
test_mod_add[eager] 0.1709ms 40.0984μs 24.9386 KOps/s 25.6218 KOps/s $\color{#d91a1a}-2.67\%$
test_mod_add[compile] 0.1311ms 85.2523μs 11.7299 KOps/s 10.3918 KOps/s $\textbf{\color{#35bf28}+12.88\%}$
test_mod_add[compile-overhead] 0.3365ms 0.1748ms 5.7206 KOps/s 5.7003 KOps/s $\color{#35bf28}+0.36\%$
test_mod_wrap[eager] 0.3497ms 0.2464ms 4.0586 KOps/s 3.9487 KOps/s $\color{#35bf28}+2.78\%$
test_mod_wrap[compile] 0.3386ms 0.2796ms 3.5768 KOps/s 3.5124 KOps/s $\color{#35bf28}+1.84\%$
test_mod_wrap[compile-overhead] 7.1527ms 3.6855ms 271.3313 Ops/s 268.5897 Ops/s $\color{#35bf28}+1.02\%$
test_mod_wrap_and_backward[eager] 1.6043ms 1.3656ms 732.2906 Ops/s 686.8014 Ops/s $\textbf{\color{#35bf28}+6.62\%}$
test_mod_wrap_and_backward[compile] 1.3860ms 1.2677ms 788.8409 Ops/s 729.2499 Ops/s $\textbf{\color{#35bf28}+8.17\%}$
test_mod_wrap_and_backward[compile-overhead] 1.4189ms 0.9295ms 1.0759 KOps/s 938.6712 Ops/s $\textbf{\color{#35bf28}+14.61\%}$
test_seq_add[eager] 0.1742ms 0.1165ms 8.5866 KOps/s 8.3559 KOps/s $\color{#35bf28}+2.76\%$
test_seq_add[compile] 0.1275ms 86.8857μs 11.5094 KOps/s 11.4642 KOps/s $\color{#35bf28}+0.39\%$
test_seq_add[compile-overhead] 0.1763ms 0.1297ms 7.7110 KOps/s 7.7452 KOps/s $\color{#d91a1a}-0.44\%$
test_seq_wrap[eager] 0.5037ms 0.4352ms 2.2978 KOps/s 2.2283 KOps/s $\color{#35bf28}+3.12\%$
test_seq_wrap[compile] 0.3637ms 0.3013ms 3.3188 KOps/s 3.1819 KOps/s $\color{#35bf28}+4.30\%$
test_seq_wrap[compile-overhead] 0.3286ms 0.2254ms 4.4370 KOps/s 4.3771 KOps/s $\color{#35bf28}+1.37\%$
test_func_call_runtime[False-eager] 0.9327ms 0.7815ms 1.2795 KOps/s 1.2750 KOps/s $\color{#35bf28}+0.35\%$
test_func_call_runtime[False-compile] 0.7946ms 0.7370ms 1.3569 KOps/s 1.3378 KOps/s $\color{#35bf28}+1.43\%$
test_func_call_runtime[False-compile-overhead] 0.4358ms 0.3618ms 2.7642 KOps/s 2.7630 KOps/s $\color{#35bf28}+0.04\%$
test_func_call_runtime[True-eager] 0.9778ms 0.9034ms 1.1069 KOps/s 1.0852 KOps/s $\color{#35bf28}+2.00\%$
test_func_call_runtime[True-compile] 0.8052ms 0.7586ms 1.3181 KOps/s 1.3070 KOps/s $\color{#35bf28}+0.85\%$
test_func_call_runtime[True-compile-overhead] 0.4670ms 0.3815ms 2.6214 KOps/s 2.6124 KOps/s $\color{#35bf28}+0.35\%$
test_func_call_cm_runtime[False-eager] 0.7918ms 0.7283ms 1.3731 KOps/s 1.3385 KOps/s $\color{#35bf28}+2.58\%$
test_func_call_cm_runtime[False-compile] 0.8264ms 0.7433ms 1.3454 KOps/s 1.3455 KOps/s $\color{#d91a1a}-0.01\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4134ms 0.3609ms 2.7707 KOps/s 2.7621 KOps/s $\color{#35bf28}+0.31\%$
test_func_call_cm_runtime[True-eager] 1.1019ms 1.0025ms 997.5190 Ops/s 983.4458 Ops/s $\color{#35bf28}+1.43\%$
test_func_call_cm_runtime[True-compile] 0.8542ms 0.7836ms 1.2761 KOps/s 1.2211 KOps/s $\color{#35bf28}+4.51\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4574ms 0.4043ms 2.4735 KOps/s 2.4319 KOps/s $\color{#35bf28}+1.71\%$
test_vmap_func_call_cm_runtime[eager] 2.6550ms 2.1045ms 475.1801 Ops/s 478.2714 Ops/s $\color{#d91a1a}-0.65\%$
test_vmap_func_call_cm_runtime[compile] 0.9111ms 0.8029ms 1.2454 KOps/s 1.2346 KOps/s $\color{#35bf28}+0.88\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4884ms 0.4085ms 2.4478 KOps/s 2.4285 KOps/s $\color{#35bf28}+0.79\%$
test_distributed 1.8869ms 0.1757ms 5.6921 KOps/s 8.5579 KOps/s $\textbf{\color{#d91a1a}-33.49\%}$
test_tdmodule 72.5510μs 19.7063μs 50.7453 KOps/s 48.9283 KOps/s $\color{#35bf28}+3.71\%$
test_tdmodule_dispatch 72.6010μs 34.7456μs 28.7807 KOps/s 26.9289 KOps/s $\textbf{\color{#35bf28}+6.88\%}$
test_tdseq 40.1400μs 20.7982μs 48.0810 KOps/s 46.3727 KOps/s $\color{#35bf28}+3.68\%$
test_tdseq_dispatch 59.7110μs 38.6774μs 25.8549 KOps/s 24.4206 KOps/s $\textbf{\color{#35bf28}+5.87\%}$
test_instantiation_functorch 1.6119ms 1.5235ms 656.3763 Ops/s 644.7839 Ops/s $\color{#35bf28}+1.80\%$
test_exec_functorch 0.1856ms 0.1445ms 6.9217 KOps/s 6.8479 KOps/s $\color{#35bf28}+1.08\%$
test_exec_functional_call 0.1873ms 0.1453ms 6.8801 KOps/s 7.1692 KOps/s $\color{#d91a1a}-4.03\%$
test_exec_td_decorator 0.4117ms 0.1887ms 5.2997 KOps/s 5.3242 KOps/s $\color{#d91a1a}-0.46\%$
test_vmap_mlp_speed_decorator[True-True] 0.7810ms 0.6892ms 1.4509 KOps/s 1.4619 KOps/s $\color{#d91a1a}-0.76\%$
test_vmap_mlp_speed_decorator[True-False] 0.7991ms 0.6988ms 1.4310 KOps/s 1.4590 KOps/s $\color{#d91a1a}-1.92\%$
test_vmap_mlp_speed_decorator[False-True] 0.7271ms 0.6171ms 1.6204 KOps/s 1.6919 KOps/s $\color{#d91a1a}-4.22\%$
test_vmap_mlp_speed_decorator[False-False] 0.7347ms 0.6212ms 1.6099 KOps/s 1.6842 KOps/s $\color{#d91a1a}-4.41\%$
test_vmap_transformer_speed_decorator[True-True] 19.8250ms 19.3728ms 51.6189 Ops/s 51.9006 Ops/s $\color{#d91a1a}-0.54\%$
test_vmap_transformer_speed_decorator[True-False] 20.0861ms 19.3750ms 51.6130 Ops/s 52.0240 Ops/s $\color{#d91a1a}-0.79\%$
test_vmap_transformer_speed_decorator[False-True] 20.1035ms 19.5923ms 51.0404 Ops/s 52.2746 Ops/s $\color{#d91a1a}-2.36\%$
test_vmap_transformer_speed_decorator[False-False] 19.9466ms 19.2363ms 51.9850 Ops/s 52.4007 Ops/s $\color{#d91a1a}-0.79\%$
test_to_module_speed[True] 1.0699ms 0.9646ms 1.0367 KOps/s 1.0290 KOps/s $\color{#35bf28}+0.76\%$
test_to_module_speed[False] 1.5323ms 0.9464ms 1.0567 KOps/s 1.0535 KOps/s $\color{#35bf28}+0.30\%$
test_tc_init 75.6810μs 37.6832μs 26.5370 KOps/s 25.0757 KOps/s $\textbf{\color{#35bf28}+5.83\%}$
test_tc_init_nested 0.1213ms 75.4768μs 13.2491 KOps/s 12.3181 KOps/s $\textbf{\color{#35bf28}+7.56\%}$
test_tc_first_layer_tensor 6.4257μs 0.7301μs 1.3697 MOps/s 1.3617 MOps/s $\color{#35bf28}+0.59\%$
test_tc_first_layer_nontensor 25.4710μs 2.2403μs 446.3652 KOps/s 437.7048 KOps/s $\color{#35bf28}+1.98\%$
test_tc_second_layer_tensor 10.3800μs 1.4721μs 679.2925 KOps/s 640.7627 KOps/s $\textbf{\color{#35bf28}+6.01\%}$
test_tc_second_layer_nontensor 95.8010μs 3.0332μs 329.6795 KOps/s 326.6603 KOps/s $\color{#35bf28}+0.92\%$
test_unbind 0.2190s 9.8141ms 101.8939 Ops/s 145.2682 Ops/s $\textbf{\color{#d91a1a}-29.86\%}$
test_full_like 9.2458ms 9.1007ms 109.8818 Ops/s 107.8879 Ops/s $\color{#35bf28}+1.85\%$
test_zeros_like 5.4516ms 4.3232ms 231.3127 Ops/s 137.2848 Ops/s $\textbf{\color{#35bf28}+68.49\%}$
test_ones_like 4.4214ms 4.3137ms 231.8178 Ops/s 231.8823 Ops/s $\color{#d91a1a}-0.03\%$
test_clone 6.4669ms 6.3566ms 157.3180 Ops/s 157.3798 Ops/s $\color{#d91a1a}-0.04\%$
test_squeeze 60.4310μs 9.4524μs 105.7936 KOps/s 106.1756 KOps/s $\color{#d91a1a}-0.36\%$
test_unsqueeze 0.1205ms 70.6683μs 14.1506 KOps/s 13.0179 KOps/s $\textbf{\color{#35bf28}+8.70\%}$
test_split 0.3867ms 0.1586ms 6.3055 KOps/s 6.0655 KOps/s $\color{#35bf28}+3.96\%$
test_permute 0.2307ms 0.1752ms 5.7080 KOps/s 5.4757 KOps/s $\color{#35bf28}+4.24\%$
test_stack 50.6073ms 50.4259ms 19.8311 Ops/s 19.8668 Ops/s $\color{#d91a1a}-0.18\%$
test_cat 50.7066ms 50.4660ms 19.8153 Ops/s 19.9065 Ops/s $\color{#d91a1a}-0.46\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants