Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BE] TensorClass stub method check #1174

Merged
merged 2 commits into from
Jan 9, 2025
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 9, 2025

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 9, 2025
Copy link

github-actions bot commented Jan 9, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}40$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 47.4380μs 21.4015μs 46.7257 KOps/s 49.6161 KOps/s $\textbf{\color{#d91a1a}-5.83\%}$
test_plain_set_stack_nested 77.4450μs 21.4491μs 46.6220 KOps/s 49.3497 KOps/s $\textbf{\color{#d91a1a}-5.53\%}$
test_plain_set_nested_inplace 76.6230μs 23.7782μs 42.0553 KOps/s 45.7764 KOps/s $\textbf{\color{#d91a1a}-8.13\%}$
test_plain_set_stack_nested_inplace 56.9760μs 23.6154μs 42.3453 KOps/s 45.8826 KOps/s $\textbf{\color{#d91a1a}-7.71\%}$
test_items 39.2730μs 4.1966μs 238.2887 KOps/s 239.2115 KOps/s $\color{#d91a1a}-0.39\%$
test_items_nested 0.5904ms 0.4052ms 2.4679 KOps/s 2.5362 KOps/s $\color{#d91a1a}-2.69\%$
test_items_nested_locked 0.8351ms 0.4095ms 2.4421 KOps/s 2.5215 KOps/s $\color{#d91a1a}-3.15\%$
test_items_nested_leaf 0.1474ms 78.0380μs 12.8143 KOps/s 13.0584 KOps/s $\color{#d91a1a}-1.87\%$
test_items_stack_nested 0.7099ms 0.4129ms 2.4219 KOps/s 2.5225 KOps/s $\color{#d91a1a}-3.99\%$
test_items_stack_nested_leaf 0.1364ms 79.1647μs 12.6319 KOps/s 12.6807 KOps/s $\color{#d91a1a}-0.38\%$
test_items_stack_nested_locked 0.5644ms 0.4091ms 2.4441 KOps/s 2.5108 KOps/s $\color{#d91a1a}-2.66\%$
test_keys 40.2650μs 3.4845μs 286.9869 KOps/s 276.1071 KOps/s $\color{#35bf28}+3.94\%$
test_keys_nested 0.2612ms 0.1652ms 6.0543 KOps/s 6.1629 KOps/s $\color{#d91a1a}-1.76\%$
test_keys_nested_locked 0.7214ms 0.1707ms 5.8585 KOps/s 5.9392 KOps/s $\color{#d91a1a}-1.36\%$
test_keys_nested_leaf 0.2292ms 0.1443ms 6.9283 KOps/s 7.0651 KOps/s $\color{#d91a1a}-1.94\%$
test_keys_stack_nested 0.3000ms 0.1641ms 6.0937 KOps/s 6.2169 KOps/s $\color{#d91a1a}-1.98\%$
test_keys_stack_nested_leaf 0.2332ms 0.1437ms 6.9577 KOps/s 7.2293 KOps/s $\color{#d91a1a}-3.76\%$
test_keys_stack_nested_locked 0.2786ms 0.1707ms 5.8579 KOps/s 6.0485 KOps/s $\color{#d91a1a}-3.15\%$
test_values 7.8968μs 1.0379μs 963.4824 KOps/s 969.2192 KOps/s $\color{#d91a1a}-0.59\%$
test_values_nested 0.1179ms 63.2223μs 15.8172 KOps/s 16.2925 KOps/s $\color{#d91a1a}-2.92\%$
test_values_nested_locked 0.1084ms 62.1358μs 16.0938 KOps/s 16.4893 KOps/s $\color{#d91a1a}-2.40\%$
test_values_nested_leaf 0.1244ms 71.6901μs 13.9489 KOps/s 13.9757 KOps/s $\color{#d91a1a}-0.19\%$
test_values_stack_nested 0.1342ms 63.2297μs 15.8154 KOps/s 15.9227 KOps/s $\color{#d91a1a}-0.67\%$
test_values_stack_nested_leaf 0.1257ms 72.3124μs 13.8289 KOps/s 13.7696 KOps/s $\color{#35bf28}+0.43\%$
test_values_stack_nested_locked 0.1139ms 63.4596μs 15.7581 KOps/s 16.1076 KOps/s $\color{#d91a1a}-2.17\%$
test_membership 5.3284μs 0.7238μs 1.3816 MOps/s 1.1438 MOps/s $\textbf{\color{#35bf28}+20.78\%}$
test_membership_nested 25.2980μs 2.8939μs 345.5592 KOps/s 344.8477 KOps/s $\color{#35bf28}+0.21\%$
test_membership_nested_leaf 23.3540μs 2.9157μs 342.9736 KOps/s 347.0252 KOps/s $\color{#d91a1a}-1.17\%$
test_membership_stacked_nested 21.3590μs 2.9083μs 343.8396 KOps/s 349.5161 KOps/s $\color{#d91a1a}-1.62\%$
test_membership_stacked_nested_leaf 30.0160μs 2.8790μs 347.3447 KOps/s 345.7303 KOps/s $\color{#35bf28}+0.47\%$
test_membership_nested_last 32.7520μs 4.4048μs 227.0270 KOps/s 233.3854 KOps/s $\color{#d91a1a}-2.72\%$
test_membership_nested_leaf_last 23.5730μs 4.4248μs 226.0006 KOps/s 231.4922 KOps/s $\color{#d91a1a}-2.37\%$
test_membership_stacked_nested_last 30.1560μs 4.3450μs 230.1516 KOps/s 228.0018 KOps/s $\color{#35bf28}+0.94\%$
test_membership_stacked_nested_leaf_last 25.5180μs 4.3697μs 228.8478 KOps/s 229.7119 KOps/s $\color{#d91a1a}-0.38\%$
test_nested_getleaf 42.4290μs 10.7394μs 93.1155 KOps/s 94.4103 KOps/s $\color{#d91a1a}-1.37\%$
test_nested_get 31.9390μs 10.3129μs 96.9655 KOps/s 101.2444 KOps/s $\color{#d91a1a}-4.23\%$
test_stacked_getleaf 36.8480μs 10.8499μs 92.1671 KOps/s 94.1695 KOps/s $\color{#d91a1a}-2.13\%$
test_stacked_get 33.5430μs 10.3512μs 96.6068 KOps/s 99.7181 KOps/s $\color{#d91a1a}-3.12\%$
test_nested_getitemleaf 36.0880μs 11.1735μs 89.4971 KOps/s 90.0287 KOps/s $\color{#d91a1a}-0.59\%$
test_nested_getitem 32.7410μs 10.3895μs 96.2508 KOps/s 96.7498 KOps/s $\color{#d91a1a}-0.52\%$
test_stacked_getitemleaf 33.3120μs 11.3044μs 88.4614 KOps/s 89.0186 KOps/s $\color{#d91a1a}-0.63\%$
test_stacked_getitem 38.1910μs 10.5077μs 95.1679 KOps/s 96.8100 KOps/s $\color{#d91a1a}-1.70\%$
test_lock_nested 0.7972ms 0.4577ms 2.1849 KOps/s 1.8570 KOps/s $\textbf{\color{#35bf28}+17.66\%}$
test_lock_stack_nested 0.7115ms 0.4312ms 2.3189 KOps/s 2.4175 KOps/s $\color{#d91a1a}-4.08\%$
test_unlock_nested 0.6898ms 0.3732ms 2.6796 KOps/s 2.7290 KOps/s $\color{#d91a1a}-1.81\%$
test_unlock_stack_nested 0.5968ms 0.3484ms 2.8704 KOps/s 3.0011 KOps/s $\color{#d91a1a}-4.35\%$
test_flatten_speed 0.2132ms 0.1018ms 9.8271 KOps/s 10.1516 KOps/s $\color{#d91a1a}-3.20\%$
test_unflatten_speed 0.9084ms 0.5218ms 1.9166 KOps/s 1.9504 KOps/s $\color{#d91a1a}-1.73\%$
test_common_ops 1.7442ms 0.8287ms 1.2067 KOps/s 1.3157 KOps/s $\textbf{\color{#d91a1a}-8.28\%}$
test_creation 19.8380μs 2.5150μs 397.6207 KOps/s 403.8572 KOps/s $\color{#d91a1a}-1.54\%$
test_creation_empty 37.8710μs 13.4495μs 74.3521 KOps/s 91.1421 KOps/s $\textbf{\color{#d91a1a}-18.42\%}$
test_creation_nested_1 48.1100μs 16.5407μs 60.4568 KOps/s 72.9855 KOps/s $\textbf{\color{#d91a1a}-17.17\%}$
test_creation_nested_2 61.2950μs 21.4645μs 46.5885 KOps/s 54.7369 KOps/s $\textbf{\color{#d91a1a}-14.89\%}$
test_clone 1.3768ms 13.8550μs 72.1759 KOps/s 76.5450 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_getitem[int] 0.8208ms 13.2924μs 75.2308 KOps/s 78.1577 KOps/s $\color{#d91a1a}-3.74\%$
test_getitem[slice_int] 0.1369ms 25.1817μs 39.7113 KOps/s 40.2336 KOps/s $\color{#d91a1a}-1.30\%$
test_getitem[range] 0.1765ms 48.6599μs 20.5508 KOps/s 21.5294 KOps/s $\color{#d91a1a}-4.55\%$
test_getitem[tuple] 0.1305ms 21.2986μs 46.9515 KOps/s 48.9440 KOps/s $\color{#d91a1a}-4.07\%$
test_getitem[list] 0.1816ms 43.2079μs 23.1439 KOps/s 23.8982 KOps/s $\color{#d91a1a}-3.16\%$
test_setitem_dim[int] 44.8040μs 24.5265μs 40.7722 KOps/s 40.6030 KOps/s $\color{#35bf28}+0.42\%$
test_setitem_dim[slice_int] 96.4700μs 51.2435μs 19.5147 KOps/s 19.6723 KOps/s $\color{#d91a1a}-0.80\%$
test_setitem_dim[range] 0.1202ms 72.1405μs 13.8618 KOps/s 13.9816 KOps/s $\color{#d91a1a}-0.86\%$
test_setitem_dim[tuple] 75.6010μs 39.6593μs 25.2148 KOps/s 25.0529 KOps/s $\color{#35bf28}+0.65\%$
test_setitem 74.3790μs 22.1176μs 45.2129 KOps/s 52.0133 KOps/s $\textbf{\color{#d91a1a}-13.07\%}$
test_set 75.0000μs 21.4374μs 46.6474 KOps/s 53.0557 KOps/s $\textbf{\color{#d91a1a}-12.08\%}$
test_set_shared 1.2790ms 0.1691ms 5.9121 KOps/s 5.9609 KOps/s $\color{#d91a1a}-0.82\%$
test_update 0.1276ms 24.3884μs 41.0031 KOps/s 46.7085 KOps/s $\textbf{\color{#d91a1a}-12.22\%}$
test_update_nested 1.0652ms 34.3158μs 29.1411 KOps/s 31.9608 KOps/s $\textbf{\color{#d91a1a}-8.82\%}$
test_update__nested 88.3950μs 34.4645μs 29.0154 KOps/s 29.8554 KOps/s $\color{#d91a1a}-2.81\%$
test_set_nested 77.5550μs 23.5497μs 42.4634 KOps/s 48.4122 KOps/s $\textbf{\color{#d91a1a}-12.29\%}$
test_set_nested_new 88.2050μs 29.2143μs 34.2298 KOps/s 40.2276 KOps/s $\textbf{\color{#d91a1a}-14.91\%}$
test_select 0.1017ms 45.3424μs 22.0544 KOps/s 24.2813 KOps/s $\textbf{\color{#d91a1a}-9.17\%}$
test_select_nested 0.1437ms 66.3813μs 15.0645 KOps/s 15.9444 KOps/s $\textbf{\color{#d91a1a}-5.52\%}$
test_exclude_nested 0.1740ms 84.7332μs 11.8018 KOps/s 12.3385 KOps/s $\color{#d91a1a}-4.35\%$
test_empty[True] 0.5505ms 0.4171ms 2.3977 KOps/s 2.4762 KOps/s $\color{#d91a1a}-3.17\%$
test_empty[False] 6.8352μs 1.3574μs 736.7214 KOps/s 727.9972 KOps/s $\color{#35bf28}+1.20\%$
test_unbind_speed 0.3511ms 0.2711ms 3.6881 KOps/s 3.7980 KOps/s $\color{#d91a1a}-2.90\%$
test_unbind_speed_stack0 0.3751ms 0.2688ms 3.7198 KOps/s 3.8787 KOps/s $\color{#d91a1a}-4.10\%$
test_unbind_speed_stack1 94.1638ms 0.7899ms 1.2659 KOps/s 1.4189 KOps/s $\textbf{\color{#d91a1a}-10.78\%}$
test_split 97.6400ms 1.7846ms 560.3523 Ops/s 562.3446 Ops/s $\color{#d91a1a}-0.35\%$
test_chunk 97.4766ms 1.8043ms 554.2194 Ops/s 557.3825 Ops/s $\color{#d91a1a}-0.57\%$
test_consolidate_njt[False-None] 11.3821ms 8.2833ms 120.7255 Ops/s 123.8495 Ops/s $\color{#d91a1a}-2.52\%$
test_creation[device0] 0.2525ms 91.8676μs 10.8852 KOps/s 11.1122 KOps/s $\color{#d91a1a}-2.04\%$
test_creation_from_tensor 3.2993ms 96.0301μs 10.4134 KOps/s 10.6661 KOps/s $\color{#d91a1a}-2.37\%$
test_add_one[memmap_tensor0] 0.1633ms 4.8550μs 205.9752 KOps/s 207.5331 KOps/s $\color{#d91a1a}-0.75\%$
test_contiguous[memmap_tensor0] 12.4230μs 0.5163μs 1.9369 MOps/s 1.9335 MOps/s $\color{#35bf28}+0.17\%$
test_stack[memmap_tensor0] 39.4040μs 3.3067μs 302.4184 KOps/s 289.4768 KOps/s $\color{#35bf28}+4.47\%$
test_memmaptd_index 0.9164ms 0.2400ms 4.1667 KOps/s 4.1736 KOps/s $\color{#d91a1a}-0.17\%$
test_memmaptd_index_astensor 0.6618ms 0.3273ms 3.0556 KOps/s 3.0397 KOps/s $\color{#35bf28}+0.52\%$
test_memmaptd_index_op 0.9548ms 0.6126ms 1.6324 KOps/s 1.7041 KOps/s $\color{#d91a1a}-4.20\%$
test_serialize_model 0.1205s 0.1126s 8.8843 Ops/s 7.6311 Ops/s $\textbf{\color{#35bf28}+16.42\%}$
test_serialize_model_pickle 0.4554s 0.3881s 2.5767 Ops/s 2.5488 Ops/s $\color{#35bf28}+1.09\%$
test_serialize_weights 0.1209s 0.1142s 8.7576 Ops/s 8.8320 Ops/s $\color{#d91a1a}-0.84\%$
test_serialize_weights_returnearly 0.2571s 0.1774s 5.6364 Ops/s 6.3967 Ops/s $\textbf{\color{#d91a1a}-11.89\%}$
test_serialize_weights_pickle 0.5357s 0.4282s 2.3355 Ops/s 2.4377 Ops/s $\color{#d91a1a}-4.19\%$
test_serialize_weights_filesystem 0.1527s 0.1417s 7.0550 Ops/s 6.8480 Ops/s $\color{#35bf28}+3.02\%$
test_serialize_model_filesystem 0.1538s 0.1449s 6.9020 Ops/s 6.6680 Ops/s $\color{#35bf28}+3.51\%$
test_reshape_pytree 68.4380μs 26.9929μs 37.0468 KOps/s 37.8559 KOps/s $\color{#d91a1a}-2.14\%$
test_reshape_td 75.8610μs 33.4430μs 29.9016 KOps/s 30.5630 KOps/s $\color{#d91a1a}-2.16\%$
test_view_pytree 68.5500μs 27.0820μs 36.9249 KOps/s 38.2888 KOps/s $\color{#d91a1a}-3.56\%$
test_view_td 74.1280μs 38.0696μs 26.2677 KOps/s 27.1121 KOps/s $\color{#d91a1a}-3.11\%$
test_unbind_pytree 65.9730μs 30.3847μs 32.9113 KOps/s 33.9499 KOps/s $\color{#d91a1a}-3.06\%$
test_unbind_td 0.3045ms 39.8671μs 25.0834 KOps/s 25.5407 KOps/s $\color{#d91a1a}-1.79\%$
test_split_pytree 69.2500μs 30.7050μs 32.5680 KOps/s 34.0224 KOps/s $\color{#d91a1a}-4.27\%$
test_split_td 0.4692ms 46.0266μs 21.7266 KOps/s 21.9359 KOps/s $\color{#d91a1a}-0.95\%$
test_add_pytree 92.3120μs 36.9713μs 27.0480 KOps/s 28.8264 KOps/s $\textbf{\color{#d91a1a}-6.17\%}$
test_add_td 0.1660ms 59.4363μs 16.8247 KOps/s 18.8392 KOps/s $\textbf{\color{#d91a1a}-10.69\%}$
test_compile_add_one_nested[tensordict-compile] 0.1269ms 64.4846μs 15.5076 KOps/s 16.2509 KOps/s $\color{#d91a1a}-4.57\%$
test_compile_add_one_nested[tensordict-eager] 1.3788ms 0.1747ms 5.7231 KOps/s 5.8513 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_nested[pytree-compile] 0.1097ms 46.6204μs 21.4498 KOps/s 22.3911 KOps/s $\color{#d91a1a}-4.20\%$
test_compile_add_one_nested[pytree-eager] 0.2263ms 0.1203ms 8.3157 KOps/s 8.6083 KOps/s $\color{#d91a1a}-3.40\%$
test_compile_copy_nested[tensordict-compile] 96.1020μs 27.6081μs 36.2212 KOps/s 39.0490 KOps/s $\textbf{\color{#d91a1a}-7.24\%}$
test_compile_copy_nested[tensordict-eager] 0.1238ms 59.2666μs 16.8729 KOps/s 16.9562 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_copy_nested[pytree-compile] 0.1745ms 80.3302μs 12.4486 KOps/s 12.7856 KOps/s $\color{#d91a1a}-2.64\%$
test_compile_copy_nested[pytree-eager] 0.1503ms 69.5154μs 14.3853 KOps/s 14.9062 KOps/s $\color{#d91a1a}-3.49\%$
test_compile_add_one_flat[tensordict-compile] 0.2254ms 0.1071ms 9.3359 KOps/s 9.7384 KOps/s $\color{#d91a1a}-4.13\%$
test_compile_add_one_flat[tensordict-eager] 0.4506ms 0.2181ms 4.5858 KOps/s 4.6398 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_add_one_flat[tensorclass-compile] 99.9770μs 45.6391μs 21.9111 KOps/s 23.6063 KOps/s $\textbf{\color{#d91a1a}-7.18\%}$
test_compile_add_one_flat[tensorclass-eager] 0.4905ms 67.5460μs 14.8047 KOps/s 15.1368 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_flat[pytree-compile] 0.2228ms 0.1029ms 9.7137 KOps/s 9.8088 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_add_one_flat[pytree-eager] 0.3549ms 0.2017ms 4.9574 KOps/s 5.0713 KOps/s $\color{#d91a1a}-2.24\%$
test_compile_add_self_flat[tensordict-eager] 0.3815ms 0.2366ms 4.2263 KOps/s 4.2137 KOps/s $\color{#35bf28}+0.30\%$
test_compile_add_self_flat[tensordict-compile] 0.1951ms 0.1074ms 9.3129 KOps/s 9.6532 KOps/s $\color{#d91a1a}-3.53\%$
test_compile_add_self_flat[tensorclass-eager] 0.1497ms 68.8086μs 14.5331 KOps/s 15.6167 KOps/s $\textbf{\color{#d91a1a}-6.94\%}$
test_compile_add_self_flat[tensorclass-compile] 0.1080ms 47.7186μs 20.9562 KOps/s 21.7095 KOps/s $\color{#d91a1a}-3.47\%$
test_compile_add_self_flat[pytree-eager] 0.3280ms 0.1630ms 6.1352 KOps/s 6.3892 KOps/s $\color{#d91a1a}-3.98\%$
test_compile_add_self_flat[pytree-compile] 0.2138ms 0.1024ms 9.7631 KOps/s 9.4781 KOps/s $\color{#35bf28}+3.01\%$
test_compile_copy_flat[tensordict-compile] 59.4710μs 21.8198μs 45.8300 KOps/s 46.6617 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_copy_flat[tensordict-eager] 0.1647ms 69.4427μs 14.4004 KOps/s 15.1096 KOps/s $\color{#d91a1a}-4.69\%$
test_compile_copy_flat[pytree-compile] 0.1575ms 81.4561μs 12.2766 KOps/s 12.7649 KOps/s $\color{#d91a1a}-3.83\%$
test_compile_copy_flat[pytree-eager] 0.1492ms 68.9261μs 14.5083 KOps/s 14.5816 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_assign_and_add[tensordict-compile] 0.3999ms 0.2071ms 4.8286 KOps/s 4.8885 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_assign_and_add[tensordict-eager] 1.5562ms 1.3230ms 755.8743 Ops/s 769.9825 Ops/s $\color{#d91a1a}-1.83\%$
test_compile_assign_and_add[pytree-compile] 0.3267ms 0.2036ms 4.9124 KOps/s 4.8928 KOps/s $\color{#35bf28}+0.40\%$
test_compile_assign_and_add[pytree-eager] 0.9664ms 0.7714ms 1.2964 KOps/s 1.2957 KOps/s $\color{#35bf28}+0.05\%$
test_compile_assign_and_add_stack[compile] 0.5896ms 0.4503ms 2.2207 KOps/s 2.2348 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_assign_and_add_stack[eager] 2.9398ms 2.7444ms 364.3729 Ops/s 386.9289 Ops/s $\textbf{\color{#d91a1a}-5.83\%}$
test_compile_indexing[tensor-tensordict-compile] 0.1279ms 36.1863μs 27.6348 KOps/s 28.2233 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_indexing[tensor-tensordict-eager] 0.4925ms 33.8353μs 29.5549 KOps/s 30.6823 KOps/s $\color{#d91a1a}-3.67\%$
test_compile_indexing[tensor-tensorclass-compile] 76.5920μs 29.2609μs 34.1753 KOps/s 35.0984 KOps/s $\color{#d91a1a}-2.63\%$
test_compile_indexing[tensor-tensorclass-eager] 66.5840μs 23.2942μs 42.9292 KOps/s 44.3147 KOps/s $\color{#d91a1a}-3.13\%$
test_compile_indexing[tensor-pytree-compile] 69.0580μs 29.4407μs 33.9666 KOps/s 33.2241 KOps/s $\color{#35bf28}+2.23\%$
test_compile_indexing[tensor-pytree-eager] 73.8180μs 23.1663μs 43.1661 KOps/s 44.2538 KOps/s $\color{#d91a1a}-2.46\%$
test_compile_indexing[slice-tensordict-compile] 0.1177ms 52.3933μs 19.0864 KOps/s 19.6162 KOps/s $\color{#d91a1a}-2.70\%$
test_compile_indexing[slice-tensordict-eager] 0.5472ms 20.6039μs 48.5345 KOps/s 49.6388 KOps/s $\color{#d91a1a}-2.22\%$
test_compile_indexing[slice-tensorclass-compile] 90.4890μs 44.5429μs 22.4503 KOps/s 22.8648 KOps/s $\color{#d91a1a}-1.81\%$
test_compile_indexing[slice-tensorclass-eager] 57.1260μs 19.0302μs 52.5479 KOps/s 54.4912 KOps/s $\color{#d91a1a}-3.57\%$
test_compile_indexing[slice-pytree-compile] 0.1243ms 45.0494μs 22.1979 KOps/s 22.4582 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_indexing[slice-pytree-eager] 52.6780μs 19.3021μs 51.8077 KOps/s 54.2601 KOps/s $\color{#d91a1a}-4.52\%$
test_compile_indexing[int-tensordict-compile] 0.1081ms 52.5680μs 19.0230 KOps/s 18.9562 KOps/s $\color{#35bf28}+0.35\%$
test_compile_indexing[int-tensordict-eager] 0.9948ms 20.3944μs 49.0332 KOps/s 49.7878 KOps/s $\color{#d91a1a}-1.52\%$
test_compile_indexing[int-tensorclass-compile] 0.1078ms 45.0412μs 22.2019 KOps/s 22.5061 KOps/s $\color{#d91a1a}-1.35\%$
test_compile_indexing[int-tensorclass-eager] 0.2868ms 19.2248μs 52.0163 KOps/s 54.1987 KOps/s $\color{#d91a1a}-4.03\%$
test_compile_indexing[int-pytree-compile] 99.3850μs 44.9065μs 22.2685 KOps/s 22.2967 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_indexing[int-pytree-eager] 68.6380μs 19.1285μs 52.2780 KOps/s 53.3176 KOps/s $\color{#d91a1a}-1.95\%$
test_mod_add[eager] 74.3280μs 35.3653μs 28.2763 KOps/s 30.0758 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_mod_add[compile] 0.1122ms 48.9510μs 20.4286 KOps/s 20.8451 KOps/s $\color{#d91a1a}-2.00\%$
test_mod_add[compile-overhead] 96.6010μs 49.6721μs 20.1320 KOps/s 20.7445 KOps/s $\color{#d91a1a}-2.95\%$
test_mod_wrap[eager] 0.4203ms 0.2259ms 4.4275 KOps/s 4.6031 KOps/s $\color{#d91a1a}-3.82\%$
test_mod_wrap[compile] 0.3014ms 0.2077ms 4.8149 KOps/s 4.9218 KOps/s $\color{#d91a1a}-2.17\%$
test_mod_wrap[compile-overhead] 0.4402ms 0.2064ms 4.8441 KOps/s 4.9275 KOps/s $\color{#d91a1a}-1.69\%$
test_mod_wrap_and_backward[eager] 16.3471ms 11.5209ms 86.7991 Ops/s 94.8103 Ops/s $\textbf{\color{#d91a1a}-8.45\%}$
test_mod_wrap_and_backward[compile] 15.4413ms 12.5820ms 79.4789 Ops/s 80.9722 Ops/s $\color{#d91a1a}-1.84\%$
test_mod_wrap_and_backward[compile-overhead] 15.8466ms 12.3760ms 80.8015 Ops/s 78.7524 Ops/s $\color{#35bf28}+2.60\%$
test_seq_add[eager] 0.2755ms 0.1191ms 8.3947 KOps/s 8.8965 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_seq_add[compile] 0.1340ms 64.4732μs 15.5103 KOps/s 16.2601 KOps/s $\color{#d91a1a}-4.61\%$
test_seq_add[compile-overhead] 0.1233ms 62.9857μs 15.8766 KOps/s 16.7808 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_seq_wrap[eager] 0.6286ms 0.4537ms 2.2042 KOps/s 2.2957 KOps/s $\color{#d91a1a}-3.98\%$
test_seq_wrap[compile] 0.4257ms 0.2293ms 4.3614 KOps/s 4.3436 KOps/s $\color{#35bf28}+0.41\%$
test_seq_wrap[compile-overhead] 0.4254ms 0.2305ms 4.3375 KOps/s 4.3941 KOps/s $\color{#d91a1a}-1.29\%$
test_func_call_runtime[False-eager] 0.8140ms 0.5527ms 1.8093 KOps/s 1.8910 KOps/s $\color{#d91a1a}-4.32\%$
test_func_call_runtime[False-compile] 0.5659ms 0.4275ms 2.3393 KOps/s 2.3878 KOps/s $\color{#d91a1a}-2.03\%$
test_func_call_runtime[False-compile-overhead] 0.8931ms 0.4345ms 2.3013 KOps/s 2.3181 KOps/s $\color{#d91a1a}-0.73\%$
test_func_call_runtime[True-eager] 1.0335ms 0.7760ms 1.2887 KOps/s 1.3385 KOps/s $\color{#d91a1a}-3.72\%$
test_func_call_runtime[True-compile] 0.7720ms 0.4687ms 2.1334 KOps/s 2.2109 KOps/s $\color{#d91a1a}-3.50\%$
test_func_call_runtime[True-compile-overhead] 0.8841ms 0.4729ms 2.1145 KOps/s 2.2011 KOps/s $\color{#d91a1a}-3.93\%$
test_func_call_cm_runtime[False-eager] 0.6679ms 0.5548ms 1.8026 KOps/s 1.9188 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_func_call_cm_runtime[False-compile] 0.5132ms 0.4263ms 2.3457 KOps/s 2.4062 KOps/s $\color{#d91a1a}-2.52\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5191ms 0.4258ms 2.3487 KOps/s 2.3900 KOps/s $\color{#d91a1a}-1.73\%$
test_func_call_cm_runtime[True-eager] 1.4706ms 0.9260ms 1.0799 KOps/s 1.1404 KOps/s $\textbf{\color{#d91a1a}-5.30\%}$
test_func_call_cm_runtime[True-compile] 0.6915ms 0.4953ms 2.0188 KOps/s 2.0845 KOps/s $\color{#d91a1a}-3.15\%$
test_func_call_cm_runtime[True-compile-overhead] 0.9018ms 0.4960ms 2.0161 KOps/s 2.0953 KOps/s $\color{#d91a1a}-3.78\%$
test_vmap_func_call_cm_runtime[eager] 2.3595ms 1.8947ms 527.7982 Ops/s 531.7989 Ops/s $\color{#d91a1a}-0.75\%$
test_vmap_func_call_cm_runtime[compile] 1.0478ms 0.5190ms 1.9269 KOps/s 1.9341 KOps/s $\color{#d91a1a}-0.37\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7556ms 0.5222ms 1.9148 KOps/s 1.9193 KOps/s $\color{#d91a1a}-0.23\%$
test_distributed 0.2607ms 0.1290ms 7.7492 KOps/s 7.9009 KOps/s $\color{#d91a1a}-1.92\%$
test_tdmodule 85.8000μs 27.7469μs 36.0401 KOps/s 38.3294 KOps/s $\textbf{\color{#d91a1a}-5.97\%}$
test_tdmodule_dispatch 94.5360μs 50.8953μs 19.6482 KOps/s 21.0171 KOps/s $\textbf{\color{#d91a1a}-6.51\%}$
test_tdseq 59.3810μs 31.0231μs 32.2341 KOps/s 33.3118 KOps/s $\color{#d91a1a}-3.24\%$
test_tdseq_dispatch 94.7960μs 57.1598μs 17.4948 KOps/s 18.2374 KOps/s $\color{#d91a1a}-4.07\%$
test_instantiation_functorch 1.7987ms 1.5517ms 644.4463 Ops/s 655.2836 Ops/s $\color{#d91a1a}-1.65\%$
test_exec_functorch 0.4483ms 0.1898ms 5.2697 KOps/s 5.6024 KOps/s $\textbf{\color{#d91a1a}-5.94\%}$
test_exec_functional_call 0.4090ms 0.1818ms 5.5008 KOps/s 5.8088 KOps/s $\textbf{\color{#d91a1a}-5.30\%}$
test_exec_td_decorator 0.4531ms 0.2445ms 4.0898 KOps/s 4.3811 KOps/s $\textbf{\color{#d91a1a}-6.65\%}$
test_vmap_mlp_speed_decorator[True-True] 1.0259ms 0.6622ms 1.5102 KOps/s 1.5415 KOps/s $\color{#d91a1a}-2.03\%$
test_vmap_mlp_speed_decorator[True-False] 0.9605ms 0.6619ms 1.5108 KOps/s 1.5532 KOps/s $\color{#d91a1a}-2.73\%$
test_vmap_mlp_speed_decorator[False-True] 0.8201ms 0.5303ms 1.8857 KOps/s 1.9356 KOps/s $\color{#d91a1a}-2.58\%$
test_vmap_mlp_speed_decorator[False-False] 0.7623ms 0.5295ms 1.8886 KOps/s 1.9369 KOps/s $\color{#d91a1a}-2.49\%$
test_to_module_speed[True] 2.2096ms 1.3738ms 727.9309 Ops/s 747.3703 Ops/s $\color{#d91a1a}-2.60\%$
test_to_module_speed[False] 1.7062ms 1.3439ms 744.1123 Ops/s 768.0111 Ops/s $\color{#d91a1a}-3.11\%$
test_tc_init 93.7640μs 49.8490μs 20.0606 KOps/s 21.6590 KOps/s $\textbf{\color{#d91a1a}-7.38\%}$
test_tc_init_nested 0.1693ms 0.1001ms 9.9949 KOps/s 11.1928 KOps/s $\textbf{\color{#d91a1a}-10.70\%}$
test_tc_first_layer_tensor 24.1350μs 1.4934μs 669.6313 KOps/s 661.4411 KOps/s $\color{#35bf28}+1.24\%$
test_tc_first_layer_nontensor 24.8770μs 4.7462μs 210.6951 KOps/s 218.6528 KOps/s $\color{#d91a1a}-3.64\%$
test_tc_second_layer_tensor 18.8950μs 2.8212μs 354.4581 KOps/s 357.8768 KOps/s $\color{#d91a1a}-0.96\%$
test_tc_second_layer_nontensor 24.6760μs 6.1016μs 163.8911 KOps/s 169.0527 KOps/s $\color{#d91a1a}-3.05\%$
test_unbind 0.2106s 12.9726ms 77.0855 Ops/s 80.8176 Ops/s $\color{#d91a1a}-4.62\%$
test_full_like 7.7038ms 6.9021ms 144.8838 Ops/s 146.1678 Ops/s $\color{#d91a1a}-0.88\%$
test_zeros_like 3.0117ms 2.6713ms 374.3545 Ops/s 366.6089 Ops/s $\color{#35bf28}+2.11\%$
test_ones_like 3.5745ms 3.0980ms 322.7940 Ops/s 321.3847 Ops/s $\color{#35bf28}+0.44\%$
test_clone 5.5171ms 4.8442ms 206.4325 Ops/s 207.0979 Ops/s $\color{#d91a1a}-0.32\%$
test_squeeze 58.9200μs 12.8880μs 77.5917 KOps/s 82.6628 KOps/s $\textbf{\color{#d91a1a}-6.13\%}$
test_unsqueeze 0.1868ms 95.4884μs 10.4725 KOps/s 11.0306 KOps/s $\textbf{\color{#d91a1a}-5.06\%}$
test_split 0.5545ms 0.1987ms 5.0315 KOps/s 5.1146 KOps/s $\color{#d91a1a}-1.62\%$
test_permute 0.3157ms 0.2093ms 4.7772 KOps/s 4.8293 KOps/s $\color{#d91a1a}-1.08\%$
test_stack 28.1191ms 24.0537ms 41.5737 Ops/s 40.4973 Ops/s $\color{#35bf28}+2.66\%$
test_cat 27.9462ms 23.9775ms 41.7058 Ops/s 41.1419 Ops/s $\color{#35bf28}+1.37\%$

Copy link

github-actions bot commented Jan 9, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}60$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 51.2410μs 11.3901μs 87.7958 KOps/s 74.9412 KOps/s $\textbf{\color{#35bf28}+17.15\%}$
test_plain_set_stack_nested 50.9910μs 11.6182μs 86.0716 KOps/s 74.0522 KOps/s $\textbf{\color{#35bf28}+16.23\%}$
test_plain_set_nested_inplace 40.4110μs 12.5977μs 79.3795 KOps/s 69.1061 KOps/s $\textbf{\color{#35bf28}+14.87\%}$
test_plain_set_stack_nested_inplace 41.8710μs 12.5783μs 79.5018 KOps/s 69.0149 KOps/s $\textbf{\color{#35bf28}+15.20\%}$
test_items 34.2200μs 2.9089μs 343.7744 KOps/s 334.7314 KOps/s $\color{#35bf28}+2.70\%$
test_items_nested 0.3993ms 0.3601ms 2.7771 KOps/s 2.8272 KOps/s $\color{#d91a1a}-1.77\%$
test_items_nested_locked 0.4020ms 0.3613ms 2.7678 KOps/s 2.8161 KOps/s $\color{#d91a1a}-1.71\%$
test_items_nested_leaf 84.8710μs 58.1056μs 17.2100 KOps/s 17.2384 KOps/s $\color{#d91a1a}-0.16\%$
test_items_stack_nested 0.4007ms 0.3631ms 2.7543 KOps/s 2.8301 KOps/s $\color{#d91a1a}-2.68\%$
test_items_stack_nested_leaf 83.5820μs 59.1667μs 16.9014 KOps/s 16.5065 KOps/s $\color{#35bf28}+2.39\%$
test_items_stack_nested_locked 0.3915ms 0.3604ms 2.7743 KOps/s 2.7730 KOps/s $\color{#35bf28}+0.05\%$
test_keys 26.5610μs 3.4707μs 288.1282 KOps/s 287.7319 KOps/s $\color{#35bf28}+0.14\%$
test_keys_nested 0.1077ms 81.4496μs 12.2775 KOps/s 12.2149 KOps/s $\color{#35bf28}+0.51\%$
test_keys_nested_locked 0.7686ms 87.4425μs 11.4361 KOps/s 11.3565 KOps/s $\color{#35bf28}+0.70\%$
test_keys_nested_leaf 0.1072ms 72.5706μs 13.7797 KOps/s 13.7909 KOps/s $\color{#d91a1a}-0.08\%$
test_keys_stack_nested 0.1141ms 81.9296μs 12.2056 KOps/s 11.9734 KOps/s $\color{#35bf28}+1.94\%$
test_keys_stack_nested_leaf 0.1057ms 73.3212μs 13.6386 KOps/s 13.4472 KOps/s $\color{#35bf28}+1.42\%$
test_keys_stack_nested_locked 0.1278ms 87.7609μs 11.3946 KOps/s 11.0968 KOps/s $\color{#35bf28}+2.68\%$
test_values 8.0335μs 0.8518μs 1.1740 MOps/s 1.1750 MOps/s $\color{#d91a1a}-0.08\%$
test_values_nested 0.1194ms 34.4298μs 29.0446 KOps/s 29.0547 KOps/s $\color{#d91a1a}-0.03\%$
test_values_nested_locked 60.5810μs 36.0736μs 27.7211 KOps/s 27.8276 KOps/s $\color{#d91a1a}-0.38\%$
test_values_nested_leaf 78.2910μs 39.1714μs 25.5288 KOps/s 25.9771 KOps/s $\color{#d91a1a}-1.73\%$
test_values_stack_nested 76.9610μs 34.8549μs 28.6904 KOps/s 28.5162 KOps/s $\color{#35bf28}+0.61\%$
test_values_stack_nested_leaf 72.8210μs 39.4899μs 25.3229 KOps/s 25.5593 KOps/s $\color{#d91a1a}-0.92\%$
test_values_stack_nested_locked 61.6910μs 36.4452μs 27.4385 KOps/s 27.1329 KOps/s $\color{#35bf28}+1.13\%$
test_membership 1.6560μs 0.5077μs 1.9696 MOps/s 1.9517 MOps/s $\color{#35bf28}+0.92\%$
test_membership_nested 16.5805μs 2.0431μs 489.4571 KOps/s 478.0138 KOps/s $\color{#35bf28}+2.39\%$
test_membership_nested_leaf 14.8600μs 2.0791μs 480.9676 KOps/s 493.0967 KOps/s $\color{#d91a1a}-2.46\%$
test_membership_stacked_nested 39.8310μs 2.1099μs 473.9570 KOps/s 475.3271 KOps/s $\color{#d91a1a}-0.29\%$
test_membership_stacked_nested_leaf 34.3110μs 2.0992μs 476.3774 KOps/s 473.9294 KOps/s $\color{#35bf28}+0.52\%$
test_membership_nested_last 40.5610μs 3.1003μs 322.5521 KOps/s 317.3517 KOps/s $\color{#35bf28}+1.64\%$
test_membership_nested_leaf_last 31.9610μs 3.1200μs 320.5119 KOps/s 317.3139 KOps/s $\color{#35bf28}+1.01\%$
test_membership_stacked_nested_last 44.1410μs 3.1664μs 315.8124 KOps/s 275.1589 KOps/s $\textbf{\color{#35bf28}+14.77\%}$
test_membership_stacked_nested_leaf_last 34.8710μs 3.1425μs 318.2209 KOps/s 277.9926 KOps/s $\textbf{\color{#35bf28}+14.47\%}$
test_nested_getleaf 31.7210μs 6.1736μs 161.9793 KOps/s 160.3302 KOps/s $\color{#35bf28}+1.03\%$
test_nested_get 61.4620μs 5.8435μs 171.1310 KOps/s 170.9814 KOps/s $\color{#35bf28}+0.09\%$
test_stacked_getleaf 32.4010μs 6.1803μs 161.8032 KOps/s 162.5764 KOps/s $\color{#d91a1a}-0.48\%$
test_stacked_get 27.2200μs 5.8356μs 171.3631 KOps/s 171.4996 KOps/s $\color{#d91a1a}-0.08\%$
test_nested_getitemleaf 28.6100μs 6.2449μs 160.1315 KOps/s 158.7396 KOps/s $\color{#35bf28}+0.88\%$
test_nested_getitem 30.8210μs 5.9663μs 167.6079 KOps/s 168.6622 KOps/s $\color{#d91a1a}-0.63\%$
test_stacked_getitemleaf 35.3410μs 6.2044μs 161.1755 KOps/s 160.4387 KOps/s $\color{#35bf28}+0.46\%$
test_stacked_getitem 32.1110μs 5.8946μs 169.6460 KOps/s 168.7471 KOps/s $\color{#35bf28}+0.53\%$
test_lock_nested 0.7618ms 0.3730ms 2.6811 KOps/s 2.5979 KOps/s $\color{#35bf28}+3.20\%$
test_lock_stack_nested 0.3726ms 0.3443ms 2.9041 KOps/s 2.7985 KOps/s $\color{#35bf28}+3.77\%$
test_unlock_nested 0.6060ms 0.3127ms 3.1982 KOps/s 3.0457 KOps/s $\textbf{\color{#35bf28}+5.01\%}$
test_unlock_stack_nested 0.3135ms 0.2811ms 3.5573 KOps/s 3.3894 KOps/s $\color{#35bf28}+4.95\%$
test_flatten_speed 0.1305ms 76.1718μs 13.1282 KOps/s 13.1445 KOps/s $\color{#d91a1a}-0.12\%$
test_unflatten_speed 0.3515ms 0.3197ms 3.1275 KOps/s 3.0878 KOps/s $\color{#35bf28}+1.29\%$
test_common_ops 95.2309ms 0.6562ms 1.5238 KOps/s 1.4885 KOps/s $\color{#35bf28}+2.37\%$
test_creation 0.1024ms 1.7413μs 574.2704 KOps/s 565.7573 KOps/s $\color{#35bf28}+1.50\%$
test_creation_empty 34.5500μs 6.9486μs 143.9140 KOps/s 93.0659 KOps/s $\textbf{\color{#35bf28}+54.64\%}$
test_creation_nested_1 31.2300μs 8.6117μs 116.1204 KOps/s 80.6591 KOps/s $\textbf{\color{#35bf28}+43.96\%}$
test_creation_nested_2 34.3010μs 11.3114μs 88.4061 KOps/s 65.9481 KOps/s $\textbf{\color{#35bf28}+34.05\%}$
test_clone 75.8120μs 10.4065μs 96.0938 KOps/s 89.2401 KOps/s $\textbf{\color{#35bf28}+7.68\%}$
test_getitem[int] 2.2641ms 10.6176μs 94.1829 KOps/s 87.6795 KOps/s $\textbf{\color{#35bf28}+7.42\%}$
test_getitem[slice_int] 0.1092ms 20.3107μs 49.2351 KOps/s 44.6123 KOps/s $\textbf{\color{#35bf28}+10.36\%}$
test_getitem[range] 0.1339ms 38.0726μs 26.2656 KOps/s 25.2452 KOps/s $\color{#35bf28}+4.04\%$
test_getitem[tuple] 0.1066ms 18.0622μs 55.3642 KOps/s 50.3992 KOps/s $\textbf{\color{#35bf28}+9.85\%}$
test_getitem[list] 0.2043ms 34.6095μs 28.8938 KOps/s 28.5676 KOps/s $\color{#35bf28}+1.14\%$
test_setitem_dim[int] 52.0410μs 20.2068μs 49.4882 KOps/s 48.0335 KOps/s $\color{#35bf28}+3.03\%$
test_setitem_dim[slice_int] 75.1920μs 38.6150μs 25.8966 KOps/s 24.9038 KOps/s $\color{#35bf28}+3.99\%$
test_setitem_dim[range] 85.3310μs 53.2256μs 18.7879 KOps/s 18.1487 KOps/s $\color{#35bf28}+3.52\%$
test_setitem_dim[tuple] 51.4810μs 32.4804μs 30.7878 KOps/s 28.9878 KOps/s $\textbf{\color{#35bf28}+6.21\%}$
test_setitem 83.0420μs 14.0810μs 71.0178 KOps/s 57.7474 KOps/s $\textbf{\color{#35bf28}+22.98\%}$
test_set 84.6510μs 13.7445μs 72.7565 KOps/s 59.3364 KOps/s $\textbf{\color{#35bf28}+22.62\%}$
test_set_shared 1.6713ms 0.1516ms 6.5941 KOps/s 6.5254 KOps/s $\color{#35bf28}+1.05\%$
test_update 0.4980ms 15.8051μs 63.2707 KOps/s 47.7210 KOps/s $\textbf{\color{#35bf28}+32.58\%}$
test_update_nested 81.3520μs 21.4688μs 46.5793 KOps/s 37.3340 KOps/s $\textbf{\color{#35bf28}+24.76\%}$
test_update__nested 0.4967ms 25.2294μs 39.6363 KOps/s 37.9863 KOps/s $\color{#35bf28}+4.34\%$
test_set_nested 89.4620μs 14.8902μs 67.1585 KOps/s 54.6283 KOps/s $\textbf{\color{#35bf28}+22.94\%}$
test_set_nested_new 78.9710μs 16.9519μs 58.9905 KOps/s 48.5632 KOps/s $\textbf{\color{#35bf28}+21.47\%}$
test_select 0.2253ms 29.6654μs 33.7093 KOps/s 30.9766 KOps/s $\textbf{\color{#35bf28}+8.82\%}$
test_select_nested 78.7120μs 43.4638μs 23.0077 KOps/s 22.6067 KOps/s $\color{#35bf28}+1.77\%$
test_exclude_nested 96.4720μs 62.6069μs 15.9727 KOps/s 15.7629 KOps/s $\color{#35bf28}+1.33\%$
test_empty[True] 0.3559ms 0.2863ms 3.4924 KOps/s 3.4242 KOps/s $\color{#35bf28}+1.99\%$
test_empty[False] 4.4081μs 0.8228μs 1.2153 MOps/s 1.2167 MOps/s $\color{#d91a1a}-0.12\%$
test_to 88.3320μs 56.6154μs 17.6630 KOps/s 17.7518 KOps/s $\color{#d91a1a}-0.50\%$
test_to_nonblocking 0.1137ms 51.2979μs 19.4940 KOps/s 20.4499 KOps/s $\color{#d91a1a}-4.67\%$
test_unbind_speed 0.2853ms 0.2297ms 4.3536 KOps/s 3.9943 KOps/s $\textbf{\color{#35bf28}+9.00\%}$
test_unbind_speed_stack0 0.2780ms 0.2354ms 4.2477 KOps/s 4.0004 KOps/s $\textbf{\color{#35bf28}+6.18\%}$
test_unbind_speed_stack1 94.7563ms 0.6663ms 1.5008 KOps/s 1.4566 KOps/s $\color{#35bf28}+3.03\%$
test_split 96.4302ms 1.7274ms 578.8950 Ops/s 610.1147 Ops/s $\textbf{\color{#d91a1a}-5.12\%}$
test_chunk 1.5537ms 1.4484ms 690.4182 Ops/s 608.0736 Ops/s $\textbf{\color{#35bf28}+13.54\%}$
test_consolidate[False-None] 98.9588ms 2.9460ms 339.4469 Ops/s 326.1982 Ops/s $\color{#35bf28}+4.06\%$
test_consolidate[default-None] 1.7630ms 1.6804ms 595.0906 Ops/s 569.7617 Ops/s $\color{#35bf28}+4.45\%$
test_consolidate[reduce-overhead-None] 1.8197ms 1.7445ms 573.2326 Ops/s 562.8446 Ops/s $\color{#35bf28}+1.85\%$
test_consolidate_njt[False-None] 6.7091ms 6.6173ms 151.1201 Ops/s 148.8000 Ops/s $\color{#35bf28}+1.56\%$
test_to[False-False-None] 1.8126ms 1.7161ms 582.7267 Ops/s 582.0306 Ops/s $\color{#35bf28}+0.12\%$
test_to[True-False-None] 1.5832ms 1.3658ms 732.1473 Ops/s 723.8304 Ops/s $\color{#35bf28}+1.15\%$
test_to[within-False-None] 4.4163ms 4.1696ms 239.8283 Ops/s 233.4004 Ops/s $\color{#35bf28}+2.75\%$
test_to[True-default-None] 5.7947ms 5.6008ms 178.5460 Ops/s 182.5904 Ops/s $\color{#d91a1a}-2.21\%$
test_to_njt[False-False-None] 7.4023ms 7.2560ms 137.8174 Ops/s 140.9465 Ops/s $\color{#d91a1a}-2.22\%$
test_to_njt[True-False-None] 5.9809ms 5.6305ms 177.6043 Ops/s 177.4001 Ops/s $\color{#35bf28}+0.12\%$
test_to_njt[within-False-None] 12.8470ms 12.2884ms 81.3775 Ops/s 79.8251 Ops/s $\color{#35bf28}+1.94\%$
test_creation[device0] 0.3786ms 82.9629μs 12.0536 KOps/s 12.1123 KOps/s $\color{#d91a1a}-0.49\%$
test_creation_from_tensor 0.6814ms 87.0462μs 11.4881 KOps/s 11.2524 KOps/s $\color{#35bf28}+2.10\%$
test_add_one[memmap_tensor0] 0.3137ms 6.7145μs 148.9307 KOps/s 139.4368 KOps/s $\textbf{\color{#35bf28}+6.81\%}$
test_contiguous[memmap_tensor0] 1.7335μs 0.4109μs 2.4338 MOps/s 2.3988 MOps/s $\color{#35bf28}+1.46\%$
test_stack[memmap_tensor0] 31.3410μs 4.4811μs 223.1571 KOps/s 205.8992 KOps/s $\textbf{\color{#35bf28}+8.38\%}$
test_memmaptd_index 1.5587ms 0.2528ms 3.9564 KOps/s 3.7432 KOps/s $\textbf{\color{#35bf28}+5.70\%}$
test_memmaptd_index_astensor 0.6121ms 0.3126ms 3.1993 KOps/s 3.0574 KOps/s $\color{#35bf28}+4.64\%$
test_memmaptd_index_op 0.9760ms 0.5667ms 1.7647 KOps/s 1.5297 KOps/s $\textbf{\color{#35bf28}+15.36\%}$
test_serialize_model 0.1311s 0.1305s 7.6628 Ops/s 7.6618 Ops/s $\color{#35bf28}+0.01\%$
test_serialize_model_pickle 1.3465s 1.2130s 0.8244 Ops/s 0.8220 Ops/s $\color{#35bf28}+0.30\%$
test_serialize_weights 0.1317s 0.1304s 7.6685 Ops/s 7.6722 Ops/s $\color{#d91a1a}-0.05\%$
test_serialize_weights_returnearly 0.5486s 74.5308ms 13.4173 Ops/s 13.6022 Ops/s $\color{#d91a1a}-1.36\%$
test_serialize_weights_pickle 1.3767s 1.2167s 0.8219 Ops/s 0.8213 Ops/s $\color{#35bf28}+0.08\%$
test_reshape_pytree 51.9910μs 22.2425μs 44.9590 KOps/s 42.4182 KOps/s $\textbf{\color{#35bf28}+5.99\%}$
test_reshape_td 60.7210μs 27.3423μs 36.5733 KOps/s 33.7417 KOps/s $\textbf{\color{#35bf28}+8.39\%}$
test_view_pytree 78.3020μs 22.0398μs 45.3725 KOps/s 43.9795 KOps/s $\color{#35bf28}+3.17\%$
test_view_td 67.7410μs 31.7098μs 31.5360 KOps/s 29.8990 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_unbind_pytree 61.2910μs 27.9173μs 35.8201 KOps/s 34.8907 KOps/s $\color{#35bf28}+2.66\%$
test_unbind_td 0.8457ms 36.8378μs 27.1460 KOps/s 26.3660 KOps/s $\color{#35bf28}+2.96\%$
test_split_pytree 0.1101ms 29.5375μs 33.8553 KOps/s 32.6649 KOps/s $\color{#35bf28}+3.64\%$
test_split_td 1.0673ms 38.0796μs 26.2608 KOps/s 24.2839 KOps/s $\textbf{\color{#35bf28}+8.14\%}$
test_add_pytree 78.1320μs 34.0604μs 29.3596 KOps/s 27.4677 KOps/s $\textbf{\color{#35bf28}+6.89\%}$
test_add_td 95.9320μs 45.2812μs 22.0842 KOps/s 18.0157 KOps/s $\textbf{\color{#35bf28}+22.58\%}$
test_compile_add_one_nested[tensordict-compile] 0.1738ms 0.1202ms 8.3166 KOps/s 7.9771 KOps/s $\color{#35bf28}+4.26\%$
test_compile_add_one_nested[tensordict-eager] 0.2222ms 0.1330ms 7.5165 KOps/s 7.4779 KOps/s $\color{#35bf28}+0.52\%$
test_compile_add_one_nested[pytree-compile] 0.2387ms 94.5861μs 10.5724 KOps/s 10.0908 KOps/s $\color{#35bf28}+4.77\%$
test_compile_add_one_nested[pytree-eager] 1.9321ms 0.1484ms 6.7390 KOps/s 6.4216 KOps/s $\color{#35bf28}+4.94\%$
test_compile_copy_nested[tensordict-compile] 73.2510μs 22.7796μs 43.8989 KOps/s 42.2244 KOps/s $\color{#35bf28}+3.97\%$
test_compile_copy_nested[tensordict-eager] 77.6020μs 28.9966μs 34.4868 KOps/s 33.7056 KOps/s $\color{#35bf28}+2.32\%$
test_compile_copy_nested[pytree-compile] 0.2287ms 62.6954μs 15.9501 KOps/s 15.0957 KOps/s $\textbf{\color{#35bf28}+5.66\%}$
test_compile_copy_nested[pytree-eager] 87.0520μs 49.1228μs 20.3572 KOps/s 20.0388 KOps/s $\color{#35bf28}+1.59\%$
test_compile_add_one_flat[tensordict-compile] 0.2011ms 0.1417ms 7.0593 KOps/s 6.9659 KOps/s $\color{#35bf28}+1.34\%$
test_compile_add_one_flat[tensordict-eager] 0.3131ms 0.2194ms 4.5573 KOps/s 4.5901 KOps/s $\color{#d91a1a}-0.71\%$
test_compile_add_one_flat[tensorclass-compile] 0.1385ms 96.9870μs 10.3107 KOps/s 10.0443 KOps/s $\color{#35bf28}+2.65\%$
test_compile_add_one_flat[tensorclass-eager] 0.3512ms 56.3090μs 17.7592 KOps/s 17.6236 KOps/s $\color{#35bf28}+0.77\%$
test_compile_add_one_flat[pytree-compile] 0.1748ms 0.1352ms 7.3965 KOps/s 7.3123 KOps/s $\color{#35bf28}+1.15\%$
test_compile_add_one_flat[pytree-eager] 0.5153ms 0.4788ms 2.0888 KOps/s 1.9974 KOps/s $\color{#35bf28}+4.58\%$
test_compile_add_self_flat[tensordict-eager] 0.3838ms 0.2613ms 3.8270 KOps/s 3.8099 KOps/s $\color{#35bf28}+0.45\%$
test_compile_add_self_flat[tensordict-compile] 0.2785ms 0.1551ms 6.4493 KOps/s 7.0180 KOps/s $\textbf{\color{#d91a1a}-8.10\%}$
test_compile_add_self_flat[tensorclass-eager] 0.2109ms 73.1584μs 13.6690 KOps/s 14.1454 KOps/s $\color{#d91a1a}-3.37\%$
test_compile_add_self_flat[tensorclass-compile] 0.2463ms 0.1074ms 9.3074 KOps/s 9.9304 KOps/s $\textbf{\color{#d91a1a}-6.27\%}$
test_compile_add_self_flat[pytree-eager] 0.5259ms 0.4081ms 2.4502 KOps/s 2.4438 KOps/s $\color{#35bf28}+0.26\%$
test_compile_add_self_flat[pytree-compile] 0.2510ms 0.1430ms 6.9937 KOps/s 7.4254 KOps/s $\textbf{\color{#d91a1a}-5.81\%}$
test_compile_copy_flat[tensordict-compile] 0.1298ms 19.5838μs 51.0626 KOps/s 53.2676 KOps/s $\color{#d91a1a}-4.14\%$
test_compile_copy_flat[tensordict-eager] 0.1212ms 31.7081μs 31.5377 KOps/s 32.3901 KOps/s $\color{#d91a1a}-2.63\%$
test_compile_copy_flat[pytree-compile] 0.1793ms 71.7618μs 13.9350 KOps/s 14.2532 KOps/s $\color{#d91a1a}-2.23\%$
test_compile_copy_flat[pytree-eager] 0.1622ms 52.4102μs 19.0803 KOps/s 19.4518 KOps/s $\color{#d91a1a}-1.91\%$
test_compile_assign_and_add[tensordict-compile] 1.6850ms 0.4065ms 2.4599 KOps/s 2.2015 KOps/s $\textbf{\color{#35bf28}+11.73\%}$
test_compile_assign_and_add[tensordict-eager] 2.8758ms 2.6813ms 372.9533 Ops/s 371.6581 Ops/s $\color{#35bf28}+0.35\%$
test_compile_assign_and_add[pytree-compile] 1.6258ms 0.4369ms 2.2889 KOps/s 2.2593 KOps/s $\color{#35bf28}+1.31\%$
test_compile_assign_and_add[pytree-eager] 2.8500ms 2.6267ms 380.6993 Ops/s 364.6643 Ops/s $\color{#35bf28}+4.40\%$
test_compile_indexing[tensor-tensordict-compile] 0.1719ms 0.1136ms 8.8060 KOps/s 8.2666 KOps/s $\textbf{\color{#35bf28}+6.52\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5686ms 78.0388μs 12.8141 KOps/s 11.6964 KOps/s $\textbf{\color{#35bf28}+9.56\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.4443ms 0.1107ms 9.0316 KOps/s 8.9287 KOps/s $\color{#35bf28}+1.15\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1319ms 70.4343μs 14.1976 KOps/s 13.6401 KOps/s $\color{#35bf28}+4.09\%$
test_compile_indexing[tensor-pytree-compile] 0.2046ms 0.1122ms 8.9147 KOps/s 8.7973 KOps/s $\color{#35bf28}+1.33\%$
test_compile_indexing[tensor-pytree-eager] 0.1228ms 71.1898μs 14.0469 KOps/s 13.6454 KOps/s $\color{#35bf28}+2.94\%$
test_compile_indexing[slice-tensordict-compile] 0.1555ms 0.1032ms 9.6897 KOps/s 9.7113 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_indexing[slice-tensordict-eager] 0.1481ms 17.3144μs 57.7555 KOps/s 44.8490 KOps/s $\textbf{\color{#35bf28}+28.78\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1291ms 95.6336μs 10.4566 KOps/s 9.6549 KOps/s $\textbf{\color{#35bf28}+8.30\%}$
test_compile_indexing[slice-tensorclass-eager] 62.8010μs 15.9352μs 62.7540 KOps/s 59.9834 KOps/s $\color{#35bf28}+4.62\%$
test_compile_indexing[slice-pytree-compile] 0.1710ms 95.8995μs 10.4276 KOps/s 9.6322 KOps/s $\textbf{\color{#35bf28}+8.26\%}$
test_compile_indexing[slice-pytree-eager] 0.1700ms 15.9151μs 62.8335 KOps/s 60.7321 KOps/s $\color{#35bf28}+3.46\%$
test_compile_indexing[int-tensordict-compile] 0.1572ms 0.1042ms 9.6008 KOps/s 9.4928 KOps/s $\color{#35bf28}+1.14\%$
test_compile_indexing[int-tensordict-eager] 0.6361ms 17.1517μs 58.3034 KOps/s 54.6113 KOps/s $\textbf{\color{#35bf28}+6.76\%}$
test_compile_indexing[int-tensorclass-compile] 0.1754ms 96.4181μs 10.3715 KOps/s 9.7832 KOps/s $\textbf{\color{#35bf28}+6.01\%}$
test_compile_indexing[int-tensorclass-eager] 82.8510μs 15.7856μs 63.3491 KOps/s 60.8174 KOps/s $\color{#35bf28}+4.16\%$
test_compile_indexing[int-pytree-compile] 0.2485ms 95.7865μs 10.4399 KOps/s 9.7014 KOps/s $\textbf{\color{#35bf28}+7.61\%}$
test_compile_indexing[int-pytree-eager] 80.4310μs 15.6154μs 64.0392 KOps/s 59.8874 KOps/s $\textbf{\color{#35bf28}+6.93\%}$
test_mod_add[eager] 0.1146ms 37.0823μs 26.9670 KOps/s 23.0502 KOps/s $\textbf{\color{#35bf28}+16.99\%}$
test_mod_add[compile] 0.3310ms 79.9230μs 12.5120 KOps/s 11.5759 KOps/s $\textbf{\color{#35bf28}+8.09\%}$
test_mod_add[compile-overhead] 0.3298ms 0.1678ms 5.9611 KOps/s 5.6296 KOps/s $\textbf{\color{#35bf28}+5.89\%}$
test_mod_wrap[eager] 0.3915ms 0.2511ms 3.9832 KOps/s 3.8920 KOps/s $\color{#35bf28}+2.34\%$
test_mod_wrap[compile] 0.6858ms 0.2859ms 3.4979 KOps/s 3.4120 KOps/s $\color{#35bf28}+2.52\%$
test_mod_wrap[compile-overhead] 7.1198ms 3.7329ms 267.8858 Ops/s 278.6147 Ops/s $\color{#d91a1a}-3.85\%$
test_mod_wrap_and_backward[eager] 1.4775ms 1.3680ms 730.9947 Ops/s 700.2438 Ops/s $\color{#35bf28}+4.39\%$
test_mod_wrap_and_backward[compile] 1.4219ms 1.2764ms 783.4603 Ops/s 770.9679 Ops/s $\color{#35bf28}+1.62\%$
test_mod_wrap_and_backward[compile-overhead] 1.4613ms 0.9685ms 1.0326 KOps/s 1.0405 KOps/s $\color{#d91a1a}-0.76\%$
test_seq_add[eager] 0.1821ms 0.1187ms 8.4278 KOps/s 8.0651 KOps/s $\color{#35bf28}+4.50\%$
test_seq_add[compile] 0.1357ms 89.1953μs 11.2113 KOps/s 11.0738 KOps/s $\color{#35bf28}+1.24\%$
test_seq_add[compile-overhead] 0.1747ms 0.1275ms 7.8403 KOps/s 7.6224 KOps/s $\color{#35bf28}+2.86\%$
test_seq_wrap[eager] 0.5376ms 0.4127ms 2.4231 KOps/s 2.2618 KOps/s $\textbf{\color{#35bf28}+7.13\%}$
test_seq_wrap[compile] 0.4044ms 0.3075ms 3.2522 KOps/s 3.2581 KOps/s $\color{#d91a1a}-0.18\%$
test_seq_wrap[compile-overhead] 0.3105ms 0.2298ms 4.3516 KOps/s 4.3871 KOps/s $\color{#d91a1a}-0.81\%$
test_func_call_runtime[False-eager] 0.9081ms 0.7845ms 1.2747 KOps/s 1.3216 KOps/s $\color{#d91a1a}-3.55\%$
test_func_call_runtime[False-compile] 1.1697ms 0.7535ms 1.3272 KOps/s 1.3116 KOps/s $\color{#35bf28}+1.19\%$
test_func_call_runtime[False-compile-overhead] 0.4144ms 0.3686ms 2.7126 KOps/s 2.7077 KOps/s $\color{#35bf28}+0.18\%$
test_func_call_runtime[True-eager] 1.3429ms 0.9089ms 1.1002 KOps/s 1.0829 KOps/s $\color{#35bf28}+1.60\%$
test_func_call_runtime[True-compile] 1.1926ms 0.7716ms 1.2960 KOps/s 1.2804 KOps/s $\color{#35bf28}+1.22\%$
test_func_call_runtime[True-compile-overhead] 0.8049ms 0.3874ms 2.5811 KOps/s 2.5278 KOps/s $\color{#35bf28}+2.11\%$
test_func_call_cm_runtime[False-eager] 1.2354ms 0.7904ms 1.2652 KOps/s 1.2931 KOps/s $\color{#d91a1a}-2.16\%$
test_func_call_cm_runtime[False-compile] 1.1764ms 0.7533ms 1.3275 KOps/s 1.2203 KOps/s $\textbf{\color{#35bf28}+8.78\%}$
test_func_call_cm_runtime[False-compile-overhead] 0.4247ms 0.3661ms 2.7312 KOps/s 2.6846 KOps/s $\color{#35bf28}+1.74\%$
test_func_call_cm_runtime[True-eager] 1.1739ms 1.0128ms 987.3544 Ops/s 936.5911 Ops/s $\textbf{\color{#35bf28}+5.42\%}$
test_func_call_cm_runtime[True-compile] 1.2186ms 0.8001ms 1.2498 KOps/s 1.1688 KOps/s $\textbf{\color{#35bf28}+6.94\%}$
test_func_call_cm_runtime[True-compile-overhead] 0.4814ms 0.4177ms 2.3942 KOps/s 2.3095 KOps/s $\color{#35bf28}+3.67\%$
test_vmap_func_call_cm_runtime[eager] 2.5804ms 2.0790ms 480.9908 Ops/s 472.6664 Ops/s $\color{#35bf28}+1.76\%$
test_vmap_func_call_cm_runtime[compile] 1.2219ms 0.8099ms 1.2348 KOps/s 1.1463 KOps/s $\textbf{\color{#35bf28}+7.72\%}$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4623ms 0.4146ms 2.4121 KOps/s 2.3790 KOps/s $\color{#35bf28}+1.39\%$
test_distributed 13.0008ms 0.3033ms 3.2965 KOps/s 7.2122 KOps/s $\textbf{\color{#d91a1a}-54.29\%}$
test_tdmodule 0.1151ms 20.2355μs 49.4180 KOps/s 46.4755 KOps/s $\textbf{\color{#35bf28}+6.33\%}$
test_tdmodule_dispatch 65.0010μs 36.3752μs 27.4912 KOps/s 25.6376 KOps/s $\textbf{\color{#35bf28}+7.23\%}$
test_tdseq 42.0810μs 21.5782μs 46.3431 KOps/s 44.6457 KOps/s $\color{#35bf28}+3.80\%$
test_tdseq_dispatch 64.1320μs 39.7370μs 25.1654 KOps/s 23.7234 KOps/s $\textbf{\color{#35bf28}+6.08\%}$
test_instantiation_functorch 1.6850ms 1.5702ms 636.8446 Ops/s 627.4584 Ops/s $\color{#35bf28}+1.50\%$
test_exec_functorch 0.1874ms 0.1466ms 6.8191 KOps/s 6.6331 KOps/s $\color{#35bf28}+2.80\%$
test_exec_functional_call 0.2141ms 0.1404ms 7.1222 KOps/s 7.1984 KOps/s $\color{#d91a1a}-1.06\%$
test_exec_td_decorator 0.3860ms 0.1884ms 5.3090 KOps/s 5.2549 KOps/s $\color{#35bf28}+1.03\%$
test_vmap_mlp_speed_decorator[True-True] 0.9220ms 0.7170ms 1.3947 KOps/s 1.4265 KOps/s $\color{#d91a1a}-2.23\%$
test_vmap_mlp_speed_decorator[True-False] 0.8262ms 0.7151ms 1.3983 KOps/s 1.4263 KOps/s $\color{#d91a1a}-1.96\%$
test_vmap_mlp_speed_decorator[False-True] 0.7331ms 0.6219ms 1.6080 KOps/s 1.6549 KOps/s $\color{#d91a1a}-2.84\%$
test_vmap_mlp_speed_decorator[False-False] 0.7515ms 0.6263ms 1.5967 KOps/s 1.6531 KOps/s $\color{#d91a1a}-3.41\%$
test_vmap_transformer_speed_decorator[True-True] 19.5359ms 19.2194ms 52.0308 Ops/s 51.3963 Ops/s $\color{#35bf28}+1.23\%$
test_vmap_transformer_speed_decorator[True-False] 20.0257ms 19.3468ms 51.6883 Ops/s 51.3800 Ops/s $\color{#35bf28}+0.60\%$
test_vmap_transformer_speed_decorator[False-True] 19.7811ms 19.1743ms 52.1531 Ops/s 51.8237 Ops/s $\color{#35bf28}+0.64\%$
test_vmap_transformer_speed_decorator[False-False] 20.0069ms 19.2944ms 51.8286 Ops/s 51.7648 Ops/s $\color{#35bf28}+0.12\%$
test_to_module_speed[True] 1.5176ms 0.9609ms 1.0407 KOps/s 1.0352 KOps/s $\color{#35bf28}+0.53\%$
test_to_module_speed[False] 1.5032ms 0.9441ms 1.0592 KOps/s 1.0393 KOps/s $\color{#35bf28}+1.92\%$
test_tc_init 70.1010μs 34.4960μs 28.9889 KOps/s 24.8883 KOps/s $\textbf{\color{#35bf28}+16.48\%}$
test_tc_init_nested 0.1663ms 71.5156μs 13.9830 KOps/s 12.4232 KOps/s $\textbf{\color{#35bf28}+12.56\%}$
test_tc_first_layer_tensor 5.4159μs 0.7103μs 1.4078 MOps/s 1.4432 MOps/s $\color{#d91a1a}-2.46\%$
test_tc_first_layer_nontensor 19.8210μs 2.2860μs 437.4534 KOps/s 451.1323 KOps/s $\color{#d91a1a}-3.03\%$
test_tc_second_layer_tensor 10.6425μs 1.4207μs 703.8661 KOps/s 717.2024 KOps/s $\color{#d91a1a}-1.86\%$
test_tc_second_layer_nontensor 29.8010μs 3.0248μs 330.5975 KOps/s 334.7627 KOps/s $\color{#d91a1a}-1.24\%$
test_unbind 0.2400s 10.2303ms 97.7491 Ops/s 142.5067 Ops/s $\textbf{\color{#d91a1a}-31.41\%}$
test_full_like 9.7193ms 9.4912ms 105.3611 Ops/s 103.4393 Ops/s $\color{#35bf28}+1.86\%$
test_zeros_like 5.6034ms 4.3781ms 228.4110 Ops/s 114.6767 Ops/s $\textbf{\color{#35bf28}+99.18\%}$
test_ones_like 5.0253ms 4.3783ms 228.3972 Ops/s 229.5048 Ops/s $\color{#d91a1a}-0.48\%$
test_clone 7.0618ms 6.7590ms 147.9511 Ops/s 147.8579 Ops/s $\color{#35bf28}+0.06\%$
test_squeeze 85.6220μs 10.5411μs 94.8669 KOps/s 102.7312 KOps/s $\textbf{\color{#d91a1a}-7.66\%}$
test_unsqueeze 0.1281ms 73.7514μs 13.5591 KOps/s 13.2966 KOps/s $\color{#35bf28}+1.97\%$
test_split 0.3685ms 0.1604ms 6.2327 KOps/s 6.0253 KOps/s $\color{#35bf28}+3.44\%$
test_permute 0.2275ms 0.1837ms 5.4425 KOps/s 5.6306 KOps/s $\color{#d91a1a}-3.34\%$
test_stack 53.3286ms 51.6977ms 19.3432 Ops/s 19.7840 Ops/s $\color{#d91a1a}-2.23\%$
test_cat 51.6356ms 51.3598ms 19.4705 Ops/s 19.4647 Ops/s $\color{#35bf28}+0.03\%$

[ghstack-poisoned]
@vmoens vmoens added the BE Better errors, logs, docs or test utils label Jan 9, 2025
@vmoens vmoens merged commit e235ae8 into gh/vmoens/46/base Jan 9, 2025
45 of 52 checks passed
vmoens added a commit that referenced this pull request Jan 9, 2025
ghstack-source-id: 5d85310221eb18ca6d0b1c4a4a88557f3fe8819d
Pull Request resolved: #1174
@vmoens vmoens deleted the gh/vmoens/46/head branch January 9, 2025 18:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BE Better errors, logs, docs or test utils CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants