Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Quality] fix c++ binaries formatting #859

Merged
merged 1 commit into from
Jul 8, 2024
Merged

[Quality] fix c++ binaries formatting #859

merged 1 commit into from
Jul 8, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jul 8, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 8, 2024
@vmoens vmoens merged commit 3934fe1 into main Jul 8, 2024
21 of 33 checks passed
@vmoens vmoens deleted the fix-imports2 branch July 8, 2024 11:01
Copy link

github-actions bot commented Jul 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 144. Improved: $\large\color{#35bf28}19$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 39.5640μs 17.0205μs 58.7527 KOps/s 58.3969 KOps/s $\color{#35bf28}+0.61\%$
test_plain_set_stack_nested 56.4750μs 17.5121μs 57.1033 KOps/s 56.8635 KOps/s $\color{#35bf28}+0.42\%$
test_plain_set_nested_inplace 89.0660μs 19.6159μs 50.9789 KOps/s 50.5215 KOps/s $\color{#35bf28}+0.91\%$
test_plain_set_stack_nested_inplace 78.5970μs 19.5643μs 51.1135 KOps/s 51.2806 KOps/s $\color{#d91a1a}-0.33\%$
test_items 22.5620μs 2.6404μs 378.7233 KOps/s 385.2130 KOps/s $\color{#d91a1a}-1.68\%$
test_items_nested 0.6531ms 0.2719ms 3.6785 KOps/s 3.6278 KOps/s $\color{#35bf28}+1.40\%$
test_items_nested_locked 0.9178ms 0.2755ms 3.6303 KOps/s 3.6141 KOps/s $\color{#35bf28}+0.45\%$
test_items_nested_leaf 0.1649ms 78.4885μs 12.7407 KOps/s 12.4207 KOps/s $\color{#35bf28}+2.58\%$
test_items_stack_nested 0.9347ms 0.2757ms 3.6277 KOps/s 3.6010 KOps/s $\color{#35bf28}+0.74\%$
test_items_stack_nested_leaf 0.1667ms 79.8822μs 12.5184 KOps/s 12.0929 KOps/s $\color{#35bf28}+3.52\%$
test_items_stack_nested_locked 0.5901ms 0.2737ms 3.6532 KOps/s 3.6112 KOps/s $\color{#35bf28}+1.16\%$
test_keys 42.9110μs 3.8070μs 262.6757 KOps/s 261.6729 KOps/s $\color{#35bf28}+0.38\%$
test_keys_nested 0.2639ms 0.1407ms 7.1052 KOps/s 7.1611 KOps/s $\color{#d91a1a}-0.78\%$
test_keys_nested_locked 0.7261ms 0.1465ms 6.8250 KOps/s 6.9086 KOps/s $\color{#d91a1a}-1.21\%$
test_keys_nested_leaf 0.2663ms 0.1199ms 8.3392 KOps/s 8.4091 KOps/s $\color{#d91a1a}-0.83\%$
test_keys_stack_nested 0.2416ms 0.1412ms 7.0810 KOps/s 7.2433 KOps/s $\color{#d91a1a}-2.24\%$
test_keys_stack_nested_leaf 0.2022ms 0.1200ms 8.3306 KOps/s 8.3982 KOps/s $\color{#d91a1a}-0.81\%$
test_keys_stack_nested_locked 0.3042ms 0.1446ms 6.9180 KOps/s 6.9093 KOps/s $\color{#35bf28}+0.13\%$
test_values 10.7325μs 1.1313μs 883.9019 KOps/s 860.9855 KOps/s $\color{#35bf28}+2.66\%$
test_values_nested 97.7120μs 51.4996μs 19.4176 KOps/s 19.0837 KOps/s $\color{#35bf28}+1.75\%$
test_values_nested_locked 0.1033ms 51.5646μs 19.3931 KOps/s 19.0295 KOps/s $\color{#35bf28}+1.91\%$
test_values_nested_leaf 0.1154ms 46.6381μs 21.4417 KOps/s 20.6095 KOps/s $\color{#35bf28}+4.04\%$
test_values_stack_nested 0.1206ms 52.8108μs 18.9355 KOps/s 18.6659 KOps/s $\color{#35bf28}+1.44\%$
test_values_stack_nested_leaf 95.6190μs 46.5218μs 21.4953 KOps/s 20.7909 KOps/s $\color{#35bf28}+3.39\%$
test_values_stack_nested_locked 0.1344ms 51.6615μs 19.3568 KOps/s 18.9742 KOps/s $\color{#35bf28}+2.02\%$
test_membership 28.4830μs 1.3767μs 726.3781 KOps/s 734.1979 KOps/s $\color{#d91a1a}-1.07\%$
test_membership_nested 21.5100μs 3.4437μs 290.3835 KOps/s 290.1342 KOps/s $\color{#35bf28}+0.09\%$
test_membership_nested_leaf 50.9580μs 3.4171μs 292.6448 KOps/s 278.3419 KOps/s $\textbf{\color{#35bf28}+5.14\%}$
test_membership_stacked_nested 17.9240μs 3.4443μs 290.3335 KOps/s 294.1497 KOps/s $\color{#d91a1a}-1.30\%$
test_membership_stacked_nested_leaf 17.1720μs 3.4320μs 291.3745 KOps/s 291.0655 KOps/s $\color{#35bf28}+0.11\%$
test_membership_nested_last 24.2050μs 4.2568μs 234.9172 KOps/s 233.9399 KOps/s $\color{#35bf28}+0.42\%$
test_membership_nested_leaf_last 40.4760μs 4.2720μs 234.0839 KOps/s 236.5932 KOps/s $\color{#d91a1a}-1.06\%$
test_membership_stacked_nested_last 35.2040μs 4.2283μs 236.5000 KOps/s 207.9226 KOps/s $\textbf{\color{#35bf28}+13.74\%}$
test_membership_stacked_nested_leaf_last 22.7220μs 4.2606μs 234.7068 KOps/s 205.6324 KOps/s $\textbf{\color{#35bf28}+14.14\%}$
test_nested_getleaf 58.9100μs 10.7615μs 92.9235 KOps/s 94.6654 KOps/s $\color{#d91a1a}-1.84\%$
test_nested_get 62.3660μs 10.3485μs 96.6325 KOps/s 99.7669 KOps/s $\color{#d91a1a}-3.14\%$
test_stacked_getleaf 37.8710μs 10.7387μs 93.1212 KOps/s 95.4133 KOps/s $\color{#d91a1a}-2.40\%$
test_stacked_get 55.7250μs 10.0795μs 99.2111 KOps/s 99.3319 KOps/s $\color{#d91a1a}-0.12\%$
test_nested_getitemleaf 52.1580μs 11.3331μs 88.2370 KOps/s 90.0049 KOps/s $\color{#d91a1a}-1.96\%$
test_nested_getitem 33.2830μs 10.5118μs 95.1314 KOps/s 97.4317 KOps/s $\color{#d91a1a}-2.36\%$
test_stacked_getitemleaf 51.1050μs 11.1359μs 89.7999 KOps/s 90.0434 KOps/s $\color{#d91a1a}-0.27\%$
test_stacked_getitem 46.8680μs 10.2317μs 97.7357 KOps/s 97.5914 KOps/s $\color{#35bf28}+0.15\%$
test_lock_nested 50.8759ms 0.3745ms 2.6699 KOps/s 2.9405 KOps/s $\textbf{\color{#d91a1a}-9.20\%}$
test_lock_stack_nested 0.5282ms 0.2981ms 3.3543 KOps/s 3.2565 KOps/s $\color{#35bf28}+3.00\%$
test_unlock_nested 0.7525ms 0.3342ms 2.9926 KOps/s 2.9012 KOps/s $\color{#35bf28}+3.15\%$
test_unlock_stack_nested 0.3777ms 0.3080ms 3.2469 KOps/s 3.1870 KOps/s $\color{#35bf28}+1.88\%$
test_flatten_speed 0.5961ms 0.1004ms 9.9605 KOps/s 9.8803 KOps/s $\color{#35bf28}+0.81\%$
test_unflatten_speed 0.9060ms 0.4183ms 2.3908 KOps/s 2.4151 KOps/s $\color{#d91a1a}-1.00\%$
test_common_ops 3.6739ms 0.7144ms 1.3998 KOps/s 1.3414 KOps/s $\color{#35bf28}+4.35\%$
test_creation 51.9570μs 1.8711μs 534.4579 KOps/s 524.7461 KOps/s $\color{#35bf28}+1.85\%$
test_creation_empty 26.8100μs 10.8946μs 91.7887 KOps/s 89.5166 KOps/s $\color{#35bf28}+2.54\%$
test_creation_nested_1 31.3490μs 13.7957μs 72.4863 KOps/s 71.9105 KOps/s $\color{#35bf28}+0.80\%$
test_creation_nested_2 65.6530μs 16.9745μs 58.9119 KOps/s 57.7174 KOps/s $\color{#35bf28}+2.07\%$
test_clone 91.3610μs 13.1125μs 76.2629 KOps/s 75.2858 KOps/s $\color{#35bf28}+1.30\%$
test_getitem[int] 45.2240μs 10.9429μs 91.3834 KOps/s 89.9007 KOps/s $\color{#35bf28}+1.65\%$
test_getitem[slice_int] 77.1450μs 21.2500μs 47.0588 KOps/s 45.0247 KOps/s $\color{#35bf28}+4.52\%$
test_getitem[range] 76.7330μs 57.0644μs 17.5241 KOps/s 17.0237 KOps/s $\color{#35bf28}+2.94\%$
test_getitem[tuple] 0.2250ms 18.0444μs 55.4190 KOps/s 52.7793 KOps/s $\textbf{\color{#35bf28}+5.00\%}$
test_getitem[list] 0.2002ms 38.5789μs 25.9209 KOps/s 23.9336 KOps/s $\textbf{\color{#35bf28}+8.30\%}$
test_setitem_dim[int] 57.4770μs 32.9617μs 30.3383 KOps/s 29.1504 KOps/s $\color{#35bf28}+4.07\%$
test_setitem_dim[slice_int] 85.7500μs 60.2852μs 16.5878 KOps/s 16.7685 KOps/s $\color{#d91a1a}-1.08\%$
test_setitem_dim[range] 0.1491ms 81.2526μs 12.3073 KOps/s 11.7389 KOps/s $\color{#35bf28}+4.84\%$
test_setitem_dim[tuple] 92.3730μs 48.3537μs 20.6809 KOps/s 20.0295 KOps/s $\color{#35bf28}+3.25\%$
test_setitem 56.2160μs 19.9469μs 50.1331 KOps/s 50.0744 KOps/s $\color{#35bf28}+0.12\%$
test_set 78.0450μs 19.3779μs 51.6051 KOps/s 51.3818 KOps/s $\color{#35bf28}+0.43\%$
test_set_shared 1.7217ms 0.1408ms 7.1027 KOps/s 6.9537 KOps/s $\color{#35bf28}+2.14\%$
test_update 0.1390ms 22.8555μs 43.7532 KOps/s 44.0287 KOps/s $\color{#d91a1a}-0.63\%$
test_update_nested 0.1247ms 31.3296μs 31.9187 KOps/s 31.1142 KOps/s $\color{#35bf28}+2.59\%$
test_update__nested 0.1248ms 25.5172μs 39.1892 KOps/s 39.2968 KOps/s $\color{#d91a1a}-0.27\%$
test_set_nested 57.1470μs 22.0756μs 45.2988 KOps/s 47.1212 KOps/s $\color{#d91a1a}-3.87\%$
test_set_nested_new 88.0040μs 26.3020μs 38.0199 KOps/s 39.3429 KOps/s $\color{#d91a1a}-3.36\%$
test_select 0.1131ms 41.6301μs 24.0211 KOps/s 24.4263 KOps/s $\color{#d91a1a}-1.66\%$
test_select_nested 0.1141ms 58.6399μs 17.0532 KOps/s 17.1676 KOps/s $\color{#d91a1a}-0.67\%$
test_exclude_nested 0.2268ms 0.1168ms 8.5629 KOps/s 8.2950 KOps/s $\color{#35bf28}+3.23\%$
test_empty[True] 0.7287ms 0.3969ms 2.5194 KOps/s 2.4581 KOps/s $\color{#35bf28}+2.49\%$
test_empty[False] 28.5194μs 1.0705μs 934.1106 KOps/s 986.8497 KOps/s $\textbf{\color{#d91a1a}-5.34\%}$
test_unbind_speed 5.0305ms 0.2407ms 4.1551 KOps/s 3.9740 KOps/s $\color{#35bf28}+4.56\%$
test_unbind_speed_stack0 0.4385ms 0.2408ms 4.1533 KOps/s 4.0068 KOps/s $\color{#35bf28}+3.66\%$
test_unbind_speed_stack1 65.5266ms 0.6942ms 1.4405 KOps/s 1.4029 KOps/s $\color{#35bf28}+2.68\%$
test_split 73.2792ms 1.5753ms 634.8037 Ops/s 635.8062 Ops/s $\color{#d91a1a}-0.16\%$
test_chunk 72.4787ms 1.5777ms 633.8482 Ops/s 637.8691 Ops/s $\color{#d91a1a}-0.63\%$
test_creation[device0] 0.2101ms 84.5001μs 11.8343 KOps/s 11.4118 KOps/s $\color{#35bf28}+3.70\%$
test_creation_from_tensor 3.5302ms 85.5585μs 11.6879 KOps/s 11.2182 KOps/s $\color{#35bf28}+4.19\%$
test_add_one[memmap_tensor0] 89.8580μs 5.3050μs 188.5004 KOps/s 182.0020 KOps/s $\color{#35bf28}+3.57\%$
test_contiguous[memmap_tensor0] 21.9310μs 0.6434μs 1.5543 MOps/s 1.5451 MOps/s $\color{#35bf28}+0.60\%$
test_stack[memmap_tensor0] 24.2660μs 3.4984μs 285.8434 KOps/s 266.4821 KOps/s $\textbf{\color{#35bf28}+7.27\%}$
test_memmaptd_index 1.0566ms 0.2513ms 3.9795 KOps/s 3.7930 KOps/s $\color{#35bf28}+4.92\%$
test_memmaptd_index_astensor 0.7401ms 0.3223ms 3.1030 KOps/s 2.9827 KOps/s $\color{#35bf28}+4.03\%$
test_memmaptd_index_op 4.8081ms 0.6132ms 1.6309 KOps/s 1.5819 KOps/s $\color{#35bf28}+3.10\%$
test_serialize_model 0.1779s 0.1065s 9.3857 Ops/s 10.0528 Ops/s $\textbf{\color{#d91a1a}-6.64\%}$
test_serialize_model_pickle 0.4504s 0.3802s 2.6304 Ops/s 2.6451 Ops/s $\color{#d91a1a}-0.56\%$
test_serialize_weights 0.1088s 98.6906ms 10.1327 Ops/s 9.0716 Ops/s $\textbf{\color{#35bf28}+11.70\%}$
test_serialize_weights_returnearly 0.1250s 0.1208s 8.2774 Ops/s 8.2521 Ops/s $\color{#35bf28}+0.31\%$
test_serialize_weights_pickle 0.4473s 0.4233s 2.3623 Ops/s 2.2787 Ops/s $\color{#35bf28}+3.67\%$
test_serialize_weights_filesystem 96.2747ms 90.9865ms 10.9906 Ops/s 9.5234 Ops/s $\textbf{\color{#35bf28}+15.41\%}$
test_serialize_model_filesystem 0.1008s 97.0199ms 10.3072 Ops/s 10.4290 Ops/s $\color{#d91a1a}-1.17\%$
test_reshape_pytree 66.6450μs 25.5597μs 39.1240 KOps/s 37.6528 KOps/s $\color{#35bf28}+3.91\%$
test_reshape_td 0.1132ms 33.5064μs 29.8450 KOps/s 29.0441 KOps/s $\color{#35bf28}+2.76\%$
test_view_pytree 66.0340μs 25.5628μs 39.1193 KOps/s 38.5380 KOps/s $\color{#35bf28}+1.51\%$
test_view_td 0.1011ms 37.7316μs 26.5030 KOps/s 25.5242 KOps/s $\color{#35bf28}+3.83\%$
test_unbind_pytree 66.4340μs 29.2530μs 34.1846 KOps/s 33.5003 KOps/s $\color{#35bf28}+2.04\%$
test_unbind_td 0.3667ms 35.7308μs 27.9870 KOps/s 24.9954 KOps/s $\textbf{\color{#35bf28}+11.97\%}$
test_split_pytree 70.1110μs 29.3803μs 34.0364 KOps/s 33.1281 KOps/s $\color{#35bf28}+2.74\%$
test_split_td 0.1191ms 38.7483μs 25.8076 KOps/s 24.9433 KOps/s $\color{#35bf28}+3.47\%$
test_add_pytree 96.8000μs 35.5172μs 28.1554 KOps/s 28.1504 KOps/s $\color{#35bf28}+0.02\%$
test_add_td 0.1315ms 56.6298μs 17.6586 KOps/s 18.0999 KOps/s $\color{#d91a1a}-2.44\%$
test_distributed 0.2374ms 0.1004ms 9.9579 KOps/s 9.6408 KOps/s $\color{#35bf28}+3.29\%$
test_tdmodule 32.9720μs 17.8748μs 55.9446 KOps/s 54.3446 KOps/s $\color{#35bf28}+2.94\%$
test_tdmodule_dispatch 58.6990μs 35.0669μs 28.5169 KOps/s 27.3211 KOps/s $\color{#35bf28}+4.38\%$
test_tdseq 38.0710μs 20.3873μs 49.0501 KOps/s 40.1585 KOps/s $\textbf{\color{#35bf28}+22.14\%}$
test_tdseq_dispatch 58.6090μs 39.7690μs 25.1452 KOps/s 21.0237 KOps/s $\textbf{\color{#35bf28}+19.60\%}$
test_instantiation_functorch 1.6020ms 1.3092ms 763.8380 Ops/s 756.4258 Ops/s $\color{#35bf28}+0.98\%$
test_instantiation_td 1.4969ms 1.0025ms 997.5266 Ops/s 976.7366 Ops/s $\color{#35bf28}+2.13\%$
test_exec_functorch 0.2974ms 0.1621ms 6.1694 KOps/s 6.1837 KOps/s $\color{#d91a1a}-0.23\%$
test_exec_functional_call 0.2763ms 0.1470ms 6.8048 KOps/s 6.4119 KOps/s $\textbf{\color{#35bf28}+6.13\%}$
test_exec_td 0.3387ms 0.1462ms 6.8379 KOps/s 6.8216 KOps/s $\color{#35bf28}+0.24\%$
test_exec_td_decorator 0.4708ms 0.2207ms 4.5313 KOps/s 4.3729 KOps/s $\color{#35bf28}+3.62\%$
test_vmap_mlp_speed[True-True] 0.6995ms 0.4886ms 2.0465 KOps/s 1.9975 KOps/s $\color{#35bf28}+2.46\%$
test_vmap_mlp_speed[True-False] 0.7663ms 0.4871ms 2.0532 KOps/s 1.9932 KOps/s $\color{#35bf28}+3.01\%$
test_vmap_mlp_speed[False-True] 1.8857ms 0.4081ms 2.4502 KOps/s 2.4250 KOps/s $\color{#35bf28}+1.04\%$
test_vmap_mlp_speed[False-False] 0.6726ms 0.3954ms 2.5289 KOps/s 2.4444 KOps/s $\color{#35bf28}+3.45\%$
test_vmap_mlp_speed_decorator[True-True] 1.1155ms 0.5603ms 1.7846 KOps/s 1.7366 KOps/s $\color{#35bf28}+2.77\%$
test_vmap_mlp_speed_decorator[True-False] 0.9592ms 0.5612ms 1.7820 KOps/s 1.7369 KOps/s $\color{#35bf28}+2.60\%$
test_vmap_mlp_speed_decorator[False-True] 0.7226ms 0.4618ms 2.1657 KOps/s 2.1245 KOps/s $\color{#35bf28}+1.94\%$
test_vmap_mlp_speed_decorator[False-False] 0.6211ms 0.4594ms 2.1767 KOps/s 2.1201 KOps/s $\color{#35bf28}+2.67\%$
test_to_module_speed[True] 2.3372ms 1.6967ms 589.3637 Ops/s 581.1210 Ops/s $\color{#35bf28}+1.42\%$
test_to_module_speed[False] 76.7863ms 1.8076ms 553.2317 Ops/s 581.9234 Ops/s $\color{#d91a1a}-4.93\%$
test_tc_init 0.1205ms 58.9930μs 16.9512 KOps/s 16.1504 KOps/s $\color{#35bf28}+4.96\%$
test_tc_init_nested 0.2283ms 0.1210ms 8.2642 KOps/s 8.2029 KOps/s $\color{#35bf28}+0.75\%$
test_tc_first_layer_tensor 34.9960μs 8.2400μs 121.3596 KOps/s 119.6489 KOps/s $\color{#35bf28}+1.43\%$
test_tc_first_layer_nontensor 49.9530μs 8.1689μs 122.4161 KOps/s 121.4072 KOps/s $\color{#35bf28}+0.83\%$
test_tc_second_layer_tensor 38.3510μs 2.5017μs 399.7359 KOps/s 398.5926 KOps/s $\color{#35bf28}+0.29\%$
test_tc_second_layer_nontensor 64.0900μs 9.1765μs 108.9737 KOps/s 106.7474 KOps/s $\color{#35bf28}+2.09\%$
test_unbind 81.7783ms 13.4097ms 74.5727 Ops/s 70.8813 Ops/s $\textbf{\color{#35bf28}+5.21\%}$
test_full_like 8.9223ms 7.2202ms 138.5000 Ops/s 122.6028 Ops/s $\textbf{\color{#35bf28}+12.97\%}$
test_zeros_like 6.4181ms 5.8024ms 172.3432 Ops/s 158.4115 Ops/s $\textbf{\color{#35bf28}+8.79\%}$
test_ones_like 15.4731ms 6.7144ms 148.9339 Ops/s 145.8334 Ops/s $\color{#35bf28}+2.13\%$
test_clone 13.0502ms 8.2252ms 121.5770 Ops/s 119.5736 Ops/s $\color{#35bf28}+1.68\%$
test_squeeze 62.4070μs 12.3502μs 80.9705 KOps/s 76.3389 KOps/s $\textbf{\color{#35bf28}+6.07\%}$
test_unsqueeze 0.2761ms 95.7485μs 10.4440 KOps/s 9.9060 KOps/s $\textbf{\color{#35bf28}+5.43\%}$
test_split 0.5166ms 0.2692ms 3.7152 KOps/s 3.6380 KOps/s $\color{#35bf28}+2.12\%$
test_permute 0.3019ms 0.2201ms 4.5433 KOps/s 4.4340 KOps/s $\color{#35bf28}+2.46\%$
test_stack 24.6053ms 22.6470ms 44.1559 Ops/s 41.9546 Ops/s $\textbf{\color{#35bf28}+5.25\%}$
test_cat 25.1129ms 22.7535ms 43.9494 Ops/s 41.5459 Ops/s $\textbf{\color{#35bf28}+5.79\%}$

Copy link

github-actions bot commented Jul 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 152. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}28$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 28.6610μs 13.2052μs 75.7276 KOps/s 81.7734 KOps/s $\textbf{\color{#d91a1a}-7.39\%}$
test_plain_set_stack_nested 26.4310μs 13.2055μs 75.7262 KOps/s 80.0296 KOps/s $\textbf{\color{#d91a1a}-5.38\%}$
test_plain_set_nested_inplace 50.6510μs 14.4035μs 69.4276 KOps/s 73.0657 KOps/s $\color{#d91a1a}-4.98\%$
test_plain_set_stack_nested_inplace 42.7010μs 14.5897μs 68.5415 KOps/s 73.1565 KOps/s $\textbf{\color{#d91a1a}-6.31\%}$
test_items 18.1610μs 4.7375μs 211.0806 KOps/s 210.2732 KOps/s $\color{#35bf28}+0.38\%$
test_items_nested 0.3625ms 0.3354ms 2.9817 KOps/s 2.8895 KOps/s $\color{#35bf28}+3.19\%$
test_items_nested_locked 0.3671ms 0.3380ms 2.9588 KOps/s 2.9196 KOps/s $\color{#35bf28}+1.34\%$
test_items_nested_leaf 0.1003ms 82.5270μs 12.1172 KOps/s 12.0467 KOps/s $\color{#35bf28}+0.59\%$
test_items_stack_nested 0.4072ms 0.3392ms 2.9484 KOps/s 2.9463 KOps/s $\color{#35bf28}+0.07\%$
test_items_stack_nested_leaf 0.1035ms 83.9617μs 11.9102 KOps/s 11.8883 KOps/s $\color{#35bf28}+0.18\%$
test_items_stack_nested_locked 0.3803ms 0.3424ms 2.9202 KOps/s 2.9534 KOps/s $\color{#d91a1a}-1.12\%$
test_keys 23.6710μs 4.3859μs 228.0014 KOps/s 229.0676 KOps/s $\color{#d91a1a}-0.47\%$
test_keys_nested 90.4820μs 69.1377μs 14.4639 KOps/s 14.4838 KOps/s $\color{#d91a1a}-0.14\%$
test_keys_nested_locked 2.2241ms 75.2072μs 13.2966 KOps/s 13.2282 KOps/s $\color{#35bf28}+0.52\%$
test_keys_nested_leaf 76.5610μs 57.9812μs 17.2470 KOps/s 16.6369 KOps/s $\color{#35bf28}+3.67\%$
test_keys_stack_nested 0.1042ms 68.8675μs 14.5206 KOps/s 14.3474 KOps/s $\color{#35bf28}+1.21\%$
test_keys_stack_nested_leaf 78.8810μs 59.4637μs 16.8170 KOps/s 16.4989 KOps/s $\color{#35bf28}+1.93\%$
test_keys_stack_nested_locked 95.2520μs 74.6528μs 13.3953 KOps/s 13.1072 KOps/s $\color{#35bf28}+2.20\%$
test_values 8.1973μs 1.7986μs 555.9976 KOps/s 549.6638 KOps/s $\color{#35bf28}+1.15\%$
test_values_nested 66.4410μs 34.9124μs 28.6431 KOps/s 28.2906 KOps/s $\color{#35bf28}+1.25\%$
test_values_nested_locked 0.7728ms 37.1439μs 26.9224 KOps/s 26.9886 KOps/s $\color{#d91a1a}-0.25\%$
test_values_nested_leaf 59.7920μs 31.1187μs 32.1351 KOps/s 31.7493 KOps/s $\color{#35bf28}+1.21\%$
test_values_stack_nested 60.6310μs 36.0814μs 27.7151 KOps/s 27.8543 KOps/s $\color{#d91a1a}-0.50\%$
test_values_stack_nested_leaf 56.7110μs 32.1782μs 31.0769 KOps/s 31.2638 KOps/s $\color{#d91a1a}-0.60\%$
test_values_stack_nested_locked 59.5820μs 38.1598μs 26.2056 KOps/s 26.5954 KOps/s $\color{#d91a1a}-1.47\%$
test_membership 1.6021μs 0.6992μs 1.4302 MOps/s 1.2426 MOps/s $\textbf{\color{#35bf28}+15.09\%}$
test_membership_nested 15.0010μs 2.5822μs 387.2692 KOps/s 387.2402 KOps/s $+0.01\%$
test_membership_nested_leaf 18.2210μs 2.5935μs 385.5728 KOps/s 385.9864 KOps/s $\color{#d91a1a}-0.11\%$
test_membership_stacked_nested 17.4500μs 2.6173μs 382.0701 KOps/s 388.3872 KOps/s $\color{#d91a1a}-1.63\%$
test_membership_stacked_nested_leaf 22.0800μs 2.6001μs 384.6052 KOps/s 389.0550 KOps/s $\color{#d91a1a}-1.14\%$
test_membership_nested_last 24.1310μs 3.1216μs 320.3463 KOps/s 321.5082 KOps/s $\color{#d91a1a}-0.36\%$
test_membership_nested_leaf_last 24.8600μs 3.1264μs 319.8611 KOps/s 318.6495 KOps/s $\color{#35bf28}+0.38\%$
test_membership_stacked_nested_last 23.0000μs 3.5738μs 279.8126 KOps/s 319.0228 KOps/s $\textbf{\color{#d91a1a}-12.29\%}$
test_membership_stacked_nested_leaf_last 23.6010μs 3.5759μs 279.6524 KOps/s 318.2486 KOps/s $\textbf{\color{#d91a1a}-12.13\%}$
test_nested_getleaf 25.0000μs 8.3953μs 119.1146 KOps/s 119.3060 KOps/s $\color{#d91a1a}-0.16\%$
test_nested_get 20.8910μs 7.8757μs 126.9726 KOps/s 126.6729 KOps/s $\color{#35bf28}+0.24\%$
test_stacked_getleaf 30.5600μs 8.3534μs 119.7121 KOps/s 119.5696 KOps/s $\color{#35bf28}+0.12\%$
test_stacked_get 21.5800μs 7.8168μs 127.9289 KOps/s 127.5737 KOps/s $\color{#35bf28}+0.28\%$
test_nested_getitemleaf 28.6710μs 8.5893μs 116.4235 KOps/s 116.8852 KOps/s $\color{#d91a1a}-0.39\%$
test_nested_getitem 32.8300μs 8.0108μs 124.8317 KOps/s 124.6124 KOps/s $\color{#35bf28}+0.18\%$
test_stacked_getitemleaf 23.1110μs 8.5018μs 117.6222 KOps/s 117.5489 KOps/s $\color{#35bf28}+0.06\%$
test_stacked_getitem 20.3810μs 7.9529μs 125.7407 KOps/s 123.9720 KOps/s $\color{#35bf28}+1.43\%$
test_lock_nested 57.7133ms 0.4030ms 2.4812 KOps/s 2.4553 KOps/s $\color{#35bf28}+1.05\%$
test_lock_stack_nested 0.3238ms 0.2950ms 3.3901 KOps/s 3.3179 KOps/s $\color{#35bf28}+2.18\%$
test_unlock_nested 60.1607ms 0.4046ms 2.4718 KOps/s 2.4601 KOps/s $\color{#35bf28}+0.47\%$
test_unlock_stack_nested 0.3341ms 0.3050ms 3.2791 KOps/s 3.2355 KOps/s $\color{#35bf28}+1.35\%$
test_flatten_speed 0.3984ms 0.1022ms 9.7869 KOps/s 9.8915 KOps/s $\color{#d91a1a}-1.06\%$
test_unflatten_speed 0.3572ms 0.2900ms 3.4480 KOps/s 3.4545 KOps/s $\color{#d91a1a}-0.19\%$
test_common_ops 1.0703ms 0.5976ms 1.6735 KOps/s 1.7616 KOps/s $\textbf{\color{#d91a1a}-5.01\%}$
test_creation 33.7700μs 1.6304μs 613.3585 KOps/s 616.1618 KOps/s $\color{#d91a1a}-0.45\%$
test_creation_empty 22.5900μs 8.9624μs 111.5778 KOps/s 130.8499 KOps/s $\textbf{\color{#d91a1a}-14.73\%}$
test_creation_nested_1 24.5710μs 10.7523μs 93.0036 KOps/s 104.9117 KOps/s $\textbf{\color{#d91a1a}-11.35\%}$
test_creation_nested_2 34.0910μs 12.9261μs 77.3626 KOps/s 85.5695 KOps/s $\textbf{\color{#d91a1a}-9.59\%}$
test_clone 82.8920μs 11.8913μs 84.0951 KOps/s 88.9450 KOps/s $\textbf{\color{#d91a1a}-5.45\%}$
test_getitem[int] 33.9810μs 10.5791μs 94.5261 KOps/s 93.0434 KOps/s $\color{#35bf28}+1.59\%$
test_getitem[slice_int] 40.3310μs 20.9961μs 47.6280 KOps/s 48.2307 KOps/s $\color{#d91a1a}-1.25\%$
test_getitem[range] 84.2420μs 51.2212μs 19.5232 KOps/s 21.1770 KOps/s $\textbf{\color{#d91a1a}-7.81\%}$
test_getitem[tuple] 42.3500μs 18.9652μs 52.7281 KOps/s 53.9298 KOps/s $\color{#d91a1a}-2.23\%$
test_getitem[list] 0.1238ms 34.0074μs 29.4054 KOps/s 30.5657 KOps/s $\color{#d91a1a}-3.80\%$
test_setitem_dim[int] 44.2510μs 28.1571μs 35.5150 KOps/s 37.4152 KOps/s $\textbf{\color{#d91a1a}-5.08\%}$
test_setitem_dim[slice_int] 70.2310μs 49.8010μs 20.0799 KOps/s 20.7040 KOps/s $\color{#d91a1a}-3.01\%$
test_setitem_dim[range] 0.1067ms 66.4184μs 15.0561 KOps/s 15.3596 KOps/s $\color{#d91a1a}-1.98\%$
test_setitem_dim[tuple] 60.1410μs 43.5321μs 22.9716 KOps/s 24.0569 KOps/s $\color{#d91a1a}-4.51\%$
test_setitem 48.8610μs 17.2891μs 57.8400 KOps/s 65.0583 KOps/s $\textbf{\color{#d91a1a}-11.10\%}$
test_set 54.4010μs 16.8147μs 59.4716 KOps/s 66.8022 KOps/s $\textbf{\color{#d91a1a}-10.97\%}$
test_set_shared 1.7673ms 0.1041ms 9.6027 KOps/s 10.2051 KOps/s $\textbf{\color{#d91a1a}-5.90\%}$
test_update 0.1056ms 19.7506μs 50.6315 KOps/s 56.8258 KOps/s $\textbf{\color{#d91a1a}-10.90\%}$
test_update_nested 66.0310μs 25.3193μs 39.4956 KOps/s 43.8512 KOps/s $\textbf{\color{#d91a1a}-9.93\%}$
test_update__nested 52.0410μs 22.5401μs 44.3653 KOps/s 46.6791 KOps/s $\color{#d91a1a}-4.96\%$
test_set_nested 78.6020μs 17.7529μs 56.3289 KOps/s 61.5612 KOps/s $\textbf{\color{#d91a1a}-8.50\%}$
test_set_nested_new 53.8210μs 20.4613μs 48.8729 KOps/s 53.2677 KOps/s $\textbf{\color{#d91a1a}-8.25\%}$
test_select 67.9710μs 33.4052μs 29.9354 KOps/s 31.0096 KOps/s $\color{#d91a1a}-3.46\%$
test_select_nested 0.6213ms 52.5057μs 19.0456 KOps/s 19.5066 KOps/s $\color{#d91a1a}-2.36\%$
test_exclude_nested 0.1477ms 0.1075ms 9.2998 KOps/s 9.1539 KOps/s $\color{#35bf28}+1.59\%$
test_empty[True] 0.4002ms 0.3425ms 2.9201 KOps/s 2.9064 KOps/s $\color{#35bf28}+0.47\%$
test_empty[False] 2.9761μs 0.8031μs 1.2452 MOps/s 1.2386 MOps/s $\color{#35bf28}+0.53\%$
test_to 87.7820μs 58.9100μs 16.9750 KOps/s 15.4819 KOps/s $\textbf{\color{#35bf28}+9.64\%}$
test_to_nonblocking 77.7420μs 38.8215μs 25.7589 KOps/s 27.3774 KOps/s $\textbf{\color{#d91a1a}-5.91\%}$
test_unbind_speed 0.3519ms 0.2612ms 3.8290 KOps/s 3.8724 KOps/s $\color{#d91a1a}-1.12\%$
test_unbind_speed_stack0 0.3464ms 0.2575ms 3.8833 KOps/s 3.8562 KOps/s $\color{#35bf28}+0.70\%$
test_unbind_speed_stack1 74.6450ms 0.7681ms 1.3018 KOps/s 1.2727 KOps/s $\color{#35bf28}+2.29\%$
test_split 75.7636ms 1.6876ms 592.5429 Ops/s 605.7622 Ops/s $\color{#d91a1a}-2.18\%$
test_chunk 75.3546ms 1.6887ms 592.1572 Ops/s 606.8029 Ops/s $\color{#d91a1a}-2.41\%$
test_creation[device0] 0.1320ms 60.5311μs 16.5204 KOps/s 17.4833 KOps/s $\textbf{\color{#d91a1a}-5.51\%}$
test_creation_from_tensor 0.1396ms 58.2456μs 17.1687 KOps/s 18.6961 KOps/s $\textbf{\color{#d91a1a}-8.17\%}$
test_add_one[memmap_tensor0] 90.4520μs 7.1084μs 140.6790 KOps/s 145.3733 KOps/s $\color{#d91a1a}-3.23\%$
test_contiguous[memmap_tensor0] 13.9910μs 0.6836μs 1.4629 MOps/s 1.4723 MOps/s $\color{#d91a1a}-0.64\%$
test_stack[memmap_tensor0] 46.9810μs 5.1375μs 194.6458 KOps/s 202.4506 KOps/s $\color{#d91a1a}-3.86\%$
test_memmaptd_index 1.0717ms 0.2845ms 3.5155 KOps/s 3.6983 KOps/s $\color{#d91a1a}-4.94\%$
test_memmaptd_index_astensor 0.6091ms 0.3452ms 2.8971 KOps/s 3.0281 KOps/s $\color{#d91a1a}-4.33\%$
test_memmaptd_index_op 0.9333ms 0.6516ms 1.5346 KOps/s 1.6571 KOps/s $\textbf{\color{#d91a1a}-7.39\%}$
test_serialize_model 92.3388ms 89.9211ms 11.1209 Ops/s 10.6293 Ops/s $\color{#35bf28}+4.62\%$
test_serialize_model_pickle 1.3510s 1.2349s 0.8098 Ops/s 0.8089 Ops/s $\color{#35bf28}+0.10\%$
test_serialize_weights 90.7380ms 87.4543ms 11.4345 Ops/s 10.7286 Ops/s $\textbf{\color{#35bf28}+6.58\%}$
test_serialize_weights_returnearly 0.1867s 72.0415ms 13.8809 Ops/s 13.9054 Ops/s $\color{#d91a1a}-0.18\%$
test_serialize_weights_pickle 1.3969s 1.2556s 0.7964 Ops/s 0.7957 Ops/s $\color{#35bf28}+0.09\%$
test_reshape_pytree 54.8310μs 26.5439μs 37.6734 KOps/s 38.3423 KOps/s $\color{#d91a1a}-1.74\%$
test_reshape_td 56.3510μs 32.4581μs 30.8090 KOps/s 31.6951 KOps/s $\color{#d91a1a}-2.80\%$
test_view_pytree 76.9310μs 26.3285μs 37.9816 KOps/s 38.7604 KOps/s $\color{#d91a1a}-2.01\%$
test_view_td 66.3220μs 38.2001μs 26.1779 KOps/s 27.3129 KOps/s $\color{#d91a1a}-4.16\%$
test_unbind_pytree 0.1567ms 32.6155μs 30.6602 KOps/s 31.2071 KOps/s $\color{#d91a1a}-1.75\%$
test_unbind_td 0.4734ms 41.0286μs 24.3733 KOps/s 24.1827 KOps/s $\color{#35bf28}+0.79\%$
test_split_pytree 59.1310μs 35.6051μs 28.0859 KOps/s 28.2205 KOps/s $\color{#d91a1a}-0.48\%$
test_split_td 0.1229ms 39.2154μs 25.5002 KOps/s 26.2063 KOps/s $\color{#d91a1a}-2.69\%$
test_add_pytree 74.9520μs 38.0373μs 26.2900 KOps/s 27.2616 KOps/s $\color{#d91a1a}-3.56\%$
test_add_td 92.4120μs 51.9894μs 19.2347 KOps/s 20.0079 KOps/s $\color{#d91a1a}-3.86\%$
test_distributed 0.1789ms 69.7175μs 14.3436 KOps/s 12.5371 KOps/s $\textbf{\color{#35bf28}+14.41\%}$
test_tdmodule 90.9720μs 16.4426μs 60.8175 KOps/s 66.0814 KOps/s $\textbf{\color{#d91a1a}-7.97\%}$
test_tdmodule_dispatch 51.1410μs 31.1715μs 32.0805 KOps/s 34.0699 KOps/s $\textbf{\color{#d91a1a}-5.84\%}$
test_tdseq 33.9000μs 17.2431μs 57.9942 KOps/s 62.5345 KOps/s $\textbf{\color{#d91a1a}-7.26\%}$
test_tdseq_dispatch 53.5810μs 33.8782μs 29.5175 KOps/s 32.3817 KOps/s $\textbf{\color{#d91a1a}-8.85\%}$
test_instantiation_functorch 1.5394ms 1.4305ms 699.0654 Ops/s 709.1521 Ops/s $\color{#d91a1a}-1.42\%$
test_instantiation_td 77.7946ms 1.0860ms 920.8400 Ops/s 924.2722 Ops/s $\color{#d91a1a}-0.37\%$
test_exec_functorch 0.1742ms 0.1502ms 6.6581 KOps/s 6.8787 KOps/s $\color{#d91a1a}-3.21\%$
test_exec_functional_call 0.2037ms 0.1403ms 7.1298 KOps/s 7.3123 KOps/s $\color{#d91a1a}-2.50\%$
test_exec_td 0.1795ms 0.1387ms 7.2105 KOps/s 7.4750 KOps/s $\color{#d91a1a}-3.54\%$
test_exec_td_decorator 0.4635ms 0.2093ms 4.7770 KOps/s 4.7944 KOps/s $\color{#d91a1a}-0.36\%$
test_vmap_mlp_speed[True-True] 0.6691ms 0.5813ms 1.7201 KOps/s 1.7251 KOps/s $\color{#d91a1a}-0.29\%$
test_vmap_mlp_speed[True-False] 0.6478ms 0.5809ms 1.7215 KOps/s 1.5550 KOps/s $\textbf{\color{#35bf28}+10.71\%}$
test_vmap_mlp_speed[False-True] 0.5799ms 0.5120ms 1.9531 KOps/s 1.8663 KOps/s $\color{#35bf28}+4.65\%$
test_vmap_mlp_speed[False-False] 0.7457ms 0.5284ms 1.8925 KOps/s 1.8605 KOps/s $\color{#35bf28}+1.72\%$
test_vmap_mlp_speed_decorator[True-True] 0.8538ms 0.6799ms 1.4709 KOps/s 1.5370 KOps/s $\color{#d91a1a}-4.30\%$
test_vmap_mlp_speed_decorator[True-False] 0.7876ms 0.6442ms 1.5523 KOps/s 1.5478 KOps/s $\color{#35bf28}+0.29\%$
test_vmap_mlp_speed_decorator[False-True] 0.6917ms 0.5687ms 1.7585 KOps/s 1.7385 KOps/s $\color{#35bf28}+1.15\%$
test_vmap_mlp_speed_decorator[False-False] 0.6368ms 0.5690ms 1.7574 KOps/s 1.7486 KOps/s $\color{#35bf28}+0.51\%$
test_vmap_transformer_speed[True-True] 8.0991ms 7.8181ms 127.9076 Ops/s 130.1710 Ops/s $\color{#d91a1a}-1.74\%$
test_vmap_transformer_speed[True-False] 8.6003ms 8.0041ms 124.9355 Ops/s 129.9316 Ops/s $\color{#d91a1a}-3.85\%$
test_vmap_transformer_speed[False-True] 8.7017ms 7.9826ms 125.2726 Ops/s 127.7513 Ops/s $\color{#d91a1a}-1.94\%$
test_vmap_transformer_speed[False-False] 8.0378ms 7.6640ms 130.4806 Ops/s 128.8862 Ops/s $\color{#35bf28}+1.24\%$
test_vmap_transformer_speed_decorator[True-True] 19.2588ms 18.7270ms 53.3988 Ops/s 53.7079 Ops/s $\color{#d91a1a}-0.58\%$
test_vmap_transformer_speed_decorator[True-False] 19.1299ms 18.7184ms 53.4233 Ops/s 53.5322 Ops/s $\color{#d91a1a}-0.20\%$
test_vmap_transformer_speed_decorator[False-True] 19.2434ms 18.6480ms 53.6251 Ops/s 53.7404 Ops/s $\color{#d91a1a}-0.21\%$
test_vmap_transformer_speed_decorator[False-False] 18.7961ms 18.5987ms 53.7671 Ops/s 53.8800 Ops/s $\color{#d91a1a}-0.21\%$
test_to_module_speed[True] 1.5945ms 1.4970ms 668.0077 Ops/s 662.3579 Ops/s $\color{#35bf28}+0.85\%$
test_to_module_speed[False] 1.5964ms 1.4858ms 673.0418 Ops/s 664.8042 Ops/s $\color{#35bf28}+1.24\%$
test_tc_init 82.9110μs 54.6895μs 18.2850 KOps/s 19.4772 KOps/s $\textbf{\color{#d91a1a}-6.12\%}$
test_tc_init_nested 0.1408ms 0.1062ms 9.4190 KOps/s 9.7304 KOps/s $\color{#d91a1a}-3.20\%$
test_tc_first_layer_tensor 17.7410μs 3.6862μs 271.2788 KOps/s 265.6646 KOps/s $\color{#35bf28}+2.11\%$
test_tc_first_layer_nontensor 19.5700μs 3.7156μs 269.1334 KOps/s 263.6150 KOps/s $\color{#35bf28}+2.09\%$
test_tc_second_layer_tensor 13.3500μs 1.2611μs 792.9692 KOps/s 772.9381 KOps/s $\color{#35bf28}+2.59\%$
test_tc_second_layer_nontensor 85.5920μs 4.2473μs 235.4423 KOps/s 231.6104 KOps/s $\color{#35bf28}+1.65\%$
test_unbind 0.1109s 13.7905ms 72.5139 Ops/s 64.9099 Ops/s $\textbf{\color{#35bf28}+11.71\%}$
test_full_like 14.6025ms 13.8832ms 72.0297 Ops/s 71.5749 Ops/s $\color{#35bf28}+0.64\%$
test_zeros_like 8.3675ms 7.9718ms 125.4416 Ops/s 123.8435 Ops/s $\color{#35bf28}+1.29\%$
test_ones_like 8.4584ms 7.9818ms 125.2857 Ops/s 123.6332 Ops/s $\color{#35bf28}+1.34\%$
test_clone 9.7884ms 9.5072ms 105.1840 Ops/s 103.8218 Ops/s $\color{#35bf28}+1.31\%$
test_squeeze 61.6810μs 10.7627μs 92.9132 KOps/s 90.8840 KOps/s $\color{#35bf28}+2.23\%$
test_unsqueeze 0.2363ms 90.0566μs 11.1041 KOps/s 11.0989 KOps/s $\color{#35bf28}+0.05\%$
test_split 3.5336ms 3.1708ms 315.3828 Ops/s 298.6810 Ops/s $\textbf{\color{#35bf28}+5.59\%}$
test_permute 0.2781ms 0.2141ms 4.6706 KOps/s 4.6114 KOps/s $\color{#35bf28}+1.28\%$
test_stack 27.9225ms 27.4800ms 36.3901 Ops/s 36.5402 Ops/s $\color{#d91a1a}-0.41\%$
test_cat 27.4561ms 27.2316ms 36.7220 Ops/s 37.2747 Ops/s $\color{#d91a1a}-1.48\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants