Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Versioning] v0.6.1 #1072

Merged
merged 1 commit into from
Nov 4, 2024
Merged

[Versioning] v0.6.1 #1072

merged 1 commit into from
Nov 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 4, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 4, 2024
ghstack-source-id: a899c95c12a3b1b986ed429b6507711c4126189e
Pull Request resolved: #1072
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 4, 2024
@vmoens vmoens merged commit 2b117a9 into gh/vmoens/35/base Nov 4, 2024
11 of 19 checks passed
vmoens added a commit that referenced this pull request Nov 4, 2024
ghstack-source-id: a899c95c12a3b1b986ed429b6507711c4126189e
Pull Request resolved: #1072
@vmoens vmoens deleted the gh/vmoens/35/head branch November 4, 2024 13:13
vmoens added a commit that referenced this pull request Nov 4, 2024
ghstack-source-id: a899c95c12a3b1b986ed429b6507711c4126189e
Pull Request resolved: #1072

(cherry picked from commit f12d31d)
Copy link

github-actions bot commented Nov 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}33$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 40.6250μs 17.1584μs 58.2806 KOps/s 52.7423 KOps/s $\textbf{\color{#35bf28}+10.50\%}$
test_plain_set_stack_nested 65.5120μs 17.3276μs 57.7114 KOps/s 52.4996 KOps/s $\textbf{\color{#35bf28}+9.93\%}$
test_plain_set_nested_inplace 48.1890μs 18.8747μs 52.9811 KOps/s 48.8148 KOps/s $\textbf{\color{#35bf28}+8.53\%}$
test_plain_set_stack_nested_inplace 78.9160μs 19.0455μs 52.5057 KOps/s 48.8696 KOps/s $\textbf{\color{#35bf28}+7.44\%}$
test_items 23.6240μs 4.2024μs 237.9592 KOps/s 233.3797 KOps/s $\color{#35bf28}+1.96\%$
test_items_nested 0.7047ms 0.3432ms 2.9137 KOps/s 2.9177 KOps/s $\color{#d91a1a}-0.14\%$
test_items_nested_locked 0.6391ms 0.3454ms 2.8956 KOps/s 2.9501 KOps/s $\color{#d91a1a}-1.85\%$
test_items_nested_leaf 0.1392ms 72.3845μs 13.8151 KOps/s 13.6558 KOps/s $\color{#35bf28}+1.17\%$
test_items_stack_nested 0.7142ms 0.3474ms 2.8789 KOps/s 2.9224 KOps/s $\color{#d91a1a}-1.49\%$
test_items_stack_nested_leaf 0.1351ms 76.4830μs 13.0748 KOps/s 13.2787 KOps/s $\color{#d91a1a}-1.54\%$
test_items_stack_nested_locked 0.6404ms 0.3491ms 2.8644 KOps/s 2.8989 KOps/s $\color{#d91a1a}-1.19\%$
test_keys 20.8090μs 3.5548μs 281.3118 KOps/s 277.5179 KOps/s $\color{#35bf28}+1.37\%$
test_keys_nested 0.2958ms 0.1361ms 7.3454 KOps/s 7.1907 KOps/s $\color{#35bf28}+2.15\%$
test_keys_nested_locked 2.1816ms 0.1414ms 7.0706 KOps/s 7.0014 KOps/s $\color{#35bf28}+0.99\%$
test_keys_nested_leaf 0.1982ms 0.1171ms 8.5379 KOps/s 8.6437 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_stack_nested 0.2322ms 0.1348ms 7.4195 KOps/s 7.3370 KOps/s $\color{#35bf28}+1.12\%$
test_keys_stack_nested_leaf 0.1925ms 0.1157ms 8.6451 KOps/s 8.6175 KOps/s $\color{#35bf28}+0.32\%$
test_keys_stack_nested_locked 0.2383ms 0.1412ms 7.0803 KOps/s 7.0554 KOps/s $\color{#35bf28}+0.35\%$
test_values 9.6624μs 1.0337μs 967.4235 KOps/s 964.4775 KOps/s $\color{#35bf28}+0.31\%$
test_values_nested 0.1324ms 55.8405μs 17.9082 KOps/s 17.9561 KOps/s $\color{#d91a1a}-0.27\%$
test_values_nested_locked 0.1077ms 55.6803μs 17.9597 KOps/s 18.1171 KOps/s $\color{#d91a1a}-0.87\%$
test_values_nested_leaf 0.1206ms 60.7518μs 16.4604 KOps/s 15.9033 KOps/s $\color{#35bf28}+3.50\%$
test_values_stack_nested 0.1166ms 57.3568μs 17.4347 KOps/s 16.2986 KOps/s $\textbf{\color{#35bf28}+6.97\%}$
test_values_stack_nested_leaf 0.1178ms 60.1625μs 16.6216 KOps/s 16.6550 KOps/s $\color{#d91a1a}-0.20\%$
test_values_stack_nested_locked 0.1599ms 57.3552μs 17.4352 KOps/s 17.4837 KOps/s $\color{#d91a1a}-0.28\%$
test_membership 13.7960μs 0.9155μs 1.0923 MOps/s 1.1161 MOps/s $\color{#d91a1a}-2.13\%$
test_membership_nested 54.3720μs 2.8734μs 348.0249 KOps/s 365.8537 KOps/s $\color{#d91a1a}-4.87\%$
test_membership_nested_leaf 67.8160μs 2.8163μs 355.0761 KOps/s 363.0575 KOps/s $\color{#d91a1a}-2.20\%$
test_membership_stacked_nested 47.6190μs 2.7971μs 357.5142 KOps/s 367.6419 KOps/s $\color{#d91a1a}-2.75\%$
test_membership_stacked_nested_leaf 17.9730μs 2.8167μs 355.0203 KOps/s 364.3280 KOps/s $\color{#d91a1a}-2.55\%$
test_membership_nested_last 79.2980μs 4.2254μs 236.6658 KOps/s 242.2528 KOps/s $\color{#d91a1a}-2.31\%$
test_membership_nested_leaf_last 50.7950μs 4.1805μs 239.2070 KOps/s 240.8631 KOps/s $\color{#d91a1a}-0.69\%$
test_membership_stacked_nested_last 23.0330μs 4.1402μs 241.5324 KOps/s 161.8911 KOps/s $\textbf{\color{#35bf28}+49.19\%}$
test_membership_stacked_nested_leaf_last 26.9400μs 4.0979μs 244.0268 KOps/s 163.2636 KOps/s $\textbf{\color{#35bf28}+49.47\%}$
test_nested_getleaf 41.4870μs 10.9034μs 91.7142 KOps/s 94.6388 KOps/s $\color{#d91a1a}-3.09\%$
test_nested_get 54.8020μs 10.3378μs 96.7327 KOps/s 100.1605 KOps/s $\color{#d91a1a}-3.42\%$
test_stacked_getleaf 53.8300μs 10.8910μs 91.8190 KOps/s 94.5992 KOps/s $\color{#d91a1a}-2.94\%$
test_stacked_get 95.7980μs 10.3595μs 96.5293 KOps/s 100.4727 KOps/s $\color{#d91a1a}-3.92\%$
test_nested_getitemleaf 64.7920μs 11.7518μs 85.0932 KOps/s 90.5349 KOps/s $\textbf{\color{#d91a1a}-6.01\%}$
test_nested_getitem 53.0490μs 10.4579μs 95.6215 KOps/s 96.6036 KOps/s $\color{#d91a1a}-1.02\%$
test_stacked_getitemleaf 30.8580μs 11.1400μs 89.7668 KOps/s 91.2541 KOps/s $\color{#d91a1a}-1.63\%$
test_stacked_getitem 64.3390μs 10.5242μs 95.0188 KOps/s 97.6026 KOps/s $\color{#d91a1a}-2.65\%$
test_lock_nested 4.5521ms 0.4569ms 2.1888 KOps/s 2.1909 KOps/s $\color{#d91a1a}-0.10\%$
test_lock_stack_nested 0.6290ms 0.4118ms 2.4285 KOps/s 2.4140 KOps/s $\color{#35bf28}+0.60\%$
test_unlock_nested 0.9928ms 0.3764ms 2.6569 KOps/s 2.6577 KOps/s $\color{#d91a1a}-0.03\%$
test_unlock_stack_nested 0.5151ms 0.3324ms 3.0082 KOps/s 2.9950 KOps/s $\color{#35bf28}+0.44\%$
test_flatten_speed 0.1661ms 93.0623μs 10.7455 KOps/s 10.6870 KOps/s $\color{#35bf28}+0.55\%$
test_unflatten_speed 0.7848ms 0.4652ms 2.1497 KOps/s 2.1563 KOps/s $\color{#d91a1a}-0.30\%$
test_common_ops 4.1601ms 0.7780ms 1.2853 KOps/s 1.1893 KOps/s $\textbf{\color{#35bf28}+8.07\%}$
test_creation 40.1520μs 2.1016μs 475.8272 KOps/s 465.9196 KOps/s $\color{#35bf28}+2.13\%$
test_creation_empty 45.6350μs 10.1683μs 98.3449 KOps/s 73.8043 KOps/s $\textbf{\color{#35bf28}+33.25\%}$
test_creation_nested_1 50.0330μs 12.7488μs 78.4388 KOps/s 60.9607 KOps/s $\textbf{\color{#35bf28}+28.67\%}$
test_creation_nested_2 54.1710μs 17.0649μs 58.5997 KOps/s 48.0597 KOps/s $\textbf{\color{#35bf28}+21.93\%}$
test_clone 0.1666ms 13.2899μs 75.2453 KOps/s 74.3197 KOps/s $\color{#35bf28}+1.25\%$
test_getitem[int] 0.9796ms 12.7452μs 78.4612 KOps/s 77.9628 KOps/s $\color{#35bf28}+0.64\%$
test_getitem[slice_int] 0.1634ms 25.1628μs 39.7412 KOps/s 40.1837 KOps/s $\color{#d91a1a}-1.10\%$
test_getitem[range] 0.3265ms 50.1391μs 19.9445 KOps/s 20.4085 KOps/s $\color{#d91a1a}-2.27\%$
test_getitem[tuple] 0.1618ms 20.2745μs 49.3229 KOps/s 48.9412 KOps/s $\color{#35bf28}+0.78\%$
test_getitem[list] 0.4190ms 46.4827μs 21.5134 KOps/s 22.2251 KOps/s $\color{#d91a1a}-3.20\%$
test_setitem_dim[int] 54.8620μs 26.0368μs 38.4071 KOps/s 39.1376 KOps/s $\color{#d91a1a}-1.87\%$
test_setitem_dim[slice_int] 0.1376ms 54.6514μs 18.2978 KOps/s 19.4438 KOps/s $\textbf{\color{#d91a1a}-5.89\%}$
test_setitem_dim[range] 0.1287ms 76.8670μs 13.0095 KOps/s 13.3370 KOps/s $\color{#d91a1a}-2.46\%$
test_setitem_dim[tuple] 93.8050μs 41.8070μs 23.9194 KOps/s 24.5067 KOps/s $\color{#d91a1a}-2.40\%$
test_setitem 0.1742ms 19.9506μs 50.1238 KOps/s 46.1868 KOps/s $\textbf{\color{#35bf28}+8.52\%}$
test_set 0.1693ms 19.1202μs 52.3008 KOps/s 47.6706 KOps/s $\textbf{\color{#35bf28}+9.71\%}$
test_set_shared 2.4304ms 0.1779ms 5.6216 KOps/s 5.7713 KOps/s $\color{#d91a1a}-2.59\%$
test_update 0.1907ms 21.6371μs 46.2170 KOps/s 40.1972 KOps/s $\textbf{\color{#35bf28}+14.98\%}$
test_update_nested 0.2037ms 32.0851μs 31.1671 KOps/s 28.8216 KOps/s $\textbf{\color{#35bf28}+8.14\%}$
test_update__nested 1.0777ms 33.2096μs 30.1118 KOps/s 30.9739 KOps/s $\color{#d91a1a}-2.78\%$
test_set_nested 0.1950ms 21.7496μs 45.9779 KOps/s 43.5638 KOps/s $\textbf{\color{#35bf28}+5.54\%}$
test_set_nested_new 0.2045ms 26.3838μs 37.9021 KOps/s 36.2456 KOps/s $\color{#35bf28}+4.57\%$
test_select 0.2191ms 42.1569μs 23.7209 KOps/s 22.3241 KOps/s $\textbf{\color{#35bf28}+6.26\%}$
test_select_nested 0.1321ms 60.2389μs 16.6006 KOps/s 16.7526 KOps/s $\color{#d91a1a}-0.91\%$
test_exclude_nested 0.3692ms 75.0010μs 13.3331 KOps/s 13.4738 KOps/s $\color{#d91a1a}-1.04\%$
test_empty[True] 0.6963ms 0.3487ms 2.8680 KOps/s 2.8529 KOps/s $\color{#35bf28}+0.53\%$
test_empty[False] 12.1025μs 1.2943μs 772.6311 KOps/s 828.5754 KOps/s $\textbf{\color{#d91a1a}-6.75\%}$
test_unbind_speed 0.5738ms 0.2679ms 3.7326 KOps/s 3.8242 KOps/s $\color{#d91a1a}-2.39\%$
test_unbind_speed_stack0 0.4032ms 0.2575ms 3.8840 KOps/s 3.8936 KOps/s $\color{#d91a1a}-0.25\%$
test_unbind_speed_stack1 0.1156s 0.7802ms 1.2816 KOps/s 1.4130 KOps/s $\textbf{\color{#d91a1a}-9.30\%}$
test_split 0.1111s 1.7632ms 567.1527 Ops/s 552.4067 Ops/s $\color{#35bf28}+2.67\%$
test_chunk 1.7568ms 1.6016ms 624.3837 Ops/s 552.8162 Ops/s $\textbf{\color{#35bf28}+12.95\%}$
test_consolidate_njt[False-None] 0.1243s 9.2964ms 107.5680 Ops/s 114.8861 Ops/s $\textbf{\color{#d91a1a}-6.37\%}$
test_creation[device0] 0.2862ms 92.6147μs 10.7974 KOps/s 10.7795 KOps/s $\color{#35bf28}+0.17\%$
test_creation_from_tensor 4.8996ms 96.6267μs 10.3491 KOps/s 10.2410 KOps/s $\color{#35bf28}+1.06\%$
test_add_one[memmap_tensor0] 0.2568ms 4.9067μs 203.8023 KOps/s 207.0933 KOps/s $\color{#d91a1a}-1.59\%$
test_contiguous[memmap_tensor0] 12.8040μs 0.5011μs 1.9955 MOps/s 1.9848 MOps/s $\color{#35bf28}+0.54\%$
test_stack[memmap_tensor0] 73.9080μs 3.4097μs 293.2849 KOps/s 288.1371 KOps/s $\color{#35bf28}+1.79\%$
test_memmaptd_index 1.2916ms 0.2445ms 4.0900 KOps/s 4.1464 KOps/s $\color{#d91a1a}-1.36\%$
test_memmaptd_index_astensor 0.6889ms 0.3225ms 3.1011 KOps/s 3.1202 KOps/s $\color{#d91a1a}-0.61\%$
test_memmaptd_index_op 1.1671ms 0.5781ms 1.7299 KOps/s 1.5815 KOps/s $\textbf{\color{#35bf28}+9.38\%}$
test_serialize_model 0.1244s 0.1201s 8.3248 Ops/s 8.4042 Ops/s $\color{#d91a1a}-0.94\%$
test_serialize_model_pickle 0.4460s 0.3986s 2.5089 Ops/s 2.4888 Ops/s $\color{#35bf28}+0.81\%$
test_serialize_weights 0.2371s 0.1348s 7.4207 Ops/s 7.2356 Ops/s $\color{#35bf28}+2.56\%$
test_serialize_weights_returnearly 0.3144s 0.1799s 5.5579 Ops/s 6.2258 Ops/s $\textbf{\color{#d91a1a}-10.73\%}$
test_serialize_weights_pickle 0.4568s 0.4048s 2.4705 Ops/s 2.4051 Ops/s $\color{#35bf28}+2.72\%$
test_serialize_weights_filesystem 0.1497s 0.1467s 6.8185 Ops/s 6.6930 Ops/s $\color{#35bf28}+1.87\%$
test_serialize_model_filesystem 0.1624s 0.1533s 6.5236 Ops/s 5.6051 Ops/s $\textbf{\color{#35bf28}+16.39\%}$
test_reshape_pytree 67.4750μs 26.5126μs 37.7179 KOps/s 36.8825 KOps/s $\color{#35bf28}+2.27\%$
test_reshape_td 89.0060μs 32.2845μs 30.9746 KOps/s 30.6930 KOps/s $\color{#35bf28}+0.92\%$
test_view_pytree 69.0090μs 26.4607μs 37.7919 KOps/s 36.8940 KOps/s $\color{#35bf28}+2.43\%$
test_view_td 0.1091ms 37.4728μs 26.6860 KOps/s 26.8158 KOps/s $\color{#d91a1a}-0.48\%$
test_unbind_pytree 75.8510μs 29.9130μs 33.4302 KOps/s 33.1698 KOps/s $\color{#35bf28}+0.79\%$
test_unbind_td 0.3455ms 38.5605μs 25.9333 KOps/s 26.0113 KOps/s $\color{#d91a1a}-0.30\%$
test_split_pytree 0.1061ms 29.7535μs 33.6095 KOps/s 33.8923 KOps/s $\color{#d91a1a}-0.83\%$
test_split_td 0.1216s 56.9765μs 17.5511 KOps/s 22.5205 KOps/s $\textbf{\color{#d91a1a}-22.07\%}$
test_add_pytree 0.1100ms 36.4329μs 27.4477 KOps/s 27.3969 KOps/s $\color{#35bf28}+0.19\%$
test_add_td 0.1704ms 52.8631μs 18.9168 KOps/s 15.9509 KOps/s $\textbf{\color{#35bf28}+18.59\%}$
test_compile_add_one_nested[tensordict-compile] 0.1501ms 68.3839μs 14.6233 KOps/s 15.5390 KOps/s $\textbf{\color{#d91a1a}-5.89\%}$
test_compile_add_one_nested[tensordict-eager] 0.4191ms 0.1634ms 6.1212 KOps/s 6.1853 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_add_one_nested[pytree-compile] 0.1307ms 47.5273μs 21.0406 KOps/s 21.0838 KOps/s $\color{#d91a1a}-0.21\%$
test_compile_add_one_nested[pytree-eager] 0.2315ms 0.1188ms 8.4183 KOps/s 8.3454 KOps/s $\color{#35bf28}+0.87\%$
test_compile_copy_nested[tensordict-compile] 88.1330μs 26.6512μs 37.5217 KOps/s 37.7653 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_copy_nested[tensordict-eager] 0.1220ms 54.0378μs 18.5056 KOps/s 18.4063 KOps/s $\color{#35bf28}+0.54\%$
test_compile_copy_nested[pytree-compile] 0.1486ms 78.8938μs 12.6753 KOps/s 12.5503 KOps/s $\color{#35bf28}+1.00\%$
test_compile_copy_nested[pytree-eager] 0.1553ms 68.4044μs 14.6190 KOps/s 14.5761 KOps/s $\color{#35bf28}+0.29\%$
test_compile_add_one_flat[tensordict-compile] 0.1949ms 0.1079ms 9.2703 KOps/s 9.3163 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_one_flat[tensordict-eager] 0.4169ms 0.2017ms 4.9580 KOps/s 5.0601 KOps/s $\color{#d91a1a}-2.02\%$
test_compile_add_one_flat[tensorclass-compile] 0.1304ms 47.4954μs 21.0547 KOps/s 21.8301 KOps/s $\color{#d91a1a}-3.55\%$
test_compile_add_one_flat[tensorclass-eager] 0.5040ms 63.4874μs 15.7512 KOps/s 16.1348 KOps/s $\color{#d91a1a}-2.38\%$
test_compile_add_one_flat[pytree-compile] 0.2084ms 0.1061ms 9.4276 KOps/s 9.5870 KOps/s $\color{#d91a1a}-1.66\%$
test_compile_add_one_flat[pytree-eager] 0.3590ms 0.1981ms 5.0482 KOps/s 4.8217 KOps/s $\color{#35bf28}+4.70\%$
test_compile_add_self_flat[tensordict-eager] 0.4241ms 0.2134ms 4.6871 KOps/s 4.7379 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_add_self_flat[tensordict-compile] 0.2283ms 0.1090ms 9.1771 KOps/s 9.3286 KOps/s $\color{#d91a1a}-1.62\%$
test_compile_add_self_flat[tensorclass-eager] 0.2071ms 56.0511μs 17.8409 KOps/s 18.3696 KOps/s $\color{#d91a1a}-2.88\%$
test_compile_add_self_flat[tensorclass-compile] 0.1221ms 49.3955μs 20.2448 KOps/s 21.5297 KOps/s $\textbf{\color{#d91a1a}-5.97\%}$
test_compile_add_self_flat[pytree-eager] 0.3196ms 0.1612ms 6.2044 KOps/s 6.1515 KOps/s $\color{#35bf28}+0.86\%$
test_compile_add_self_flat[pytree-compile] 6.2805ms 0.1083ms 9.2355 KOps/s 9.6845 KOps/s $\color{#d91a1a}-4.64\%$
test_compile_copy_flat[tensordict-compile] 72.4950μs 21.3822μs 46.7679 KOps/s 46.8468 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_copy_flat[tensordict-eager] 0.1245ms 58.0931μs 17.2137 KOps/s 16.4221 KOps/s $\color{#35bf28}+4.82\%$
test_compile_copy_flat[pytree-compile] 0.1601ms 81.3600μs 12.2910 KOps/s 11.8786 KOps/s $\color{#35bf28}+3.47\%$
test_compile_copy_flat[pytree-eager] 0.1779ms 68.5043μs 14.5976 KOps/s 14.1987 KOps/s $\color{#35bf28}+2.81\%$
test_compile_assign_and_add[tensordict-compile] 0.3435ms 0.2123ms 4.7093 KOps/s 4.7466 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_assign_and_add[tensordict-eager] 1.6355ms 1.3037ms 767.0606 Ops/s 755.3999 Ops/s $\color{#35bf28}+1.54\%$
test_compile_assign_and_add[pytree-compile] 0.3171ms 0.2094ms 4.7763 KOps/s 4.8708 KOps/s $\color{#d91a1a}-1.94\%$
test_compile_assign_and_add[pytree-eager] 1.2232ms 0.7847ms 1.2743 KOps/s 1.2552 KOps/s $\color{#35bf28}+1.52\%$
test_compile_assign_and_add_stack[compile] 0.7043ms 0.4770ms 2.0964 KOps/s 2.1765 KOps/s $\color{#d91a1a}-3.68\%$
test_compile_assign_and_add_stack[eager] 3.3333ms 2.6155ms 382.3375 Ops/s 253.0705 Ops/s $\textbf{\color{#35bf28}+51.08\%}$
test_compile_indexing[tensor-tensordict-compile] 99.4250μs 37.8404μs 26.4268 KOps/s 26.5267 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[tensor-tensordict-eager] 0.6294ms 32.1587μs 31.0958 KOps/s 29.0060 KOps/s $\textbf{\color{#35bf28}+7.20\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.1079ms 30.3884μs 32.9073 KOps/s 32.4289 KOps/s $\color{#35bf28}+1.48\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1843ms 23.5482μs 42.4661 KOps/s 43.2588 KOps/s $\color{#d91a1a}-1.83\%$
test_compile_indexing[tensor-pytree-compile] 76.6920μs 30.9327μs 32.3283 KOps/s 31.5490 KOps/s $\color{#35bf28}+2.47\%$
test_compile_indexing[tensor-pytree-eager] 73.8280μs 23.0981μs 43.2936 KOps/s 42.5197 KOps/s $\color{#35bf28}+1.82\%$
test_compile_indexing[slice-tensordict-compile] 0.2458ms 54.0521μs 18.5007 KOps/s 18.7416 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_indexing[slice-tensordict-eager] 0.4639ms 19.4130μs 51.5119 KOps/s 47.9488 KOps/s $\textbf{\color{#35bf28}+7.43\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1188ms 46.0924μs 21.6955 KOps/s 21.8231 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[slice-tensorclass-eager] 74.8690μs 18.4250μs 54.2742 KOps/s 51.8297 KOps/s $\color{#35bf28}+4.72\%$
test_compile_indexing[slice-pytree-compile] 0.1260ms 46.5914μs 21.4632 KOps/s 21.3253 KOps/s $\color{#35bf28}+0.65\%$
test_compile_indexing[slice-pytree-eager] 60.6230μs 18.5246μs 53.9823 KOps/s 52.6876 KOps/s $\color{#35bf28}+2.46\%$
test_compile_indexing[int-tensordict-compile] 0.2626ms 55.6508μs 17.9692 KOps/s 18.3535 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_indexing[int-tensordict-eager] 0.8659ms 19.4148μs 51.5070 KOps/s 50.2521 KOps/s $\color{#35bf28}+2.50\%$
test_compile_indexing[int-tensorclass-compile] 0.1936ms 46.6390μs 21.4413 KOps/s 21.5567 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_indexing[int-tensorclass-eager] 62.5160μs 18.6534μs 53.6096 KOps/s 53.2360 KOps/s $\color{#35bf28}+0.70\%$
test_compile_indexing[int-pytree-compile] 0.1341ms 46.7815μs 21.3760 KOps/s 21.5175 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_indexing[int-pytree-eager] 82.6140μs 18.5584μs 53.8841 KOps/s 53.3345 KOps/s $\color{#35bf28}+1.03\%$
test_mod_add[eager] 0.1031ms 25.7915μs 38.7725 KOps/s 35.4727 KOps/s $\textbf{\color{#35bf28}+9.30\%}$
test_mod_add[compile] 0.1921ms 47.5040μs 21.0508 KOps/s 21.6099 KOps/s $\color{#d91a1a}-2.59\%$
test_mod_add[compile-overhead] 0.1156ms 47.9819μs 20.8412 KOps/s 21.4610 KOps/s $\color{#d91a1a}-2.89\%$
test_mod_wrap[eager] 0.3990ms 0.2263ms 4.4181 KOps/s 4.3700 KOps/s $\color{#35bf28}+1.10\%$
test_mod_wrap[compile] 2.0455ms 0.2106ms 4.7476 KOps/s 4.7182 KOps/s $\color{#35bf28}+0.62\%$
test_mod_wrap[compile-overhead] 2.3272ms 0.2096ms 4.7710 KOps/s 4.7823 KOps/s $\color{#d91a1a}-0.24\%$
test_mod_wrap_and_backward[eager] 21.8281ms 12.5535ms 79.6588 Ops/s 82.6943 Ops/s $\color{#d91a1a}-3.67\%$
test_mod_wrap_and_backward[compile] 16.9114ms 13.2396ms 75.5308 Ops/s 77.5451 Ops/s $\color{#d91a1a}-2.60\%$
test_mod_wrap_and_backward[compile-overhead] 17.0945ms 13.5140ms 73.9976 Ops/s 73.7779 Ops/s $\color{#35bf28}+0.30\%$
test_seq_add[eager] 0.1843ms 91.7857μs 10.8949 KOps/s 10.1461 KOps/s $\textbf{\color{#35bf28}+7.38\%}$
test_seq_add[compile] 0.1291ms 64.9900μs 15.3870 KOps/s 16.0402 KOps/s $\color{#d91a1a}-4.07\%$
test_seq_add[compile-overhead] 0.1448ms 62.2994μs 16.0515 KOps/s 16.7222 KOps/s $\color{#d91a1a}-4.01\%$
test_seq_wrap[eager] 0.6058ms 0.4029ms 2.4819 KOps/s 2.3884 KOps/s $\color{#35bf28}+3.92\%$
test_seq_wrap[compile] 0.4566ms 0.2357ms 4.2436 KOps/s 4.2388 KOps/s $\color{#35bf28}+0.11\%$
test_seq_wrap[compile-overhead] 0.4703ms 0.2365ms 4.2276 KOps/s 4.2927 KOps/s $\color{#d91a1a}-1.52\%$
test_func_call_runtime[False-eager] 1.1101ms 0.5941ms 1.6834 KOps/s 1.7177 KOps/s $\color{#d91a1a}-2.00\%$
test_func_call_runtime[False-compile] 0.7037ms 0.4371ms 2.2876 KOps/s 2.2660 KOps/s $\color{#35bf28}+0.95\%$
test_func_call_runtime[False-compile-overhead] 0.8793ms 0.4398ms 2.2738 KOps/s 2.3026 KOps/s $\color{#d91a1a}-1.25\%$
test_func_call_runtime[True-eager] 1.1932ms 0.8033ms 1.2449 KOps/s 1.2610 KOps/s $\color{#d91a1a}-1.28\%$
test_func_call_runtime[True-compile] 0.6779ms 0.4788ms 2.0884 KOps/s 2.1045 KOps/s $\color{#d91a1a}-0.76\%$
test_func_call_runtime[True-compile-overhead] 0.6820ms 0.4800ms 2.0832 KOps/s 2.1143 KOps/s $\color{#d91a1a}-1.47\%$
test_func_call_cm_runtime[False-eager] 1.0650ms 0.5815ms 1.7196 KOps/s 1.7456 KOps/s $\color{#d91a1a}-1.49\%$
test_func_call_cm_runtime[False-compile] 0.8856ms 0.4355ms 2.2960 KOps/s 2.3124 KOps/s $\color{#d91a1a}-0.71\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5889ms 0.4376ms 2.2854 KOps/s 2.2878 KOps/s $\color{#d91a1a}-0.11\%$
test_func_call_cm_runtime[True-eager] 1.4969ms 0.9417ms 1.0619 KOps/s 1.0790 KOps/s $\color{#d91a1a}-1.58\%$
test_func_call_cm_runtime[True-compile] 0.7875ms 0.5084ms 1.9668 KOps/s 1.9948 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_cm_runtime[True-compile-overhead] 0.7033ms 0.5032ms 1.9874 KOps/s 1.9979 KOps/s $\color{#d91a1a}-0.53\%$
test_vmap_func_call_cm_runtime[eager] 2.6863ms 1.9845ms 503.9135 Ops/s 505.2186 Ops/s $\color{#d91a1a}-0.26\%$
test_vmap_func_call_cm_runtime[compile] 1.0814ms 0.5390ms 1.8551 KOps/s 1.8856 KOps/s $\color{#d91a1a}-1.62\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7173ms 0.5360ms 1.8655 KOps/s 1.8482 KOps/s $\color{#35bf28}+0.94\%$
test_distributed 0.4918ms 0.1268ms 7.8890 KOps/s 7.6506 KOps/s $\color{#35bf28}+3.12\%$
test_tdmodule 72.1840μs 20.0631μs 49.8427 KOps/s 47.6101 KOps/s $\color{#35bf28}+4.69\%$
test_tdmodule_dispatch 0.1061ms 35.4651μs 28.1967 KOps/s 24.1775 KOps/s $\textbf{\color{#35bf28}+16.62\%}$
test_tdseq 41.6580μs 21.2308μs 47.1013 KOps/s 41.5570 KOps/s $\textbf{\color{#35bf28}+13.34\%}$
test_tdseq_dispatch 86.1810μs 42.6418μs 23.4512 KOps/s 21.2705 KOps/s $\textbf{\color{#35bf28}+10.25\%}$
test_instantiation_functorch 1.9008ms 1.5620ms 640.2108 Ops/s 639.6578 Ops/s $\color{#35bf28}+0.09\%$
test_exec_functorch 0.3375ms 0.1861ms 5.3739 KOps/s 5.4799 KOps/s $\color{#d91a1a}-1.94\%$
test_exec_functional_call 0.4538ms 0.1844ms 5.4240 KOps/s 5.8038 KOps/s $\textbf{\color{#d91a1a}-6.54\%}$
test_exec_td_decorator 0.6317ms 0.2402ms 4.1635 KOps/s 4.4160 KOps/s $\textbf{\color{#d91a1a}-5.72\%}$
test_vmap_mlp_speed_decorator[True-True] 1.1257ms 0.6594ms 1.5166 KOps/s 1.5479 KOps/s $\color{#d91a1a}-2.02\%$
test_vmap_mlp_speed_decorator[True-False] 0.9760ms 0.6598ms 1.5155 KOps/s 1.5447 KOps/s $\color{#d91a1a}-1.89\%$
test_vmap_mlp_speed_decorator[False-True] 0.7972ms 0.5392ms 1.8546 KOps/s 1.9011 KOps/s $\color{#d91a1a}-2.45\%$
test_vmap_mlp_speed_decorator[False-False] 0.7777ms 0.5368ms 1.8628 KOps/s 1.9011 KOps/s $\color{#d91a1a}-2.01\%$
test_to_module_speed[True] 2.1121ms 1.3104ms 763.1264 Ops/s 761.2349 Ops/s $\color{#35bf28}+0.25\%$
test_to_module_speed[False] 2.0077ms 1.2643ms 790.9234 Ops/s 793.2066 Ops/s $\color{#d91a1a}-0.29\%$
test_tc_init 0.1241ms 43.0278μs 23.2408 KOps/s 20.4240 KOps/s $\textbf{\color{#35bf28}+13.79\%}$
test_tc_init_nested 0.1394ms 85.2975μs 11.7237 KOps/s 10.0357 KOps/s $\textbf{\color{#35bf28}+16.82\%}$
test_tc_first_layer_tensor 18.2340μs 1.5240μs 656.1827 KOps/s 658.4102 KOps/s $\color{#d91a1a}-0.34\%$
test_tc_first_layer_nontensor 33.5120μs 4.7058μs 212.5046 KOps/s 206.1823 KOps/s $\color{#35bf28}+3.07\%$
test_tc_second_layer_tensor 28.1020μs 2.8122μs 355.5945 KOps/s 361.3598 KOps/s $\color{#d91a1a}-1.60\%$
test_tc_second_layer_nontensor 79.3390μs 5.9992μs 166.6896 KOps/s 162.8665 KOps/s $\color{#35bf28}+2.35\%$
test_unbind 0.2523s 14.3922ms 69.4820 Ops/s 65.9153 Ops/s $\textbf{\color{#35bf28}+5.41\%}$
test_full_like 11.3466ms 9.1762ms 108.9776 Ops/s 103.3152 Ops/s $\textbf{\color{#35bf28}+5.48\%}$
test_zeros_like 5.6351ms 3.5556ms 281.2472 Ops/s 280.7879 Ops/s $\color{#35bf28}+0.16\%$
test_ones_like 4.9040ms 4.0107ms 249.3299 Ops/s 243.1663 Ops/s $\color{#35bf28}+2.53\%$
test_clone 9.0144ms 6.3598ms 157.2378 Ops/s 155.2748 Ops/s $\color{#35bf28}+1.26\%$
test_squeeze 69.5900μs 11.7043μs 85.4390 KOps/s 84.7411 KOps/s $\color{#35bf28}+0.82\%$
test_unsqueeze 0.1641ms 88.6437μs 11.2811 KOps/s 11.4003 KOps/s $\color{#d91a1a}-1.05\%$
test_split 0.5320ms 0.1929ms 5.1845 KOps/s 5.0797 KOps/s $\color{#35bf28}+2.06\%$
test_permute 0.4649ms 0.2299ms 4.3498 KOps/s 4.4731 KOps/s $\color{#d91a1a}-2.76\%$
test_stack 31.4410ms 27.7327ms 36.0585 Ops/s 35.7753 Ops/s $\color{#35bf28}+0.79\%$
test_cat 29.6466ms 27.3724ms 36.5331 Ops/s 35.7598 Ops/s $\color{#35bf28}+2.16\%$

Copy link

github-actions bot commented Nov 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}18$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 36.5210μs 11.4003μs 87.7172 KOps/s 88.8863 KOps/s $\color{#d91a1a}-1.32\%$
test_plain_set_stack_nested 35.2510μs 11.5148μs 86.8444 KOps/s 88.7211 KOps/s $\color{#d91a1a}-2.12\%$
test_plain_set_nested_inplace 63.7910μs 12.2496μs 81.6350 KOps/s 82.5517 KOps/s $\color{#d91a1a}-1.11\%$
test_plain_set_stack_nested_inplace 0.3978ms 12.3076μs 81.2507 KOps/s 82.4431 KOps/s $\color{#d91a1a}-1.45\%$
test_items 25.7900μs 2.8640μs 349.1650 KOps/s 338.9081 KOps/s $\color{#35bf28}+3.03\%$
test_items_nested 0.6874ms 0.3206ms 3.1194 KOps/s 3.1229 KOps/s $\color{#d91a1a}-0.11\%$
test_items_nested_locked 0.6959ms 0.3246ms 3.0811 KOps/s 3.0871 KOps/s $\color{#d91a1a}-0.19\%$
test_items_nested_leaf 0.4341ms 58.5068μs 17.0920 KOps/s 17.1363 KOps/s $\color{#d91a1a}-0.26\%$
test_items_stack_nested 0.7034ms 0.3217ms 3.1085 KOps/s 3.1356 KOps/s $\color{#d91a1a}-0.86\%$
test_items_stack_nested_leaf 91.5820μs 58.3266μs 17.1448 KOps/s 17.1899 KOps/s $\color{#d91a1a}-0.26\%$
test_items_stack_nested_locked 0.6995ms 0.3222ms 3.1036 KOps/s 3.0902 KOps/s $\color{#35bf28}+0.43\%$
test_keys 0.3757ms 3.4712μs 288.0824 KOps/s 288.1452 KOps/s $\color{#d91a1a}-0.02\%$
test_keys_nested 0.4454ms 70.8322μs 14.1179 KOps/s 14.0773 KOps/s $\color{#35bf28}+0.29\%$
test_keys_nested_locked 2.5264ms 76.0468μs 13.1498 KOps/s 13.0434 KOps/s $\color{#35bf28}+0.82\%$
test_keys_nested_leaf 0.4306ms 62.0045μs 16.1279 KOps/s 16.0068 KOps/s $\color{#35bf28}+0.76\%$
test_keys_stack_nested 0.1073ms 70.9416μs 14.0961 KOps/s 13.9562 KOps/s $\color{#35bf28}+1.00\%$
test_keys_stack_nested_leaf 0.4317ms 61.6166μs 16.2294 KOps/s 16.0843 KOps/s $\color{#35bf28}+0.90\%$
test_keys_stack_nested_locked 0.4499ms 75.7353μs 13.2039 KOps/s 12.9058 KOps/s $\color{#35bf28}+2.31\%$
test_values 62.7178μs 0.8408μs 1.1894 MOps/s 1.1733 MOps/s $\color{#35bf28}+1.37\%$
test_values_nested 0.4191ms 31.1509μs 32.1018 KOps/s 31.8012 KOps/s $\color{#35bf28}+0.95\%$
test_values_nested_locked 0.4069ms 33.8426μs 29.5486 KOps/s 30.4959 KOps/s $\color{#d91a1a}-3.11\%$
test_values_nested_leaf 0.4251ms 33.8485μs 29.5434 KOps/s 29.4742 KOps/s $\color{#35bf28}+0.23\%$
test_values_stack_nested 64.2310μs 31.3530μs 31.8949 KOps/s 31.7744 KOps/s $\color{#35bf28}+0.38\%$
test_values_stack_nested_leaf 0.4221ms 34.0596μs 29.3603 KOps/s 29.1096 KOps/s $\color{#35bf28}+0.86\%$
test_values_stack_nested_locked 0.4076ms 34.1371μs 29.2936 KOps/s 30.2508 KOps/s $\color{#d91a1a}-3.16\%$
test_membership 19.0554μs 0.5115μs 1.9552 MOps/s 1.9640 MOps/s $\color{#d91a1a}-0.45\%$
test_membership_nested 0.1945ms 1.8853μs 530.4245 KOps/s 536.6182 KOps/s $\color{#d91a1a}-1.15\%$
test_membership_nested_leaf 16.2505μs 1.8829μs 531.0844 KOps/s 537.5625 KOps/s $\color{#d91a1a}-1.21\%$
test_membership_stacked_nested 32.6200μs 1.9561μs 511.2146 KOps/s 512.7883 KOps/s $\color{#d91a1a}-0.31\%$
test_membership_stacked_nested_leaf 28.5300μs 1.9600μs 510.2084 KOps/s 507.5000 KOps/s $\color{#35bf28}+0.53\%$
test_membership_nested_last 0.4005ms 2.7916μs 358.2169 KOps/s 360.0025 KOps/s $\color{#d91a1a}-0.50\%$
test_membership_nested_leaf_last 34.7310μs 2.7942μs 357.8849 KOps/s 357.9707 KOps/s $\color{#d91a1a}-0.02\%$
test_membership_stacked_nested_last 25.5010μs 2.8185μs 354.7966 KOps/s 359.0390 KOps/s $\color{#d91a1a}-1.18\%$
test_membership_stacked_nested_leaf_last 0.4034ms 2.7618μs 362.0853 KOps/s 358.2217 KOps/s $\color{#35bf28}+1.08\%$
test_nested_getleaf 38.8910μs 5.9779μs 167.2828 KOps/s 166.8331 KOps/s $\color{#35bf28}+0.27\%$
test_nested_get 46.4310μs 5.6920μs 175.6843 KOps/s 175.7490 KOps/s $\color{#d91a1a}-0.04\%$
test_stacked_getleaf 41.8300μs 5.9973μs 166.7414 KOps/s 167.3772 KOps/s $\color{#d91a1a}-0.38\%$
test_stacked_get 57.0310μs 5.6486μs 177.0352 KOps/s 175.5337 KOps/s $\color{#35bf28}+0.86\%$
test_nested_getitemleaf 30.4810μs 6.1045μs 163.8129 KOps/s 164.1673 KOps/s $\color{#d91a1a}-0.22\%$
test_nested_getitem 57.3810μs 5.7435μs 174.1087 KOps/s 172.4887 KOps/s $\color{#35bf28}+0.94\%$
test_stacked_getitemleaf 64.9510μs 6.0697μs 164.7540 KOps/s 164.2418 KOps/s $\color{#35bf28}+0.31\%$
test_stacked_getitem 32.0910μs 5.7492μs 173.9387 KOps/s 171.8004 KOps/s $\color{#35bf28}+1.24\%$
test_lock_nested 6.8105ms 0.3740ms 2.6740 KOps/s 2.7449 KOps/s $\color{#d91a1a}-2.59\%$
test_lock_stack_nested 0.4830ms 0.3365ms 2.9716 KOps/s 2.9971 KOps/s $\color{#d91a1a}-0.85\%$
test_unlock_nested 0.6676ms 0.3113ms 3.2125 KOps/s 3.3462 KOps/s $\color{#d91a1a}-4.00\%$
test_unlock_stack_nested 0.3361ms 0.2777ms 3.6008 KOps/s 3.6944 KOps/s $\color{#d91a1a}-2.54\%$
test_flatten_speed 0.4514ms 73.5657μs 13.5933 KOps/s 13.6404 KOps/s $\color{#d91a1a}-0.35\%$
test_unflatten_speed 0.6687ms 0.2942ms 3.3989 KOps/s 3.4419 KOps/s $\color{#d91a1a}-1.25\%$
test_common_ops 1.7210ms 0.6073ms 1.6467 KOps/s 1.6938 KOps/s $\color{#d91a1a}-2.78\%$
test_creation 0.1412ms 1.4795μs 675.8919 KOps/s 689.1809 KOps/s $\color{#d91a1a}-1.93\%$
test_creation_empty 41.7010μs 9.0186μs 110.8815 KOps/s 117.5131 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_creation_nested_1 39.7700μs 10.5829μs 94.4921 KOps/s 99.1936 KOps/s $\color{#d91a1a}-4.74\%$
test_creation_nested_2 0.3930ms 13.3258μs 75.0423 KOps/s 79.3159 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_clone 49.0110μs 9.9921μs 100.0787 KOps/s 99.5146 KOps/s $\color{#35bf28}+0.57\%$
test_getitem[int] 1.4992ms 10.4704μs 95.5074 KOps/s 95.8626 KOps/s $\color{#d91a1a}-0.37\%$
test_getitem[slice_int] 93.0076ms 30.4645μs 32.8250 KOps/s 47.5370 KOps/s $\textbf{\color{#d91a1a}-30.95\%}$
test_getitem[range] 0.1342ms 37.7214μs 26.5102 KOps/s 26.2012 KOps/s $\color{#35bf28}+1.18\%$
test_getitem[tuple] 0.1122ms 17.9184μs 55.8085 KOps/s 54.5995 KOps/s $\color{#35bf28}+2.21\%$
test_getitem[list] 0.4203ms 33.7833μs 29.6004 KOps/s 29.6793 KOps/s $\color{#d91a1a}-0.27\%$
test_setitem_dim[int] 39.3910μs 18.5588μs 53.8828 KOps/s 53.4682 KOps/s $\color{#35bf28}+0.78\%$
test_setitem_dim[slice_int] 64.7210μs 37.7845μs 26.4659 KOps/s 26.4479 KOps/s $\color{#35bf28}+0.07\%$
test_setitem_dim[range] 78.5510μs 53.9796μs 18.5255 KOps/s 18.5316 KOps/s $\color{#d91a1a}-0.03\%$
test_setitem_dim[tuple] 54.9320μs 32.1305μs 31.1230 KOps/s 31.1684 KOps/s $\color{#d91a1a}-0.15\%$
test_setitem 81.2410μs 15.4345μs 64.7901 KOps/s 68.5058 KOps/s $\textbf{\color{#d91a1a}-5.42\%}$
test_set 0.4017ms 14.8829μs 67.1912 KOps/s 70.9847 KOps/s $\textbf{\color{#d91a1a}-5.34\%}$
test_set_shared 1.6201ms 0.1462ms 6.8390 KOps/s 6.8211 KOps/s $\color{#35bf28}+0.26\%$
test_update 0.2505ms 17.9801μs 55.6169 KOps/s 56.8144 KOps/s $\color{#d91a1a}-2.11\%$
test_update_nested 0.4132ms 22.4025μs 44.6378 KOps/s 45.2886 KOps/s $\color{#d91a1a}-1.44\%$
test_update__nested 1.1490ms 24.3304μs 41.1008 KOps/s 43.7552 KOps/s $\textbf{\color{#d91a1a}-6.07\%}$
test_set_nested 85.3120μs 15.4645μs 64.6640 KOps/s 65.1606 KOps/s $\color{#d91a1a}-0.76\%$
test_set_nested_new 0.3987ms 17.8393μs 56.0560 KOps/s 55.7556 KOps/s $\color{#35bf28}+0.54\%$
test_select 91.1220μs 29.7398μs 33.6250 KOps/s 33.4284 KOps/s $\color{#35bf28}+0.59\%$
test_select_nested 0.4199ms 41.9564μs 23.8343 KOps/s 23.9190 KOps/s $\color{#d91a1a}-0.35\%$
test_exclude_nested 0.4371ms 59.9344μs 16.6849 KOps/s 16.8598 KOps/s $\color{#d91a1a}-1.04\%$
test_empty[True] 0.6347ms 0.2580ms 3.8767 KOps/s 3.9055 KOps/s $\color{#d91a1a}-0.74\%$
test_empty[False] 38.4587μs 0.7474μs 1.3380 MOps/s 1.3214 MOps/s $\color{#35bf28}+1.26\%$
test_to 85.7810μs 59.1624μs 16.9026 KOps/s 16.7493 KOps/s $\color{#35bf28}+0.92\%$
test_to_nonblocking 97.1820μs 53.8961μs 18.5542 KOps/s 19.2731 KOps/s $\color{#d91a1a}-3.73\%$
test_unbind_speed 0.6202ms 0.2328ms 4.2959 KOps/s 4.4062 KOps/s $\color{#d91a1a}-2.50\%$
test_unbind_speed_stack0 0.6199ms 0.2341ms 4.2715 KOps/s 4.4449 KOps/s $\color{#d91a1a}-3.90\%$
test_unbind_speed_stack1 93.6649ms 0.6554ms 1.5258 KOps/s 1.6866 KOps/s $\textbf{\color{#d91a1a}-9.53\%}$
test_split 94.2928ms 1.7478ms 572.1594 Ops/s 627.8630 Ops/s $\textbf{\color{#d91a1a}-8.87\%}$
test_chunk 1.8531ms 1.4823ms 674.6465 Ops/s 627.4971 Ops/s $\textbf{\color{#35bf28}+7.51\%}$
test_consolidate[False-None] 97.4791ms 2.9050ms 344.2297 Ops/s 343.0403 Ops/s $\color{#35bf28}+0.35\%$
test_consolidate[default-None] 1.7627ms 1.6660ms 600.2418 Ops/s 599.3785 Ops/s $\color{#35bf28}+0.14\%$
test_consolidate[reduce-overhead-None] 1.7661ms 1.6974ms 589.1345 Ops/s 589.7977 Ops/s $\color{#d91a1a}-0.11\%$
test_consolidate_njt[False-None] 6.8196ms 6.7035ms 149.1749 Ops/s 149.5605 Ops/s $\color{#d91a1a}-0.26\%$
test_to[False-False-None] 2.2710ms 2.1771ms 459.3249 Ops/s 426.3046 Ops/s $\textbf{\color{#35bf28}+7.75\%}$
test_to[True-False-None] 1.4123ms 1.3089ms 763.9938 Ops/s 763.7473 Ops/s $\color{#35bf28}+0.03\%$
test_to[within-False-None] 4.3429ms 4.0856ms 244.7626 Ops/s 245.1965 Ops/s $\color{#d91a1a}-0.18\%$
test_to[True-default-None] 5.3231ms 5.1152ms 195.4960 Ops/s 191.9982 Ops/s $\color{#35bf28}+1.82\%$
test_to_njt[False-False-None] 7.9012ms 7.7057ms 129.7733 Ops/s 130.3640 Ops/s $\color{#d91a1a}-0.45\%$
test_to_njt[True-False-None] 5.8083ms 5.6789ms 176.0906 Ops/s 181.0595 Ops/s $\color{#d91a1a}-2.74\%$
test_to_njt[within-False-None] 12.6167ms 12.3350ms 81.0703 Ops/s 80.5804 Ops/s $\color{#35bf28}+0.61\%$
test_creation[device0] 0.3736ms 78.5096μs 12.7373 KOps/s 12.5485 KOps/s $\color{#35bf28}+1.50\%$
test_creation_from_tensor 0.4947ms 82.9810μs 12.0509 KOps/s 11.9139 KOps/s $\color{#35bf28}+1.15\%$
test_add_one[memmap_tensor0] 0.2322ms 6.4945μs 153.9763 KOps/s 148.0090 KOps/s $\color{#35bf28}+4.03\%$
test_contiguous[memmap_tensor0] 1.8381μs 0.4268μs 2.3430 MOps/s 2.3423 MOps/s $\color{#35bf28}+0.03\%$
test_stack[memmap_tensor0] 26.9900μs 4.5408μs 220.2247 KOps/s 221.2852 KOps/s $\color{#d91a1a}-0.48\%$
test_memmaptd_index 1.9596ms 0.2485ms 4.0246 KOps/s 3.9147 KOps/s $\color{#35bf28}+2.81\%$
test_memmaptd_index_astensor 0.5946ms 0.3081ms 3.2457 KOps/s 3.1816 KOps/s $\color{#35bf28}+2.01\%$
test_memmaptd_index_op 1.0076ms 0.5914ms 1.6909 KOps/s 1.6711 KOps/s $\color{#35bf28}+1.19\%$
test_serialize_model 0.1321s 0.1309s 7.6411 Ops/s 5.3702 Ops/s $\textbf{\color{#35bf28}+42.29\%}$
test_serialize_model_pickle 1.3661s 1.1933s 0.8380 Ops/s 0.8217 Ops/s $\color{#35bf28}+1.99\%$
test_serialize_weights 0.1315s 0.1303s 7.6767 Ops/s 7.6887 Ops/s $\color{#d91a1a}-0.16\%$
test_serialize_weights_returnearly 0.3773s 57.0011ms 17.5435 Ops/s 23.1474 Ops/s $\textbf{\color{#d91a1a}-24.21\%}$
test_serialize_weights_pickle 1.3769s 1.2199s 0.8197 Ops/s 0.8196 Ops/s $\color{#35bf28}+0.01\%$
test_reshape_pytree 53.1510μs 22.3570μs 44.7286 KOps/s 43.6599 KOps/s $\color{#35bf28}+2.45\%$
test_reshape_td 53.4910μs 26.7930μs 37.3233 KOps/s 37.0005 KOps/s $\color{#35bf28}+0.87\%$
test_view_pytree 47.0700μs 22.2026μs 45.0398 KOps/s 44.5480 KOps/s $\color{#35bf28}+1.10\%$
test_view_td 59.1710μs 30.8367μs 32.4289 KOps/s 32.3507 KOps/s $\color{#35bf28}+0.24\%$
test_unbind_pytree 65.1210μs 27.9410μs 35.7897 KOps/s 35.7776 KOps/s $\color{#35bf28}+0.03\%$
test_unbind_td 0.6196ms 35.7583μs 27.9655 KOps/s 28.4131 KOps/s $\color{#d91a1a}-1.58\%$
test_split_pytree 62.1720μs 30.5602μs 32.7223 KOps/s 32.6248 KOps/s $\color{#35bf28}+0.30\%$
test_split_td 0.7825ms 38.6669μs 25.8619 KOps/s 25.4350 KOps/s $\color{#35bf28}+1.68\%$
test_add_pytree 79.0910μs 34.0931μs 29.3314 KOps/s 28.9907 KOps/s $\color{#35bf28}+1.18\%$
test_add_td 87.1120μs 48.1745μs 20.7579 KOps/s 20.3721 KOps/s $\color{#35bf28}+1.89\%$
test_compile_add_one_nested[tensordict-compile] 0.1821ms 0.1187ms 8.4215 KOps/s 8.1319 KOps/s $\color{#35bf28}+3.56\%$
test_compile_add_one_nested[tensordict-eager] 0.2229ms 0.1237ms 8.0818 KOps/s 7.9654 KOps/s $\color{#35bf28}+1.46\%$
test_compile_add_one_nested[pytree-compile] 0.2591ms 0.1027ms 9.7355 KOps/s 10.0681 KOps/s $\color{#d91a1a}-3.30\%$
test_compile_add_one_nested[pytree-eager] 1.2536ms 0.1511ms 6.6167 KOps/s 6.4666 KOps/s $\color{#35bf28}+2.32\%$
test_compile_copy_nested[tensordict-compile] 97.7010μs 23.8667μs 41.8993 KOps/s 44.3439 KOps/s $\textbf{\color{#d91a1a}-5.51\%}$
test_compile_copy_nested[tensordict-eager] 57.2710μs 27.0348μs 36.9894 KOps/s 36.7568 KOps/s $\color{#35bf28}+0.63\%$
test_compile_copy_nested[pytree-compile] 0.1009ms 65.2364μs 15.3289 KOps/s 15.4281 KOps/s $\color{#d91a1a}-0.64\%$
test_compile_copy_nested[pytree-eager] 0.2901ms 49.8206μs 20.0720 KOps/s 20.5467 KOps/s $\color{#d91a1a}-2.31\%$
test_compile_add_one_flat[tensordict-compile] 0.1989ms 0.1441ms 6.9414 KOps/s 6.9325 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_flat[tensordict-eager] 0.2921ms 0.2085ms 4.7959 KOps/s 4.8360 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_add_one_flat[tensorclass-compile] 0.1341ms 97.7087μs 10.2345 KOps/s 9.7875 KOps/s $\color{#35bf28}+4.57\%$
test_compile_add_one_flat[tensorclass-eager] 0.1104ms 52.0391μs 19.2163 KOps/s 18.2900 KOps/s $\textbf{\color{#35bf28}+5.06\%}$
test_compile_add_one_flat[pytree-compile] 0.2312ms 0.1454ms 6.8797 KOps/s 6.8637 KOps/s $\color{#35bf28}+0.23\%$
test_compile_add_one_flat[pytree-eager] 0.5496ms 0.4886ms 2.0467 KOps/s 2.0116 KOps/s $\color{#35bf28}+1.74\%$
test_compile_add_self_flat[tensordict-eager] 0.3606ms 0.2458ms 4.0678 KOps/s 4.0489 KOps/s $\color{#35bf28}+0.47\%$
test_compile_add_self_flat[tensordict-compile] 0.1994ms 0.1442ms 6.9352 KOps/s 6.9000 KOps/s $\color{#35bf28}+0.51\%$
test_compile_add_self_flat[tensorclass-eager] 0.1431ms 62.8119μs 15.9205 KOps/s 15.8699 KOps/s $\color{#35bf28}+0.32\%$
test_compile_add_self_flat[tensorclass-compile] 0.1539ms 0.1018ms 9.8236 KOps/s 10.2239 KOps/s $\color{#d91a1a}-3.91\%$
test_compile_add_self_flat[pytree-eager] 0.4616ms 0.4062ms 2.4621 KOps/s 2.3967 KOps/s $\color{#35bf28}+2.73\%$
test_compile_add_self_flat[pytree-compile] 0.2066ms 0.1431ms 6.9874 KOps/s 7.1782 KOps/s $\color{#d91a1a}-2.66\%$
test_compile_copy_flat[tensordict-compile] 56.9510μs 18.8315μs 53.1024 KOps/s 53.3782 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_copy_flat[tensordict-eager] 54.7810μs 28.2366μs 35.4151 KOps/s 36.9522 KOps/s $\color{#d91a1a}-4.16\%$
test_compile_copy_flat[pytree-compile] 0.1922ms 69.7919μs 14.3283 KOps/s 14.2423 KOps/s $\color{#35bf28}+0.60\%$
test_compile_copy_flat[pytree-eager] 94.9220μs 51.6583μs 19.3580 KOps/s 19.2467 KOps/s $\color{#35bf28}+0.58\%$
test_compile_assign_and_add[tensordict-compile] 1.6758ms 0.4548ms 2.1985 KOps/s 2.2104 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_assign_and_add[tensordict-eager] 2.7276ms 2.6458ms 377.9582 Ops/s 375.2026 Ops/s $\color{#35bf28}+0.73\%$
test_compile_assign_and_add[pytree-compile] 1.6392ms 0.4438ms 2.2535 KOps/s 2.2437 KOps/s $\color{#35bf28}+0.44\%$
test_compile_assign_and_add[pytree-eager] 2.8184ms 2.6770ms 373.5482 Ops/s 366.8481 Ops/s $\color{#35bf28}+1.83\%$
test_compile_indexing[tensor-tensordict-compile] 0.5357ms 0.1128ms 8.8672 KOps/s 8.7468 KOps/s $\color{#35bf28}+1.38\%$
test_compile_indexing[tensor-tensordict-eager] 0.5674ms 80.8592μs 12.3672 KOps/s 12.2902 KOps/s $\color{#35bf28}+0.63\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2698ms 0.1060ms 9.4376 KOps/s 9.2579 KOps/s $\color{#35bf28}+1.94\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1116ms 68.3143μs 14.6382 KOps/s 14.6622 KOps/s $\color{#d91a1a}-0.16\%$
test_compile_indexing[tensor-pytree-compile] 0.1608ms 0.1068ms 9.3635 KOps/s 9.3701 KOps/s $\color{#d91a1a}-0.07\%$
test_compile_indexing[tensor-pytree-eager] 0.1970ms 68.3021μs 14.6408 KOps/s 14.7161 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_indexing[slice-tensordict-compile] 0.1509ms 99.9882μs 10.0012 KOps/s 9.9962 KOps/s $\color{#35bf28}+0.05\%$
test_compile_indexing[slice-tensordict-eager] 0.1376ms 17.4049μs 57.4552 KOps/s 56.0181 KOps/s $\color{#35bf28}+2.57\%$
test_compile_indexing[slice-tensorclass-compile] 0.1520ms 96.0379μs 10.4126 KOps/s 10.3160 KOps/s $\color{#35bf28}+0.94\%$
test_compile_indexing[slice-tensorclass-eager] 53.0710μs 15.9249μs 62.7947 KOps/s 63.6695 KOps/s $\color{#d91a1a}-1.37\%$
test_compile_indexing[slice-pytree-compile] 0.1496ms 97.2742μs 10.2802 KOps/s 10.3312 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_indexing[slice-pytree-eager] 65.0020μs 15.8450μs 63.1115 KOps/s 63.0814 KOps/s $\color{#35bf28}+0.05\%$
test_compile_indexing[int-tensordict-compile] 0.1604ms 0.1017ms 9.8319 KOps/s 9.8554 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_indexing[int-tensordict-eager] 0.5583ms 17.2350μs 58.0216 KOps/s 58.2966 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_indexing[int-tensorclass-compile] 0.1469ms 97.3659μs 10.2705 KOps/s 10.2432 KOps/s $\color{#35bf28}+0.27\%$
test_compile_indexing[int-tensorclass-eager] 54.5810μs 16.0206μs 62.4195 KOps/s 63.7949 KOps/s $\color{#d91a1a}-2.16\%$
test_compile_indexing[int-pytree-compile] 0.1469ms 97.4044μs 10.2665 KOps/s 10.3064 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_indexing[int-pytree-eager] 0.1545ms 19.2051μs 52.0696 KOps/s 63.9339 KOps/s $\textbf{\color{#d91a1a}-18.56\%}$
test_mod_add[eager] 74.3320μs 32.2893μs 30.9700 KOps/s 31.1328 KOps/s $\color{#d91a1a}-0.52\%$
test_mod_add[compile] 0.2620ms 80.5249μs 12.4185 KOps/s 12.6275 KOps/s $\color{#d91a1a}-1.65\%$
test_mod_add[compile-overhead] 0.3158ms 0.1639ms 6.1012 KOps/s 5.7740 KOps/s $\textbf{\color{#35bf28}+5.67\%}$
test_mod_wrap[eager] 0.3870ms 0.2504ms 3.9935 KOps/s 3.9670 KOps/s $\color{#35bf28}+0.67\%$
test_mod_wrap[compile] 0.3745ms 0.2872ms 3.4820 KOps/s 3.4595 KOps/s $\color{#35bf28}+0.65\%$
test_mod_wrap[compile-overhead] 7.9367ms 4.0192ms 248.8046 Ops/s 247.2261 Ops/s $\color{#35bf28}+0.64\%$
test_mod_wrap_and_backward[eager] 1.6819ms 1.4756ms 677.7090 Ops/s 685.3810 Ops/s $\color{#d91a1a}-1.12\%$
test_mod_wrap_and_backward[compile] 1.5043ms 1.3833ms 722.9260 Ops/s 718.2080 Ops/s $\color{#35bf28}+0.66\%$
test_mod_wrap_and_backward[compile-overhead] 1.5547ms 1.0349ms 966.2945 Ops/s 953.4488 Ops/s $\color{#35bf28}+1.35\%$
test_seq_add[eager] 0.1393ms 0.1027ms 9.7390 KOps/s 10.3276 KOps/s $\textbf{\color{#d91a1a}-5.70\%}$
test_seq_add[compile] 0.1556ms 88.0453μs 11.3578 KOps/s 11.3594 KOps/s $\color{#d91a1a}-0.01\%$
test_seq_add[compile-overhead] 0.1831ms 0.1291ms 7.7476 KOps/s 7.7411 KOps/s $\color{#35bf28}+0.08\%$
test_seq_wrap[eager] 0.4657ms 0.3884ms 2.5749 KOps/s 2.5343 KOps/s $\color{#35bf28}+1.60\%$
test_seq_wrap[compile] 0.3683ms 0.3018ms 3.3139 KOps/s 3.2874 KOps/s $\color{#35bf28}+0.81\%$
test_seq_wrap[compile-overhead] 0.3046ms 0.2313ms 4.3242 KOps/s 4.4660 KOps/s $\color{#d91a1a}-3.18\%$
test_func_call_runtime[False-eager] 0.8769ms 0.8055ms 1.2414 KOps/s 1.3100 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_func_call_runtime[False-compile] 0.9361ms 0.7986ms 1.2522 KOps/s 1.3222 KOps/s $\textbf{\color{#d91a1a}-5.30\%}$
test_func_call_runtime[False-compile-overhead] 0.4376ms 0.3700ms 2.7025 KOps/s 2.7387 KOps/s $\color{#d91a1a}-1.32\%$
test_func_call_runtime[True-eager] 1.0151ms 0.9154ms 1.0924 KOps/s 1.0639 KOps/s $\color{#35bf28}+2.69\%$
test_func_call_runtime[True-compile] 0.8929ms 0.8258ms 1.2110 KOps/s 1.2827 KOps/s $\textbf{\color{#d91a1a}-5.59\%}$
test_func_call_runtime[True-compile-overhead] 0.4558ms 0.3865ms 2.5876 KOps/s 2.5674 KOps/s $\color{#35bf28}+0.79\%$
test_func_call_cm_runtime[False-eager] 0.9250ms 0.7952ms 1.2576 KOps/s 1.2378 KOps/s $\color{#35bf28}+1.60\%$
test_func_call_cm_runtime[False-compile] 0.8286ms 0.7513ms 1.3309 KOps/s 1.3231 KOps/s $\color{#35bf28}+0.59\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4224ms 0.3673ms 2.7222 KOps/s 2.7215 KOps/s $\color{#35bf28}+0.03\%$
test_func_call_cm_runtime[True-eager] 1.1365ms 1.0114ms 988.7030 Ops/s 974.9448 Ops/s $\color{#35bf28}+1.41\%$
test_func_call_cm_runtime[True-compile] 0.9158ms 0.8228ms 1.2154 KOps/s 1.2283 KOps/s $\color{#d91a1a}-1.05\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4687ms 0.4137ms 2.4174 KOps/s 2.3986 KOps/s $\color{#35bf28}+0.79\%$
test_vmap_func_call_cm_runtime[eager] 2.5268ms 2.0955ms 477.2112 Ops/s 469.2278 Ops/s $\color{#35bf28}+1.70\%$
test_vmap_func_call_cm_runtime[compile] 1.0008ms 0.8662ms 1.1544 KOps/s 1.2147 KOps/s $\color{#d91a1a}-4.96\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4770ms 0.4129ms 2.4220 KOps/s 2.3952 KOps/s $\color{#35bf28}+1.12\%$
test_distributed 0.7311ms 0.1229ms 8.1395 KOps/s 8.8847 KOps/s $\textbf{\color{#d91a1a}-8.39\%}$
test_tdmodule 0.5811ms 14.8169μs 67.4905 KOps/s 71.3256 KOps/s $\textbf{\color{#d91a1a}-5.38\%}$
test_tdmodule_dispatch 47.7310μs 28.5914μs 34.9756 KOps/s 34.8897 KOps/s $\color{#35bf28}+0.25\%$
test_tdseq 36.9310μs 15.7977μs 63.3002 KOps/s 64.1435 KOps/s $\color{#d91a1a}-1.31\%$
test_tdseq_dispatch 54.4610μs 31.9125μs 31.3357 KOps/s 31.7148 KOps/s $\color{#d91a1a}-1.20\%$
test_instantiation_functorch 1.7591ms 1.5710ms 636.5536 Ops/s 635.3100 Ops/s $\color{#35bf28}+0.20\%$
test_exec_functorch 0.2068ms 0.1467ms 6.8168 KOps/s 6.6727 KOps/s $\color{#35bf28}+2.16\%$
test_exec_functional_call 0.2113ms 0.1418ms 7.0508 KOps/s 6.9412 KOps/s $\color{#35bf28}+1.58\%$
test_exec_td_decorator 0.3860ms 0.1908ms 5.2422 KOps/s 5.2862 KOps/s $\color{#d91a1a}-0.83\%$
test_vmap_mlp_speed_decorator[True-True] 0.7562ms 0.6804ms 1.4697 KOps/s 1.4546 KOps/s $\color{#35bf28}+1.04\%$
test_vmap_mlp_speed_decorator[True-False] 0.8568ms 0.6845ms 1.4610 KOps/s 1.4540 KOps/s $\color{#35bf28}+0.48\%$
test_vmap_mlp_speed_decorator[False-True] 0.7134ms 0.6082ms 1.6443 KOps/s 1.6596 KOps/s $\color{#d91a1a}-0.92\%$
test_vmap_mlp_speed_decorator[False-False] 0.7862ms 0.6179ms 1.6183 KOps/s 1.6550 KOps/s $\color{#d91a1a}-2.22\%$
test_vmap_transformer_speed_decorator[True-True] 19.8708ms 19.5362ms 51.1870 Ops/s 50.9081 Ops/s $\color{#35bf28}+0.55\%$
test_vmap_transformer_speed_decorator[True-False] 20.2563ms 19.6062ms 51.0043 Ops/s 50.8213 Ops/s $\color{#35bf28}+0.36\%$
test_vmap_transformer_speed_decorator[False-True] 20.3657ms 19.5295ms 51.2045 Ops/s 51.2125 Ops/s $\color{#d91a1a}-0.02\%$
test_vmap_transformer_speed_decorator[False-False] 20.2404ms 19.5785ms 51.0765 Ops/s 51.1461 Ops/s $\color{#d91a1a}-0.14\%$
test_to_module_speed[True] 1.0870ms 0.9415ms 1.0621 KOps/s 1.0376 KOps/s $\color{#35bf28}+2.36\%$
test_to_module_speed[False] 1.3252ms 0.9345ms 1.0701 KOps/s 1.0662 KOps/s $\color{#35bf28}+0.36\%$
test_tc_init 0.1677ms 34.5911μs 28.9091 KOps/s 27.1921 KOps/s $\textbf{\color{#35bf28}+6.31\%}$
test_tc_init_nested 0.1591ms 69.9367μs 14.2986 KOps/s 13.3493 KOps/s $\textbf{\color{#35bf28}+7.11\%}$
test_tc_first_layer_tensor 4.5987μs 0.7034μs 1.4217 MOps/s 1.4391 MOps/s $\color{#d91a1a}-1.21\%$
test_tc_first_layer_nontensor 21.7310μs 2.3045μs 433.9289 KOps/s 427.7962 KOps/s $\color{#35bf28}+1.43\%$
test_tc_second_layer_tensor 17.9002μs 1.4124μs 708.0391 KOps/s 699.1994 KOps/s $\color{#35bf28}+1.26\%$
test_tc_second_layer_nontensor 31.5110μs 3.0366μs 329.3179 KOps/s 326.5097 KOps/s $\color{#35bf28}+0.86\%$
test_unbind 0.2383s 10.0503ms 99.4996 Ops/s 149.9449 Ops/s $\textbf{\color{#d91a1a}-33.64\%}$
test_full_like 12.0698ms 9.1016ms 109.8706 Ops/s 108.6651 Ops/s $\color{#35bf28}+1.11\%$
test_zeros_like 9.1259ms 7.1341ms 140.1711 Ops/s 137.9878 Ops/s $\color{#35bf28}+1.58\%$
test_ones_like 5.2220ms 4.3115ms 231.9386 Ops/s 232.4602 Ops/s $\color{#d91a1a}-0.22\%$
test_clone 6.6039ms 6.2992ms 158.7496 Ops/s 158.9083 Ops/s $\color{#d91a1a}-0.10\%$
test_squeeze 66.9510μs 9.4838μs 105.4427 KOps/s 106.9402 KOps/s $\color{#d91a1a}-1.40\%$
test_unsqueeze 0.1533ms 70.9662μs 14.0912 KOps/s 14.0220 KOps/s $\color{#35bf28}+0.49\%$
test_split 0.3918ms 0.1566ms 6.3869 KOps/s 6.3047 KOps/s $\color{#35bf28}+1.30\%$
test_permute 0.2252ms 0.1796ms 5.5677 KOps/s 5.6246 KOps/s $\color{#d91a1a}-1.01\%$
test_stack 51.8107ms 50.5693ms 19.7749 Ops/s 19.8849 Ops/s $\color{#d91a1a}-0.55\%$
test_cat 50.5184ms 50.2491ms 19.9009 Ops/s 19.9632 Ops/s $\color{#d91a1a}-0.31\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants