Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Add aarch64-linux wheels #2434

Merged
merged 1 commit into from
Sep 12, 2024
Merged

[CI] Add aarch64-linux wheels #2434

merged 1 commit into from
Sep 12, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 12, 2024

No description provided.

Copy link

pytorch-bot bot commented Sep 12, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2434

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 10 Unrelated Failures

As of commit 65bbf27 with merge base 361b763 (image):

NEW FAILURE - The following job has failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 12, 2024
@vmoens vmoens linked an issue Sep 12, 2024 that may be closed by this pull request
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 91. Improved: $\large\color{#35bf28}2$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 61.1451ms 59.4131ms 16.8313 Ops/s 16.7993 Ops/s $\color{#35bf28}+0.19\%$
test_sync 49.3935ms 35.5928ms 28.0956 Ops/s 30.6855 Ops/s $\textbf{\color{#d91a1a}-8.44\%}$
test_async 83.5989ms 31.8616ms 31.3858 Ops/s 31.4781 Ops/s $\color{#d91a1a}-0.29\%$
test_simple 0.5160s 0.4374s 2.2860 Ops/s 2.4926 Ops/s $\textbf{\color{#d91a1a}-8.29\%}$
test_transformed 0.6819s 0.6019s 1.6614 Ops/s 1.7794 Ops/s $\textbf{\color{#d91a1a}-6.63\%}$
test_serial 1.4014s 1.3186s 0.7584 Ops/s 0.7823 Ops/s $\color{#d91a1a}-3.06\%$
test_parallel 1.2351s 1.1499s 0.8697 Ops/s 0.8901 Ops/s $\color{#d91a1a}-2.30\%$
test_step_mdp_speed[True-True-True-True-True] 0.1603ms 27.4382μs 36.4456 KOps/s 36.5506 KOps/s $\color{#d91a1a}-0.29\%$
test_step_mdp_speed[True-True-True-True-False] 50.0540μs 16.2689μs 61.4668 KOps/s 62.7281 KOps/s $\color{#d91a1a}-2.01\%$
test_step_mdp_speed[True-True-True-False-True] 43.5020μs 15.8268μs 63.1841 KOps/s 63.2945 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[True-True-True-False-False] 51.6270μs 9.3081μs 107.4334 KOps/s 105.7076 KOps/s $\color{#35bf28}+1.63\%$
test_step_mdp_speed[True-True-False-True-True] 0.1421ms 29.9298μs 33.4115 KOps/s 33.7732 KOps/s $\color{#d91a1a}-1.07\%$
test_step_mdp_speed[True-True-False-True-False] 78.1760μs 18.2414μs 54.8203 KOps/s 56.2476 KOps/s $\color{#d91a1a}-2.54\%$
test_step_mdp_speed[True-True-False-False-True] 49.4830μs 17.7463μs 56.3496 KOps/s 56.5634 KOps/s $\color{#d91a1a}-0.38\%$
test_step_mdp_speed[True-True-False-False-False] 37.9510μs 11.2152μs 89.1646 KOps/s 90.9511 KOps/s $\color{#d91a1a}-1.96\%$
test_step_mdp_speed[True-False-True-True-True] 73.7080μs 31.2257μs 32.0249 KOps/s 31.9346 KOps/s $\color{#35bf28}+0.28\%$
test_step_mdp_speed[True-False-True-True-False] 52.2480μs 19.9096μs 50.2270 KOps/s 51.1221 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[True-False-True-False-True] 55.1840μs 17.5451μs 56.9958 KOps/s 57.0543 KOps/s $\color{#d91a1a}-0.10\%$
test_step_mdp_speed[True-False-True-False-False] 43.6810μs 11.0085μs 90.8388 KOps/s 91.1595 KOps/s $\color{#d91a1a}-0.35\%$
test_step_mdp_speed[True-False-False-True-True] 64.9610μs 32.5321μs 30.7388 KOps/s 30.5222 KOps/s $\color{#35bf28}+0.71\%$
test_step_mdp_speed[True-False-False-True-False] 58.6090μs 21.1261μs 47.3349 KOps/s 47.8365 KOps/s $\color{#d91a1a}-1.05\%$
test_step_mdp_speed[True-False-False-False-True] 66.1040μs 18.8019μs 53.1860 KOps/s 51.6545 KOps/s $\color{#35bf28}+2.96\%$
test_step_mdp_speed[True-False-False-False-False] 49.6120μs 12.7111μs 78.6711 KOps/s 80.1792 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[False-True-True-True-True] 0.1500ms 31.0435μs 32.2129 KOps/s 32.0840 KOps/s $\color{#35bf28}+0.40\%$
test_step_mdp_speed[False-True-True-True-False] 0.1018ms 21.0234μs 47.5661 KOps/s 51.4480 KOps/s $\textbf{\color{#d91a1a}-7.55\%}$
test_step_mdp_speed[False-True-True-False-True] 69.4100μs 19.4343μs 51.4554 KOps/s 49.2245 KOps/s $\color{#35bf28}+4.53\%$
test_step_mdp_speed[False-True-True-False-False] 43.1410μs 12.2134μs 81.8776 KOps/s 81.6883 KOps/s $\color{#35bf28}+0.23\%$
test_step_mdp_speed[False-True-False-True-True] 90.2690μs 31.6876μs 31.5581 KOps/s 30.9605 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[False-True-False-True-False] 48.5110μs 21.1056μs 47.3808 KOps/s 47.9976 KOps/s $\color{#d91a1a}-1.29\%$
test_step_mdp_speed[False-True-False-False-True] 2.8891ms 21.3310μs 46.8801 KOps/s 45.7363 KOps/s $\color{#35bf28}+2.50\%$
test_step_mdp_speed[False-True-False-False-False] 61.0240μs 13.6440μs 73.2924 KOps/s 73.6640 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[False-False-True-True-True] 0.1081ms 33.6470μs 29.7203 KOps/s 28.8297 KOps/s $\color{#35bf28}+3.09\%$
test_step_mdp_speed[False-False-True-True-False] 78.3660μs 22.8064μs 43.8473 KOps/s 44.2436 KOps/s $\color{#d91a1a}-0.90\%$
test_step_mdp_speed[False-False-True-False-True] 53.6500μs 21.2213μs 47.1225 KOps/s 45.9401 KOps/s $\color{#35bf28}+2.57\%$
test_step_mdp_speed[False-False-True-False-False] 44.3730μs 13.7989μs 72.4696 KOps/s 71.9133 KOps/s $\color{#35bf28}+0.77\%$
test_step_mdp_speed[False-False-False-True-True] 70.2110μs 35.4159μs 28.2359 KOps/s 27.2999 KOps/s $\color{#35bf28}+3.43\%$
test_step_mdp_speed[False-False-False-True-False] 76.4930μs 24.2181μs 41.2914 KOps/s 42.1629 KOps/s $\color{#d91a1a}-2.07\%$
test_step_mdp_speed[False-False-False-False-True] 55.7740μs 22.4844μs 44.4753 KOps/s 43.8728 KOps/s $\color{#35bf28}+1.37\%$
test_step_mdp_speed[False-False-False-False-False] 65.9840μs 15.0955μs 66.2449 KOps/s 65.3586 KOps/s $\color{#35bf28}+1.36\%$
test_values[generalized_advantage_estimate-True-True] 13.5304ms 10.1140ms 98.8726 Ops/s 102.7823 Ops/s $\color{#d91a1a}-3.80\%$
test_values[vec_generalized_advantage_estimate-True-True] 40.9827ms 36.6498ms 27.2852 Ops/s 29.7108 Ops/s $\textbf{\color{#d91a1a}-8.16\%}$
test_values[td0_return_estimate-False-False] 0.2158ms 0.1862ms 5.3694 KOps/s 5.0754 KOps/s $\textbf{\color{#35bf28}+5.79\%}$
test_values[td1_return_estimate-False-False] 27.7657ms 24.0724ms 41.5414 Ops/s 42.2273 Ops/s $\color{#d91a1a}-1.62\%$
test_values[vec_td1_return_estimate-False-False] 38.2608ms 36.5290ms 27.3755 Ops/s 29.7505 Ops/s $\textbf{\color{#d91a1a}-7.98\%}$
test_values[td_lambda_return_estimate-True-False] 42.4087ms 35.4240ms 28.2295 Ops/s 29.0497 Ops/s $\color{#d91a1a}-2.82\%$
test_values[vec_td_lambda_return_estimate-True-False] 38.5357ms 36.5309ms 27.3741 Ops/s 29.6324 Ops/s $\textbf{\color{#d91a1a}-7.62\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.4326ms 8.2192ms 121.6660 Ops/s 122.0102 Ops/s $\color{#d91a1a}-0.28\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.4248ms 2.0313ms 492.3052 Ops/s 499.8792 Ops/s $\color{#d91a1a}-1.52\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5078ms 0.3588ms 2.7870 KOps/s 2.7983 KOps/s $\color{#d91a1a}-0.41\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 46.2032ms 44.5436ms 22.4499 Ops/s 25.7372 Ops/s $\textbf{\color{#d91a1a}-12.77\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.1997ms 3.1009ms 322.4913 Ops/s 327.2164 Ops/s $\color{#d91a1a}-1.44\%$
test_dqn_speed 6.3821ms 1.3407ms 745.8972 Ops/s 749.3886 Ops/s $\color{#d91a1a}-0.47\%$
test_ddpg_speed 3.4336ms 2.7731ms 360.6062 Ops/s 361.6606 Ops/s $\color{#d91a1a}-0.29\%$
test_sac_speed 9.9815ms 8.4427ms 118.4454 Ops/s 121.6287 Ops/s $\color{#d91a1a}-2.62\%$
test_redq_speed 14.1130ms 13.2379ms 75.5408 Ops/s 75.6750 Ops/s $\color{#d91a1a}-0.18\%$
test_redq_deprec_speed 15.4731ms 13.4495ms 74.3522 Ops/s 73.4336 Ops/s $\color{#35bf28}+1.25\%$
test_td3_speed 9.6638ms 8.4332ms 118.5785 Ops/s 118.4947 Ops/s $\color{#35bf28}+0.07\%$
test_cql_speed 38.1471ms 36.6512ms 27.2842 Ops/s 27.9627 Ops/s $\color{#d91a1a}-2.43\%$
test_a2c_speed 9.7661ms 7.6299ms 131.0629 Ops/s 136.1263 Ops/s $\color{#d91a1a}-3.72\%$
test_ppo_speed 8.7944ms 7.9194ms 126.2729 Ops/s 129.8418 Ops/s $\color{#d91a1a}-2.75\%$
test_reinforce_speed 7.9980ms 6.8300ms 146.4134 Ops/s 150.5307 Ops/s $\color{#d91a1a}-2.74\%$
test_iql_speed 33.7725ms 32.9038ms 30.3916 Ops/s 31.1671 Ops/s $\color{#d91a1a}-2.49\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8075ms 5.1434ms 194.4245 Ops/s 205.2133 Ops/s $\textbf{\color{#d91a1a}-5.26\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9697ms 0.4955ms 2.0182 KOps/s 2.0965 KOps/s $\color{#d91a1a}-3.73\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7628ms 0.4629ms 2.1605 KOps/s 2.2233 KOps/s $\color{#d91a1a}-2.82\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.7670ms 5.0239ms 199.0477 Ops/s 205.8198 Ops/s $\color{#d91a1a}-3.29\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.3964ms 0.4804ms 2.0817 KOps/s 2.0730 KOps/s $\color{#35bf28}+0.42\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8685ms 0.4642ms 2.1543 KOps/s 2.2368 KOps/s $\color{#d91a1a}-3.69\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.8122ms 1.5941ms 627.2973 Ops/s 636.3494 Ops/s $\color{#d91a1a}-1.42\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2304ms 1.5178ms 658.8540 Ops/s 671.9801 Ops/s $\color{#d91a1a}-1.95\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 8.0071ms 5.2175ms 191.6629 Ops/s 203.5432 Ops/s $\textbf{\color{#d91a1a}-5.84\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0881ms 0.6203ms 1.6121 KOps/s 1.6514 KOps/s $\color{#d91a1a}-2.38\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0444ms 0.5961ms 1.6776 KOps/s 1.7329 KOps/s $\color{#d91a1a}-3.19\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.3741ms 5.1298ms 194.9398 Ops/s 204.2595 Ops/s $\color{#d91a1a}-4.56\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.4110ms 0.4906ms 2.0382 KOps/s 2.1374 KOps/s $\color{#d91a1a}-4.64\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8173ms 0.4758ms 2.1017 KOps/s 2.1959 KOps/s $\color{#d91a1a}-4.29\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.4581ms 5.0338ms 198.6567 Ops/s 212.0195 Ops/s $\textbf{\color{#d91a1a}-6.30\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9137ms 0.4872ms 2.0525 KOps/s 2.1473 KOps/s $\color{#d91a1a}-4.41\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7580ms 0.4706ms 2.1249 KOps/s 2.2826 KOps/s $\textbf{\color{#d91a1a}-6.91\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 8.4236ms 5.4896ms 182.1615 Ops/s 202.2885 Ops/s $\textbf{\color{#d91a1a}-9.95\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.4393ms 0.6277ms 1.5931 KOps/s 1.6196 KOps/s $\color{#d91a1a}-1.63\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9040ms 0.6005ms 1.6654 KOps/s 1.7280 KOps/s $\color{#d91a1a}-3.62\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1561s 7.1324ms 140.2062 Ops/s 155.8046 Ops/s $\textbf{\color{#d91a1a}-10.01\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 18.8109ms 13.4253ms 74.4863 Ops/s 77.2037 Ops/s $\color{#d91a1a}-3.52\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 1.9525ms 1.2568ms 795.6852 Ops/s 812.0927 Ops/s $\color{#d91a1a}-2.02\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1334s 9.1773ms 108.9645 Ops/s 161.9890 Ops/s $\textbf{\color{#d91a1a}-32.73\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 17.5879ms 13.1542ms 76.0211 Ops/s 74.0743 Ops/s $\color{#35bf28}+2.63\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.7336ms 1.2119ms 825.1842 Ops/s 837.3169 Ops/s $\color{#d91a1a}-1.45\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1302s 6.6915ms 149.4434 Ops/s 117.2665 Ops/s $\textbf{\color{#35bf28}+27.44\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 17.3947ms 13.2424ms 75.5149 Ops/s 76.2768 Ops/s $\color{#d91a1a}-1.00\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 5.1550ms 1.4499ms 689.7156 Ops/s 673.7170 Ops/s $\color{#35bf28}+2.37\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 94. Improved: $\large\color{#35bf28}1$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 0.1011s 0.1010s 9.9044 Ops/s 9.0269 Ops/s $\textbf{\color{#35bf28}+9.72\%}$
test_sync 90.0614ms 88.6142ms 11.2849 Ops/s 11.3485 Ops/s $\color{#d91a1a}-0.56\%$
test_async 0.2002s 82.4280ms 12.1318 Ops/s 11.8762 Ops/s $\color{#35bf28}+2.15\%$
test_single_pixels 0.1094s 0.1075s 9.3025 Ops/s 9.3031 Ops/s $-0.01\%$
test_sync_pixels 71.9351ms 70.5579ms 14.1728 Ops/s 13.8465 Ops/s $\color{#35bf28}+2.36\%$
test_async_pixels 0.1343s 66.6617ms 15.0011 Ops/s 15.0021 Ops/s $-0.01\%$
test_simple 0.7241s 0.7220s 1.3850 Ops/s 1.3315 Ops/s $\color{#35bf28}+4.02\%$
test_transformed 0.9541s 0.9490s 1.0538 Ops/s 1.0551 Ops/s $\color{#d91a1a}-0.12\%$
test_serial 2.0484s 2.0449s 0.4890 Ops/s 0.4898 Ops/s $\color{#d91a1a}-0.15\%$
test_parallel 1.8975s 1.8517s 0.5401 Ops/s 0.5380 Ops/s $\color{#35bf28}+0.37\%$
test_step_mdp_speed[True-True-True-True-True] 0.2341ms 37.1829μs 26.8941 KOps/s 26.5678 KOps/s $\color{#35bf28}+1.23\%$
test_step_mdp_speed[True-True-True-True-False] 0.1774ms 21.2045μs 47.1599 KOps/s 46.7330 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[True-True-True-False-True] 56.9740μs 20.8668μs 47.9231 KOps/s 46.6095 KOps/s $\color{#35bf28}+2.82\%$
test_step_mdp_speed[True-True-True-False-False] 99.0860μs 12.0339μs 83.0989 KOps/s 82.7307 KOps/s $\color{#35bf28}+0.45\%$
test_step_mdp_speed[True-True-False-True-True] 0.1138ms 39.5213μs 25.3028 KOps/s 24.9515 KOps/s $\color{#35bf28}+1.41\%$
test_step_mdp_speed[True-True-False-True-False] 54.3830μs 23.2785μs 42.9580 KOps/s 42.2993 KOps/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[True-True-False-False-True] 66.8530μs 23.0416μs 43.3998 KOps/s 42.3822 KOps/s $\color{#35bf28}+2.40\%$
test_step_mdp_speed[True-True-False-False-False] 37.8320μs 14.3236μs 69.8147 KOps/s 71.1244 KOps/s $\color{#d91a1a}-1.84\%$
test_step_mdp_speed[True-False-True-True-True] 72.4240μs 41.3880μs 24.1616 KOps/s 23.9114 KOps/s $\color{#35bf28}+1.05\%$
test_step_mdp_speed[True-False-True-True-False] 54.8530μs 25.4531μs 39.2879 KOps/s 38.7930 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[True-False-True-False-True] 66.7040μs 22.8208μs 43.8198 KOps/s 42.9561 KOps/s $\color{#35bf28}+2.01\%$
test_step_mdp_speed[True-False-True-False-False] 37.1920μs 14.2462μs 70.1940 KOps/s 71.7835 KOps/s $\color{#d91a1a}-2.21\%$
test_step_mdp_speed[True-False-False-True-True] 70.0140μs 43.8006μs 22.8308 KOps/s 22.8867 KOps/s $\color{#d91a1a}-0.24\%$
test_step_mdp_speed[True-False-False-True-False] 56.3740μs 27.3559μs 36.5552 KOps/s 36.1454 KOps/s $\color{#35bf28}+1.13\%$
test_step_mdp_speed[True-False-False-False-True] 53.9430μs 25.1677μs 39.7334 KOps/s 39.3654 KOps/s $\color{#35bf28}+0.93\%$
test_step_mdp_speed[True-False-False-False-False] 44.0130μs 16.1086μs 62.0786 KOps/s 61.9553 KOps/s $\color{#35bf28}+0.20\%$
test_step_mdp_speed[False-True-True-True-True] 72.4740μs 41.6555μs 24.0065 KOps/s 23.9589 KOps/s $\color{#35bf28}+0.20\%$
test_step_mdp_speed[False-True-True-True-False] 49.6720μs 25.4656μs 39.2686 KOps/s 38.7273 KOps/s $\color{#35bf28}+1.40\%$
test_step_mdp_speed[False-True-True-False-True] 55.9730μs 26.4592μs 37.7941 KOps/s 37.7231 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[False-True-True-False-False] 49.7630μs 15.8951μs 62.9125 KOps/s 62.9927 KOps/s $\color{#d91a1a}-0.13\%$
test_step_mdp_speed[False-True-False-True-True] 69.6940μs 43.6546μs 22.9071 KOps/s 22.8410 KOps/s $\color{#35bf28}+0.29\%$
test_step_mdp_speed[False-True-False-True-False] 53.0230μs 27.4708μs 36.4023 KOps/s 35.8274 KOps/s $\color{#35bf28}+1.60\%$
test_step_mdp_speed[False-True-False-False-True] 3.3653ms 28.4514μs 35.1476 KOps/s 34.7657 KOps/s $\color{#35bf28}+1.10\%$
test_step_mdp_speed[False-True-False-False-False] 45.9220μs 18.0529μs 55.3927 KOps/s 55.4032 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[False-False-True-True-True] 74.0640μs 45.5806μs 21.9392 KOps/s 21.8350 KOps/s $\color{#35bf28}+0.48\%$
test_step_mdp_speed[False-False-True-True-False] 63.9240μs 29.4803μs 33.9209 KOps/s 33.1458 KOps/s $\color{#35bf28}+2.34\%$
test_step_mdp_speed[False-False-True-False-True] 84.0850μs 28.2613μs 35.3841 KOps/s 35.1641 KOps/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[False-False-True-False-False] 0.1419ms 17.6098μs 56.7866 KOps/s 55.8712 KOps/s $\color{#35bf28}+1.64\%$
test_step_mdp_speed[False-False-False-True-True] 83.4850μs 46.7742μs 21.3793 KOps/s 21.0199 KOps/s $\color{#35bf28}+1.71\%$
test_step_mdp_speed[False-False-False-True-False] 68.7040μs 31.3404μs 31.9077 KOps/s 31.3949 KOps/s $\color{#35bf28}+1.63\%$
test_step_mdp_speed[False-False-False-False-True] 97.3260μs 29.5904μs 33.7947 KOps/s 33.5732 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[False-False-False-False-False] 72.1740μs 19.6464μs 50.9000 KOps/s 50.6518 KOps/s $\color{#35bf28}+0.49\%$
test_values[generalized_advantage_estimate-True-True] 24.6670ms 24.1266ms 41.4481 Ops/s 42.3743 Ops/s $\color{#d91a1a}-2.19\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1155s 3.1891ms 313.5701 Ops/s 328.3213 Ops/s $\color{#d91a1a}-4.49\%$
test_values[td0_return_estimate-False-False] 91.6350μs 64.1037μs 15.5997 KOps/s 15.4794 KOps/s $\color{#35bf28}+0.78\%$
test_values[td1_return_estimate-False-False] 56.8659ms 55.1683ms 18.1264 Ops/s 18.6904 Ops/s $\color{#d91a1a}-3.02\%$
test_values[vec_td1_return_estimate-False-False] 1.5216ms 1.0636ms 940.2006 Ops/s 944.6883 Ops/s $\color{#d91a1a}-0.48\%$
test_values[td_lambda_return_estimate-True-False] 89.9775ms 87.5264ms 11.4251 Ops/s 11.7987 Ops/s $\color{#d91a1a}-3.17\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2704ms 1.0595ms 943.8613 Ops/s 950.8500 Ops/s $\color{#d91a1a}-0.74\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.1855ms 23.8573ms 41.9159 Ops/s 42.5189 Ops/s $\color{#d91a1a}-1.42\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 0.9268ms 0.6948ms 1.4392 KOps/s 1.4434 KOps/s $\color{#d91a1a}-0.29\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7748ms 0.6471ms 1.5454 KOps/s 1.5550 KOps/s $\color{#d91a1a}-0.62\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6227ms 1.4454ms 691.8438 Ops/s 692.9446 Ops/s $\color{#d91a1a}-0.16\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8232ms 0.6633ms 1.5077 KOps/s 1.4861 KOps/s $\color{#35bf28}+1.45\%$
test_dqn_speed 6.9039ms 1.3038ms 766.9876 Ops/s 770.0196 Ops/s $\color{#d91a1a}-0.39\%$
test_ddpg_speed 2.9822ms 2.6160ms 382.2644 Ops/s 381.9581 Ops/s $\color{#35bf28}+0.08\%$
test_sac_speed 7.9833ms 7.5334ms 132.7419 Ops/s 133.1283 Ops/s $\color{#d91a1a}-0.29\%$
test_redq_speed 15.6084ms 10.0570ms 99.4334 Ops/s 101.4599 Ops/s $\color{#d91a1a}-2.00\%$
test_redq_deprec_speed 10.8413ms 10.4471ms 95.7203 Ops/s 97.1193 Ops/s $\color{#d91a1a}-1.44\%$
test_td3_speed 7.6928ms 7.5752ms 132.0097 Ops/s 132.3081 Ops/s $\color{#d91a1a}-0.23\%$
test_cql_speed 28.1607ms 24.6534ms 40.5624 Ops/s 40.9023 Ops/s $\color{#d91a1a}-0.83\%$
test_a2c_speed 5.7040ms 5.3806ms 185.8545 Ops/s 186.8436 Ops/s $\color{#d91a1a}-0.53\%$
test_ppo_speed 6.2075ms 5.6966ms 175.5434 Ops/s 176.6837 Ops/s $\color{#d91a1a}-0.65\%$
test_reinforce_speed 5.3759ms 4.4103ms 226.7413 Ops/s 228.9345 Ops/s $\color{#d91a1a}-0.96\%$
test_iql_speed 19.3734ms 18.8338ms 53.0959 Ops/s 53.6568 Ops/s $\color{#d91a1a}-1.05\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.7523ms 6.2898ms 158.9880 Ops/s 159.1282 Ops/s $\color{#d91a1a}-0.09\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9278ms 0.3547ms 2.8195 KOps/s 3.6119 KOps/s $\textbf{\color{#d91a1a}-21.94\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6726ms 0.3321ms 3.0116 KOps/s 4.7649 KOps/s $\textbf{\color{#d91a1a}-36.80\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.8011ms 6.2581ms 159.7922 Ops/s 159.6985 Ops/s $\color{#35bf28}+0.06\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6568ms 0.2960ms 3.3782 KOps/s 4.3375 KOps/s $\textbf{\color{#d91a1a}-22.12\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4878ms 0.2802ms 3.5689 KOps/s 4.8004 KOps/s $\textbf{\color{#d91a1a}-25.65\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.4440ms 1.2703ms 787.2414 Ops/s 833.5552 Ops/s $\textbf{\color{#d91a1a}-5.56\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3599ms 1.1826ms 845.6230 Ops/s 892.1801 Ops/s $\textbf{\color{#d91a1a}-5.22\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6849ms 6.4122ms 155.9534 Ops/s 155.4232 Ops/s $\color{#35bf28}+0.34\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.7690ms 0.4148ms 2.4108 KOps/s 2.6305 KOps/s $\textbf{\color{#d91a1a}-8.35\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6604ms 0.3940ms 2.5381 KOps/s 2.8219 KOps/s $\textbf{\color{#d91a1a}-10.06\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.6230ms 6.3329ms 157.9059 Ops/s 157.1713 Ops/s $\color{#35bf28}+0.47\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.1429ms 0.3032ms 3.2987 KOps/s 3.8820 KOps/s $\textbf{\color{#d91a1a}-15.03\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4771ms 0.2849ms 3.5102 KOps/s 4.7741 KOps/s $\textbf{\color{#d91a1a}-26.47\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.7275ms 6.2425ms 160.1928 Ops/s 159.9218 Ops/s $\color{#35bf28}+0.17\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.1428s 0.4317ms 2.3164 KOps/s 2.7605 KOps/s $\textbf{\color{#d91a1a}-16.09\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4378ms 0.2815ms 3.5519 KOps/s 4.8067 KOps/s $\textbf{\color{#d91a1a}-26.11\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.8942ms 6.4403ms 155.2732 Ops/s 154.3443 Ops/s $\color{#35bf28}+0.60\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.3668ms 0.4152ms 2.4083 KOps/s 2.6568 KOps/s $\textbf{\color{#d91a1a}-9.35\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6330ms 0.3992ms 2.5049 KOps/s 2.7422 KOps/s $\textbf{\color{#d91a1a}-8.66\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1422s 7.8818ms 126.8752 Ops/s 124.8614 Ops/s $\color{#35bf28}+1.61\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 20.8937ms 15.8189ms 63.2154 Ops/s 64.0059 Ops/s $\color{#d91a1a}-1.23\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 2.0272ms 1.0156ms 984.6591 Ops/s 1.0140 KOps/s $\color{#d91a1a}-2.89\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1291s 10.1575ms 98.4492 Ops/s 98.9535 Ops/s $\color{#d91a1a}-0.51\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 20.8476ms 15.8283ms 63.1780 Ops/s 63.6610 Ops/s $\color{#d91a1a}-0.76\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.0921ms 1.0211ms 979.3582 Ops/s 1.0062 KOps/s $\color{#d91a1a}-2.67\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1298s 7.7894ms 128.3793 Ops/s 129.7042 Ops/s $\color{#d91a1a}-1.02\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 21.4007ms 16.0240ms 62.4063 Ops/s 62.8303 Ops/s $\color{#d91a1a}-0.67\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.2751ms 1.1663ms 857.4198 Ops/s 881.5703 Ops/s $\color{#d91a1a}-2.74\%$

@vmoens vmoens merged commit d40fa4f into main Sep 12, 2024
64 of 71 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature Request] aarch64-linux wheels
2 participants