Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix, BE] Document and fix fps passing in recorder and loggers #2694

Merged
merged 1 commit into from
Jan 13, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 13, 2025

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Jan 13, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2694

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Jan 13, 2025
ghstack-source-id: b3996a9a27643eb5da8a78135f6b9fcef3685f17
Pull Request resolved: #2694
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 13, 2025
@vmoens vmoens merged commit 920d89e into gh/vmoens/67/base Jan 13, 2025
30 of 48 checks passed
vmoens added a commit that referenced this pull request Jan 13, 2025
ghstack-source-id: b3996a9a27643eb5da8a78135f6b9fcef3685f17
Pull Request resolved: #2694
@vmoens vmoens deleted the gh/vmoens/67/head branch January 13, 2025 16:47
@vmoens vmoens added bug Something isn't working BE Better errors, logs, docs or test utils labels Jan 13, 2025
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5217s 0.4423s 2.2607 Ops/s 2.2522 Ops/s $\color{#35bf28}+0.38\%$
test_transformed 0.7037s 0.6258s 1.5979 Ops/s 1.5805 Ops/s $\color{#35bf28}+1.10\%$
test_serial 1.4643s 1.3728s 0.7284 Ops/s 0.7282 Ops/s $\color{#35bf28}+0.02\%$
test_parallel 1.2958s 1.2097s 0.8266 Ops/s 0.8213 Ops/s $\color{#35bf28}+0.64\%$
test_step_mdp_speed[True-True-True-True-True] 0.1609ms 30.0662μs 33.2600 KOps/s 33.0146 KOps/s $\color{#35bf28}+0.74\%$
test_step_mdp_speed[True-True-True-True-False] 49.8440μs 17.8101μs 56.1480 KOps/s 55.9853 KOps/s $\color{#35bf28}+0.29\%$
test_step_mdp_speed[True-True-True-False-True] 44.3540μs 17.1732μs 58.2302 KOps/s 58.6950 KOps/s $\color{#d91a1a}-0.79\%$
test_step_mdp_speed[True-True-True-False-False] 50.2240μs 10.0684μs 99.3207 KOps/s 99.2000 KOps/s $\color{#35bf28}+0.12\%$
test_step_mdp_speed[True-True-False-True-True] 81.3030μs 32.1475μs 31.1066 KOps/s 31.0294 KOps/s $\color{#35bf28}+0.25\%$
test_step_mdp_speed[True-True-False-True-False] 50.8660μs 19.7726μs 50.5750 KOps/s 51.2755 KOps/s $\color{#d91a1a}-1.37\%$
test_step_mdp_speed[True-True-False-False-True] 55.9150μs 19.0856μs 52.3956 KOps/s 52.6898 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[True-True-False-False-False] 45.8460μs 12.0124μs 83.2472 KOps/s 83.9593 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[True-False-True-True-True] 93.4360μs 33.7763μs 29.6065 KOps/s 29.0769 KOps/s $\color{#35bf28}+1.82\%$
test_step_mdp_speed[True-False-True-True-False] 60.8740μs 21.8635μs 45.7384 KOps/s 46.3014 KOps/s $\color{#d91a1a}-1.22\%$
test_step_mdp_speed[True-False-True-False-True] 44.1520μs 19.0284μs 52.5531 KOps/s 52.6102 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[True-False-True-False-False] 33.5130μs 12.0361μs 83.0834 KOps/s 85.5357 KOps/s $\color{#d91a1a}-2.87\%$
test_step_mdp_speed[True-False-False-True-True] 91.8920μs 35.7604μs 27.9639 KOps/s 28.0414 KOps/s $\color{#d91a1a}-0.28\%$
test_step_mdp_speed[True-False-False-True-False] 61.3050μs 23.5732μs 42.4210 KOps/s 43.0229 KOps/s $\color{#d91a1a}-1.40\%$
test_step_mdp_speed[True-False-False-False-True] 58.1490μs 20.6882μs 48.3367 KOps/s 48.2326 KOps/s $\color{#35bf28}+0.22\%$
test_step_mdp_speed[True-False-False-False-False] 41.2270μs 13.5885μs 73.5917 KOps/s 73.6929 KOps/s $\color{#d91a1a}-0.14\%$
test_step_mdp_speed[False-True-True-True-True] 85.0890μs 34.1776μs 29.2589 KOps/s 28.8877 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[False-True-True-True-False] 54.7330μs 21.5412μs 46.4226 KOps/s 46.8103 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[False-True-True-False-True] 61.6450μs 21.8796μs 45.7046 KOps/s 46.6673 KOps/s $\color{#d91a1a}-2.06\%$
test_step_mdp_speed[False-True-True-False-False] 40.7960μs 13.5272μs 73.9253 KOps/s 75.8249 KOps/s $\color{#d91a1a}-2.51\%$
test_step_mdp_speed[False-True-False-True-True] 79.3190μs 36.0315μs 27.7535 KOps/s 28.3441 KOps/s $\color{#d91a1a}-2.08\%$
test_step_mdp_speed[False-True-False-True-False] 55.7540μs 23.7016μs 42.1913 KOps/s 43.4513 KOps/s $\color{#d91a1a}-2.90\%$
test_step_mdp_speed[False-True-False-False-True] 2.4201ms 23.3798μs 42.7719 KOps/s 42.9532 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[False-True-False-False-False] 47.8190μs 15.1751μs 65.8976 KOps/s 66.6416 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-False-True-True-True] 79.9800μs 37.4772μs 26.6829 KOps/s 26.5677 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[False-False-True-True-False] 53.3000μs 25.3103μs 39.5097 KOps/s 39.6287 KOps/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[False-False-True-False-True] 54.0910μs 23.3905μs 42.7525 KOps/s 42.8529 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-False-True-False-False] 50.0240μs 15.1441μs 66.0323 KOps/s 66.0736 KOps/s $\color{#d91a1a}-0.06\%$
test_step_mdp_speed[False-False-False-True-True] 94.4770μs 39.2533μs 25.4755 KOps/s 25.5073 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-False-False-True-False] 69.1290μs 26.9434μs 37.1148 KOps/s 37.2069 KOps/s $\color{#d91a1a}-0.25\%$
test_step_mdp_speed[False-False-False-False-True] 59.2410μs 24.8039μs 40.3162 KOps/s 39.9755 KOps/s $\color{#35bf28}+0.85\%$
test_step_mdp_speed[False-False-False-False-False] 48.2500μs 16.6595μs 60.0258 KOps/s 59.9101 KOps/s $\color{#35bf28}+0.19\%$
test_values[generalized_advantage_estimate-True-True] 9.9863ms 9.5524ms 104.6862 Ops/s 103.1828 Ops/s $\color{#35bf28}+1.46\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.8773ms 33.3705ms 29.9666 Ops/s 29.8071 Ops/s $\color{#35bf28}+0.54\%$
test_values[td0_return_estimate-False-False] 0.2332ms 0.1757ms 5.6911 KOps/s 5.7526 KOps/s $\color{#d91a1a}-1.07\%$
test_values[td1_return_estimate-False-False] 35.5781ms 24.3090ms 41.1370 Ops/s 41.4675 Ops/s $\color{#d91a1a}-0.80\%$
test_values[vec_td1_return_estimate-False-False] 35.8924ms 33.3338ms 29.9996 Ops/s 29.7881 Ops/s $\color{#35bf28}+0.71\%$
test_values[td_lambda_return_estimate-True-False] 36.3216ms 34.3989ms 29.0707 Ops/s 28.9055 Ops/s $\color{#35bf28}+0.57\%$
test_values[vec_td_lambda_return_estimate-True-False] 35.7648ms 33.4579ms 29.8883 Ops/s 29.8628 Ops/s $\color{#35bf28}+0.09\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.5023ms 8.2845ms 120.7073 Ops/s 118.3121 Ops/s $\color{#35bf28}+2.02\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.1862ms 1.9299ms 518.1523 Ops/s 512.7711 Ops/s $\color{#35bf28}+1.05\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6032ms 0.3643ms 2.7448 KOps/s 2.7259 KOps/s $\color{#35bf28}+0.69\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 47.0273ms 45.2152ms 22.1165 Ops/s 22.2152 Ops/s $\color{#d91a1a}-0.44\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.7463ms 3.0373ms 329.2353 Ops/s 328.9674 Ops/s $\color{#35bf28}+0.08\%$
test_dqn_speed[False-None] 2.0061ms 1.4149ms 706.7571 Ops/s 688.8253 Ops/s $\color{#35bf28}+2.60\%$
test_dqn_speed[False-backward] 1.9709ms 1.8867ms 530.0138 Ops/s 528.2342 Ops/s $\color{#35bf28}+0.34\%$
test_dqn_speed[True-None] 0.6528ms 0.4831ms 2.0698 KOps/s 2.0568 KOps/s $\color{#35bf28}+0.63\%$
test_dqn_speed[True-backward] 0.9542ms 0.9019ms 1.1087 KOps/s 1.0895 KOps/s $\color{#35bf28}+1.76\%$
test_dqn_speed[reduce-overhead-None] 0.8442ms 0.4913ms 2.0353 KOps/s 2.0647 KOps/s $\color{#d91a1a}-1.43\%$
test_dqn_speed[reduce-overhead-backward] 0.9329ms 0.8982ms 1.1133 KOps/s 1.1008 KOps/s $\color{#35bf28}+1.13\%$
test_ddpg_speed[False-None] 3.2522ms 2.8969ms 345.2022 Ops/s 340.9023 Ops/s $\color{#35bf28}+1.26\%$
test_ddpg_speed[False-backward] 4.1402ms 4.0296ms 248.1609 Ops/s 245.0112 Ops/s $\color{#35bf28}+1.29\%$
test_ddpg_speed[True-None] 1.2044ms 1.0272ms 973.4883 Ops/s 975.5499 Ops/s $\color{#d91a1a}-0.21\%$
test_ddpg_speed[True-backward] 2.0290ms 1.9354ms 516.6813 Ops/s 521.1996 Ops/s $\color{#d91a1a}-0.87\%$
test_ddpg_speed[reduce-overhead-None] 1.2593ms 1.0298ms 971.0196 Ops/s 972.5792 Ops/s $\color{#d91a1a}-0.16\%$
test_ddpg_speed[reduce-overhead-backward] 1.9879ms 1.9163ms 521.8466 Ops/s 523.3955 Ops/s $\color{#d91a1a}-0.30\%$
test_sac_speed[False-None] 9.4083ms 8.0366ms 124.4300 Ops/s 122.3255 Ops/s $\color{#35bf28}+1.72\%$
test_sac_speed[False-backward] 13.3712ms 10.7896ms 92.6817 Ops/s 91.4998 Ops/s $\color{#35bf28}+1.29\%$
test_sac_speed[True-None] 2.4317ms 1.8433ms 542.5036 Ops/s 539.4457 Ops/s $\color{#35bf28}+0.57\%$
test_sac_speed[True-backward] 3.6590ms 3.5507ms 281.6351 Ops/s 284.5423 Ops/s $\color{#d91a1a}-1.02\%$
test_sac_speed[reduce-overhead-None] 2.4404ms 1.9032ms 525.4403 Ops/s 538.2585 Ops/s $\color{#d91a1a}-2.38\%$
test_sac_speed[reduce-overhead-backward] 4.5311ms 3.5876ms 278.7400 Ops/s 284.0694 Ops/s $\color{#d91a1a}-1.88\%$
test_redq_speed[False-None] 14.4099ms 12.8995ms 77.5224 Ops/s 77.9787 Ops/s $\color{#d91a1a}-0.59\%$
test_redq_speed[False-backward] 0.2508s 27.0062ms 37.0285 Ops/s 44.8126 Ops/s $\textbf{\color{#d91a1a}-17.37\%}$
test_redq_speed[True-None] 5.0290ms 4.5252ms 220.9870 Ops/s 220.5261 Ops/s $\color{#35bf28}+0.21\%$
test_redq_speed[True-backward] 12.8044ms 12.2232ms 81.8118 Ops/s 84.8905 Ops/s $\color{#d91a1a}-3.63\%$
test_redq_speed[reduce-overhead-None] 5.3432ms 4.5108ms 221.6920 Ops/s 219.5672 Ops/s $\color{#35bf28}+0.97\%$
test_redq_speed[reduce-overhead-backward] 13.6338ms 12.3575ms 80.9225 Ops/s 80.9037 Ops/s $\color{#35bf28}+0.02\%$
test_redq_deprec_speed[False-None] 14.2107ms 12.7388ms 78.5003 Ops/s 76.2738 Ops/s $\color{#35bf28}+2.92\%$
test_redq_deprec_speed[False-backward] 19.7177ms 18.3852ms 54.3916 Ops/s 53.6429 Ops/s $\color{#35bf28}+1.40\%$
test_redq_deprec_speed[True-None] 4.0228ms 3.5891ms 278.6228 Ops/s 278.9668 Ops/s $\color{#d91a1a}-0.12\%$
test_redq_deprec_speed[True-backward] 8.3613ms 7.9864ms 125.2125 Ops/s 121.8314 Ops/s $\color{#35bf28}+2.78\%$
test_redq_deprec_speed[reduce-overhead-None] 4.6648ms 3.7939ms 263.5818 Ops/s 275.7555 Ops/s $\color{#d91a1a}-4.41\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.8798ms 7.9808ms 125.3007 Ops/s 124.3763 Ops/s $\color{#35bf28}+0.74\%$
test_td3_speed[False-None] 8.3969ms 8.0369ms 124.4258 Ops/s 122.0254 Ops/s $\color{#35bf28}+1.97\%$
test_td3_speed[False-backward] 10.8078ms 10.4250ms 95.9234 Ops/s 93.2783 Ops/s $\color{#35bf28}+2.84\%$
test_td3_speed[True-None] 2.0407ms 1.7663ms 566.1510 Ops/s 572.3336 Ops/s $\color{#d91a1a}-1.08\%$
test_td3_speed[True-backward] 3.4530ms 3.3529ms 298.2511 Ops/s 298.4671 Ops/s $\color{#d91a1a}-0.07\%$
test_td3_speed[reduce-overhead-None] 1.9816ms 1.7495ms 571.6002 Ops/s 563.0267 Ops/s $\color{#35bf28}+1.52\%$
test_td3_speed[reduce-overhead-backward] 5.1163ms 3.4538ms 289.5325 Ops/s 295.3349 Ops/s $\color{#d91a1a}-1.96\%$
test_cql_speed[False-None] 43.4817ms 37.8683ms 26.4073 Ops/s 27.0086 Ops/s $\color{#d91a1a}-2.23\%$
test_cql_speed[False-backward] 51.5377ms 48.1000ms 20.7900 Ops/s 21.1433 Ops/s $\color{#d91a1a}-1.67\%$
test_cql_speed[True-None] 16.5804ms 15.5068ms 64.4880 Ops/s 63.1562 Ops/s $\color{#35bf28}+2.11\%$
test_cql_speed[True-backward] 23.5205ms 22.5199ms 44.4052 Ops/s 43.7700 Ops/s $\color{#35bf28}+1.45\%$
test_cql_speed[reduce-overhead-None] 20.7509ms 16.1507ms 61.9169 Ops/s 64.0089 Ops/s $\color{#d91a1a}-3.27\%$
test_cql_speed[reduce-overhead-backward] 24.2498ms 22.6618ms 44.1272 Ops/s 44.7185 Ops/s $\color{#d91a1a}-1.32\%$
test_a2c_speed[False-None] 8.0939ms 7.2602ms 137.7374 Ops/s 137.0720 Ops/s $\color{#35bf28}+0.49\%$
test_a2c_speed[False-backward] 14.7382ms 14.3753ms 69.5636 Ops/s 69.6077 Ops/s $\color{#d91a1a}-0.06\%$
test_a2c_speed[True-None] 5.0173ms 4.2195ms 236.9968 Ops/s 236.1551 Ops/s $\color{#35bf28}+0.36\%$
test_a2c_speed[True-backward] 11.5734ms 11.1264ms 89.8766 Ops/s 93.2456 Ops/s $\color{#d91a1a}-3.61\%$
test_a2c_speed[reduce-overhead-None] 4.6018ms 4.2348ms 236.1391 Ops/s 232.0526 Ops/s $\color{#35bf28}+1.76\%$
test_a2c_speed[reduce-overhead-backward] 11.5454ms 11.0308ms 90.6549 Ops/s 93.6741 Ops/s $\color{#d91a1a}-3.22\%$
test_ppo_speed[False-None] 7.7749ms 7.5256ms 132.8799 Ops/s 133.1738 Ops/s $\color{#d91a1a}-0.22\%$
test_ppo_speed[False-backward] 16.0167ms 15.3702ms 65.0609 Ops/s 66.5012 Ops/s $\color{#d91a1a}-2.17\%$
test_ppo_speed[True-None] 4.3987ms 3.7519ms 266.5285 Ops/s 264.3819 Ops/s $\color{#35bf28}+0.81\%$
test_ppo_speed[True-backward] 10.7763ms 9.9096ms 100.9127 Ops/s 104.0033 Ops/s $\color{#d91a1a}-2.97\%$
test_ppo_speed[reduce-overhead-None] 4.2777ms 3.7478ms 266.8234 Ops/s 266.9521 Ops/s $\color{#d91a1a}-0.05\%$
test_ppo_speed[reduce-overhead-backward] 10.1040ms 9.7380ms 102.6903 Ops/s 104.1199 Ops/s $\color{#d91a1a}-1.37\%$
test_reinforce_speed[False-None] 7.9505ms 6.6240ms 150.9673 Ops/s 147.1825 Ops/s $\color{#35bf28}+2.57\%$
test_reinforce_speed[False-backward] 12.4457ms 10.1562ms 98.4618 Ops/s 100.5797 Ops/s $\color{#d91a1a}-2.11\%$
test_reinforce_speed[True-None] 3.2345ms 2.6863ms 372.2538 Ops/s 371.4825 Ops/s $\color{#35bf28}+0.21\%$
test_reinforce_speed[True-backward] 9.7062ms 8.9094ms 112.2411 Ops/s 115.6485 Ops/s $\color{#d91a1a}-2.95\%$
test_reinforce_speed[reduce-overhead-None] 3.1648ms 2.7084ms 369.2220 Ops/s 372.6503 Ops/s $\color{#d91a1a}-0.92\%$
test_reinforce_speed[reduce-overhead-backward] 9.7118ms 8.7428ms 114.3793 Ops/s 116.3271 Ops/s $\color{#d91a1a}-1.67\%$
test_iql_speed[False-None] 39.1077ms 33.3280ms 30.0048 Ops/s 30.4458 Ops/s $\color{#d91a1a}-1.45\%$
test_iql_speed[False-backward] 54.0161ms 46.7224ms 21.4030 Ops/s 15.7089 Ops/s $\textbf{\color{#35bf28}+36.25\%}$
test_iql_speed[True-None] 12.1662ms 11.0873ms 90.1930 Ops/s 91.5028 Ops/s $\color{#d91a1a}-1.43\%$
test_iql_speed[True-backward] 23.5557ms 22.4015ms 44.6398 Ops/s 45.7017 Ops/s $\color{#d91a1a}-2.32\%$
test_iql_speed[reduce-overhead-None] 15.8357ms 11.3358ms 88.2164 Ops/s 93.0557 Ops/s $\textbf{\color{#d91a1a}-5.20\%}$
test_iql_speed[reduce-overhead-backward] 23.7192ms 21.7957ms 45.8807 Ops/s 46.4272 Ops/s $\color{#d91a1a}-1.18\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.2930ms 4.8562ms 205.9238 Ops/s 204.8345 Ops/s $\color{#35bf28}+0.53\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9365ms 0.5187ms 1.9279 KOps/s 1.8930 KOps/s $\color{#35bf28}+1.85\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.9047ms 0.5035ms 1.9860 KOps/s 1.8169 KOps/s $\textbf{\color{#35bf28}+9.30\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.2616ms 4.5853ms 218.0862 Ops/s 218.8591 Ops/s $\color{#d91a1a}-0.35\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.2909ms 0.5102ms 1.9598 KOps/s 1.9714 KOps/s $\color{#d91a1a}-0.59\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6746ms 0.4787ms 2.0889 KOps/s 2.0506 KOps/s $\color{#35bf28}+1.87\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.9850ms 1.6791ms 595.5715 Ops/s 593.2502 Ops/s $\color{#35bf28}+0.39\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.0838ms 1.5834ms 631.5380 Ops/s 620.1055 Ops/s $\color{#35bf28}+1.84\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 4.8966ms 4.6809ms 213.6332 Ops/s 211.8579 Ops/s $\color{#35bf28}+0.84\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0689ms 0.6496ms 1.5394 KOps/s 1.5019 KOps/s $\color{#35bf28}+2.50\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.4932ms 0.6256ms 1.5985 KOps/s 1.5788 KOps/s $\color{#35bf28}+1.24\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.7312ms 4.5393ms 220.3005 Ops/s 216.1092 Ops/s $\color{#35bf28}+1.94\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.1956ms 0.5235ms 1.9103 KOps/s 1.9344 KOps/s $\color{#d91a1a}-1.25\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.3989s 1.0102ms 989.8553 Ops/s 1.9591 KOps/s $\textbf{\color{#d91a1a}-49.48\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.2702ms 4.6331ms 215.8368 Ops/s 221.2114 Ops/s $\color{#d91a1a}-2.43\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0281ms 0.5006ms 1.9975 KOps/s 500.5578 Ops/s $\textbf{\color{#35bf28}+299.06\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6872ms 0.4863ms 2.0565 KOps/s 2.0469 KOps/s $\color{#35bf28}+0.47\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 4.9728ms 4.7578ms 210.1825 Ops/s 208.0593 Ops/s $\color{#35bf28}+1.02\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.3608ms 0.6587ms 1.5182 KOps/s 1.5268 KOps/s $\color{#d91a1a}-0.56\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1060ms 0.6260ms 1.5975 KOps/s 1.5550 KOps/s $\color{#35bf28}+2.73\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.3530ms 4.2609ms 234.6902 Ops/s 231.5213 Ops/s $\color{#35bf28}+1.37\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.1070ms 2.2923ms 436.2390 Ops/s 423.7087 Ops/s $\color{#35bf28}+2.96\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 3.7832ms 1.3422ms 745.0643 Ops/s 776.8289 Ops/s $\color{#d91a1a}-4.09\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3882s 11.8785ms 84.1860 Ops/s 235.8745 Ops/s $\textbf{\color{#d91a1a}-64.31\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 5.1238ms 2.2443ms 445.5747 Ops/s 427.9230 Ops/s $\color{#35bf28}+4.12\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9519ms 1.4611ms 684.4152 Ops/s 696.3846 Ops/s $\color{#d91a1a}-1.72\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.4658ms 4.3292ms 230.9896 Ops/s 236.7050 Ops/s $\color{#d91a1a}-2.41\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 7.7859ms 2.5022ms 399.6406 Ops/s 401.1481 Ops/s $\color{#d91a1a}-0.38\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.9784ms 1.5508ms 644.8220 Ops/s 636.0924 Ops/s $\color{#35bf28}+1.37\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 15.3431ms 13.1953ms 75.7848 Ops/s 71.9405 Ops/s $\textbf{\color{#35bf28}+5.34\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 19.1704ms 16.2838ms 61.4109 Ops/s 61.2030 Ops/s $\color{#35bf28}+0.34\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 23.9396ms 21.8496ms 45.7675 Ops/s 44.0723 Ops/s $\color{#35bf28}+3.85\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.3945ms 16.0180ms 62.4298 Ops/s 60.3512 Ops/s $\color{#35bf28}+3.44\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 23.1442ms 21.7806ms 45.9125 Ops/s 44.0745 Ops/s $\color{#35bf28}+4.17\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.5025ms 17.5261ms 57.0577 Ops/s 55.7439 Ops/s $\color{#35bf28}+2.36\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}21$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8479s 0.7572s 1.3207 Ops/s 1.3381 Ops/s $\color{#d91a1a}-1.30\%$
test_transformed 0.9983s 0.9757s 1.0249 Ops/s 1.0032 Ops/s $\color{#35bf28}+2.17\%$
test_serial 2.1734s 2.1555s 0.4639 Ops/s 0.4587 Ops/s $\color{#35bf28}+1.14\%$
test_parallel 1.8341s 1.8124s 0.5517 Ops/s 0.5417 Ops/s $\color{#35bf28}+1.85\%$
test_step_mdp_speed[True-True-True-True-True] 0.1919ms 39.4277μs 25.3629 KOps/s 25.2654 KOps/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[True-True-True-True-False] 61.8610μs 23.6287μs 42.3215 KOps/s 42.5458 KOps/s $\color{#d91a1a}-0.53\%$
test_step_mdp_speed[True-True-True-False-True] 57.5010μs 22.3511μs 44.7406 KOps/s 44.4432 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[True-True-True-False-False] 54.3310μs 12.9612μs 77.1534 KOps/s 76.5495 KOps/s $\color{#35bf28}+0.79\%$
test_step_mdp_speed[True-True-False-True-True] 0.1157ms 42.7910μs 23.3694 KOps/s 23.2380 KOps/s $\color{#35bf28}+0.57\%$
test_step_mdp_speed[True-True-False-True-False] 71.5920μs 25.5064μs 39.2058 KOps/s 38.5846 KOps/s $\color{#35bf28}+1.61\%$
test_step_mdp_speed[True-True-False-False-True] 61.0220μs 24.4879μs 40.8365 KOps/s 39.8266 KOps/s $\color{#35bf28}+2.54\%$
test_step_mdp_speed[True-True-False-False-False] 53.1410μs 15.4117μs 64.8857 KOps/s 66.0900 KOps/s $\color{#d91a1a}-1.82\%$
test_step_mdp_speed[True-False-True-True-True] 76.9610μs 45.3133μs 22.0686 KOps/s 22.3180 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[True-False-True-True-False] 59.1910μs 28.0999μs 35.5873 KOps/s 35.8138 KOps/s $\color{#d91a1a}-0.63\%$
test_step_mdp_speed[True-False-True-False-True] 65.4410μs 24.7488μs 40.4059 KOps/s 40.0142 KOps/s $\color{#35bf28}+0.98\%$
test_step_mdp_speed[True-False-True-False-False] 65.2320μs 15.3091μs 65.3207 KOps/s 64.2861 KOps/s $\color{#35bf28}+1.61\%$
test_step_mdp_speed[True-False-False-True-True] 97.6620μs 47.4120μs 21.0917 KOps/s 21.1458 KOps/s $\color{#d91a1a}-0.26\%$
test_step_mdp_speed[True-False-False-True-False] 79.6120μs 29.9532μs 33.3855 KOps/s 33.1257 KOps/s $\color{#35bf28}+0.78\%$
test_step_mdp_speed[True-False-False-False-True] 74.6320μs 26.5290μs 37.6946 KOps/s 37.3578 KOps/s $\color{#35bf28}+0.90\%$
test_step_mdp_speed[True-False-False-False-False] 61.6320μs 17.3720μs 57.5640 KOps/s 57.0003 KOps/s $\color{#35bf28}+0.99\%$
test_step_mdp_speed[False-True-True-True-True] 88.9420μs 44.4587μs 22.4928 KOps/s 22.4469 KOps/s $\color{#35bf28}+0.20\%$
test_step_mdp_speed[False-True-True-True-False] 69.2010μs 28.5170μs 35.0668 KOps/s 35.6867 KOps/s $\color{#d91a1a}-1.74\%$
test_step_mdp_speed[False-True-True-False-True] 69.0810μs 28.8855μs 34.6195 KOps/s 35.9344 KOps/s $\color{#d91a1a}-3.66\%$
test_step_mdp_speed[False-True-True-False-False] 51.1910μs 17.1483μs 58.3147 KOps/s 58.1578 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[False-True-False-True-True] 0.1052ms 47.5822μs 21.0163 KOps/s 21.2553 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-True-False-True-False] 57.6610μs 30.3921μs 32.9033 KOps/s 32.7558 KOps/s $\color{#35bf28}+0.45\%$
test_step_mdp_speed[False-True-False-False-True] 3.2145ms 31.4218μs 31.8250 KOps/s 31.9585 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[False-True-False-False-False] 48.4410μs 19.5897μs 51.0473 KOps/s 51.4729 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[False-False-True-True-True] 83.0220μs 49.6309μs 20.1487 KOps/s 19.9398 KOps/s $\color{#35bf28}+1.05\%$
test_step_mdp_speed[False-False-True-True-False] 68.8510μs 33.4177μs 29.9242 KOps/s 29.9603 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-False-True-False-True] 58.1710μs 31.0052μs 32.2527 KOps/s 32.2646 KOps/s $\color{#d91a1a}-0.04\%$
test_step_mdp_speed[False-False-True-False-False] 50.2610μs 19.8810μs 50.2993 KOps/s 51.0375 KOps/s $\color{#d91a1a}-1.45\%$
test_step_mdp_speed[False-False-False-True-True] 89.5520μs 52.8790μs 18.9111 KOps/s 19.1244 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-False-False-True-False] 65.3710μs 35.1978μs 28.4109 KOps/s 28.5012 KOps/s $\color{#d91a1a}-0.32\%$
test_step_mdp_speed[False-False-False-False-True] 60.0310μs 32.9400μs 30.3582 KOps/s 31.1823 KOps/s $\color{#d91a1a}-2.64\%$
test_step_mdp_speed[False-False-False-False-False] 47.7910μs 21.4755μs 46.5647 KOps/s 48.5836 KOps/s $\color{#d91a1a}-4.16\%$
test_values[generalized_advantage_estimate-True-True] 25.7047ms 25.3445ms 39.4563 Ops/s 36.7614 Ops/s $\textbf{\color{#35bf28}+7.33\%}$
test_values[vec_generalized_advantage_estimate-True-True] 99.8641ms 2.9060ms 344.1134 Ops/s 338.5913 Ops/s $\color{#35bf28}+1.63\%$
test_values[td0_return_estimate-False-False] 0.1064ms 81.8075μs 12.2238 KOps/s 11.6657 KOps/s $\color{#35bf28}+4.78\%$
test_values[td1_return_estimate-False-False] 56.8640ms 56.3078ms 17.7595 Ops/s 17.0432 Ops/s $\color{#35bf28}+4.20\%$
test_values[vec_td1_return_estimate-False-False] 1.2979ms 1.0958ms 912.6051 Ops/s 913.1441 Ops/s $\color{#d91a1a}-0.06\%$
test_values[td_lambda_return_estimate-True-False] 89.6526ms 89.1254ms 11.2201 Ops/s 11.0984 Ops/s $\color{#35bf28}+1.10\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2845ms 1.0893ms 918.0216 Ops/s 916.0245 Ops/s $\color{#35bf28}+0.22\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.4895ms 25.2579ms 39.5916 Ops/s 37.9253 Ops/s $\color{#35bf28}+4.39\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0458ms 0.7653ms 1.3067 KOps/s 1.3059 KOps/s $\color{#35bf28}+0.06\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7900ms 0.6832ms 1.4637 KOps/s 1.4694 KOps/s $\color{#d91a1a}-0.39\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5315ms 1.4882ms 671.9446 Ops/s 671.8540 Ops/s $\color{#35bf28}+0.01\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8308ms 0.6994ms 1.4299 KOps/s 1.4358 KOps/s $\color{#d91a1a}-0.41\%$
test_dqn_speed[False-None] 6.9187ms 1.5331ms 652.2770 Ops/s 652.4451 Ops/s $\color{#d91a1a}-0.03\%$
test_dqn_speed[False-backward] 2.3643ms 2.1529ms 464.4907 Ops/s 465.1302 Ops/s $\color{#d91a1a}-0.14\%$
test_dqn_speed[True-None] 0.6178ms 0.5460ms 1.8315 KOps/s 1.7950 KOps/s $\color{#35bf28}+2.04\%$
test_dqn_speed[True-backward] 1.2559ms 1.2108ms 825.8905 Ops/s 817.4037 Ops/s $\color{#35bf28}+1.04\%$
test_dqn_speed[reduce-overhead-None] 0.6503ms 0.5641ms 1.7726 KOps/s 1.7129 KOps/s $\color{#35bf28}+3.48\%$
test_dqn_speed[reduce-overhead-backward] 1.1162ms 1.0739ms 931.2006 Ops/s 922.7621 Ops/s $\color{#35bf28}+0.91\%$
test_ddpg_speed[False-None] 3.2267ms 2.9075ms 343.9378 Ops/s 342.9314 Ops/s $\color{#35bf28}+0.29\%$
test_ddpg_speed[False-backward] 4.7189ms 4.2856ms 233.3401 Ops/s 232.5975 Ops/s $\color{#35bf28}+0.32\%$
test_ddpg_speed[True-None] 1.1944ms 1.0756ms 929.7191 Ops/s 922.5401 Ops/s $\color{#35bf28}+0.78\%$
test_ddpg_speed[True-backward] 2.3461ms 2.2937ms 435.9815 Ops/s 430.1370 Ops/s $\color{#35bf28}+1.36\%$
test_ddpg_speed[reduce-overhead-None] 1.1934ms 1.0952ms 913.0572 Ops/s 907.8900 Ops/s $\color{#35bf28}+0.57\%$
test_ddpg_speed[reduce-overhead-backward] 1.8700ms 1.7767ms 562.8457 Ops/s 556.5477 Ops/s $\color{#35bf28}+1.13\%$
test_sac_speed[False-None] 8.5867ms 8.1447ms 122.7798 Ops/s 122.7454 Ops/s $\color{#35bf28}+0.03\%$
test_sac_speed[False-backward] 11.8722ms 11.3976ms 87.7381 Ops/s 87.7446 Ops/s $-0.01\%$
test_sac_speed[True-None] 1.5949ms 1.5289ms 654.0573 Ops/s 638.0947 Ops/s $\color{#35bf28}+2.50\%$
test_sac_speed[True-backward] 3.8104ms 3.4312ms 291.4396 Ops/s 309.1559 Ops/s $\textbf{\color{#d91a1a}-5.73\%}$
test_sac_speed[reduce-overhead-None] 25.5874ms 12.8293ms 77.9468 Ops/s 77.4983 Ops/s $\color{#35bf28}+0.58\%$
test_sac_speed[reduce-overhead-backward] 1.6284ms 1.5304ms 653.4145 Ops/s 729.7902 Ops/s $\textbf{\color{#d91a1a}-10.47\%}$
test_redq_speed[False-None] 8.3064ms 7.5756ms 132.0034 Ops/s 129.7675 Ops/s $\color{#35bf28}+1.72\%$
test_redq_speed[False-backward] 12.5708ms 11.7835ms 84.8647 Ops/s 86.5305 Ops/s $\color{#d91a1a}-1.93\%$
test_redq_speed[True-None] 2.0628ms 1.9836ms 504.1436 Ops/s 500.7066 Ops/s $\color{#35bf28}+0.69\%$
test_redq_speed[True-backward] 4.3685ms 3.8630ms 258.8687 Ops/s 258.2771 Ops/s $\color{#35bf28}+0.23\%$
test_redq_speed[reduce-overhead-None] 2.0823ms 1.9949ms 501.2865 Ops/s 482.1025 Ops/s $\color{#35bf28}+3.98\%$
test_redq_speed[reduce-overhead-backward] 4.3009ms 3.8836ms 257.4961 Ops/s 259.4930 Ops/s $\color{#d91a1a}-0.77\%$
test_redq_deprec_speed[False-None] 9.5933ms 9.1526ms 109.2591 Ops/s 107.3053 Ops/s $\color{#35bf28}+1.82\%$
test_redq_deprec_speed[False-backward] 12.8982ms 12.4643ms 80.2293 Ops/s 79.5490 Ops/s $\color{#35bf28}+0.86\%$
test_redq_deprec_speed[True-None] 2.4293ms 2.3365ms 427.9885 Ops/s 423.9893 Ops/s $\color{#35bf28}+0.94\%$
test_redq_deprec_speed[True-backward] 4.7088ms 4.1983ms 238.1911 Ops/s 250.5856 Ops/s $\color{#d91a1a}-4.95\%$
test_redq_deprec_speed[reduce-overhead-None] 2.4314ms 2.3453ms 426.3807 Ops/s 422.6680 Ops/s $\color{#35bf28}+0.88\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.2268ms 4.1698ms 239.8199 Ops/s 248.5693 Ops/s $\color{#d91a1a}-3.52\%$
test_td3_speed[False-None] 34.8886ms 8.2904ms 120.6209 Ops/s 122.4609 Ops/s $\color{#d91a1a}-1.50\%$
test_td3_speed[False-backward] 11.1328ms 10.6334ms 94.0429 Ops/s 92.4730 Ops/s $\color{#35bf28}+1.70\%$
test_td3_speed[True-None] 1.6863ms 1.6145ms 619.3768 Ops/s 633.0115 Ops/s $\color{#d91a1a}-2.15\%$
test_td3_speed[True-backward] 3.3547ms 3.2963ms 303.3705 Ops/s 302.5082 Ops/s $\color{#35bf28}+0.29\%$
test_td3_speed[reduce-overhead-None] 59.1232ms 26.2707ms 38.0653 Ops/s 37.8697 Ops/s $\color{#35bf28}+0.52\%$
test_td3_speed[reduce-overhead-backward] 1.8895ms 1.4764ms 677.3284 Ops/s 674.5983 Ops/s $\color{#35bf28}+0.40\%$
test_cql_speed[False-None] 17.5610ms 16.9796ms 58.8943 Ops/s 58.3159 Ops/s $\color{#35bf28}+0.99\%$
test_cql_speed[False-backward] 23.6218ms 22.6091ms 44.2300 Ops/s 44.0179 Ops/s $\color{#35bf28}+0.48\%$
test_cql_speed[True-None] 3.1273ms 2.9714ms 336.5381 Ops/s 342.5337 Ops/s $\color{#d91a1a}-1.75\%$
test_cql_speed[True-backward] 5.4523ms 5.0889ms 196.5061 Ops/s 189.3425 Ops/s $\color{#35bf28}+3.78\%$
test_cql_speed[reduce-overhead-None] 0.3592s 15.0936ms 66.2533 Ops/s 74.4577 Ops/s $\textbf{\color{#d91a1a}-11.02\%}$
test_cql_speed[reduce-overhead-backward] 1.5928ms 1.5346ms 651.6456 Ops/s 581.6970 Ops/s $\textbf{\color{#35bf28}+12.02\%}$
test_a2c_speed[False-None] 3.3354ms 3.2390ms 308.7334 Ops/s 302.4818 Ops/s $\color{#35bf28}+2.07\%$
test_a2c_speed[False-backward] 6.5792ms 6.1406ms 162.8516 Ops/s 153.7651 Ops/s $\textbf{\color{#35bf28}+5.91\%}$
test_a2c_speed[True-None] 1.1028ms 1.0149ms 985.3219 Ops/s 980.0306 Ops/s $\color{#35bf28}+0.54\%$
test_a2c_speed[True-backward] 2.6514ms 2.5890ms 386.2475 Ops/s 360.8936 Ops/s $\textbf{\color{#35bf28}+7.03\%}$
test_a2c_speed[reduce-overhead-None] 21.6678ms 11.8092ms 84.6797 Ops/s 85.1029 Ops/s $\color{#d91a1a}-0.50\%$
test_a2c_speed[reduce-overhead-backward] 1.0323ms 0.9747ms 1.0259 KOps/s 863.0462 Ops/s $\textbf{\color{#35bf28}+18.87\%}$
test_ppo_speed[False-None] 3.9305ms 3.7285ms 268.2029 Ops/s 266.4913 Ops/s $\color{#35bf28}+0.64\%$
test_ppo_speed[False-backward] 7.3389ms 6.8536ms 145.9095 Ops/s 139.5290 Ops/s $\color{#35bf28}+4.57\%$
test_ppo_speed[True-None] 1.1097ms 0.9597ms 1.0420 KOps/s 1.0243 KOps/s $\color{#35bf28}+1.73\%$
test_ppo_speed[True-backward] 2.6262ms 2.5440ms 393.0881 Ops/s 364.6856 Ops/s $\textbf{\color{#35bf28}+7.79\%}$
test_ppo_speed[reduce-overhead-None] 0.6008ms 0.5327ms 1.8773 KOps/s 68.2743 Ops/s $\textbf{\color{#35bf28}+2649.62\%}$
test_ppo_speed[reduce-overhead-backward] 1.0267ms 0.9698ms 1.0311 KOps/s 979.4315 Ops/s $\textbf{\color{#35bf28}+5.27\%}$
test_reinforce_speed[False-None] 2.4645ms 2.3069ms 433.4794 Ops/s 425.8982 Ops/s $\color{#35bf28}+1.78\%$
test_reinforce_speed[False-backward] 3.7488ms 3.3202ms 301.1824 Ops/s 286.4972 Ops/s $\textbf{\color{#35bf28}+5.13\%}$
test_reinforce_speed[True-None] 0.9043ms 0.8436ms 1.1855 KOps/s 1.1407 KOps/s $\color{#35bf28}+3.92\%$
test_reinforce_speed[True-backward] 2.4506ms 2.4054ms 415.7382 Ops/s 381.3170 Ops/s $\textbf{\color{#35bf28}+9.03\%}$
test_reinforce_speed[reduce-overhead-None] 0.2916s 12.2233ms 81.8109 Ops/s 89.1410 Ops/s $\textbf{\color{#d91a1a}-8.22\%}$
test_reinforce_speed[reduce-overhead-backward] 1.0835ms 1.0371ms 964.2079 Ops/s 838.3242 Ops/s $\textbf{\color{#35bf28}+15.02\%}$
test_iql_speed[False-None] 9.9965ms 9.4309ms 106.0346 Ops/s 105.6033 Ops/s $\color{#35bf28}+0.41\%$
test_iql_speed[False-backward] 14.0587ms 13.2173ms 75.6581 Ops/s 72.7842 Ops/s $\color{#35bf28}+3.95\%$
test_iql_speed[True-None] 1.9279ms 1.7691ms 565.2675 Ops/s 530.4781 Ops/s $\textbf{\color{#35bf28}+6.56\%}$
test_iql_speed[True-backward] 4.5080ms 4.4066ms 226.9329 Ops/s 227.2945 Ops/s $\color{#d91a1a}-0.16\%$
test_iql_speed[reduce-overhead-None] 20.3347ms 11.6722ms 85.6735 Ops/s 85.9843 Ops/s $\color{#d91a1a}-0.36\%$
test_iql_speed[reduce-overhead-backward] 1.6641ms 1.6112ms 620.6485 Ops/s 607.6762 Ops/s $\color{#35bf28}+2.13\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.0932ms 6.5223ms 153.3209 Ops/s 152.0016 Ops/s $\color{#35bf28}+0.87\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6446ms 0.3195ms 3.1304 KOps/s 3.5327 KOps/s $\textbf{\color{#d91a1a}-11.39\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8229ms 0.3205ms 3.1203 KOps/s 3.6362 KOps/s $\textbf{\color{#d91a1a}-14.19\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.7725ms 6.2212ms 160.7395 Ops/s 159.2476 Ops/s $\color{#35bf28}+0.94\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8216ms 0.3382ms 2.9570 KOps/s 3.3966 KOps/s $\textbf{\color{#d91a1a}-12.94\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5378ms 0.3297ms 3.0334 KOps/s 3.3341 KOps/s $\textbf{\color{#d91a1a}-9.02\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7990ms 1.4199ms 704.2976 Ops/s 754.1904 Ops/s $\textbf{\color{#d91a1a}-6.62\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.7677ms 1.3303ms 751.7209 Ops/s 826.1960 Ops/s $\textbf{\color{#d91a1a}-9.01\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.8886ms 6.4278ms 155.5749 Ops/s 154.5367 Ops/s $\color{#35bf28}+0.67\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1118ms 0.4456ms 2.2440 KOps/s 2.3850 KOps/s $\textbf{\color{#d91a1a}-5.91\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8463ms 0.4016ms 2.4901 KOps/s 2.2315 KOps/s $\textbf{\color{#35bf28}+11.59\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5743ms 6.2034ms 161.2024 Ops/s 158.6548 Ops/s $\color{#35bf28}+1.61\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.5501ms 0.3034ms 3.2961 KOps/s 3.2256 KOps/s $\color{#35bf28}+2.18\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7718ms 0.2722ms 3.6739 KOps/s 3.7885 KOps/s $\color{#d91a1a}-3.02\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.6585ms 6.0997ms 163.9419 Ops/s 160.2594 Ops/s $\color{#35bf28}+2.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.5624ms 0.2652ms 3.7710 KOps/s 3.2139 KOps/s $\textbf{\color{#35bf28}+17.33\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4589ms 0.2454ms 4.0742 KOps/s 3.5362 KOps/s $\textbf{\color{#35bf28}+15.21\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4703ms 6.2961ms 158.8286 Ops/s 154.5433 Ops/s $\color{#35bf28}+2.77\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1821ms 0.4351ms 2.2981 KOps/s 2.1882 KOps/s $\textbf{\color{#35bf28}+5.02\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8447ms 0.3940ms 2.5382 KOps/s 2.3355 KOps/s $\textbf{\color{#35bf28}+8.68\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.2495ms 5.5643ms 179.7183 Ops/s 183.6631 Ops/s $\color{#d91a1a}-2.15\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 10.6142ms 2.1801ms 458.7027 Ops/s 489.0744 Ops/s $\textbf{\color{#d91a1a}-6.21\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 2.2048ms 1.0947ms 913.5126 Ops/s 828.4485 Ops/s $\textbf{\color{#35bf28}+10.27\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 8.3103ms 5.5028ms 181.7263 Ops/s 183.4145 Ops/s $\color{#d91a1a}-0.92\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 6.8807ms 2.0083ms 497.9339 Ops/s 435.6204 Ops/s $\textbf{\color{#35bf28}+14.30\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 8.2027ms 1.2774ms 782.8299 Ops/s 830.9151 Ops/s $\textbf{\color{#d91a1a}-5.79\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4895s 15.4723ms 64.6316 Ops/s 32.8687 Ops/s $\textbf{\color{#35bf28}+96.64\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.8143ms 2.2211ms 450.2373 Ops/s 424.0057 Ops/s $\textbf{\color{#35bf28}+6.19\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.4175ms 1.3824ms 723.3694 Ops/s 781.6193 Ops/s $\textbf{\color{#d91a1a}-7.45\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 16.2865ms 15.5879ms 64.1525 Ops/s 63.0189 Ops/s $\color{#35bf28}+1.80\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 20.3360ms 18.1401ms 55.1265 Ops/s 56.8990 Ops/s $\color{#d91a1a}-3.12\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.5844ms 19.7535ms 50.6241 Ops/s 49.7274 Ops/s $\color{#35bf28}+1.80\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 20.6748ms 18.2865ms 54.6850 Ops/s 54.7049 Ops/s $\color{#d91a1a}-0.04\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 19.8529ms 19.6073ms 51.0013 Ops/s 49.9270 Ops/s $\color{#35bf28}+2.15\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 21.7694ms 19.9484ms 50.1293 Ops/s 52.4823 Ops/s $\color{#d91a1a}-4.48\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BE Better errors, logs, docs or test utils bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants