A repo where I look into the details and practice on how to properly utilize reinforcement learning. I try out different AI models and evaluate their performance to test out which does best. Please note this is a project i do in my free time and will update periodically when i get the time to complete it!
Please note - running current model on cpu will be mind numbingly slow. I highly recommend either reducing network size (i.e parameters/layers/number of hidden layers) and reducing batch size(if running on cpu). Running on cuda cores is highly recommended
- Classic control(1/3): pendulum(solved), mountain car, mountain car continuous.
- Mujoco (0/10)
- Robotics (0/10)
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
Episodes trained | #300 | #- |
---|---|---|
Average/Overall reward(last 200 eps) | - | - |
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
Episodes trained | #- | #- |
---|---|---|
Average/Overall reward(per 200 eps) | - | - |
All credits goes to deeplizard. Visit their yt channel here: https://www.youtube.com/channel/UC4UJ26WkceqONNF5S26OiVw
- https://arxiv.org/abs/2005.14419
- https://d3c33hcgiwev3.cloudfront.net/Ph9QFZnEEemRfw7JJ0OZYA_808e8e7d9a544e1eb31ad11069d45dc4_RLbook2018.pdf?Expires=1624492800&Signature=KJBvWJSHe5fr1OvTQWuLDGKdaTUFcmQe9EwvOYhIhfmPfQaQzPJZr4NWGDKgGEjKrHSrBMbmlabY2-AcuORcoUSBB4522vSZipuZb3LXlg552LU4NUI50-G55oKxgX2mGe3Z5X2pQA1QDV1ijBdz1bCZNvnlpPgTz6Oz8iOU4WQ_&Key-Pair-Id=APKAJLTNE6QMUY6HBC5A
- https://arxiv.org/abs/1509.02971