Analyzing optimization landscape of recent policy optimization methods in deep RL
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.
Prif Awduron: | Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib |
---|---|
Awduron Eraill: | Rashid, Warida |
Fformat: | Traethawd Ymchwil |
Iaith: | English |
Cyhoeddwyd: |
Brac University
2023
|
Pynciau: | |
Mynediad Ar-lein: | http://hdl.handle.net/10361/18306 |
Eitemau Tebyg
-
Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
gan: Mahmud, Aqil, et al.
Cyhoeddwyd: (2023) -
ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
gan: Dutta, Amit
Cyhoeddwyd: (2024) -
Combinatorial optimization : algorithms and complexity /
gan: Papadimitriou, Christos H.
Cyhoeddwyd: (1998) -
Convex optimization /
gan: Boyd, Stephen P.
Cyhoeddwyd: (2004) -
Self-learning game bot using deep reinforcement learning
gan: Ananto, Azizul Haque
Cyhoeddwyd: (2018)