Analyzing optimization landscape of recent policy optimization methods in deep RL
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.
Κύριοι συγγραφείς: | Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib |
---|---|
Άλλοι συγγραφείς: | Rashid, Warida |
Μορφή: | Thesis |
Γλώσσα: | English |
Έκδοση: |
Brac University
2023
|
Θέματα: | |
Διαθέσιμο Online: | http://hdl.handle.net/10361/18306 |
Παρόμοια τεκμήρια
-
Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
ανά: Mahmud, Aqil, κ.ά.
Έκδοση: (2023) -
ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
ανά: Dutta, Amit
Έκδοση: (2024) -
Combinatorial optimization : algorithms and complexity /
ανά: Papadimitriou, Christos H.
Έκδοση: (1998) -
Convex optimization /
ανά: Boyd, Stephen P.
Έκδοση: (2004) -
Self-learning game bot using deep reinforcement learning
ανά: Ananto, Azizul Haque
Έκδοση: (2018)