Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριοι συγγραφείς:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Άλλοι συγγραφείς:	Rashid, Warida
Μορφή:	Thesis
Γλώσσα:	English
Έκδοση:	Brac University 2023
Θέματα:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Διαθέσιμο Online:	http://hdl.handle.net/10361/18306

Παρόμοια τεκμήρια

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
ανά: Mahmud, Aqil, κ.ά.
Έκδοση: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
ανά: Dutta, Amit
Έκδοση: (2024)

Combinatorial optimization : algorithms and complexity /
ανά: Papadimitriou, Christos H.
Έκδοση: (1998)

Convex optimization /
ανά: Boyd, Stephen P.
Έκδοση: (2004)

Self-learning game bot using deep reinforcement learning
ανά: Ananto, Azizul Haque
Έκδοση: (2018)

Optimal energy rendering approach from lightning return stroke
ανά: Chowdhury, A.S.M. Mishkat Hussain, κ.ά.
Έκδοση: (2016)

Elements of dynamic optimization /
ανά: Chiang, Alpha C., 1927-
Έκδοση: (1992)

Elements of dynamic optimization /
ανά: Chiang, Alpha C., 1927-
Έκδοση: (2012)

Convex optimization /
ανά: Boyd, Stephen P.
Έκδοση: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
ανά: Allen, Randy
Έκδοση: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
ανά: Bhuiyan, Emtiaz MD Tafsir, κ.ά.
Έκδοση: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
ανά: Ismail, Abdiwahab Mohamed
Έκδοση: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
ανά: Hossain, Mainul, κ.ά.
Έκδοση: (2022)

Character animation using reinforcement learning and imitation learning algorithms
ανά: Tahmid, Tokey, κ.ά.
Έκδοση: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
ανά: Mouly, Radia Rahman, κ.ά.
Έκδοση: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
ανά: Sakir, Adnan, κ.ά.
Έκδοση: (2023)

Applied shape optimization for fluids
ανά: Mohammadi, B.

Dynamic power management by reinforcement learning
ανά: Hossain, Safayet, κ.ά.
Έκδοση: (2016)

Accelerating ant colony optimization by using local search
ανά: Tabassum, Nabila, κ.ά.
Έκδοση: (2015)

Iterative Methods in Combinatorial Optimization
ανά: Lap Chi Lau, R. Ravi, Mohit Singh
Έκδοση: (2012)

Importance of educational data mining for optimized operations in Brac University
ανά: Saad, Mohammad Alif Hossain
Έκδοση: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
ανά: Krause, Lois Breur
Έκδοση: (2008)

An efficient deep learning approach to detect skin Cancer
ανά: Islam, Ashfaqul, κ.ά.
Έκδοση: (2022)

Mechanism Design
ανά: Rakesh V. Vohra
Έκδοση: (2013)

Yoga posture recognition using the deep learning process
ανά: Islam, Abidul, κ.ά.
Έκδοση: (2023)

ShopUp: transforming business through product optimization
ανά: Tamim, Farhad Hassan
Έκδοση: (2018)

Reinforcement learning : an introduction /
ανά: Sutton, Richard S., κ.ά.
Έκδοση: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
ανά: Rafid, Mutasim, κ.ά.
Έκδοση: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
ανά: Moti, Md Mahraj Murshalin Al, κ.ά.
Έκδοση: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
ανά: Ghosh, Kawshik Kumar, κ.ά.
Έκδοση: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
ανά: Ahmed, Istiak, κ.ά.
Έκδοση: (2023)

Classification of peripheral blood cell images using deep learning
ανά: Aadi, Oyshik Ahmed, κ.ά.
Έκδοση: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
ανά: Chishty, Wadud
Έκδοση: (2018)

Essentials of learning : the new cognitive learning for students of education /
ανά: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
ανά: Khan, Muhidul Islam
Έκδοση: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
ανά: Saif, Muntasir Mahmud, κ.ά.
Έκδοση: (2023)

Corn leaf disease detection using deep convolution neural network
ανά: Rabbi, Rawhatur, κ.ά.
Έκδοση: (2023)

Prospect Theory
ανά: Peter P. Wakker
Έκδοση: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
ανά: Khan, Zumana Hayat
Έκδοση: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
ανά: Issa, Razin Bin, κ.ά.
Έκδοση: (2020)