Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Bibliografische gegevens
Hoofdauteurs:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Andere auteurs:	Rashid, Warida
Formaat:	Thesis
Taal:	English
Gepubliceerd in:	Brac University 2023
Onderwerpen:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Online toegang:	http://hdl.handle.net/10361/18306

Gelijkaardige items

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
door: Mahmud, Aqil, et al.
Gepubliceerd in: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
door: Dutta, Amit
Gepubliceerd in: (2024)

Combinatorial optimization : algorithms and complexity /
door: Papadimitriou, Christos H.
Gepubliceerd in: (1998)

Convex optimization /
door: Boyd, Stephen P.
Gepubliceerd in: (2004)

Self-learning game bot using deep reinforcement learning
door: Ananto, Azizul Haque
Gepubliceerd in: (2018)

Optimal energy rendering approach from lightning return stroke
door: Chowdhury, A.S.M. Mishkat Hussain, et al.
Gepubliceerd in: (2016)

Elements of dynamic optimization /
door: Chiang, Alpha C., 1927-
Gepubliceerd in: (1992)

Elements of dynamic optimization /
door: Chiang, Alpha C., 1927-
Gepubliceerd in: (2012)

Convex optimization /
door: Boyd, Stephen P.
Gepubliceerd in: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
door: Allen, Randy
Gepubliceerd in: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
door: Bhuiyan, Emtiaz MD Tafsir, et al.
Gepubliceerd in: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
door: Ismail, Abdiwahab Mohamed
Gepubliceerd in: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
door: Hossain, Mainul, et al.
Gepubliceerd in: (2022)

Character animation using reinforcement learning and imitation learning algorithms
door: Tahmid, Tokey, et al.
Gepubliceerd in: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
door: Mouly, Radia Rahman, et al.
Gepubliceerd in: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
door: Sakir, Adnan, et al.
Gepubliceerd in: (2023)

Applied shape optimization for fluids
door: Mohammadi, B.

Dynamic power management by reinforcement learning
door: Hossain, Safayet, et al.
Gepubliceerd in: (2016)

Accelerating ant colony optimization by using local search
door: Tabassum, Nabila, et al.
Gepubliceerd in: (2015)

Iterative Methods in Combinatorial Optimization
door: Lap Chi Lau, R. Ravi, Mohit Singh
Gepubliceerd in: (2012)

Importance of educational data mining for optimized operations in Brac University
door: Saad, Mohammad Alif Hossain
Gepubliceerd in: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
door: Krause, Lois Breur
Gepubliceerd in: (2008)

An efficient deep learning approach to detect skin Cancer
door: Islam, Ashfaqul, et al.
Gepubliceerd in: (2022)

Mechanism Design
door: Rakesh V. Vohra
Gepubliceerd in: (2013)

Yoga posture recognition using the deep learning process
door: Islam, Abidul, et al.
Gepubliceerd in: (2023)

ShopUp: transforming business through product optimization
door: Tamim, Farhad Hassan
Gepubliceerd in: (2018)

Reinforcement learning : an introduction /
door: Sutton, Richard S., et al.
Gepubliceerd in: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
door: Rafid, Mutasim, et al.
Gepubliceerd in: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
door: Moti, Md Mahraj Murshalin Al, et al.
Gepubliceerd in: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
door: Ghosh, Kawshik Kumar, et al.
Gepubliceerd in: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
door: Ahmed, Istiak, et al.
Gepubliceerd in: (2023)

Classification of peripheral blood cell images using deep learning
door: Aadi, Oyshik Ahmed, et al.
Gepubliceerd in: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
door: Chishty, Wadud
Gepubliceerd in: (2018)

Essentials of learning : the new cognitive learning for students of education /
door: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
door: Khan, Muhidul Islam
Gepubliceerd in: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
door: Saif, Muntasir Mahmud, et al.
Gepubliceerd in: (2023)

Corn leaf disease detection using deep convolution neural network
door: Rabbi, Rawhatur, et al.
Gepubliceerd in: (2023)

Prospect Theory
door: Peter P. Wakker
Gepubliceerd in: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
door: Khan, Zumana Hayat
Gepubliceerd in: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
door: Issa, Razin Bin, et al.
Gepubliceerd in: (2020)