Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Xehetasun bibliografikoak
Egile Nagusiak:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Beste egile batzuk:	Rashid, Warida
Formatua:	Thesis
Hizkuntza:	English
Argitaratua:	Brac University 2023
Gaiak:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Sarrera elektronikoa:	http://hdl.handle.net/10361/18306

Antzeko izenburuak

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
nork: Mahmud, Aqil, et al.
Argitaratua: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
nork: Dutta, Amit
Argitaratua: (2024)

Combinatorial optimization : algorithms and complexity /
nork: Papadimitriou, Christos H.
Argitaratua: (1998)

Convex optimization /
nork: Boyd, Stephen P.
Argitaratua: (2004)

Self-learning game bot using deep reinforcement learning
nork: Ananto, Azizul Haque
Argitaratua: (2018)

Optimal energy rendering approach from lightning return stroke
nork: Chowdhury, A.S.M. Mishkat Hussain, et al.
Argitaratua: (2016)

Elements of dynamic optimization /
nork: Chiang, Alpha C., 1927-
Argitaratua: (1992)

Elements of dynamic optimization /
nork: Chiang, Alpha C., 1927-
Argitaratua: (2012)

Convex optimization /
nork: Boyd, Stephen P.
Argitaratua: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
nork: Allen, Randy
Argitaratua: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
nork: Bhuiyan, Emtiaz MD Tafsir, et al.
Argitaratua: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
nork: Ismail, Abdiwahab Mohamed
Argitaratua: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
nork: Hossain, Mainul, et al.
Argitaratua: (2022)

Character animation using reinforcement learning and imitation learning algorithms
nork: Tahmid, Tokey, et al.
Argitaratua: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
nork: Mouly, Radia Rahman, et al.
Argitaratua: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
nork: Sakir, Adnan, et al.
Argitaratua: (2023)

Applied shape optimization for fluids
nork: Mohammadi, B.

Dynamic power management by reinforcement learning
nork: Hossain, Safayet, et al.
Argitaratua: (2016)

Accelerating ant colony optimization by using local search
nork: Tabassum, Nabila, et al.
Argitaratua: (2015)

Iterative Methods in Combinatorial Optimization
nork: Lap Chi Lau, R. Ravi, Mohit Singh
Argitaratua: (2012)

Importance of educational data mining for optimized operations in Brac University
nork: Saad, Mohammad Alif Hossain
Argitaratua: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
nork: Krause, Lois Breur
Argitaratua: (2008)

An efficient deep learning approach to detect skin Cancer
nork: Islam, Ashfaqul, et al.
Argitaratua: (2022)

Mechanism Design
nork: Rakesh V. Vohra
Argitaratua: (2013)

Yoga posture recognition using the deep learning process
nork: Islam, Abidul, et al.
Argitaratua: (2023)

ShopUp: transforming business through product optimization
nork: Tamim, Farhad Hassan
Argitaratua: (2018)

Reinforcement learning : an introduction /
nork: Sutton, Richard S., et al.
Argitaratua: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
nork: Rafid, Mutasim, et al.
Argitaratua: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
nork: Moti, Md Mahraj Murshalin Al, et al.
Argitaratua: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
nork: Ghosh, Kawshik Kumar, et al.
Argitaratua: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
nork: Ahmed, Istiak, et al.
Argitaratua: (2023)

Classification of peripheral blood cell images using deep learning
nork: Aadi, Oyshik Ahmed, et al.
Argitaratua: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
nork: Chishty, Wadud
Argitaratua: (2018)

Essentials of learning : the new cognitive learning for students of education /
nork: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
nork: Khan, Muhidul Islam
Argitaratua: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
nork: Saif, Muntasir Mahmud, et al.
Argitaratua: (2023)

Corn leaf disease detection using deep convolution neural network
nork: Rabbi, Rawhatur, et al.
Argitaratua: (2023)

Prospect Theory
nork: Peter P. Wakker
Argitaratua: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
nork: Khan, Zumana Hayat
Argitaratua: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
nork: Issa, Razin Bin, et al.
Argitaratua: (2020)