Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Manylion Llyfryddiaeth
Prif Awduron:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Awduron Eraill:	Rashid, Warida
Fformat:	Traethawd Ymchwil
Iaith:	English
Cyhoeddwyd:	Brac University 2023
Pynciau:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Mynediad Ar-lein:	http://hdl.handle.net/10361/18306

Eitemau Tebyg

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
gan: Mahmud, Aqil, et al.
Cyhoeddwyd: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
gan: Dutta, Amit
Cyhoeddwyd: (2024)

Combinatorial optimization : algorithms and complexity /
gan: Papadimitriou, Christos H.
Cyhoeddwyd: (1998)

Convex optimization /
gan: Boyd, Stephen P.
Cyhoeddwyd: (2004)

Self-learning game bot using deep reinforcement learning
gan: Ananto, Azizul Haque
Cyhoeddwyd: (2018)

Optimal energy rendering approach from lightning return stroke
gan: Chowdhury, A.S.M. Mishkat Hussain, et al.
Cyhoeddwyd: (2016)

Elements of dynamic optimization /
gan: Chiang, Alpha C., 1927-
Cyhoeddwyd: (1992)

Elements of dynamic optimization /
gan: Chiang, Alpha C., 1927-
Cyhoeddwyd: (2012)

Convex optimization /
gan: Boyd, Stephen P.
Cyhoeddwyd: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
gan: Allen, Randy
Cyhoeddwyd: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
gan: Bhuiyan, Emtiaz MD Tafsir, et al.
Cyhoeddwyd: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
gan: Ismail, Abdiwahab Mohamed
Cyhoeddwyd: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
gan: Hossain, Mainul, et al.
Cyhoeddwyd: (2022)

Character animation using reinforcement learning and imitation learning algorithms
gan: Tahmid, Tokey, et al.
Cyhoeddwyd: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
gan: Mouly, Radia Rahman, et al.
Cyhoeddwyd: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
gan: Sakir, Adnan, et al.
Cyhoeddwyd: (2023)

Applied shape optimization for fluids
gan: Mohammadi, B.

Dynamic power management by reinforcement learning
gan: Hossain, Safayet, et al.
Cyhoeddwyd: (2016)

Accelerating ant colony optimization by using local search
gan: Tabassum, Nabila, et al.
Cyhoeddwyd: (2015)

Iterative Methods in Combinatorial Optimization
gan: Lap Chi Lau, R. Ravi, Mohit Singh
Cyhoeddwyd: (2012)

Importance of educational data mining for optimized operations in Brac University
gan: Saad, Mohammad Alif Hossain
Cyhoeddwyd: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
gan: Krause, Lois Breur
Cyhoeddwyd: (2008)

An efficient deep learning approach to detect skin Cancer
gan: Islam, Ashfaqul, et al.
Cyhoeddwyd: (2022)

Mechanism Design
gan: Rakesh V. Vohra
Cyhoeddwyd: (2013)

Yoga posture recognition using the deep learning process
gan: Islam, Abidul, et al.
Cyhoeddwyd: (2023)

ShopUp: transforming business through product optimization
gan: Tamim, Farhad Hassan
Cyhoeddwyd: (2018)

Reinforcement learning : an introduction /
gan: Sutton, Richard S., et al.
Cyhoeddwyd: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
gan: Rafid, Mutasim, et al.
Cyhoeddwyd: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
gan: Moti, Md Mahraj Murshalin Al, et al.
Cyhoeddwyd: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
gan: Ghosh, Kawshik Kumar, et al.
Cyhoeddwyd: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
gan: Ahmed, Istiak, et al.
Cyhoeddwyd: (2023)

Classification of peripheral blood cell images using deep learning
gan: Aadi, Oyshik Ahmed, et al.
Cyhoeddwyd: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
gan: Chishty, Wadud
Cyhoeddwyd: (2018)

Essentials of learning : the new cognitive learning for students of education /
gan: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
gan: Khan, Muhidul Islam
Cyhoeddwyd: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
gan: Saif, Muntasir Mahmud, et al.
Cyhoeddwyd: (2023)

Corn leaf disease detection using deep convolution neural network
gan: Rabbi, Rawhatur, et al.
Cyhoeddwyd: (2023)

Prospect Theory
gan: Peter P. Wakker
Cyhoeddwyd: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
gan: Khan, Zumana Hayat
Cyhoeddwyd: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
gan: Issa, Razin Bin, et al.
Cyhoeddwyd: (2020)