Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Sonraí bibleagrafaíochta
Príomhchruthaitheoirí:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Rannpháirtithe:	Rashid, Warida
Formáid:	Tráchtas
Teanga:	English
Foilsithe / Cruthaithe:	Brac University 2023
Ábhair:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Rochtain ar líne:	http://hdl.handle.net/10361/18306

Míreanna comhchosúla

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
de réir: Mahmud, Aqil, et al.
Foilsithe / Cruthaithe: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
de réir: Dutta, Amit
Foilsithe / Cruthaithe: (2024)

Combinatorial optimization : algorithms and complexity /
de réir: Papadimitriou, Christos H.
Foilsithe / Cruthaithe: (1998)

Convex optimization /
de réir: Boyd, Stephen P.
Foilsithe / Cruthaithe: (2004)

Self-learning game bot using deep reinforcement learning
de réir: Ananto, Azizul Haque
Foilsithe / Cruthaithe: (2018)

Optimal energy rendering approach from lightning return stroke
de réir: Chowdhury, A.S.M. Mishkat Hussain, et al.
Foilsithe / Cruthaithe: (2016)

Elements of dynamic optimization /
de réir: Chiang, Alpha C., 1927-
Foilsithe / Cruthaithe: (1992)

Elements of dynamic optimization /
de réir: Chiang, Alpha C., 1927-
Foilsithe / Cruthaithe: (2012)

Convex optimization /
de réir: Boyd, Stephen P.
Foilsithe / Cruthaithe: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
de réir: Allen, Randy
Foilsithe / Cruthaithe: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
de réir: Bhuiyan, Emtiaz MD Tafsir, et al.
Foilsithe / Cruthaithe: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
de réir: Ismail, Abdiwahab Mohamed
Foilsithe / Cruthaithe: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
de réir: Hossain, Mainul, et al.
Foilsithe / Cruthaithe: (2022)

Character animation using reinforcement learning and imitation learning algorithms
de réir: Tahmid, Tokey, et al.
Foilsithe / Cruthaithe: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
de réir: Mouly, Radia Rahman, et al.
Foilsithe / Cruthaithe: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
de réir: Sakir, Adnan, et al.
Foilsithe / Cruthaithe: (2023)

Applied shape optimization for fluids
de réir: Mohammadi, B.

Dynamic power management by reinforcement learning
de réir: Hossain, Safayet, et al.
Foilsithe / Cruthaithe: (2016)

Accelerating ant colony optimization by using local search
de réir: Tabassum, Nabila, et al.
Foilsithe / Cruthaithe: (2015)

Iterative Methods in Combinatorial Optimization
de réir: Lap Chi Lau, R. Ravi, Mohit Singh
Foilsithe / Cruthaithe: (2012)

Importance of educational data mining for optimized operations in Brac University
de réir: Saad, Mohammad Alif Hossain
Foilsithe / Cruthaithe: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
de réir: Krause, Lois Breur
Foilsithe / Cruthaithe: (2008)

An efficient deep learning approach to detect skin Cancer
de réir: Islam, Ashfaqul, et al.
Foilsithe / Cruthaithe: (2022)

Mechanism Design
de réir: Rakesh V. Vohra
Foilsithe / Cruthaithe: (2013)

Yoga posture recognition using the deep learning process
de réir: Islam, Abidul, et al.
Foilsithe / Cruthaithe: (2023)

ShopUp: transforming business through product optimization
de réir: Tamim, Farhad Hassan
Foilsithe / Cruthaithe: (2018)

Reinforcement learning : an introduction /
de réir: Sutton, Richard S., et al.
Foilsithe / Cruthaithe: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
de réir: Rafid, Mutasim, et al.
Foilsithe / Cruthaithe: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
de réir: Moti, Md Mahraj Murshalin Al, et al.
Foilsithe / Cruthaithe: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
de réir: Ghosh, Kawshik Kumar, et al.
Foilsithe / Cruthaithe: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
de réir: Ahmed, Istiak, et al.
Foilsithe / Cruthaithe: (2023)

Classification of peripheral blood cell images using deep learning
de réir: Aadi, Oyshik Ahmed, et al.
Foilsithe / Cruthaithe: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
de réir: Chishty, Wadud
Foilsithe / Cruthaithe: (2018)

Essentials of learning : the new cognitive learning for students of education /
de réir: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
de réir: Khan, Muhidul Islam
Foilsithe / Cruthaithe: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
de réir: Saif, Muntasir Mahmud, et al.
Foilsithe / Cruthaithe: (2023)

Corn leaf disease detection using deep convolution neural network
de réir: Rabbi, Rawhatur, et al.
Foilsithe / Cruthaithe: (2023)

Prospect Theory
de réir: Peter P. Wakker
Foilsithe / Cruthaithe: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
de réir: Khan, Zumana Hayat
Foilsithe / Cruthaithe: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
de réir: Issa, Razin Bin, et al.
Foilsithe / Cruthaithe: (2020)