Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Bibliografske podrobnosti
Main Authors:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Drugi avtorji:	Rashid, Warida
Format:	Thesis
Jezik:	English
Izdano:	Brac University 2023
Teme:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Online dostop:	http://hdl.handle.net/10361/18306

Podobne knjige/članki

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
od: Mahmud, Aqil, et al.
Izdano: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
od: Dutta, Amit
Izdano: (2024)

Combinatorial optimization : algorithms and complexity /
od: Papadimitriou, Christos H.
Izdano: (1998)

Convex optimization /
od: Boyd, Stephen P.
Izdano: (2004)

Self-learning game bot using deep reinforcement learning
od: Ananto, Azizul Haque
Izdano: (2018)

Optimal energy rendering approach from lightning return stroke
od: Chowdhury, A.S.M. Mishkat Hussain, et al.
Izdano: (2016)

Elements of dynamic optimization /
od: Chiang, Alpha C., 1927-
Izdano: (1992)

Elements of dynamic optimization /
od: Chiang, Alpha C., 1927-
Izdano: (2012)

Convex optimization /
od: Boyd, Stephen P.
Izdano: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
od: Allen, Randy
Izdano: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
od: Bhuiyan, Emtiaz MD Tafsir, et al.
Izdano: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
od: Ismail, Abdiwahab Mohamed
Izdano: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
od: Hossain, Mainul, et al.
Izdano: (2022)

Character animation using reinforcement learning and imitation learning algorithms
od: Tahmid, Tokey, et al.
Izdano: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
od: Mouly, Radia Rahman, et al.
Izdano: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
od: Sakir, Adnan, et al.
Izdano: (2023)

Applied shape optimization for fluids
od: Mohammadi, B.

Dynamic power management by reinforcement learning
od: Hossain, Safayet, et al.
Izdano: (2016)

Accelerating ant colony optimization by using local search
od: Tabassum, Nabila, et al.
Izdano: (2015)

Iterative Methods in Combinatorial Optimization
od: Lap Chi Lau, R. Ravi, Mohit Singh
Izdano: (2012)

Importance of educational data mining for optimized operations in Brac University
od: Saad, Mohammad Alif Hossain
Izdano: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
od: Krause, Lois Breur
Izdano: (2008)

An efficient deep learning approach to detect skin Cancer
od: Islam, Ashfaqul, et al.
Izdano: (2022)

Mechanism Design
od: Rakesh V. Vohra
Izdano: (2013)

Yoga posture recognition using the deep learning process
od: Islam, Abidul, et al.
Izdano: (2023)

ShopUp: transforming business through product optimization
od: Tamim, Farhad Hassan
Izdano: (2018)

Reinforcement learning : an introduction /
od: Sutton, Richard S., et al.
Izdano: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
od: Rafid, Mutasim, et al.
Izdano: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
od: Moti, Md Mahraj Murshalin Al, et al.
Izdano: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
od: Ghosh, Kawshik Kumar, et al.
Izdano: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
od: Ahmed, Istiak, et al.
Izdano: (2023)

Classification of peripheral blood cell images using deep learning
od: Aadi, Oyshik Ahmed, et al.
Izdano: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
od: Chishty, Wadud
Izdano: (2018)

Essentials of learning : the new cognitive learning for students of education /
od: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
od: Khan, Muhidul Islam
Izdano: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
od: Saif, Muntasir Mahmud, et al.
Izdano: (2023)

Corn leaf disease detection using deep convolution neural network
od: Rabbi, Rawhatur, et al.
Izdano: (2023)

Prospect Theory
od: Peter P. Wakker
Izdano: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
od: Khan, Zumana Hayat
Izdano: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
od: Issa, Razin Bin, et al.
Izdano: (2020)