Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Detalles Bibliográficos
Autores principales:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Otros Autores:	Rashid, Warida
Formato:	Tesis
Lenguaje:	English
Publicado:	Brac University 2023
Materias:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Acceso en línea:	http://hdl.handle.net/10361/18306

Ejemplares similares

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
por: Mahmud, Aqil, et al.
Publicado: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
por: Dutta, Amit
Publicado: (2024)

Combinatorial optimization : algorithms and complexity /
por: Papadimitriou, Christos H.
Publicado: (1998)

Convex optimization /
por: Boyd, Stephen P.
Publicado: (2004)

Self-learning game bot using deep reinforcement learning
por: Ananto, Azizul Haque
Publicado: (2018)

Optimal energy rendering approach from lightning return stroke
por: Chowdhury, A.S.M. Mishkat Hussain, et al.
Publicado: (2016)

Elements of dynamic optimization /
por: Chiang, Alpha C., 1927-
Publicado: (1992)

Elements of dynamic optimization /
por: Chiang, Alpha C., 1927-
Publicado: (2012)

Convex optimization /
por: Boyd, Stephen P.
Publicado: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
por: Allen, Randy
Publicado: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
por: Bhuiyan, Emtiaz MD Tafsir, et al.
Publicado: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
por: Ismail, Abdiwahab Mohamed
Publicado: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
por: Hossain, Mainul, et al.
Publicado: (2022)

Character animation using reinforcement learning and imitation learning algorithms
por: Tahmid, Tokey, et al.
Publicado: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
por: Mouly, Radia Rahman, et al.
Publicado: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
por: Sakir, Adnan, et al.
Publicado: (2023)

Applied shape optimization for fluids
por: Mohammadi, B.

Dynamic power management by reinforcement learning
por: Hossain, Safayet, et al.
Publicado: (2016)

Accelerating ant colony optimization by using local search
por: Tabassum, Nabila, et al.
Publicado: (2015)

Iterative Methods in Combinatorial Optimization
por: Lap Chi Lau, R. Ravi, Mohit Singh
Publicado: (2012)

Importance of educational data mining for optimized operations in Brac University
por: Saad, Mohammad Alif Hossain
Publicado: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
por: Krause, Lois Breur
Publicado: (2008)

An efficient deep learning approach to detect skin Cancer
por: Islam, Ashfaqul, et al.
Publicado: (2022)

Mechanism Design
por: Rakesh V. Vohra
Publicado: (2013)

Yoga posture recognition using the deep learning process
por: Islam, Abidul, et al.
Publicado: (2023)

ShopUp: transforming business through product optimization
por: Tamim, Farhad Hassan
Publicado: (2018)

Reinforcement learning : an introduction /
por: Sutton, Richard S., et al.
Publicado: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
por: Rafid, Mutasim, et al.
Publicado: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
por: Moti, Md Mahraj Murshalin Al, et al.
Publicado: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
por: Ghosh, Kawshik Kumar, et al.
Publicado: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
por: Ahmed, Istiak, et al.
Publicado: (2023)

Classification of peripheral blood cell images using deep learning
por: Aadi, Oyshik Ahmed, et al.
Publicado: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
por: Chishty, Wadud
Publicado: (2018)

Essentials of learning : the new cognitive learning for students of education /
por: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
por: Khan, Muhidul Islam
Publicado: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
por: Saif, Muntasir Mahmud, et al.
Publicado: (2023)

Corn leaf disease detection using deep convolution neural network
por: Rabbi, Rawhatur, et al.
Publicado: (2023)

Prospect Theory
por: Peter P. Wakker
Publicado: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
por: Khan, Zumana Hayat
Publicado: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
por: Issa, Razin Bin, et al.
Publicado: (2020)