Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
مؤلفون آخرون:	Rashid, Warida
التنسيق:	أطروحة
اللغة:	English
منشور في:	Brac University 2023
الموضوعات:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
الوصول للمادة أونلاين:	http://hdl.handle.net/10361/18306

مواد مشابهة

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
بواسطة: Mahmud, Aqil, وآخرون
منشور في: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
بواسطة: Dutta, Amit
منشور في: (2024)

Combinatorial optimization : algorithms and complexity /
بواسطة: Papadimitriou, Christos H.
منشور في: (1998)

Convex optimization /
بواسطة: Boyd, Stephen P.
منشور في: (2004)

Self-learning game bot using deep reinforcement learning
بواسطة: Ananto, Azizul Haque
منشور في: (2018)

Optimal energy rendering approach from lightning return stroke
بواسطة: Chowdhury, A.S.M. Mishkat Hussain, وآخرون
منشور في: (2016)

Elements of dynamic optimization /
بواسطة: Chiang, Alpha C., 1927-
منشور في: (1992)

Elements of dynamic optimization /
بواسطة: Chiang, Alpha C., 1927-
منشور في: (2012)

Convex optimization /
بواسطة: Boyd, Stephen P.
منشور في: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
بواسطة: Allen, Randy
منشور في: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
بواسطة: Bhuiyan, Emtiaz MD Tafsir, وآخرون
منشور في: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
بواسطة: Ismail, Abdiwahab Mohamed
منشور في: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
بواسطة: Hossain, Mainul, وآخرون
منشور في: (2022)

Character animation using reinforcement learning and imitation learning algorithms
بواسطة: Tahmid, Tokey, وآخرون
منشور في: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
بواسطة: Mouly, Radia Rahman, وآخرون
منشور في: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
بواسطة: Sakir, Adnan, وآخرون
منشور في: (2023)

Applied shape optimization for fluids
بواسطة: Mohammadi, B.

Dynamic power management by reinforcement learning
بواسطة: Hossain, Safayet, وآخرون
منشور في: (2016)

Accelerating ant colony optimization by using local search
بواسطة: Tabassum, Nabila, وآخرون
منشور في: (2015)

Iterative Methods in Combinatorial Optimization
بواسطة: Lap Chi Lau, R. Ravi, Mohit Singh
منشور في: (2012)

Importance of educational data mining for optimized operations in Brac University
بواسطة: Saad, Mohammad Alif Hossain
منشور في: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
بواسطة: Krause, Lois Breur
منشور في: (2008)

An efficient deep learning approach to detect skin Cancer
بواسطة: Islam, Ashfaqul, وآخرون
منشور في: (2022)

Mechanism Design
بواسطة: Rakesh V. Vohra
منشور في: (2013)

Yoga posture recognition using the deep learning process
بواسطة: Islam, Abidul, وآخرون
منشور في: (2023)

ShopUp: transforming business through product optimization
بواسطة: Tamim, Farhad Hassan
منشور في: (2018)

Reinforcement learning : an introduction /
بواسطة: Sutton, Richard S., وآخرون
منشور في: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
بواسطة: Rafid, Mutasim, وآخرون
منشور في: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
بواسطة: Moti, Md Mahraj Murshalin Al, وآخرون
منشور في: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
بواسطة: Ghosh, Kawshik Kumar, وآخرون
منشور في: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
بواسطة: Ahmed, Istiak, وآخرون
منشور في: (2023)

Classification of peripheral blood cell images using deep learning
بواسطة: Aadi, Oyshik Ahmed, وآخرون
منشور في: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
بواسطة: Chishty, Wadud
منشور في: (2018)

Essentials of learning : the new cognitive learning for students of education /
بواسطة: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
بواسطة: Khan, Muhidul Islam
منشور في: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
بواسطة: Saif, Muntasir Mahmud, وآخرون
منشور في: (2023)

Corn leaf disease detection using deep convolution neural network
بواسطة: Rabbi, Rawhatur, وآخرون
منشور في: (2023)

Prospect Theory
بواسطة: Peter P. Wakker
منشور في: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
بواسطة: Khan, Zumana Hayat
منشور في: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
بواسطة: Issa, Razin Bin, وآخرون
منشور في: (2020)