Analyzing optimization landscape of recent policy optimization methods in deep RL

Analyzing optimization landscape of recent policy optimization methods in deep RL

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

Chi tiết về thư mục
Những tác giả chính:	Khan, Mahir Asaf, Ashraf, Adib, Amin, Tahmid Adib
Tác giả khác:	Rashid, Warida
Định dạng:	Luận văn
Ngôn ngữ:	English
Được phát hành:	Brac University 2023
Những chủ đề:	Optimization landscape Policy optimization Deep reinforcement learning Variance reduction Control variates Cognitive learning theory Machine learning
Truy cập trực tuyến:	http://hdl.handle.net/10361/18306

Những quyển sách tương tự

Implementation of reinforcement learning architecture to augment an AI that can self-learn to play video games
Bằng: Mahmud, Aqil, et al.
Được phát hành: (2023)

ROBB: recurrent proximal policy optimization reinforcement learning for optimal block formation in bitcoin blockchain network
Bằng: Dutta, Amit
Được phát hành: (2024)

Combinatorial optimization : algorithms and complexity /
Bằng: Papadimitriou, Christos H.
Được phát hành: (1998)

Convex optimization /
Bằng: Boyd, Stephen P.
Được phát hành: (2004)

Self-learning game bot using deep reinforcement learning
Bằng: Ananto, Azizul Haque
Được phát hành: (2018)

Optimal energy rendering approach from lightning return stroke
Bằng: Chowdhury, A.S.M. Mishkat Hussain, et al.
Được phát hành: (2016)

Elements of dynamic optimization /
Bằng: Chiang, Alpha C., 1927-
Được phát hành: (1992)

Elements of dynamic optimization /
Bằng: Chiang, Alpha C., 1927-
Được phát hành: (2012)

Convex optimization /
Bằng: Boyd, Stephen P.
Được phát hành: (1994)

Optimizing compilers for modern architectures : a dependence-based approach /
Bằng: Allen, Randy
Được phát hành: (2001)

Implementation of real-time learning on homomorphically encrypted visual inputs
Bằng: Bhuiyan, Emtiaz MD Tafsir, et al.
Được phát hành: (2021)

Optimal capacitor placement in radial distribution system for loss minimization using particle swarm optimization
Bằng: Ismail, Abdiwahab Mohamed
Được phát hành: (2024)

Early stage detection and classification of colon cancer using deep learning and explainable AI on histopathological images
Bằng: Hossain, Mainul, et al.
Được phát hành: (2022)

Character animation using reinforcement learning and imitation learning algorithms
Bằng: Tahmid, Tokey, et al.
Được phát hành: (2021)

Traﬃc congestion reduction in SUMO using reinforcement learning method
Bằng: Mouly, Radia Rahman, et al.
Được phát hành: (2021)

Skin cancer detection and classification using multiple optimized deep convolutional neural network
Bằng: Sakir, Adnan, et al.
Được phát hành: (2023)

Applied shape optimization for fluids
Bằng: Mohammadi, B.

Dynamic power management by reinforcement learning
Bằng: Hossain, Safayet, et al.
Được phát hành: (2016)

Accelerating ant colony optimization by using local search
Bằng: Tabassum, Nabila, et al.
Được phát hành: (2015)

Iterative Methods in Combinatorial Optimization
Bằng: Lap Chi Lau, R. Ravi, Mohit Singh
Được phát hành: (2012)

Importance of educational data mining for optimized operations in Brac University
Bằng: Saad, Mohammad Alif Hossain
Được phát hành: (2021)

How we learn and why we don't : student survival guide using the cognitive profile inventory /
Bằng: Krause, Lois Breur
Được phát hành: (2008)

An efficient deep learning approach to detect skin Cancer
Bằng: Islam, Ashfaqul, et al.
Được phát hành: (2022)

Mechanism Design
Bằng: Rakesh V. Vohra
Được phát hành: (2013)

Yoga posture recognition using the deep learning process
Bằng: Islam, Abidul, et al.
Được phát hành: (2023)

ShopUp: transforming business through product optimization
Bằng: Tamim, Farhad Hassan
Được phát hành: (2018)

Reinforcement learning : an introduction /
Bằng: Sutton, Richard S., et al.
Được phát hành: (2018)

Resource optimization in cloud computing using dynamic load balancing technique
Bằng: Rafid, Mutasim, et al.
Được phát hành: (2021)

Reinforcement learning based electricity price forecasting in Blockchain based smart grid environment
Bằng: Moti, Md Mahraj Murshalin Al, et al.
Được phát hành: (2021)

Real-time mastitis detection in livestock using deep learning and machine learning leveraging edge devices
Bằng: Ghosh, Kawshik Kumar, et al.
Được phát hành: (2023)

A conventional & deep learning strategy for analyzing & detecting Bengali fake news in online medium
Bằng: Ahmed, Istiak, et al.
Được phát hành: (2023)

Classification of peripheral blood cell images using deep learning
Bằng: Aadi, Oyshik Ahmed, et al.
Được phát hành: (2024)

Importance of Search Engine Optimization (SEO) for businesses in Bangladesh
Bằng: Chishty, Wadud
Được phát hành: (2018)

Essentials of learning : the new cognitive learning for students of education /
Bằng: Travers, Robert Morris William, 1913-

Resource-aware task scheduling by an adversarial bandit solver method in wireless sensor networks
Bằng: Khan, Muhidul Islam
Được phát hành: (2016)

A modern technique to detect potholes by Computer Vision and Deep Learning
Bằng: Saif, Muntasir Mahmud, et al.
Được phát hành: (2023)

Corn leaf disease detection using deep convolution neural network
Bằng: Rabbi, Rawhatur, et al.
Được phát hành: (2023)

Prospect Theory
Bằng: Peter P. Wakker
Được phát hành: (2012)

Method optimization for isolation of Klebsiella Bacteriophage from soil samples
Bằng: Khan, Zumana Hayat
Được phát hành: (2021)

Reinforcement learning based autonomous vehicle for exploration and exploitation of undiscovered track
Bằng: Issa, Razin Bin, et al.
Được phát hành: (2020)