Prediction on large scale data using extreme gradient boosting
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2016.
Prif Awduron: | , |
---|---|
Awduron Eraill: | |
Fformat: | Traethawd Ymchwil |
Iaith: | English |
Cyhoeddwyd: |
BRAC University
2016
|
Pynciau: | |
Mynediad Ar-lein: | http://hdl.handle.net/10361/6391 |
id |
10361-6391 |
---|---|
record_format |
dspace |
spelling |
10361-63912022-01-26T10:08:24Z Prediction on large scale data using extreme gradient boosting Sawon, Md.Tariq Hasan Hosen, Md. Shazzed Mostakim, Moin Department of Computer Science and Engineering, BRAC University Extreme gradient boost Prediction modelling Sales prediction Linear regression Time series Gradient boosting This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2016. Cataloged from PDF version of thesis report. Includes bibliographical references (page 42-45). This paper presents a use case of data mining for sales forecasting in retail demand and sales prediction. In particular, the Extreme Gradient Boosting algorithm is used to design a prediction model to accurately estimate probable sales for retail outlets of a major European Pharmacy retailing company. The forecast of potential sales is based on a mixture of temporal and economical features including prior sales data, store promotions, retail competitors, school and state holidays, location and accessibility of the store as well as the time of year. The model building process was guided by common sense reasoning and by analytic knowledge discovered during data analysis and definitive conclusions were drawn. The performances of the XGBoost predictor were compared with those of more traditional regression algorithms like Linear Regression and Random Forest Regression. Findings not only reveal that the XGBoost algorithm outperforms the traditional modeling approaches with regard to prediction accuracy, but it also uncovers new knowledge that is hidden in data which help in building a more robust feature set and strengthen the sales prediction model. Md.Tariq Hasan Sawon Md. Shazzed Hosen B. Computer Science and Engineering 2016-09-08T04:45:31Z 2016-09-08T04:45:31Z 2016 2016-08 Thesis ID 11201030 ID 11221039 http://hdl.handle.net/10361/6391 en BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 45 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Extreme gradient boost Prediction modelling Sales prediction Linear regression Time series Gradient boosting |
spellingShingle |
Extreme gradient boost Prediction modelling Sales prediction Linear regression Time series Gradient boosting Sawon, Md.Tariq Hasan Hosen, Md. Shazzed Prediction on large scale data using extreme gradient boosting |
description |
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2016. |
author2 |
Mostakim, Moin |
author_facet |
Mostakim, Moin Sawon, Md.Tariq Hasan Hosen, Md. Shazzed |
format |
Thesis |
author |
Sawon, Md.Tariq Hasan Hosen, Md. Shazzed |
author_sort |
Sawon, Md.Tariq Hasan |
title |
Prediction on large scale data using extreme gradient boosting |
title_short |
Prediction on large scale data using extreme gradient boosting |
title_full |
Prediction on large scale data using extreme gradient boosting |
title_fullStr |
Prediction on large scale data using extreme gradient boosting |
title_full_unstemmed |
Prediction on large scale data using extreme gradient boosting |
title_sort |
prediction on large scale data using extreme gradient boosting |
publisher |
BRAC University |
publishDate |
2016 |
url |
http://hdl.handle.net/10361/6391 |
work_keys_str_mv |
AT sawonmdtariqhasan predictiononlargescaledatausingextremegradientboosting AT hosenmdshazzed predictiononlargescaledatausingextremegradientboosting |
_version_ |
1814307611884912640 |