Analyzing CV/resume using natural language processing and machine learning

This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.

Bibliografski detalji
Glavni autori: Reza, Md. Tanzim, Zaman, Md. Sakib
Daljnji autori: Uddin, Dr. Jia
Format: Disertacija
Jezik:English
Izdano: BRAC University 2018
Teme:
Online pristup:http://hdl.handle.net/10361/9480
id 10361-9480
record_format dspace
spelling 10361-94802022-01-26T10:21:46Z Analyzing CV/resume using natural language processing and machine learning Reza, Md. Tanzim Zaman, Md. Sakib Uddin, Dr. Jia Department of Computer Science and Engineering, BRAC University Machine learning CV Resume Natural language NLP JSON ID3 This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017. Cataloged from PDF version of thesis report. Includes bibliographical references. This paper proposes a model of extracting important information from the semi-structured text format in a curriculum vitae or resume and ranking it according to the preference of the associated company and requirements. In order to achieve the desired goal, the entire process has been divided into 3 basic segments. The first segment consists of segmenting the entire CV / Resume based on the topic of each part, the second segment consists of extracting data in structured form from the unstructured data and the final segment consists of evaluating the structured data by decision tree algorithm and training the system. The structured data extraction process is done by segmenting the entire CV / Resume by converting it to HTML. After the conversion to structured data, decision tree algorithm techniques are used to classify the input into different categories based on qualifications and then the data with positive weight is used to train the system for future benefit. Finally, classifier algorithm apart from decision tree such as logistic regression is used to compare the classification result. Md. Tanzim Reza Md. Sakib Zaman B. Computer Science and Engineering 2018-02-15T07:49:31Z 2018-02-15T07:49:31Z 2017 2017 Thesis ID 14101061 ID 14101171 http://hdl.handle.net/10361/9480 en BRAC University thesis reports are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Machine learning
CV
Resume
Natural language
NLP
JSON
ID3
spellingShingle Machine learning
CV
Resume
Natural language
NLP
JSON
ID3
Reza, Md. Tanzim
Zaman, Md. Sakib
Analyzing CV/resume using natural language processing and machine learning
description This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.
author2 Uddin, Dr. Jia
author_facet Uddin, Dr. Jia
Reza, Md. Tanzim
Zaman, Md. Sakib
format Thesis
author Reza, Md. Tanzim
Zaman, Md. Sakib
author_sort Reza, Md. Tanzim
title Analyzing CV/resume using natural language processing and machine learning
title_short Analyzing CV/resume using natural language processing and machine learning
title_full Analyzing CV/resume using natural language processing and machine learning
title_fullStr Analyzing CV/resume using natural language processing and machine learning
title_full_unstemmed Analyzing CV/resume using natural language processing and machine learning
title_sort analyzing cv/resume using natural language processing and machine learning
publisher BRAC University
publishDate 2018
url http://hdl.handle.net/10361/9480
work_keys_str_mv AT rezamdtanzim analyzingcvresumeusingnaturallanguageprocessingandmachinelearning
AT zamanmdsakib analyzingcvresumeusingnaturallanguageprocessingandmachinelearning
_version_ 1814309481841950720