dc.contributor.advisor | Chakrabarty, Amitabha | |
dc.contributor.author | Ghosh, Gitanjali | |
dc.contributor.author | Tabassum, Hridita | |
dc.contributor.author | Atika, Afra | |
dc.contributor.author | Kutubuddi, Zainab | |
dc.date.accessioned | 2021-10-07T06:44:36Z | |
dc.date.available | 2021-10-07T06:44:36Z | |
dc.date.copyright | 2021 | |
dc.date.issued | 2021-01 | |
dc.identifier.other | ID 17101228 | |
dc.identifier.other | ID 17101446 | |
dc.identifier.other | ID 17101206 | |
dc.identifier.other | ID 17101198 | |
dc.identifier.uri | http://hdl.handle.net/10361/15174 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (page 41-43). | |
dc.description.abstract | Online Recruitment fraud (ORF) is becoming an important issue in the cyber-crime
region. Companies find it easier to hire people with the help of the internet rather
than the old traditional way. But it has greatly attracted the scammers to deceive
people and exploit their information. There have been lots of incidents where innocent people have fallen for this malicious fraud and lost millions of money. Even it
causes harm to business and the economy. Unlike other cyber-security problems, like
email spam, phishing, opinion fraud, detecting Online Recruitment Fraud(ORF) did
not get that much of recognition. So, this matter needed to be highlighted more. In
this paper, we have proposed a solution on how to detect ORF. We have presented
our results based on the previous model and also presented the methodologies which
we are going to use to create the ORF detection model where we are using our own
dataset. We are going to use a publicly accessible dataset from fake job postings.csv,
license-CC0: Public Domain, as a reference for the dataset that we have created.
Furthermore, we have collected 4000 data from different job sites in Bangladesh,
among which 301 of them are fraudulent. We have used many common and latest
classification models to detect which algorithm works best for our model. Logistic
Regression, AdaBoost, Decision Tree Classifier, Random Forest, Voting Classifier,
LightGBM, Gradient Boosting are the algorithms that have been used. From our
observations we have found that the accuracy of different prediction models are:
Logistic Regression(94.67%), AdaBoost(95%), Decision Tree Classifier(95%), Random Forest(95%), Voting Classifier(95.34%), LightGBM(95.17%), Gradient Boosting(95.17%). Through this report, we tried to create a precise way for detecting the
fraudulent hiring posts. | en_US |
dc.description.statementofresponsibility | Gitanjali Ghosh | |
dc.description.statementofresponsibility | Hridita Tabassum | |
dc.description.statementofresponsibility | Afra Atika | |
dc.description.statementofresponsibility | Zainab Kutubuddin | |
dc.format.extent | 43 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Machine Learning | en_US |
dc.subject | Fraud Detection | en_US |
dc.subject | Prediction | en_US |
dc.subject | Decision Tree Classifier | en_US |
dc.subject | Logistic Regression algorithm | en_US |
dc.subject | Adaptive Boosting | en_US |
dc.subject | Random Forest Classifier | en_US |
dc.subject | Decision trees | en_US |
dc.subject | Gradient Boost | en_US |
dc.subject | LightGBM | en_US |
dc.subject.lcsh | Machine learning | |
dc.title | Detecting online recruitment fraud by using machine learning | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |