Automated reference validation for scholarly publications using NLP

Khan, A S M Nasim; Khan, Mohammad Nasif Sadique; Howlader, MD. Adnan; Roy, Ayan

dc.contributor.advisor	Alam, Md. Golam Rabiul
dc.contributor.advisor	Sadeque, Farig Yousuf
dc.contributor.advisor	Rahman, Rafeed
dc.contributor.author	Khan, A S M Nasim
dc.contributor.author	Khan, Mohammad Nasif Sadique
dc.contributor.author	Howlader, MD. Adnan
dc.contributor.author	Roy, Ayan
dc.date.accessioned	2024-05-19T05:49:46Z
dc.date.available	2024-05-19T05:49:46Z
dc.date.copyright	©2024
dc.date.issued	2024-01
dc.identifier.other	ID: 19101623
dc.identifier.other	ID: 19201084
dc.identifier.other	ID: 19201076
dc.identifier.other	ID: 19201043
dc.identifier.uri	http://hdl.handle.net/10361/22863
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 41-43).
dc.description.abstract	Accurate references in scholarly publications are a crucial aspect of scientific writing. The manual validation of references can be a time-consuming and error-prone process. This research introduces an updated version of the automated referencing validation model that makes the peer review process efficient. The proposed model utilizes the capabilities of Natural Language Processing generating sentence embeddings which uses an efficient algorithm. Our model first breaks down the scholarly article into sections and uses topic modeling to group every section according to their context properly. After that, It generates sentence embeddings for each section. By making sets of embeddings, they are used to calculate the semantic similarity between the query and the referred article. Additionally, this methodology addresses the valid references for non-contextual scenarios such as having common name entities. Lastly, strategic feature engineering is also being used for better performance. We have created a dataset of scholarly papers with manually verified references to evaluate the efficiency and accuracy of our model. This improved version of the referencing validation model aims to outperform traditional models such as Document-BERT, BERT, and SBERT regarding efficiency and accuracy. The model can be used in interactive real-time systems, providing quick and reliable feedback to peer reviewers. This study aims to make a contribution to the field of automated referencing validation in scholarly publications. The model offers a solution to the limitations of manual validation which makes it a valuable tool for peer reviewers and researchers.	en_US
dc.description.statementofresponsibility	A S M Nasim Khan
dc.description.statementofresponsibility	Mohammad Nasif Sadique Khan
dc.description.statementofresponsibility	MD. Adnan Howlader
dc.description.statementofresponsibility	Ayan Roy
dc.format.extent	51 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Automated referencing validation	en_US
dc.subject	Natural language processing	en_US
dc.subject	Context similarity	en_US
dc.subject	Scholarly publications	en_US
dc.subject	XLNet	en_US
dc.subject	NER	en_US
dc.subject.lcsh	Natural language processing (Computer science)
dc.title	Automated reference validation for scholarly publications using NLP	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B.Sc in Computer Science and Engineering

Files in this item

Name:: 19101623, 19201084, 19201076, ...
Size:: 836.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1480]

Show simple item record