dc.contributor.advisor | Alam, Md. Golam Rabiul | |
dc.contributor.advisor | Sadeque, Farig Yousuf | |
dc.contributor.advisor | Rahman, Rafeed | |
dc.contributor.author | Khan, A S M Nasim | |
dc.contributor.author | Khan, Mohammad Nasif Sadique | |
dc.contributor.author | Howlader, MD. Adnan | |
dc.contributor.author | Roy, Ayan | |
dc.date.accessioned | 2024-05-19T05:49:46Z | |
dc.date.available | 2024-05-19T05:49:46Z | |
dc.date.copyright | ©2024 | |
dc.date.issued | 2024-01 | |
dc.identifier.other | ID: 19101623 | |
dc.identifier.other | ID: 19201084 | |
dc.identifier.other | ID: 19201076 | |
dc.identifier.other | ID: 19201043 | |
dc.identifier.uri | http://hdl.handle.net/10361/22863 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 41-43). | |
dc.description.abstract | Accurate references in scholarly publications are a crucial aspect of scientific writing.
The manual validation of references can be a time-consuming and error-prone
process. This research introduces an updated version of the automated referencing
validation model that makes the peer review process efficient. The proposed
model utilizes the capabilities of Natural Language Processing generating sentence
embeddings which uses an efficient algorithm. Our model first breaks down the
scholarly article into sections and uses topic modeling to group every section according
to their context properly. After that, It generates sentence embeddings for
each section. By making sets of embeddings, they are used to calculate the semantic
similarity between the query and the referred article. Additionally, this methodology
addresses the valid references for non-contextual scenarios such as having common
name entities. Lastly, strategic feature engineering is also being used for better
performance. We have created a dataset of scholarly papers with manually verified
references to evaluate the efficiency and accuracy of our model. This improved version
of the referencing validation model aims to outperform traditional models such
as Document-BERT, BERT, and SBERT regarding efficiency and accuracy. The
model can be used in interactive real-time systems, providing quick and reliable
feedback to peer reviewers. This study aims to make a contribution to the field
of automated referencing validation in scholarly publications. The model offers a
solution to the limitations of manual validation which makes it a valuable tool for
peer reviewers and researchers. | en_US |
dc.description.statementofresponsibility | A S M Nasim Khan | |
dc.description.statementofresponsibility | Mohammad Nasif Sadique Khan | |
dc.description.statementofresponsibility | MD. Adnan Howlader | |
dc.description.statementofresponsibility | Ayan Roy | |
dc.format.extent | 51 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Automated referencing validation | en_US |
dc.subject | Natural language processing | en_US |
dc.subject | Context similarity | en_US |
dc.subject | Scholarly publications | en_US |
dc.subject | XLNet | en_US |
dc.subject | NER | en_US |
dc.subject.lcsh | Natural language processing (Computer science) | |
dc.title | Automated reference validation for scholarly publications using NLP | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B.Sc in Computer Science and Engineering | |