References validation in scholarly articles using RoBERTa

Nasib, Abdullah Umar

dc.contributor.advisor	Alam, Md. Golam Rabiul
dc.contributor.author	Nasib, Abdullah Umar
dc.date.accessioned	2023-09-25T06:25:45Z
dc.date.available	2023-09-25T06:25:45Z
dc.date.copyright	2023
dc.date.issued	2023-05
dc.identifier.other	ID 19166009
dc.identifier.uri	http://hdl.handle.net/10361/21226
dc.description	This project report is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2023.	en_US
dc.description	Cataloged from the PDF version of the project report.
dc.description	Includes bibliographical references (pages 36-37).
dc.description.abstract	In the recent era of technological advancement, evaluating references and assignments and validating those are some of the primary processes to justify the authenticity of a research work in the academic sector. The purpose of referencing is to ensure ethical writing and to make the search easier in a particular area for the reader when it is accurate. The traditional way to cross-check the validations is to check them manually one by one, which is above argument, equivalent to an- other work, and sometimes so tiring that the reader loses interest in reading the original paper. The target of this research paper is to introduce a semi-automatic digital system that enables researchers to justify the references used in a research paper without doing it manually. In this model, we have prepared a sentence trans- former named RoBERTa for generating embedding. We sanitize and preprocess an entire research paper and cross-match that against a reference query using the proposed model in terms of finding semantic and contextual similarity. The result shows mostly similar contexts based on the similarity check. We have compared the embedding of query and user input articles with the help of K similar search function. Our model outperformed the existing BERT and SBERT models’ output in accuracy with a F1 score of 0.777, which establishes the fact that the model can be used in real life with a simple query of text from research articles.	en_US
dc.description.statementofresponsibility	Abdullah Umar Nasib
dc.format.extent	47 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University project reports are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Reference validation	en_US
dc.subject	Semantic similarity analysis	en_US
dc.subject	Contextual similarity analysis	en_US
dc.subject	RoBERTa	en_US
dc.subject	K similar search	en_US
dc.subject	Natural language processing	en_US
dc.subject.lcsh	Natural language processing (Computer science)
dc.title	References validation in scholarly articles using RoBERTa	en_US
dc.type	Project report	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	M. Computer Science and Engineering

Files in this item

Name:: 19166009_CSE.pdf
Size:: 655.2Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, MSc (Computer Science and Engineering) [74]

Show simple item record