dc.contributor.advisor | Hossain, Muhammad Iqbal | |
dc.contributor.advisor | Mukta, Jannatun Noor | |
dc.contributor.advisor | Ahmed, Md. Faisal | |
dc.contributor.author | Zilane, Md. Shahanur | |
dc.contributor.author | Rahman, Mohammad Mushfiqur | |
dc.contributor.author | Jisa, Aniqa Ibnat | |
dc.contributor.author | Elma, Qurratul Ayen | |
dc.contributor.author | Rifat, Asaduzzaman | |
dc.date.accessioned | 2025-02-05T06:38:50Z | |
dc.date.available | 2025-02-05T06:38:50Z | |
dc.date.copyright | ©2024 | |
dc.date.issued | 2024-10 | |
dc.identifier.other | ID 20301225 | |
dc.identifier.other | ID 20301022 | |
dc.identifier.other | ID 20201136 | |
dc.identifier.other | ID 20201121 | |
dc.identifier.other | ID 20301003 | |
dc.identifier.uri | http://hdl.handle.net/10361/25322 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 56-59). | |
dc.description.abstract | The evolving complexity and frequency of cyber threat incidents demand the development
of robust, user-friendly systems that can educate and assist users, and
help them understand and mitigate them as much as possible. This thesis describes
Risk Radar, a query-based information retrieval and response system that uses some
advanced Natural Language Processing (NLP) methods to provide precise, contextaware
responses to cybersecurity data.
The system employs a multi-module architecture, with each module tailored to
a specific task, such as query correction, semantic sequence analysis, information
retrieval, and response generation. The core NLP models used are BERT for semantic
similarity, BM25 for effective retrieval of relevant content, and a distilled
BART model for summarization and context-based response generation. A unique
rule-based mechanism improves query understanding and maintains contextual continuity
across user interactions, addressing the challenges of multi-turn dialogue in
technical. The proposed system not only provides detailed responses, but it also includes
relevant articles to help users better understand specific incidents or trends.
The system’s performance is measured by its ability to retain the context of user
queries, retrieve and rank relevant content accurately, and generate coherent, informative
responses.
The system’s real-time implementation dynamically updates the dataset based on
daily scraping of cybersecurity articles, ensuring that responses are timely and relevant.
To address computational constraints, the model architecture prefers efficient
methods like sequence-based rule application and DistilBART over more computationally
intensive models like GPT-Neo. This trade-off balances accuracy and
resource availability, resulting in a solution that is both practical and efficient. This
thesis aims to contribute a scalable, efficient solution for tackling the growing need
for real-time, user-oriented cybersecurity information systems. | en_US |
dc.description.statementofresponsibility | Md. Shahanur Zilane | |
dc.description.statementofresponsibility | Mohammad Mushfiqur Rahman | |
dc.description.statementofresponsibility | Aniqa Ibnat Jisa | |
dc.description.statementofresponsibility | Qurratul Ayen Elma | |
dc.description.statementofresponsibility | Asaduzzaman Rifat | |
dc.format.extent | 70 pages | |
dc.language.iso | en | en_US |
dc.publisher | BRAC University | en_US |
dc.rights | BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | NLP | en_US |
dc.subject | BERT | en_US |
dc.subject | BART | en_US |
dc.subject | K-means clustering | en_US |
dc.subject | Information retrieval | en_US |
dc.subject | Query classification | en_US |
dc.subject | BM25 | en_US |
dc.subject | Autoencoders | en_US |
dc.subject | Semantic analysis | en_US |
dc.subject.lcsh | Natural language processing (Computer science). | |
dc.subject.lcsh | Information retrieval. | |
dc.subject.lcsh | Computer security. | |
dc.title | RiskRadar: an NLP-driven summarization system for query-based security insights | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, BRAC University | |
dc.description.degree | B.Sc. in Computer Science and Engineering | |