Show simple item record

dc.contributor.advisorSadeque, Farig Yousuf
dc.contributor.authorAhmed, Saib
dc.date.accessioned2025-02-18T06:53:25Z
dc.date.available2025-02-18T06:53:25Z
dc.date.copyright2024
dc.date.issued2024-10
dc.identifier.otherID 22166032
dc.identifier.urihttp://hdl.handle.net/10361/25436
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science, 2024.en_US
dc.descriptionCataloged from the PDF version of the thesis.
dc.descriptionIncludes bibliographical references (pages 49-52).
dc.description.abstractDocumenting clinical notes is a vital but time-consuming task in healthcare. Even in this modern era medical doctors spend considerable time documenting clinical notes from encounters with patients. While there have been significant advancements in general text summarization, research in clinical conversation summarization remains sparse due to the scarcity of open-source datasets available to the NLP community. Accurate summarization is paramount in clinical note generation, given its implications for human health. Our research demonstrates the efficacy of decoder-only models over traditional encoder-decoder models in generating more precise clinical notes from doctor-patient conversations. The study also tackles key challenges such as ensuring medical accuracy and complying with healthcare privacy standards. We utilized the MTS-DIALOG dataset [28], including 1, 700 such dialogues and corresponding clinical notes. This dataset was featured in the 2023 MEDIQAChat challenge, where the leading team, WangLab achieved a state-ofthe- art (SOTA) Rouge-1 score of 0.4466 and BERTScore of 0.7307 [27]. Our study surpasses these benchmarks by fine-tuning the ”metallama/Meta-Llama-3-8B” model enhanced with Qlora 8-bit quantization. We assessed our models using Rouge scores and BERT Scores to validate their superiority in performance. By evaluating the system on real-world clinical conversations, we show that the decoder-only LLM-generated notes closely match human-written ones in terms of completeness and clinical relevance. This research highlights the potential for decoder-only LLMs to revolutionize clinical workflows, making medical documentation more efficient while allowing doctors to focus more on patient care.en_US
dc.description.statementofresponsibilitySaib Ahmed
dc.format.extent64 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.rightsBRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectClinicalNLPen_US
dc.subjectDialouge2Noteen_US
dc.subjectBERTen_US
dc.subjectData documentationen_US
dc.subjectMedical documentationen_US
dc.subjectText summarizationen_US
dc.subjectNote generation
dc.subject.lcshInformation storage and retrieval systems--Technology.
dc.subject.lcshMedical records--Management.
dc.subject.lcshNatural language processing (Computer science).
dc.subject.lcshMedical records--Data processing.
dc.titleClinical note generation from doctor-patient conversations using decoder-only large language modelsen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, BRAC University
dc.description.degreeM.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record