Show simple item record

dc.contributor.advisorArif, Hossain
dc.contributor.authorHaider, Mofiz Mojib
dc.contributor.authorHossin, Md. Arman
dc.contributor.authorMahi, Hasibur Rashid
dc.date.accessioned2021-05-29T15:48:36Z
dc.date.available2021-05-29T15:48:36Z
dc.date.copyright2020
dc.date.issued2020-04
dc.identifier.otherID: 16301038
dc.identifier.otherID: 17301214
dc.identifier.otherID: 16301035
dc.identifier.urihttp://dspace.bracu.ac.bd/xmlui/handle/10361/14446
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2020.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 28-29).
dc.description.abstractThe significance of text summarization in the Natural Language Processing (NLP) community has now expanded because of the staggering increase in virtual textual materials. Text summary is the process created from one or multiple texts which convey important insight in a little form of the main text. Multiple text summarization technique assists to pick indispensable points of the original texts reducing time and effort require reading the whole document. The question was approached from a different point of view, in a different domain by using different concepts. Extractive and abstractive are the two main methods of summing up text. Though extractive summary is primarily concerned with what summary content the frequency of words, phrases, and sentences from the original document should be used. This research proposes a sentence based clustering algorithm (K-Means) for a single document. For feature extraction, we have used Gensim word2vec which is intended to automatically extract semantic topics from documents in the most efficient way possible.en_US
dc.description.statementofresponsibilityMofiz Mojib Haider
dc.description.statementofresponsibilityMd. Arman Hossin
dc.description.statementofresponsibilityHasibur Rashid Mahi
dc.format.extent29 pages
dc.language.isoen_USen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectText summarizationen_US
dc.subjectExtractiveen_US
dc.subjectSingle Documenten_US
dc.subjectNLPen_US
dc.subjectGensimen_US
dc.subjectWord2Vecen_US
dc.subjectK-Meansen_US
dc.titleAutomatic text summarization using Gensim Word2Vec and K-Means Clustering Algorithmen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record