Application of data mining identifying topics at the document level
Abstract
Data mining techniques are very popular in modern days and are used in NLP (Natural Language Processing).
One of the techniques like clustering items to groups has been used way back. This technique is applied to find
different topics for natural documents. In our thesis we aim to replicate some of these results and empirically
verify this measure to identify hypothetical topic boundaries.