Evaluating user influence in Twitter based on hashtags using data mining
Abstract
As more and more data is being processed and generated every day, it has become a tremendous challenge to process and analyze. Big data analysis is a process of collecting, organizing and analyzing large sets of data to discover patterns and other useful information. It can help understand the information contained within data using specialized tools and applications for predictive analysis, data mining, text mining, forecasting and data optimization. We will be working with data from the most popular microblogging platform, Twitter, to study the social issues concerning various forms of harassment. Twitter users categorize status messages (Tweets) using hashtags, which are also used for searching specific topics or events. We can determine trends in Twitter-documented bullying among different demographics by analyzing the hashtags which represent different forms of social attacks, incidents of oppression, discrimination and cultural persecution. Out of the several tools used worldwide to interpret datasets, we will be using an advanced data mining tool called STATISTICA.