Sentiment analysis of feature ranking methods for classification accuracy
Date
2017-01-01
Journal Title
Journal ISSN
Volume Title
Publisher
IOP Conference Series: Materials Science and Engineering.
Abstract
Text pre-processing and feature selection are important and critical steps in text
mining. Text pre-processing of large volumes of datasets is a difficult task as unstructured
raw data is converted into structured format. Traditional methods of processing and
weighing took much time and were less accurate. To overcome this challenge, feature
ranking techniques have been devised. A feature set from text preprocessing is fed as input
for feature selection. Feature selection helps improve text classification accuracy. Of the
three feature selection categories available, the filter category will be the focus. Five feature
ranking methods namely: document frequency, standard deviation information gain, CHISQUARE, and weighted-log likelihood –ratio is analyzed.
Description
Sentiment analysis of feature ranking methods for classification accuracy
Keywords
datasets, text pre-processing, feature selection, classification accuracy, feature ranking
Citation
: Shashank Joseph et al 2017. IOP Conf. Series: Materials Science and Engineering 263 (2017) 042011 doi:10.1088/1757-899X/263/4/042011