Sentiment analysis of feature ranking methods for classification accuracy

Thumbnail Image

Date

2017-01-01

Journal Title

Journal ISSN

Volume Title

Publisher

IOP Conference Series: Materials Science and Engineering.

Abstract

Text pre-processing and feature selection are important and critical steps in text mining. Text pre-processing of large volumes of datasets is a difficult task as unstructured raw data is converted into structured format. Traditional methods of processing and weighing took much time and were less accurate. To overcome this challenge, feature ranking techniques have been devised. A feature set from text preprocessing is fed as input for feature selection. Feature selection helps improve text classification accuracy. Of the three feature selection categories available, the filter category will be the focus. Five feature ranking methods namely: document frequency, standard deviation information gain, CHISQUARE, and weighted-log likelihood –ratio is analyzed.

Description

Sentiment analysis of feature ranking methods for classification accuracy

Keywords

datasets, text pre-processing, feature selection, classification accuracy, feature ranking

Citation

: Shashank Joseph et al 2017. IOP Conf. Series: Materials Science and Engineering 263 (2017) 042011 doi:10.1088/1757-899X/263/4/042011

Endorsement

Review

Supplemented By

Referenced By