CALL FOR PAPERS : DEC-2018

Submission Last Date	:	30-Dec-2018
Acceptance Notification	:	in 15 days
Publication Date	:	in 5 days

Submit Manuscript Online

FOR AUTHORS

FOR REVIEWERS

IJRET^® PUBLICATIONS

DOWNLOADS

CONTACT US

NEWS & UPDATES

Call for Paper Vol-7 Iss-02 Feb-2018

IJRET invites papers from various engineering disciplines for Volume-07 Issue-02, Feb-2018.

Submit Manuscript

Published Vol-07 Iss-01 Jan-18

IJRET Volume-07 Issue-01, Jan-2018 is published now.

Browse Papers

CLASSIFICATION OF TEXT DATA USING FEATURE CLUSTERING ALGORITHM

Avinash Guru, Asma Parveen

Abstract: Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text classification. Generally clustering means the collection of similar objects or data in groups. In this paper, we propose a feature clustering algorithm for classifying the text data. The document set contains number of words; these words are grouped into clusters based on the similarity. Words that are similar to each other are grouped into the same cluster, and the words that are not similar are grouped in another cluster. Each cluster is characterized by a membership function with statistical mean and deviation. When all the words are fed in the document then the clusters are formed automatically. Then the extracted feature starts functioning as it is based on the weighted combination of the words. By this algorithm, the derived membership functions match closely with and describe properly the real distribution of the training data. Earlier, the user has to specify the extracted feature in advance but now it is not required as the clusters are formed automatically and the trial and error method can be avoided. The experimental results show that our method can run faster and obtain better extracted features than other methods.

Keywords: Feature clustering, feature extraction, feature reduction, text classification.

DOI: https://doi.org/10.15623/ijret.2014.0315062