FFTM: A Fuzzy Feature Transformation Method for Medical Documents

Date

2014-06

Department

Program

Citation of Original Publication

Karami, Amir; Gangopadhyay, Aryya; FFTM: A Fuzzy Feature Transformation Method for Medical Documents; Proceedings of BioNLP 2014, pages 128-133, June, 2014; http://dx.doi.org/10.3115/v1/W14-3419

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)

Subjects

Abstract

The vast array of medical text data represents a valuable resource that can be analyzed to advance the state of the art in medicine. Currently, text mining methods are being used to analyze medical research and clinical text data. Some of the main challenges in text analysis are high dimensionality and noisy data. There is a need to develop novel feature transformation methods that help reduce the dimensionality of data and improve the performance of machine learning algorithms. In this paper we present a feature transformation method named FFTM. We illustrate the efficacy of our method using local term weighting, global term weighting, and Fuzzy clustering methods and show that the quality of text analysis in medical text documents can be improved. We compare FFTM with Latent Dirichlet Allocation (LDA) by using two different datasets and statistical tests show that FFTM outperforms LDA.