[Back]


Publications in Scientific Journals:

I. Feinerer, K. Hornik, D. Meyer:
"Text Mining Infrastructure in R";
Journal of Statistical Software, 25 (2008), 5; 54 pages.



English abstract:
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classification and string kernels.

Keywords:
text mining, R, count-based evaluation, text clustering, text classification, string kernels


Electronic version of the publication:
http://www.jstatsoft.org/v25/i05


Created from the Publication Database of the Vienna University of Technology.