TEXT MINING IN-R UTILIZING TM-PACKAGE FOR TAMIL DOCUMENT USING THIRUKKURAL
BALAJI K 1 B.MAGESH 2Journal Title | : | Asian Journal of Applied Research |
---|---|---|
DOI | : | |
Page No | : | 1-6 |
Volume | : | 1 |
Issue | : | 1 |
Month/Year | : | 1/2015 |
Keywords
Data import, count-based analysis, text analysis, text classification, corpus handling, term-document matrices,
Abstract
Tamil contains a huge amount of online text documents, it is nearly impossible to manually organize such vast data. The necessity to extract useful and relevant information such large data sets has managed to an important need to develop computationally efficient text mining techniques. The competence of text analytics applies analytic tools to acquire from collection of text documents using automated process can learn from massive amounts of texts, much more than human can. The tm package provides a structure for text mining applications facilities within R to be carried out a typical applications.