Content on this page requires a newer version of Adobe Flash Player.

Get Adobe Flash player


Call Center
Back Office
Market Research
Business Consultant
Customer Care 24x7

 

The purpose of Text Mining is to process unstructured (textual) information, extract meaningful numeric indices from the text, and, thus, make the information contained in the text accessible to the various data mining (statistical and machine learning) algorithms. Information can be extracted to derive summaries for the words contained in the documents or to compute summaries for the documents based on the words contained in them.

Hence, you can analyze words, clusters of words used in documents, etc., or you could analyze documents and determine similarities between them or how they are related to other variables of interest in the data mining project. In the most general terms, text mining will "turn text into numbers" (meaningful indices), which can then be incorporated in other analyses such as predictive data mining projects, the application of unsupervised learning methods (clustering), etc.

Typical Applications for Text Mining

Analyzing open-ended survey responses
Automatic processing of messages, emails, etc.
Analyzing warranty or insurance claims, diagnostic interviews, etc.
Investigating competitors by crawling their web sites.

Issues and Considerations for "Numericizing" Text

Large numbers of small documents vs. small numbers of large documents
Excluding certain characters, short words, numbers, etc.
Include lists, exclude lists (stop-words).
Synonyms and phrases
Stemming algorithms.
Support for different languages

Special Offer !!

10% Off on all websitedesign

10% Off on Bulk Sms Packages (Selective)

Domain name Starting from Rs 150/-

 
Quick Reach

Content on this page requires a newer version of Adobe Flash Player.

Get Adobe Flash player