Interactive Text Mining Suite – Version 2 Release

ITMS – Interactive Text Mining Suite ITMS is a web application for text analysis. This application offers the computational and statistical power of R and the Shiny web application interactivity.

The new release includes the following features:

  • Import Zotero rdf files, Google Book API, json and xml
  • Dynamic preprocessing  steps
  • Stemming in multiple languages
  • Tuning parameters for cluster classification
  • Word cloud comparison
  • Word cloud customization
  • Metadata extraction

screen-shot-2016-12-19-at-4-03-25-pm

Contributors: Jefferson Davis, Irina Trapido and Jay Lee

As always, please do not hesitate to contact if you have any issues or to request new features!

Advertisements

Word and Sentence Analysis

We just added a new feature to Interactive Text Mining Tool: Word and Sentence Length.

In Data Visualization Panel, select Word Frequency Tab and click on Length. You can select a specific document and explore its content.

sentence          word

If you are interested in Punctuation Visualization (for more information, read Adam Calhoun’s Blog), select  Punctuation Analysis Tab and click Punctuation.punct

Language Variation Suite and Interactive Text Mining Tool – QR Code

Our Language Variation Suite and Interactive Text-Mining Tool are now accessible via SmartPhone and iPad.

Use QR Scanner App to scan the following QR codes, open them in your browser on Smart Phone or iPad. Make sure you have your files (csv or text files) in your Dropbox. Navigate to the Descriptive Statistics in LVS or Data Preparation in ITMS, select choose files and upload them.

code                                   code (1)

We always welcome any suggestions, feedback and bug reports!

Workshop: LVS and ITMS Introduction

    Visual Analytics – “The science of analytical reasoning facilitated by visual interactive interfaces” (Thomas et al. 2005)

Materials for workshop:

  1. Categorical Data (csv): Labov’s New York study 1966 (for more information, visit http://www.ello.uos.de/field.php/Sociolinguistics/Exemplarystudylabov)
  2. Continuous Data (csv): Corpus of Caracas (Bentivoglio & Sedano 1993)
  3.  Dante Translation (txt): dante 1, dante 2, dante 3
  4. Slides for workshop (pdf): presentation

“Mastery of quantitative methods is increasingly becoming a vital component of linguistic training” (Johnson, 2008)

Links to LVS and ITMS

 

 

Interactive Text Mining Suite ITMS

ITMS integrates visual and statistical R with an interactive Shiny application to examine unstructured data (aka text documents). At present, ITMS provides several text-mining analyses for scholarly articles and literary texts (e.g. topic, frequency and cluster analyses).
Screen shot 2016-03-18 at 12.35.39 AM

https://languagevariationsuite.shinyapps.io/TextMining/

ITMS is an ongoing project by interdisciplinary team of researchers from Indiana University (Olga Scrivner and Jefferson Davis). We are also developing an NEH proposal to advance this research.

Screen shot 2016-03-18 at 12.34.33 AM

Your feedback and suggestions as well as bug reports will be very appreciated (obscrivn AT indiana PERIOD edu).