Shiny Web Application – Workshop

“The impact of data scientists’ work depends on how well others can understand their insights to take further actions” (blog)

In this workshop, I will introduce you to the concept of Declarative Reactive Web Frameworks, allowing for interactive user-friendly data visualization and data analytics, particularly Shiny. Shiny is an R package that creates interactive applications for data visualization.

You will learn some Shiny basics: how to build your reactive app and deploy it to the server.

Workshop materials:

  1. R installation Instructions – slides
  2. CSV file –  download
  3. Workshop Slides – slides
  4. Shiny Workshop Files: zip file
  5. Video of the workshop: youtube

Credits: Some ideas are based on the great tutorial by Dean Attali.

Advertisements

Seminar on ITMS and LVS: Quantitative Methods and Text Mining

The main objective of this workshop is to introduce researchers to user-friendly analytical tools. ITMS and LVS are two web-based tools for visualization and quantitative analysis.  In contrast to existing software programs (e.g., SAS, SPSS, and Tableau), these two applications are built in R and require no installation or programming skills.

This hands-on workshop will provide an overview of available statistical and text-mining techniques in these tools. You will learn how to import csv, text and pdf files, create plots, and run statistical analysis, including conditional trees and random forest tests. You will also learn about natural language pre-processing techniques, such as stopwords removal and stemming. Finally, you will be able to perform topic modeling and cluster analysis.

Part 1: Quantitative Methods – Language Variation Suite LVS slides

Part 2: Text Mining Methods – Interactive Text Mining Suite ITMS slides

Workshop exercice materials: zip file

Language Variation Suite – v.2 release

Language Variation Suite has been released with more customizable features for language variation analysis.

New features include:

  1. Plot Customization – titles, labels, colors

  2. Redesigned user-friendly interface

  3. Random Slope 

  4. Tuning parameters for cluster analysis

  5. Customized url link: www.languagevariationsuite.com
  6. R code snippets

screen-shot-2016-12-21-at-11-50-34-am

Do not hesitate to contact if you have any issues or if you like to request new features.

LVS team

Interactive Text Mining Suite – Version 2 Release

ITMS – Interactive Text Mining Suite ITMS is a web application for text analysis. This application offers the computational and statistical power of R and the Shiny web application interactivity.

The new release includes the following features:

  • Import Zotero rdf files, Google Book API, json and xml
  • Dynamic preprocessing  steps
  • Stemming in multiple languages
  • Tuning parameters for cluster classification
  • Word cloud comparison
  • Word cloud customization
  • Metadata extraction

screen-shot-2016-12-19-at-4-03-25-pm

Contributors: Jefferson Davis, Irina Trapido and Jay Lee

As always, please do not hesitate to contact if you have any issues or to request new features!

Stepwise Regressions in Language Variation Suite – LVS

LVS provides three types of model comparison (LRT, AIC, and BIC) using the package MASS. The stepwise regression uses both directions (step up and step down) and selects the best model (best predictors).

All three criteria assess model fit.  LRT is based on log likelihood ratio (k = qchisq(1-p, df=1), where for p=0.05, k = 3.84). For more information on AIC ( Akaike Information Criterion ) and BIC (Bayesian information criterion) – see  http://www.jmp.com/support/help/Likelihood_AICc_and_BIC.shtml.

Steps to perform stepwise regression in LVS:

  1. Upload csv or excel file – Panel DATA Screen shot 2016-08-06 at 5.04.42 PM
  2. Go to Panel INFERENTIAL STATISTICS – tab MODELINGScreen shot 2016-08-06 at 5.06.33 PM
  3. Select your regression model (dependent and independent factors), type of regression (see tab REGRESSION) and click RUN regression.
  4. Go to STEPWISE REGRESSION tab and click RUN stepwise model.Screen shot 2016-08-06 at 5.11.55 PM
  5. Return to Modeling and Regression and update your selection with the best fitted model.

As always, your feedback and suggestions are greatly appreciated! (LVS Team)

Language Variation Suite – New Version Release

LVS  (an interactive toolkit for sociolinguists) has been enhanced with new features: 1) Excel and CSV formats, 2) Recoding and factor modification, and 3) Token Frequency extraction from text files.

New menu Adjust Data (Data Panel) allows for an interactive data modification, such as excluding and recoding certain factors or adding logarithmic transformation to continuous data.

Screen shot 2016-07-11 at 6.22.22 PM

New menu Frequency (Data Panel) allows for frequency extraction from text files. This feature makes it possible to add a frequency column to your dataset. The dataset should have a column named token containing  words (a single word per cell).

Screen shot 2016-07-11 at 6.24.08 PM

Feel free to explore LVS – it also works with SmartPhones and IPads. Please let us know your feedback and suggestions.

LVS Team

Word and Sentence Analysis

We just added a new feature to Interactive Text Mining Tool: Word and Sentence Length.

In Data Visualization Panel, select Word Frequency Tab and click on Length. You can select a specific document and explore its content.

sentence          word

If you are interested in Punctuation Visualization (for more information, read Adam Calhoun’s Blog), select  Punctuation Analysis Tab and click Punctuation.punct

Language Variation Suite and Interactive Text Mining Tool – QR Code

Our Language Variation Suite and Interactive Text-Mining Tool are now accessible via SmartPhone and iPad.

Use QR Scanner App to scan the following QR codes, open them in your browser on Smart Phone or iPad. Make sure you have your files (csv or text files) in your Dropbox. Navigate to the Descriptive Statistics in LVS or Data Preparation in ITMS, select choose files and upload them.

code                                   code (1)

We always welcome any suggestions, feedback and bug reports!

Workshop: LVS and ITMS Introduction

    Visual Analytics – “The science of analytical reasoning facilitated by visual interactive interfaces” (Thomas et al. 2005)

Materials for workshop:

  1. Categorical Data (csv): Labov’s New York study 1966 (for more information, visit http://www.ello.uos.de/field.php/Sociolinguistics/Exemplarystudylabov)
  2. Continuous Data (csv): Corpus of Caracas (Bentivoglio & Sedano 1993)
  3.  Dante Translation (txt): dante 1, dante 2, dante 3
  4. Slides for workshop (pdf): presentation

“Mastery of quantitative methods is increasingly becoming a vital component of linguistic training” (Johnson, 2008)

Links to LVS and ITMS