Leam: An Interactive System for In-situ Visual Text Analysis

by   Sajjadur Rahman, et al.

With the increase in scale and availability of digital text generated on the web, enterprises such as online retailers and aggregators often use text analytics to mine and analyze the data to improve their services and products alike. Text data analysis is an iterative, non-linear process with diverse workflows spanning multiple stages, from data cleaning to visualization. Existing text analytics systems usually accommodate a subset of these stages and often fail to address challenges related to data heterogeneity, provenance, workflow reusability and reproducibility, and compatibility with established practices. Based on a set of design considerations we derive from these challenges, we propose Leam, a system that treats the text analysis process as a single continuum by combining advantages of computational notebooks, spreadsheets, and visualization tools. Leam features an interactive user interface for running text analysis workflows, a new data model for managing multiple atomic and composite data types, and an expressive algebra that captures diverse sets of operations representing various stages of text analysis and enables coordination among different components of the system, including data, code, and visualizations. We report our current progress in Leam development while demonstrating its usefulness with usage examples. Finally, we outline a number of enhancements to Leam and identify several research directions for developing an interactive visual text analysis system.


SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks

Computational notebooks such as Jupyter Notebook have become data scient...

ComputableViz: Mathematical Operators as a Formalism for Visualization Processing and Analysis

Data visualizations are created and shared on the web at an unprecedente...

The State of the Art in Integrating Machine Learning into Visual Analytics

Visual analytics systems combine machine learning or other analytic tech...

TBSSvis: Visual Analytics for Temporal Blind Source Separation

Temporal Blind Source Separation (TBSS) is used to obtain the true, unde...

NOVA: A Practical Method for Creating Notebook-Ready Visual Analytics

How can we develop visual analytics (VA) tools that can be easily adopte...

Visualization of Mined Pattern and Its Human Aspects

Researchers got success in mining the Web usage data effectively and eff...

An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper

Appropriate evaluation and experimental design are fundamental for empir...

Please sign up or login with your details

Forgot password? Click here to reset