Doris: A tool for interactive exploration of historic corpora (Extended Version)

10/31/2017
by   Sreya Guha, et al.
0

Insights into social phenomenon can be gleaned from trends and patterns in corpora of documents associated with that phenomenon. Recent years have witnessed the use of computational techniques, mostly based on keywords, to analyze large corpora for these purposes. In this paper, we extend these techniques to incorporate semantic features. We introduce Doris, an interactive exploration tool that combines semantic features with information retrieval techniques to enable exploration of document corpora corresponding to the social phenomenon. We discuss the semantic techniques and describe an implementation on a corpus of United States (US) presidential speeches. We illustrate, with examples, how the ability to combine syntactic and semantic features in a visualization helps researchers more easily gain insights into the underlying phenomenon.

READ FULL TEXT
research
03/19/2021

TextEssence: A Tool for Interactive Analysis of Semantic Shifts Between Corpora

Embeddings of words and concepts capture syntactic and semantic regulari...
research
06/04/2018

History Playground: A Tool for Discovering Temporal Trends in Massive Textual Corpora

Recent studies have shown that macroscopic patterns of continuity and ch...
research
04/24/2018

Characterizing Allegheny County Opioid Overdoses with an Interactive Data Explorer and Synthetic Prediction Tool

The United States has an opioid epidemic, and Pennsylvania's Allegheny C...
research
01/18/2018

Unsupervised Hashtag Retrieval and Visualization for Crisis Informatics

In social media like Twitter, hashtags carry a lot of semantic informati...
research
04/13/2023

Computational modeling of semantic change

In this chapter we provide an overview of computational modeling for sem...
research
04/10/2021

Avocado Buying Trends in the United States Using SAC

The purpose of our paper is to analyze the dataset from Hass Avocado Boa...
research
03/23/2022

Multi-Mosaics: Corpus Summarizing and Exploration using multiple Concordance Mosaic Visualisations

Researchers working in areas such as lexicography, translation studies, ...

Please sign up or login with your details

Forgot password? Click here to reset