Exploring the Daschle Collection using Text Mining

04/23/2019
by   Damon Bayer, et al.
0

A U.S. Senator from South Dakota donated documents that were accumulated during his service as a house representative and senator to be housed at the Bridges library at South Dakota State University. This project investigated the utility of quantitative statistical methods to explore some portions of this vast document collection. The available scanned documents and emails from constituents are analyzed using natural language processing methods including the Latent Dirichlet Allocation (LDA) model. This model identified major topics being discussed in a given collection of documents. Important events and popular issues from the Senator Daschles career are reflected in the changing topics from the model. These quantitative statistical methods provide a summary of the massive amount of text without requiring significant human effort or time and can be applied to similar collections.

READ FULL TEXT

page 7

page 8

page 11

page 12

research
07/28/2023

SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

A common way to explore text corpora is through low-dimensional projecti...
research
11/24/2018

Latent Dirichlet Allocation with Residual Convolutional Neural Network Applied in Evaluating Credibility of Chinese Listed Companies

This project demonstrated a methodology to estimating cooperate credibil...
research
07/11/2017

Look Who's Talking: Bipartite Networks as Representations of a Topic Model of New Zealand Parliamentary Speeches

Quantitative methods to measure the participation to parliamentary debat...
research
06/28/2015

Topic2Vec: Learning Distributed Representations of Topics

Latent Dirichlet Allocation (LDA) mining thematic structure of documents...
research
06/09/2022

Analyzing Folktales of Different Regions Using Topic Modeling and Clustering

This paper employs two major natural language processing techniques, top...
research
04/27/2018

Can You Explain That, Better? Comprehensible Text Analytics for SE Applications

Text mining methods are used for a wide range of Software Engineering (S...
research
08/01/2017

An Investigation into the Pedagogical Features of Documents

Characterizing the content of a technical document in terms of its learn...

Please sign up or login with your details

Forgot password? Click here to reset