COVID-19 Multidimensional Kaggle Literature Organization

07/17/2021
by   Maksim E. Eren, et al.
0

The unprecedented outbreak of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), or COVID-19, continues to be a significant worldwide problem. As a result, a surge of new COVID-19 related research has followed suit. The growing number of publications requires document organization methods to identify relevant information. In this paper, we expand upon our previous work with clustering the CORD-19 dataset by applying multi-dimensional analysis methods. Tensor factorization is a powerful unsupervised learning method capable of discovering hidden patterns in a document corpus. We show that a higher-order representation of the corpus allows for the simultaneous grouping of similar articles, relevant journals, authors with similar research interests, and topic keywords. These groupings are identified within and among the latent components extracted via tensor decomposition. We further demonstrate the application of this method with a publicly available interactive visualization of the dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2022

Covid-19 Analysis Using Tensor Methods

In this paper, we use tensor models to analyze Covid-19 pandemic data. F...
research
06/03/2020

Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2

With the COVID-19 pandemic, there is a growing urgency for medical commu...
research
05/29/2022

COVID-19 Literature Mining and Retrieval using Text Mining Approaches

The novel coronavirus disease (COVID-19) began in Wuhan, China, in late ...
research
08/04/2020

COVID-19 Kaggle Literature Organization

The world has faced the devastating outbreak of Severe Acute Respiratory...
research
08/07/2020

Navigating the landscape of COVID-19 research through literature analysis: A bird's eye view

Timely access to accurate scientific literature in the battle with the o...
research
09/19/2020

Can questions summarize a corpus? Using question generation for characterizing COVID-19 research

What are the latent questions on some textual data? In this work, we inv...
research
09/05/2018

Measures of Cluster Informativeness for Medical Evidence Aggregation and Dissemination

The largest collection of medical evidence in the world is PubMed. Howev...

Please sign up or login with your details

Forgot password? Click here to reset