Triage and diagnosis of COVID-19 from medical social media

03/22/2021
by   Abul Hasan, et al.
0

Objective: This study aims to develop an end-to-end natural language processing pipeline for triage and diagnosis of COVID-19 from patient-authored social media posts. Materials and Methods: The text processing pipeline first extracts COVID-19 symptoms and related concepts such as severity, duration, negations, and body parts from patients posts using conditional random fields. An unsupervised rule-based algorithm is then applied to establish relations between concepts in the next step of the pipeline. The extracted concepts and relations are subsequently used to construct two different vector representations of each post. These vectors are applied separately to build support vector machine learning models to triage patients into three categories and diagnose them for COVID-19. Results: We report that Macro- and Micro-averaged F_1 scores in the range of 71-96 the triage and diagnosis of COVID-19, when the models are trained on ground truth labelled data. Our experimental results indicate that similar performance can be achieved when the models are trained using predicted labels from concept extraction and rule-based classifiers, thus yielding end-to-end machine learning. Discussion: We highlight important features uncovered by our diagnostic machine learning models and compare them with the most frequent symptoms revealed in another COVID-19 dataset. In particular, we found that the most important features are not always the most frequent ones. Conclusions: Our preliminary results show that it is possible to automatically triage and diagnose patients for COVID-19 from natural language narratives using a machine learning pipeline.

READ FULL TEXT

page 8

page 10

research
09/05/2023

Incorporating Dictionaries into a Neural Network Architecture to Extract COVID-19 Medical Concepts From Social Media

We investigate the potential benefit of incorporating dictionary informa...
research
05/17/2021

The State of Infodemic on Twitter

Following the wave of misinterpreted, manipulated and malicious informat...
research
06/12/2022

"COVID-19 was a FIFA conspiracy #curropt": An Investigation into the Viral Spread of COVID-19 Misinformation

The outbreak of the infectious and fatal disease COVID-19 has revealed t...
research
02/28/2023

Interpretable and Intervenable Ultrasonography-based Machine Learning Models for Pediatric Appendicitis

Appendicitis is among the most frequent reasons for pediatric abdominal ...
research
03/21/2022

Healthy Twitter discussions? Time will tell

Studying misinformation and how to deal with unhealthy behaviours within...

Please sign up or login with your details

Forgot password? Click here to reset