Knowledge synthesis from 100 million biomedical documents augments the deep expression profiling of coronavirus receptors

by   AJ Venkatakrishnan, et al.

The COVID-19 pandemic demands assimilation of all available biomedical knowledge to decode its mechanisms of pathogenicity and transmission. Despite the recent renaissance in unsupervised neural networks for decoding unstructured natural languages, a platform for the real-time synthesis of the exponentially growing biomedical literature and its comprehensive triangulation with deep omic insights is not available. Here, we present the nferX platform for dynamic inference from over 45 quadrillion possible conceptual associations extracted from unstructured biomedical text, and their triangulation with Single Cell RNA-sequencing based insights from over 25 tissues. Using this platform, we identify intersections between the pathologic manifestations of COVID-19 and the comprehensive expression profile of the SARS-CoV-2 receptor ACE2. We find that tongue keratinocytes and olfactory epithelial cells are likely under-appreciated targets of SARS-CoV-2 infection, correlating with reported loss of sense of taste and smell as early indicators of COVID-19 infection, including in otherwise asymptomatic patients. Airway club cells, ciliated cells and type II pneumocytes in the lung, and enterocytes of the gut also express ACE2. This study demonstrates how a holistic data science platform can leverage unprecedented quantities of structured and unstructured publicly available data to accelerate the generation of impactful biological insights and hypotheses.


page 1

page 2

page 15

page 16

page 17

page 18

page 21

page 23


Machine Learning the Phenomenology of COVID-19 From Early Infection Dynamics

We present a data-driven machine learning analysis of COVID-19 from its ...

COVID-Datathon: Biomarker identification for COVID-19 severity based on BALF scRNA-seq data

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emergen...

Coronavirus Knowledge Graph: A Case Study

The emergence of the novel COVID-19 pandemic has had a significant impac...

The Skellam Distribution revisited -Estimating the unobserved incoming and outgoing ICU COVID-19 patients on a regional level in Germany

With the beginning of the COVID-19 pandemic, we became aware of the need...

Boundary Guided Semantic Learning for Real-time COVID-19 Lung Infection Segmentation System

The coronavirus disease 2019 (COVID-19) continues to have a negative imp...

Please sign up or login with your details

Forgot password? Click here to reset