Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic

01/09/2022
by   Simon Lupart, et al.
0

MeSH (Medical Subject Headings) is a large thesaurus created by the National Library of Medicine and used for fine-grained indexing of publications in the biomedical domain. In the context of the COVID-19 pandemic, MeSH descriptors have emerged in relation to articles published on the corresponding topic. Zero-shot classification is an adequate response for timely labeling of the stream of papers with MeSH categories. In this work, we hypothesise that rich semantic information available in MeSH has potential to improve BioBERT representations and make them more suitable for zero-shot/few-shot tasks. We frame the problem as determining if MeSH term definitions, concatenated with paper abstracts are valid instances or not, and leverage multi-task learning to induce the MeSH hierarchy in the representations thanks to a seq2seq task. Results establish a baseline on the MedLine and LitCovid datasets, and probing shows that the resulting representations convey the hierarchical relations present in MeSH.

READ FULL TEXT

page 1

page 6

research
05/13/2020

MeSH descriptors indicate the knowledge growth in the SARS-CoV-2/COVID-19 pandemic

The scientific papers dealing with the novel betacoronavirus SARS-CoV-2 ...
research
08/27/2019

Time evolution of the hierarchical networks between PubMed MeSH terms

Hierarchical organisation is a prevalent feature of many complex network...
research
01/20/2021

What is all this new MeSH about? Exploring the semantic provenance of new descriptors in the MeSH thesaurus

The Medical Subject Headings (MeSH) thesaurus is a controlled vocabulary...
research
05/15/2020

Beyond MeSH: Fine-Grained Semantic Indexing of Biomedical Literature based on Weak Supervision

In this work, we propose a method for the automated refinement of subjec...
research
09/04/2023

An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports

The pandemic resulted in vast repositories of unstructured data, includi...
research
10/07/2020

A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic

The pandemic has accelerated the pace at which COVID-19 scientific paper...
research
01/23/2023

Large-scale fine-grained semantic indexing of biomedical literature based on weakly-supervised deep learning

Semantic indexing of biomedical literature is usually done at the level ...

Please sign up or login with your details

Forgot password? Click here to reset