Zero-Shot Clinical Acronym Expansion with a Hierarchical Metadata-Based Latent Variable Model

09/29/2020
by   Griffin Adams, et al.
0

We introduce Latent Meaning Cells, a deep latent variable model which learns contextualized representations of words by combining local lexical context and metadata. Metadata can refer to granular context, such as section type, or to more global context, such as unique document ids. Reliance on metadata for contextualized representation learning is apropos in the clinical domain where text is semi-structured and expresses high variation in topics. We evaluate the LMC model on the task of clinical acronym expansion across three datasets. The LMC significantly outperforms a diverse set of baselines at a fraction of the pre-training cost and learns clinically coherent representations.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset