De-identification of Unstructured Clinical Texts from Sequence to Sequence Perspective

08/18/2021
by   Md. Monowar Anjum, et al.
0

In this work, we propose a novel problem formulation for de-identification of unstructured clinical text. We formulate the de-identification problem as a sequence to sequence learning problem instead of a token classification problem. Our approach is inspired by the recent state-of -the-art performance of sequence to sequence learning models for named entity recognition. Early experimentation of our proposed approach achieved 98.91 dataset. This performance is comparable to current state-of-the-art models for unstructured clinical text de-identification.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset