Multiple Instance Neural Networks Based on Sparse Attention for Cancer Detection using T-cell Receptor Sequences

by   Younghoon Kim, et al.

Early detection of cancers has been much explored due to its paramount importance in biomedical fields. Among different types of data used to answer this biological question, studies based on T cell receptors (TCRs) are under recent spotlight due to the growing appreciation of the roles of the host immunity system in tumor biology. However, the one-to-many correspondence between a patient and multiple TCR sequences hinders researchers from simply adopting classical statistical/machine learning methods. There were recent attempts to model this type of data in the context of multiple instance learning (MIL). Despite the novel application of MIL to cancer detection using TCR sequences and the demonstrated adequate performance in several tumor types, there is still room for improvement, especially for certain cancer types. Furthermore, explainable neural network models are not fully investigated for this application. In this article, we propose multiple instance neural networks based on sparse attention (MINN-SA) to enhance the performance in cancer detection and explainability. The sparse attention structure drops out uninformative instances in each bag, achieving both interpretability and better predictive performance in combination with the skip connection. Our experiments show that MINN-SA yields the highest area under the ROC curve (AUC) scores on average measured across 10 different types of cancers, compared to existing MIL approaches. Moreover, we observe from the estimated attentions that MINN-SA can identify the TCRs that are specific for tumor antigens in the same T cell repertoire.


page 12

page 13


Transfer Learning for Oral Cancer Detection using Microscopic Images

Oral cancer has more than 83 however, only 29 techniques can detect patt...

EfficientNet Algorithm for Classification of Different Types of Cancer

Accurate and efficient classification of different types of cancer is cr...

Sparsity Likelihood for Sparse Signal and Change-point Detection

We propose here sparsity likelihood scores for sparse signal detection a...

Statistical approaches using longitudinal biomarkers for disease early detection: A comparison of methodologies

Early detection of clinical outcomes such as cancer may be predicted bas...

Cross-attention-based saliency inference for predicting cancer metastasis on whole slide images

Although multiple instance learning (MIL) methods are widely used for au...

Segmentation of the veterinary cytological images for fast neoplastic tumors diagnosis

This paper shows the machine learning system which performs instance seg...

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classication

Multiple instance learning (MIL) is a powerful tool to solve the weakly ...

Please sign up or login with your details

Forgot password? Click here to reset