Interpretable Medical Diagnostics with Structured Data Extraction by Large Language Models

06/08/2023
by   Aleksa Bisercic, et al.
12

Tabular data is often hidden in text, particularly in medical diagnostic reports. Traditional machine learning (ML) models designed to work with tabular data, cannot effectively process information in such form. On the other hand, large language models (LLMs) which excel at textual tasks, are probably not the best tool for modeling tabular data. Therefore, we propose a novel, simple, and effective methodology for extracting structured tabular data from textual medical reports, called TEMED-LLM. Drawing upon the reasoning capabilities of LLMs, TEMED-LLM goes beyond traditional extraction techniques, accurately inferring tabular features, even when their names are not explicitly mentioned in the text. This is achieved by combining domain-specific reasoning guidelines with a proposed data validation and reasoning correction feedback loop. By applying interpretable ML models such as decision trees and logistic regression over the extracted and validated data, we obtain end-to-end interpretable predictions. We demonstrate that our approach significantly outperforms state-of-the-art text classification models in medical diagnostics. Given its predictive performance, simplicity, and interpretability, TEMED-LLM underscores the potential of leveraging LLMs to improve the performance and trustworthiness of ML models in medical applications.

READ FULL TEXT

page 5

page 19

page 20

page 21

page 23

page 25

page 27

page 28

research
08/03/2023

Local Large Language Models for Complex Structured Medical Tasks

This paper introduces an approach that combines the language reasoning c...
research
07/18/2023

Large Language Models Perform Diagnostic Reasoning

We explore the extension of chain-of-thought (CoT) prompting to medical ...
research
07/14/2023

Fairness of ChatGPT and the Role Of Explainable-Guided Prompts

Our research investigates the potential of Large-scale Language Models (...
research
05/24/2023

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset

Large Language Models (LLMs) demonstrate exceptional performance in text...
research
05/24/2023

SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations

Although deep language representations have become the dominant form of ...
research
01/04/2023

Iterated Decomposition: Improving Science Q A by Supervising Reasoning Processes

Language models (LMs) can perform complex reasoning either end-to-end, w...
research
09/02/2022

Extend and Explain: Interpreting Very Long Language Models

While Transformer language models (LMs) are state-of-the-art for informa...

Please sign up or login with your details

Forgot password? Click here to reset