Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems have significantly advanced data modelling capabilities and improved opportunities for extracting knowledge from vast and heterogeneous biomedical datasets. Recent research has increasingly focused on integrating LLMs with custom-designed RAGs to create systems capable of handling complex biomedical challenges, with a growing demand for more reliable and precise prediction mechanisms in health-related contexts. This study introduces CardioTRAP, an architecture specifically designed to manage biomedical data, with a primary focus on cardiology. The system employs advanced indexing techniques to enable efficient storage and retrieval by integrating deep learning models that generate contextual and clinically relevant insights. By adopting a hybrid approach that combines supervised and unsupervised learning methods, CardioTRAP ensures both high accuracy and scalability, supporting predictive analytics, patient risk stratification, and the discovery of novel biomarkers. Benchmarks and practical applications, evaluated through state-of-the-art metrics, underscore its ability to enhance the identification of critical clinical features. Finally, CardioTRAP demonstrates how the integration of data management and RAG systems can serve as a bridge between biomedical research and clinical practice.

Design of a RAG framework for cardiology EHR analysis

Sorrentino, Sabato;Vizza, Patrizia;Veltri, Pierangelo;
2026-01-01

Abstract

Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems have significantly advanced data modelling capabilities and improved opportunities for extracting knowledge from vast and heterogeneous biomedical datasets. Recent research has increasingly focused on integrating LLMs with custom-designed RAGs to create systems capable of handling complex biomedical challenges, with a growing demand for more reliable and precise prediction mechanisms in health-related contexts. This study introduces CardioTRAP, an architecture specifically designed to manage biomedical data, with a primary focus on cardiology. The system employs advanced indexing techniques to enable efficient storage and retrieval by integrating deep learning models that generate contextual and clinically relevant insights. By adopting a hybrid approach that combines supervised and unsupervised learning methods, CardioTRAP ensures both high accuracy and scalability, supporting predictive analytics, patient risk stratification, and the discovery of novel biomarkers. Benchmarks and practical applications, evaluated through state-of-the-art metrics, underscore its ability to enhance the identification of critical clinical features. Finally, CardioTRAP demonstrates how the integration of data management and RAG systems can serve as a bridge between biomedical research and clinical practice.
2026
Clinical Data
Electronic Health Records
Large Language Models
Retrieval-Augmented Generation systems
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/405417
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact