The emergence of Large Language Models (LLMs) has introduced exciting possibilities for applications in the digital health domain. However, their unpredictable nature necessitates the development of trustworthy strategies to prevent the generation of hallucinations. A common approach to address this challenge is using Retrieval-Augmented Generation (RAG), where text generation is supported by controlled knowledge injected into the prompts. Even with RAG, ensuring reliable and authoritative information generation requires further research. In a previous work, we presented an enhanced approach to the classic RAG pipeline, introducing an initial step where the LLM generates an enhanced query to support the retrieval step. Results showed that performances are highly sensitive to the techniques adopted for embedding queries and retrieving documents. Accordingly, in this paper, we experiment with a novel automated machine-learning approach to conduct extensive testing across various configurations and explore the retrieval module. Our findings highlight that the embedder and, especially, the retrieval strategies strongly impact the overall performance of the RAG pipeline.

Automated Machine Learning to Enhance Knowledge Retrieval in Retrieval-Augmented Generation Pipelines

Montagna, Sara
2026

Abstract

The emergence of Large Language Models (LLMs) has introduced exciting possibilities for applications in the digital health domain. However, their unpredictable nature necessitates the development of trustworthy strategies to prevent the generation of hallucinations. A common approach to address this challenge is using Retrieval-Augmented Generation (RAG), where text generation is supported by controlled knowledge injected into the prompts. Even with RAG, ensuring reliable and authoritative information generation requires further research. In a previous work, we presented an enhanced approach to the classic RAG pipeline, introducing an initial step where the LLM generates an enhanced query to support the retrieval step. Results showed that performances are highly sensitive to the techniques adopted for embedding queries and retrieving documents. Accordingly, in this paper, we experiment with a novel automated machine-learning approach to conduct extensive testing across various configurations and explore the retrieval module. Our findings highlight that the embedder and, especially, the retrieval strategies strongly impact the overall performance of the RAG pipeline.
2026
978-3-032-16708-8
File in questo prodotto:
File Dimensione Formato  
[email protected]

solo utenti autorizzati

Tipologia: Versione editoriale
Licenza: Copyright (tutti i diritti riservati)
Dimensione 643.77 kB
Formato Adobe PDF
643.77 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11576/2774151
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact