Automated Machine Learning to Enhance Knowledge Retrieval in Retrieval-Augmented Generation Pipelines

Magnini, Matteo; Aguzzi, Gianluca; Sanna, Leonardo; Magnolini, Simone; Bellan, Patrizio; Dragoni, Mauro; Montagna, Sara

The emergence of Large Language Models (LLMs) has introduced exciting possibilities for applications in the digital health domain. However, their unpredictable nature necessitates the development of trustworthy strategies to prevent the generation of hallucinations. A common approach to address this challenge is using Retrieval-Augmented Generation (RAG), where text generation is supported by controlled knowledge injected into the prompts. Even with RAG, ensuring reliable and authoritative information generation requires further research. In a previous work, we presented an enhanced approach to the classic RAG pipeline, introducing an initial step where the LLM generates an enhanced query to support the retrieval step. Results showed that performances are highly sensitive to the techniques adopted for embedding queries and retrieving documents. Accordingly, in this paper, we experiment with a novel automated machine-learning approach to conduct extensive testing across various configurations and explore the retrieval module. Our findings highlight that the embedder and, especially, the retrieval strategies strongly impact the overall performance of the RAG pipeline.

Automated Machine Learning to Enhance Knowledge Retrieval in Retrieval-Augmented Generation Pipelines

Magnini, Matteo;Aguzzi, Gianluca;Sanna, Leonardo;Magnolini, Simone;Bellan, Patrizio;Dragoni, Mauro;Montagna, Sara

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	ISBN
	
				978-3-032-16708-8
			
	Appare nelle tipologie:
	
				4.1 Contributo Atti di Convegno (Proceeding)

File in questo prodotto:

File	Dimensione	Formato
[email protected] solo utenti autorizzati Tipologia: Versione editoriale Licenza: Copyright (tutti i diritti riservati) Dimensione 643.77 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	643.77 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11576/2774151

Automated Machine Learning to Enhance Knowledge Retrieval in Retrieval-Augmented Generation Pipelines

Magnini, Matteo;Aguzzi, Gianluca;Sanna, Leonardo;Magnolini, Simone;Bellan, Patrizio;Dragoni, Mauro;Montagna, Sara

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Automated Machine Learning to Enhance Knowledge Retrieval in Retrieval-Augmented Generation Pipelines

Magnini, Matteo;Aguzzi, Gianluca;Sanna, Leonardo;Magnolini, Simone;Bellan, Patrizio;Dragoni, Mauro;Montagna, Sara

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)