Knowledge-driven active learning

IRIS

The deployment of Deep Learning (DL) models is still precluded in those contexts where the amount of supervised data is limited. To answer this issue, active learning strategies aim at minimizing the amount of labelled data required to train a DL model. Most active strategies are based on uncertain sample selection, and even often restricted to samples lying close to the decision boundary. These techniques are theoretically sound, but an understanding of the selected samples based on their content is not straightforward, further driving non-experts to consider DL as a black-box. For the first time, here we propose to take into consideration common domain-knowledge and enable non-expert users to train a model with fewer samples. In our Knowledge-driven Active Learning (KAL) framework, rule-based knowledge is converted into logic constraints and their violation is checked as a natural guide for sample selection. We show that even simple relationships among data and output classes offer a way to spot predictions for which the model need supervision. We empirically show that KAL (i) outperforms many active learning strategies, particularly in those contexts where domain knowledge is rich, (ii) it discovers data distribution lying far from the initial training data, (iii) it ensures domain experts that the provided knowledge is acquired by the model, (iv) it is suitable for regression and object recognition tasks unlike uncertainty-based strategies, and (v) its computational demand is low.

Knowledge-driven active learning / Ciravegna, G., Precioso, F., Betti, A., Mottin, K., Gori, M.. - 14169:(2023), pp. 38-54. (ECML PKDD 2023 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases Turin, Italy 18-22/09/2023) [10.1007/978-3-031-43412-9_3].

Knowledge-driven active learning

Ciravegna Gabriele;Precioso Frédéric;Betti Alessandro;Mottin Kevin;Gori Marco

2023

Abstract

The deployment of Deep Learning (DL) models is still precluded in those contexts where the amount of supervised data is limited. To answer this issue, active learning strategies aim at minimizing the amount of labelled data required to train a DL model. Most active strategies are based on uncertain sample selection, and even often restricted to samples lying close to the decision boundary. These techniques are theoretically sound, but an understanding of the selected samples based on their content is not straightforward, further driving non-experts to consider DL as a black-box. For the first time, here we propose to take into consideration common domain-knowledge and enable non-expert users to train a model with fewer samples. In our Knowledge-driven Active Learning (KAL) framework, rule-based knowledge is converted into logic constraints and their violation is checked as a natural guide for sample selection. We show that even simple relationships among data and output classes offer a way to spot predictions for which the model need supervision. We empirically show that KAL (i) outperforms many active learning strategies, particularly in those contexts where domain knowledge is rich, (ii) it discovers data distribution lying far from the initial training data, (iii) it ensures domain experts that the provided knowledge is acquired by the model, (iv) it is suitable for regression and object recognition tasks unlike uncertainty-based strategies, and (v) its computational demand is low.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Codice ISBN
	
				9783031434112
9783031434129
			
	Codice OpenAlex
	
				W4225909448
			
	Parole chiave
	
				Active learning
Knowledge-aided learning
Neurosymbolic learning
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
KDAL.pdf non disponibili Descrizione: Knowledge-Driven Active Learning Tipologia: Versione Editoriale (PDF) Licenza: Copyright dell'editore Dimensione 1.2 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.2 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
2110.08265v4.pdf accesso aperto Descrizione: Preprint - Knowledge-Driven Active Learning Tipologia: Documento in Pre-print Licenza: Creative commons Dimensione 8.12 MB Formato Adobe PDF Visualizza/Apri	8.12 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11771/34884

Citazioni

ND

2

4

social impact