Comment générer les meilleurs échantillons a faible dispersion pour l'apprentissage actif en classification ? - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

How to generate the best low dispersion samples for active learning in classification ?

Comment générer les meilleurs échantillons a faible dispersion pour l'apprentissage actif en classification ?

Résumé

We consider a problem of active learning classification: we suppose we can determine, with an oracle, the label of any point in a given compact set, and we want to generate a sample of a given size which will allow us to get the best approximation of the oracle function. It's well known that the more numerous the data are, the best quality the modelling is. However obtaining data can be expensive or destructive in consequence we want to get the best value from this investment. We have to choose the best learning set. The first contribution of this paper is to state that dispersion is the most relevant criterion for generating samples in active classification leanring whereas discrepance is the relevant criterion for active regression learning. However low dispersion samples are not easy to generate. The second contribution consists then in making a study of different ways to proceed and in proposing a new algorithm.
Fichier non déposé

Dates et versions

hal-02594341 , version 1 (15-05-2020)

Identifiants

Citer

Benoît Gandar, G. Loosli, Guillaume Deffuant. Comment générer les meilleurs échantillons a faible dispersion pour l'apprentissage actif en classification ?. Active Learning and Experimental Design Workshop AISTATS, May 2010, Sardaigne, France. pp.16. ⟨hal-02594341⟩
12 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More