Strategy Overview#


../_images/scikit-activeml-query-strategy-overview.svg

This is an overview of all implemented pool- and stream-based active learning strategies, which are often divided into threemain categories based on the utilities they compute for sample selection:

  1. Informativeness-based strategies mostly select samples for which the model is most uncertain (e.g., via information-theoretic measures).

  2. Representativeness-based strategies select samples that capture the overall data distribution (e.g., via clustering ordensity estimation).

  3. Hybrid strategies combine both criteria to select samples that are informative and representative.

Furthermore, we distinguish between regression and classification as supervised learning tasks, where labels canbe provided by a single annotator or multiple annotators. Finally, a strategy builds a batch of samples by either including the samples with the top-k utilities or by including diverse samples with high utility scores.

You can use the checkboxes below to filter the query strategies based on these distinctions.

Pool-based AL Strategies#

Baseline#

Method

Base Class

Tags

Reference

Random Sampling

RandomSampling

pool regression classification single-annotator diverse-batch

Hybrid#

Method

Base Class

Tags

Reference

Batch Active Learning by Diverse Gradient Embedding (BADGE)

Badge

pool classification single-annotator diverse-batch

Ash et al.[1]

Clustering Uncertainty-weighted Embeddings (CLUE)

Clue

pool classification regression single-annotator diverse-batch

Prabhu et al.[2]

Contrastive Active Learning (CAL)

ContrastiveAL

pool classification single-annotator top-k-batch

Margatina et al.[3]

Dropout Query (DropQuery)

DropQuery

pool classification single-annotator diverse-batch

Gupte et al.[4]

Fast Active Learning by Contrastive UNcertainty (FALCUN)

Falcun

pool classification single-annotator diverse-batch

Gilhuber et al.[5]

Density-Diversity-Distribution-Distance Sampling (4DS)

FourDs

pool classification single-annotator diverse-batch

Reitmaier and Sick[6]

Batch Density-Diversity-Distribution-Distance Sampling (Batch4DS)

FourDs

pool classification single-annotator diverse-batch

Reitmaier and Sick[6]

Multi-class Probabilistic Active Learning (McPAL)

ProbabilisticAL

pool classification single-annotator top-k-batch

Kottke et al.[7]

Querying Informative and Representative Examples (QUIRE)

Quire

pool classification single-annotator top-k-batch

Huang et al.[8], Huang et al.[9]

Density-weighted Uncertainty Sampling (DWUS)

UncertaintySampling

pool classification single-annotator top-k-batch

Donmez et al.[10]

Dual Strategy for Active Learning (DUAL)

UncertaintySampling

pool classification single-annotator

Donmez et al.[10]

Informativeness#

Method

Base Class

Tags

Reference

Batch Bayesian Active Learning by Disagreement (BatchBALD)

BatchBALD

pool classification single-annotator diverse-batch

Houlsby et al.[11], Kirsch et al.[12]

Active Learning with Cost Embedding (ALCE)

CostEmbeddingAL

pool classification single-annotator top-k-batch

Huang and Lin[13]

Epistemic Uncertainty Sampling (EpisUS)

EpistemicUncertaintySampling

pool classification single-annotator top-k-batch

Nguyen et al.[14]

Bayesian Active Learning by Disagreement (BALD)

GreedyBALD

pool classification single-annotator top-k-batch

Houlsby et al.[11]

Monte-Carlo Expected Error Reduction (EER) with Log-Loss

MonteCarloEER

pool classification single-annotator top-k-batch

Roy and McCallum[15]

Monte-Carlo Expected Error Reduction (EER) with Misclassification-Loss

MonteCarloEER

pool classification single-annotator top-k-batch

Roy and McCallum[15]

Query-by-Committee (QBC) with Kullback-Leibler Divergence

QueryByCommittee

pool classification single-annotator top-k-batch

Seung et al.[16], McCallum and Nigamy[17]

Query-by-Committee (QBC) with Vote Entropy

QueryByCommittee

pool classification single-annotator top-k-batch

Seung et al.[16], Engelson and Dagan[18]

Query-by-Committee (QBC) with Variation Ratios

QueryByCommittee

pool classification single-annotator top-k-batch

Seung et al.[16], Beluch et al.[19]

Uncertainty Sampling (US) with Margin

UncertaintySampling

pool classification single-annotator top-k-batch

Settles[20]

Uncertainty Sampling (US) with Least-Confidence

UncertaintySampling

pool classification single-annotator top-k-batch

Settles[20]

Uncertainty Sampling (US) with Entropy

UncertaintySampling

pool classification single-annotator top-k-batch

Settles[20]

Uncertainty Sampling with Expected Average Precision (USAP)

UncertaintySampling

pool classification single-annotator top-k-batch

Wang et al.[21]

Value of Information (VOI) on Unlabeled Samples

ValueOfInformationEER

pool classification single-annotator top-k-batch

Joshi et al.[22]

Value of Information (VOI) on Labeled Samples

ValueOfInformationEER

pool classification single-annotator top-k-batch

Margineantu[23]

Value of Information (VOI)

ValueOfInformationEER

pool classification single-annotator top-k-batch

Kapoor et al.[24]

Representativeness#

Method

Base Class

Tags

Reference

Core Set

CoreSet

pool regression classification single-annotator diverse-batch

Sener and Savarese[25]

Discriminative Active Learning (DAL)

DiscriminativeAL

pool classification regression single-annotator diverse-batch

Gissin and Shalev-Shwartz[26]

MaxHerding

MaxHerding

pool regression classification single-annotator diverse-batch

Bae et al.[27]

Probability Coverage (ProbCover)

ProbCover

pool classification single-annotator diverse-batch

Yehuda et al.[28]

Typical Clustering (TypiClust)

TypiClust

pool regression classification single-annotator diverse-batch

Hacohen et al.[29]

Wrapper#

Method

Base Class

Tags

Reference

Parallel Utility Estimation Wrapper

ParallelUtilityEstimationWrapper

pool regression classification single-annotator top-k-batch

Sub-sampling Wrapper

SubSamplingWrapper

pool regression classification single-annotator top-k-batch diverse-batch

Pool-based AL Strategies for Regression#

Hybrid#

Method

Base Class

Tags

Reference

Regression Tree Based Active Learning (RT-AL) with Random Selection

RegressionTreeBasedAL

pool regression single-annotator diverse-batch

Jose et al.[30]

Regression Tree Based Active Learning (RT-AL) with Diversity Selection

RegressionTreeBasedAL

pool regression single-annotator diverse-batch

Jose et al.[30]

Regression Tree Based Active Learning (RT-AL) with Representativity Selection

RegressionTreeBasedAL

pool regression single-annotator diverse-batch

Jose et al.[30]

Informativeness#

Method

Base Class

Tags

Reference

Expected Model Change

ExpectedModelChangeMaximization

pool regression single-annotator top-k-batch

Cai et al.[31]

Expected Model Output Change

ExpectedModelOutputChange

pool regression single-annotator top-k-batch

Käding et al.[32]

Expected Model Variance Reduction

ExpectedModelVarianceReduction

pool regression single-annotator top-k-batch

Cohn et al.[33]

Regression based Kullback Leibler Divergence Maximization

KLDivergenceMaximization

pool regression single-annotator top-k-batch

Elreedy et al.[34]

Query-by-Committee (QBC) with Empirical Variance

QueryByCommittee

pool regression single-annotator

Seung et al.[16], Burbidge et al.[35]

Representativeness#

Method

Base Class

Tags

Reference

Greedy Sampling on the Target Space (GSy)

GreedySamplingTarget

pool regression single-annotator diverse-batch

Wu et al.[36]

Improved Greedy Sampling (GSi)

GreedySamplingTarget

pool regression single-annotator diverse-batch

Wu et al.[36]

Greedy Sampling on the Feature Space (GSx)

GreedySamplingX

pool regression classification single-annotator diverse-batch

Wu et al.[36]

Pool-based AL Strategies for Mulitple Annotators#

Baseline#

Method

Base Class

Tags

Reference

Interval Estimation Threshold

IntervalEstimationThreshold

pool classification multi-annotator

Random Sampling

SingleAnnotatorWrapper

pool classification multi-annotator

Core Set + Greedy Selection

SingleAnnotatorWrapper

pool classification multi-annotator

Stream-based AL Strategies#

Baseline#

Method

Base Class

Tags

Reference

Periodic Sampling

PeriodicSampling

stream classification single-annotator top-k-batch

Stream Random Sampling

StreamRandomSampling

stream classification single-annotator top-k-batch

Hybrid#

Method

Base Class

Tags

Reference

Cognitive Dual-Query Strategy with Fixed-Uncertainty

CognitiveDualQueryStrategyFixUn

stream classification single-annotator top-k-batch

Liu et al.[37]

Cognitive Dual-Query Strategy with Random Sampling

CognitiveDualQueryStrategyRan

stream classification single-annotator top-k-batch

Liu et al.[37]

Cognitive Dual-Query Strategy with Randomized-Variable-Uncertainty

CognitiveDualQueryStrategyRanVarUn

stream classification single-annotator top-k-batch

Liu et al.[37]

Cognitive Dual-Query Strategy with Variable-Uncertainty

CognitiveDualQueryStrategyVarUn

stream classification single-annotator top-k-batch

Liu et al.[37]

Split

Split

stream classification single-annotator top-k-batch

Žliobaitė et al.[38]

Density Based Active Learning for Data Streams

StreamDensityBasedAL

stream classification single-annotator top-k-batch

Ienco et al.[39]

Probabilistic Active Learning in Datastreams

StreamProbabilisticAL

stream classification single-annotator top-k-batch

Kottke et al.[40]

Informativeness#

Method

Base Class

Tags

Reference

Fixed-Uncertainty

FixedUncertainty

stream classification single-annotator top-k-batch

Žliobaitė et al.[38]

Randomized-Variable-Uncertainty

RandomVariableUncertainty

stream classification single-annotator top-k-batch

Žliobaitė et al.[38]

Variable-Uncertainty

VariableUncertainty

stream classification single-annotator top-k-batch

Žliobaitė et al.[38]

References#