skactiveml.base.ClassFrequencyEstimator#
- class skactiveml.base.ClassFrequencyEstimator(class_prior=0, classes=None, missing_label=nan, cost_matrix=None, random_state=None)[source]#
Bases:
SkactivemlClassifier
Extends scikit-activeml classifiers to estimators that are able to estimate class frequencies for given samples (by calling ‘predict_freq’).
- Parameters
- classesarray-like, shape (n_classes), default=None
Holds the label for each class. If none, the classes are determined during the fit.
- missing_labelscalar or str or np.nan or None, default=np.nan
Value to represent a missing label.
- cost_matrixarray-like of shape (n_classes, n_classes)
Cost matrix with cost_matrix[i,j] indicating cost of predicting class classes[j] for a sample of class classes[i]. Can be only set, if classes is not none.
- class_priorfloat or array-like, shape (n_classes), default=0
Prior observations of the class frequency estimates. If class_prior is an array, the entry class_prior[i] indicates the non-negative prior number of samples belonging to class classes_[i]. If class_prior is a float, class_prior indicates the non-negative prior number of samples per class.
- random_stateint or np.RandomState or None, default=None
Determines random number for ‘predict’ method. Pass an int for reproducible results across multiple method calls.
- Attributes
- classes_np.ndarray of shape (n_classes)
Holds the label for each class after fitting.
- class_prior_np.ndarray of shape (n_classes)
Prior observations of the class frequency estimates. The entry class_prior_[i] indicates the non-negative prior number of samples belonging to class classes_[i].
- cost_matrix_np.ndarray of shape (classes, classes)
Cost matrix with cost_matrix_[i,j] indicating cost of predicting class classes_[j] for a sample of class classes_[i].
Methods
fit
(X, y[, sample_weight])Fit the model using X as training data and y as class labels.
Get metadata routing of this object.
get_params
([deep])Get parameters for this estimator.
predict
(X)Return class label predictions for the test samples X.
predict_freq
(X)Return class frequency estimates for the test samples X.
Return probability estimates for the test data X.
sample_proba
(X[, n_samples, random_state])Samples probability vectors from Dirichlet distributions whose parameters alphas are defined as the sum of the frequency estimates returned by predict_freq and the class_prior.
score
(X, y[, sample_weight])Return the mean accuracy on the given test data and labels.
set_fit_request
(*[, sample_weight])Request metadata passed to the
fit
method.set_params
(**params)Set the parameters of this estimator.
set_score_request
(*[, sample_weight])Request metadata passed to the
score
method.- abstract fit(X, y, sample_weight=None)#
Fit the model using X as training data and y as class labels.
- Parameters
- Xmatrix-like, shape (n_samples, n_features)
The sample matrix X is the feature matrix representing the samples.
- yarray-like, shape (n_samples) or (n_samples, n_outputs)
It contains the class labels of the training samples. The number of class labels may be variable for the samples, where missing labels are represented the attribute ‘missing_label’.
- sample_weightarray-like, shape (n_samples) or (n_samples, n_outputs)
It contains the weights of the training samples’ class labels. It must have the same shape as y.
- Returns
- self: skactiveml.base.SkactivemlClassifier,
The skactiveml.base.SkactivemlClassifier object fitted on the training data.
- get_metadata_routing()#
Get metadata routing of this object.
Please check User Guide on how the routing mechanism works.
- Returns
- routingMetadataRequest
A
MetadataRequest
encapsulating routing information.
- get_params(deep=True)#
Get parameters for this estimator.
- Parameters
- deepbool, default=True
If True, will return the parameters for this estimator and contained subobjects that are estimators.
- Returns
- paramsdict
Parameter names mapped to their values.
- predict(X)#
Return class label predictions for the test samples X.
- Parameters
- Xarray-like of shape (n_samples, n_features)
Input samples.
- Returns
- ynumpy.ndarray of shape (n_samples)
Predicted class labels of the test samples X. Classes are ordered according to classes_.
- abstract predict_freq(X)[source]#
Return class frequency estimates for the test samples X.
- Parameters
- X: array-like of shape (n_samples, n_features)
Test samples whose class frequencies are to be estimated.
- Returns
- F: array-like of shape (n_samples, classes)
The class frequency estimates of the test samples ‘X’. Classes are ordered according to attribute ‘classes_’.
- predict_proba(X)[source]#
Return probability estimates for the test data X.
- Parameters
- Xarray-like, shape (n_samples, n_features) or
- shape (n_samples, m_samples) if metric == ‘precomputed’
Input samples.
- Returns
- Parray-like of shape (n_samples, classes)
The class probabilities of the test samples. Classes are ordered according to classes_.
- sample_proba(X, n_samples=10, random_state=None)[source]#
Samples probability vectors from Dirichlet distributions whose parameters alphas are defined as the sum of the frequency estimates returned by predict_freq and the class_prior.
- Parameters
- Xarray-like of shape (n_test_samples, n_features)
Test samples for which n_samples probability vectors are to be sampled.
- n_samplesint, default=10
Number of probability vectors to sample for each X[i].
- random_stateint or numpy.random.RandomState or None, default=None
Ensure reproducibility when sampling probability vectors from the Dirichlet distributions.
- Returns
- Parray-like of shape (n_samples, n_test_samples, n_classes)
There are n_samples class probability vectors for each test sample in X. Classes are ordered according to classes_.
- score(X, y, sample_weight=None)#
Return the mean accuracy on the given test data and labels.
- Parameters
- Xarray-like of shape (n_samples, n_features)
Test samples.
- yarray-like of shape (n_samples,)
True labels for X.
- sample_weightarray-like of shape (n_samples,), default=None
Sample weights.
- Returns
- scorefloat
Mean accuracy of self.predict(X) regarding y.
- set_fit_request(*, sample_weight: Union[bool, None, str] = '$UNCHANGED$') ClassFrequencyEstimator #
Request metadata passed to the
fit
method.Note that this method is only relevant if
enable_metadata_routing=True
(seesklearn.set_config()
). Please see User Guide on how the routing mechanism works.The options for each parameter are:
True
: metadata is requested, and passed tofit
if provided. The request is ignored if metadata is not provided.False
: metadata is not requested and the meta-estimator will not pass it tofit
.None
: metadata is not requested, and the meta-estimator will raise an error if the user provides it.str
: metadata should be passed to the meta-estimator with this given alias instead of the original name.
The default (
sklearn.utils.metadata_routing.UNCHANGED
) retains the existing request. This allows you to change the request for some parameters and not others.New in version 1.3.
Note
This method is only relevant if this estimator is used as a sub-estimator of a meta-estimator, e.g. used inside a
Pipeline
. Otherwise it has no effect.- Parameters
- sample_weightstr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED
Metadata routing for
sample_weight
parameter infit
.
- Returns
- selfobject
The updated object.
- set_params(**params)#
Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects (such as
Pipeline
). The latter have parameters of the form<component>__<parameter>
so that it’s possible to update each component of a nested object.- Parameters
- **paramsdict
Estimator parameters.
- Returns
- selfestimator instance
Estimator instance.
- set_score_request(*, sample_weight: Union[bool, None, str] = '$UNCHANGED$') ClassFrequencyEstimator #
Request metadata passed to the
score
method.Note that this method is only relevant if
enable_metadata_routing=True
(seesklearn.set_config()
). Please see User Guide on how the routing mechanism works.The options for each parameter are:
True
: metadata is requested, and passed toscore
if provided. The request is ignored if metadata is not provided.False
: metadata is not requested and the meta-estimator will not pass it toscore
.None
: metadata is not requested, and the meta-estimator will raise an error if the user provides it.str
: metadata should be passed to the meta-estimator with this given alias instead of the original name.
The default (
sklearn.utils.metadata_routing.UNCHANGED
) retains the existing request. This allows you to change the request for some parameters and not others.New in version 1.3.
Note
This method is only relevant if this estimator is used as a sub-estimator of a meta-estimator, e.g. used inside a
Pipeline
. Otherwise it has no effect.- Parameters
- sample_weightstr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED
Metadata routing for
sample_weight
parameter inscore
.
- Returns
- selfobject
The updated object.
Examples using skactiveml.base.ClassFrequencyEstimator
#

Batch Active Learning by Diverse Gradient Embedding (BADGE)

Batch Density-Diversity-Distribution-Distance Sampling

Query-by-Committee (QBC) with Kullback-Leibler Divergence

Cognitive Dual-Query Strategy with Random Sampling

Cognitive Dual-Query Strategy with Fixed-Uncertainty

Cognitive Dual-Query Strategy with Variable-Uncertainty

Cognitive Dual-Query Strategy with Randomized-Variable-Uncertainty