cluster_homogeneity#
- er_evaluation.cluster_homogeneity(prediction, reference)[source]#
Cluster homogeneity score (based on conditional entropy).
This wraps scikit-learn’s homogeneity score function.
- Parameters:
prediction (Series) – membership vector for predicted clusters, i.e. a pandas Series indexed by mention ids and with values representing predicted cluster assignment.
reference (Series) – membership vector for reference (true) clusters, i.e. a pandas Series indexed by mention ids and with values representing reference cluster assignment.
- Returns:
homogeneity score
- Return type:
float
Notes
The prediction and reference membership vectors are inner joined before this metric is computed.
NA values are dropped from membership vectors prior to computing the metric.