cluster_completeness#
- er_evaluation.cluster_completeness(prediction, reference)[source]#
Cluster completeness score (based on conditional entropy)
This wraps scikit-learn’s completeness score function.
- Parameters:
prediction (Series) – membership vector for predicted clusters, i.e. a pandas Series indexed by mention ids and with values representing predicted cluster assignment.
reference (Series) – membership vector for reference (true) clusters, i.e. a pandas Series indexed by mention ids and with values representing reference cluster assignment.
- Returns:
completeness score
- Return type:
float
Notes
The prediction and reference membership vectors are inner joined before this metric is computed.
NA values are dropped from membership vectors prior to computing the metric.