pred_cluster_sizes_from_table#

er_evaluation.pred_cluster_sizes_from_table(error_table)[source]#

Compute predicted cluster sizes from record error table.

Parameters:

error_table (DataFrame) – Record error table.

Returns:

Predicted cluster sizes for each reference cluster.

Return type:

Series

Examples

>>> prediction = pd.Series(index=[1,2,3,4,5,6,7,8], data=[1,1,2,3,2,4,4,4])
>>> sample = pd.Series(index=[1,2,3,4,5,6,7], data=["c1", "c1", "c1", "c2", "c2", "c3", "c3"])
>>> error_table = record_error_table(prediction, sample)
>>> pred_cluster_sizes_from_table(error_table)
reference
c1    2.0
c2    1.5
c3    3.0
Name: pred_cluster_size, dtype: float64