Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - Cluster Evaluation 0.8 style


Copy link to this message
-
Re: Cluster Evaluation 0.8 style
Jeff Eastman 2012-07-11, 18:46
The ClusterEvaluator has methods for both inter-cluster density and
intra-cluster density. The former computes the density using the cluster
centers, while the latter uses a set of representative points extracted
from the clustered points. This reduces the computational overhead of
calculating a density from all of the points from each cluster.

The unit test uses synthetic data and produces reasonable looking
results afaict. Have you had negative experiences with that?

On 7/11/12 1:21 PM, Pat Ferrel wrote:
> ...
>
> It was my understanding that the ClusterEvaluator included an attempt
> to provide this measure with intra-cluster density per cluster though
> it looks like that output has been removed?
>