Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - online clustering with mahout


Copy link to this message
-
Re: online clustering with mahout
Jeff Eastman 2012-05-14, 13:20
Look at ClusterIterator.iterate(). This will do clustering in memory
without any Hadoop. ClusterIterator.iterateSeq will do clustering in a
single process from/to Hadoop sequence files but without map/reduce.
ClusterIterator.iterateMR uses full Hadoop to do clustering for the same
algorithms (k-means, fuzzy-k, Dirichlet), all configured using
ClusteringPolicy instances.

On 5/14/12 8:34 AM, Ioan Eugen Stan wrote:
> Hi,
>
> Dos mahout offer online clustering out of the box using sequential
> clustering (no MapReduce). I'm looking over the code (trunk) and I
> found ClusterClassifier but I can't figure out how that works. Any
> examples or more docs on this topic?
>
> Thanks,