Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 1050 (0.177s).
Loading phrases to help you
refine your search...
Re: Problems with KMeans Clustering - Radius calculation returns incorrect ZERO value in some cases. - Mahout - [mail # user]
...What you have observed is correct. During the final iteration, points  are observed by each cluster and these observations are used to  calculate the new cluster center and radius....
   Author: Jeff Eastman, 2013-05-15, 17:15
Re: Problems with KMeans Clustering - Radius calculation returns incorrect ZERO value in some cases. - Mahout - [mail # user]
...Hi Erinn,  The radius calculation in KMeans and other clustering algorithms uses a  running sums algorithm (see RunningSumsGaussianAccumulator) and the  radius is really the s...
   Author: Jeff Eastman, 2013-05-14, 17:09
Re: Fuzyy Clustering accumulates lots of memory - Mahout - [mail # user]
...Fuzzy KMeans will use a lot of heap memory because every vector is  observed (with weighting) by every cluster. This will make the cluster  centers (and other vectors) much more de...
   Author: Jeff Eastman, 2013-03-29, 16:11
Re: Retrieving Fuzzy Cluster Probabilities - Mahout - [mail # user]
...On 3/22/13 10:39 AM, Sebastian Briesemeister wrote: It sounds like you may have found a bug in the MR version. Those  probabilities should be the same. The pdf vectors are normalized I ...
   Author: Jeff Eastman, 2013-03-22, 15:17
Re: [Draft] Board Report - Mahout - [mail # dev]
...+1 Even though new development has slowed lately it still paints a  favorable picture from the adoption side  On 3/11/13 2:38 PM, Ted Dunning wrote:  ...
   Author: Jeff Eastman, 2013-03-11, 19:09
Re: KMean cluster produces more clusters then requested - Mahout - [mail # user]
...Unfortunately, all attachments are stripped by the Apache mail server.  You will need to open a JIRA to get those attachments to us. You could;  however, also tell us a bit more ab...
   Author: Jeff Eastman, 2013-03-09, 14:38
Re: KMeans Throwing Hadoop write errors for large values of K - Mahout - [mail # user]
...I don't know where the timeout is happening, but each mapper and each  reducer writes all its clusters out at the end of its run. With a large  number of clusters, and with the non...
   Author: Jeff Eastman, 2013-03-08, 21:01
Re: How to pick t1 and t2 in canopy - Mahout - [mail # user]
...This is a common question and you can search the email archives for more  discussion. Start by setting t1 == t2 as t2 is the variable that  controls the number of clusters produced...
   Author: Jeff Eastman, 2013-03-07, 13:55
Re: how to use a custom distance measure with kmeans? - Mahout - [mail # user]
...You also need to specify a fully-qualified class name  On 2/12/13 11:48 AM, Dan Filimon wrote:  ...
   Author: Jeff Eastman, 2013-02-12, 18:12
Re: Early version of streaming k-means in Mahout - Mahout - [mail # dev]
...The Job classes in examples are very, very old carryovers from a time  before our Drivers had a CLI. Just follow the current pattern, extending  AbstractJob.  On 2/11/13 3:00 ...
   Author: Jeff Eastman, 2013-02-12, 18:10
Sort:
project
Mahout (1050)
type
mail # dev (574)
mail # user (428)
issue (37)
wiki (11)
date
last 7 days (0)
last 30 days (2)
last 90 days (8)
last 6 months (27)
last 9 months (1050)
author
Ted Dunning (3527)
Sean Owen (2734)
Grant Ingersoll (1214)
Jeff Eastman (1050)
Robin Anil (1004)
Lance Norskog (872)
Jake Mannix (810)
Dmitriy Lyubimov (744)
Sebastian Schelter (697)
Benson Margulies (510)
Drew Farris (406)
Isabel Drost (324)
Paritosh Ranjan (275)
Pat Ferrel (235)
Dan Filimon (205)