| clear query|facets|time |
Search criteria: .
Results from 141 to 150 from
1050 (0.496s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Ask - Mahout - [mail # user]
|
|
...Hi Oscar, It would help a lot if you could provide a bit more information on the data that you wish to cluster, particularly the dimensionality of each record and the numbe...
|
|
|
Author: Jeff Eastman,
2012-04-16, 13:46
|
|
|
Re: is hadoop necessary for clustering in mahout? - Mahout - [mail # user]
|
|
...Most of the Mahout clustering algorithms have an -xm sequential CLI option that runs locally in-memory from/to Hadoop-style sequence files. And, as below, you can also call the J...
|
|
|
Author: Jeff Eastman,
2012-03-22, 15:41
|
|
|
Re: empty vector out of clusterdump - Mahout - [mail # user]
|
|
...Empty? Note that the printouts of Mahout vectors prints only the non-zero elements. It looks like you may have had many such zero vectors and they were clustered into VL-1705919 ...
|
|
|
Author: Jeff Eastman,
2012-03-20, 18:16
|
|
|
Re: Why there is "Infinity" values for the vector of a K-Means cluster center point? - Mahout - [mail # user]
|
|
...Good question. The only way I can think of an infinity in a Kluster center is if there were some infinity values in the vectors it observed. The center (centroid) is calculated in each itera...
|
|
|
Author: Jeff Eastman,
2012-03-16, 16:16
|
|
|
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
|
|
...+1 Paritosh, this is exactly what I envisioned. And I also like your idea of first converting them all to use ClusterWritable. Go for it! On 3/15/12 10:42 AM, Paritosh Ranjan wro...
|
|
|
Author: Jeff Eastman,
2012-03-15, 20:06
|
|
|
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
|
|
...Yes, that was my point. below It may, in fact, be impossible to implement and commit them independently since so much of Mahout clustering depends upon the Cluster sequenceFile. ...
|
|
|
Author: Jeff Eastman,
2012-03-15, 13:54
|
|
|
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
|
|
...Sure Paritosh, go ahead and take a crack at it. I am moving from CO to PA for the next few weeks and won't be able to do much coding during that period. I suspect you will also n...
|
|
|
Author: Jeff Eastman,
2012-03-15, 03:53
|
|
|
Re: Canopy Job failed processing, Error: Java heap space - Mahout - [mail # user]
|
|
...With Canopy this is a symptom of T2 being too large. This causes an explosion of clusters - in the limit, one per input vector - and if vector dimension is large too there is no amount of me...
|
|
|
Author: Jeff Eastman,
2012-03-15, 03:49
|
|
|
Re: canopy cluster size - Mahout - [mail # user]
|
|
...YW, you might also try Dirichlet with a DistanceMeasureClusterDistribution on a CosineDistanceMeasure. See DirichletClusterer or the wiki for an explanation of why this might als...
|
|
|
Author: Jeff Eastman,
2012-03-14, 13:52
|
|
|
Re: canopy cluster size - Mahout - [mail # user]
|
|
...No, Canopy only uses a single reducer, so what's happening is many mappers are munching your data in parallel and then the poor little reducer has to combine them all. It is slow...
|
|
|
Author: Jeff Eastman,
2012-03-13, 22:01
|
|
|
|