Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 141 to 150 from 1050 (0.496s).
Loading phrases to help you
refine your search...
Re: Ask - Mahout - [mail # user]
...Hi Oscar,  It would help a lot if you could provide a bit more information on the  data that you wish to cluster, particularly the dimensionality of each  record and the numbe...
   Author: Jeff Eastman, 2012-04-16, 13:46
Re: is hadoop necessary for clustering in mahout? - Mahout - [mail # user]
...Most of the Mahout clustering algorithms have an -xm sequential CLI  option that runs locally in-memory from/to Hadoop-style sequence files.  And, as below, you can also call the J...
   Author: Jeff Eastman, 2012-03-22, 15:41
Re: empty vector out of clusterdump - Mahout - [mail # user]
...Empty? Note that the printouts of Mahout vectors prints only the  non-zero elements. It looks like you may have had many such zero vectors  and they were clustered into VL-1705919 ...
   Author: Jeff Eastman, 2012-03-20, 18:16
Re: Why there is "Infinity" values for the vector of a K-Means cluster center point? - Mahout - [mail # user]
...Good question. The only way I can think of an infinity in a Kluster center is if there were some infinity values in the vectors it observed. The center (centroid) is calculated in each itera...
   Author: Jeff Eastman, 2012-03-16, 16:16
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
...+1 Paritosh, this is exactly what I envisioned. And I also like your  idea of first converting them all to use ClusterWritable. Go for it!  On 3/15/12 10:42 AM, Paritosh Ranjan wro...
   Author: Jeff Eastman, 2012-03-15, 20:06
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
...Yes, that was my point. below It may, in fact, be impossible to  implement and commit them independently since so much of Mahout  clustering depends upon the Cluster sequenceFile. ...
   Author: Jeff Eastman, 2012-03-15, 13:54
Re: [jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator - Mahout - [mail # dev]
...Sure Paritosh, go ahead and take a crack at it. I am moving from CO to  PA for the next few weeks and won't be able to do much coding during  that period. I suspect you will also n...
   Author: Jeff Eastman, 2012-03-15, 03:53
Re: Canopy Job failed processing, Error: Java heap space - Mahout - [mail # user]
...With Canopy this is a symptom of T2 being too large. This causes an explosion of clusters - in the limit, one per input vector - and if vector dimension is large too there is no amount of me...
   Author: Jeff Eastman, 2012-03-15, 03:49
Re: canopy cluster size - Mahout - [mail # user]
...YW, you might also try Dirichlet with a  DistanceMeasureClusterDistribution on a CosineDistanceMeasure. See  DirichletClusterer or the wiki for an explanation of why this might als...
   Author: Jeff Eastman, 2012-03-14, 13:52
Re: canopy cluster size - Mahout - [mail # user]
...No, Canopy only uses a single reducer, so what's happening is many  mappers are munching your data in parallel and then the poor little  reducer has to combine them all. It is slow...
   Author: Jeff Eastman, 2012-03-13, 22:01
Sort:
project
Mahout (1050)
type
mail # dev (574)
mail # user (428)
issue (37)
wiki (11)
date
last 7 days (2)
last 30 days (2)
last 90 days (8)
last 6 months (27)
last 9 months (1050)
author
Ted Dunning (3512)
Sean Owen (2727)
Grant Ingersoll (1214)
Jeff Eastman (1050)
Robin Anil (1003)
Lance Norskog (872)
Jake Mannix (810)
Dmitriy Lyubimov (740)
Sebastian Schelter (697)
Benson Margulies (510)
Drew Farris (406)
Isabel Drost (324)
Paritosh Ranjan (275)
Pat Ferrel (228)
Dan Filimon (203)