Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 71 to 80 from 31424 (0.465s).
Loading phrases to help you
refine your search...
Re: WELCOME to [EMAIL PROTECTED] - Mahout - [mail # user]
...Hi,  I'm trying to tune mahout ssvd job to not spill so much, I'm trying to tune     io.sort.mb   1047    but when I try to put any bigger value, ie.   &nb...
   Author: Jakub Pawłowski, 2013-05-22, 11:31
RE: Feature vector generation from Bag-of-Words - Mahout - [mail # user]
...Hi Suneel,  I implemented your suggested approach. This was simple to implement and you have made the steps pretty clear. Thankyou :) . I have few query in creating Features using Multi...
   Author: Stuti Awasthi, 2013-05-22, 11:02
RE: Feature vector generation from Bag-of-Words - Mahout - [mail # user]
...Thanks Suneel,  I will go through your approach and will also learn more about various api's you have suggested. I am new to Mahout so will need to dig more. :)  By the time I was ...
   Author: Stuti Awasthi, 2013-05-22, 06:31
Re: Which database should I use with Mahout - Mahout - [mail # user]
...On Tue, May 21, 2013 at 10:34 PM, Johannes Schulte  wrote:   Sorry about that.  Your English is good enough that I hadn't noticed any deficit.  Dithering is constructive ...
   Author: Ted Dunning, 2013-05-22, 06:30
Re: Which database should I use with Mahout - Mahout - [mail # user]
...Thanks for the list...as a non native speaker I got problems understanding the meaning of dithering here.  I got the feeling that somewhere between a) and d) there is also diversificati...
   Author: Johannes Schulte, 2013-05-22, 05:34
Re: Interpreting Cluster Dump Metrics - Mahout - [mail # user]
...On Tue, May 21, 2013 at 8:47 PM, Pat Ferrel  wrote:    It is really hard to tell with these numbers.  IN spite of their heritage, these scaled average distances are kind ...
   Author: Ted Dunning, 2013-05-22, 04:53
Interpreting Cluster Dump Metrics - Mahout - [mail # user]
...Doing some clustering on text docs. I iterated over a range of k using kmeans and each time got the average intra and inter cluster density scores as shown below.   Just want to make su...
   Author: Pat Ferrel, 2013-05-22, 03:47
Re: Which database should I use with Mahout - Mahout - [mail # user]
...Inline   On Tue, May 21, 2013 at 8:59 AM, Pat Ferrel  wrote:   This is a time filter.  How many transactions did this turn out to be.  I typically recommend truncati...
   Author: Ted Dunning, 2013-05-22, 00:42
Re: Which database should I use with Mahout - Mahout - [mail # user]
...I have so far just used the weights that Solr applies natively.  In my experience, what makes a recommendation engine work better is, in order of importance,  a) dithering so that ...
   Author: Ted Dunning, 2013-05-22, 00:30
Re: Review Request: MAHOUT-1224: Add the option of running a StreamingKMeans pass in the Reducer before BallKMeans - Mahout - [mail # dev]
...On Tue, May 21, 2013 at 1:47 AM, Dan Filimon wrote:  I think I wasn't clear.  k log (n/m) is a bound on the number of points.  It has nothing to do with the cluster-attach-if-...
   Author: Ted Dunning, 2013-05-21, 21:40
Sort:
project
Lucene (130010)
Solr (104012)
ElasticSearch (33873)
Mahout (31332)
Nutch (16551)
ManifoldCF (15141)
Tika (5956)
Lucene.Net (5782)
PyLucene (1905)
Droids (1668)
Lucy (1359)
OpenRelevance (286)
type
mail # user (14812)
mail # dev (8788)
javadoc (5436)
issue (1175)
source code (993)
wiki (127)
Sematext # blog (92)
web site (1)
date
last 7 days (139)
last 30 days (405)
last 90 days (1625)
last 6 months (2756)
last 9 months (24995)
author
Ted Dunning (3538)
Sean Owen (2734)
Grant Ingersoll (1214)
Jeff Eastman (1051)
Robin Anil (1004)
Lance Norskog (872)
Jake Mannix (813)
Dmitriy Lyubimov (752)
Sebastian Schelter (697)
Benson Margulies (510)
Drew Farris (406)
Isabel Drost (324)
Paritosh Ranjan (275)
Pat Ferrel (237)
Dan Filimon (205)