Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 56 (0.41s).
Loading phrases to help you
refine your search...
Re: Which database should I use with Mahout - Mahout - [mail # user]
...Hi Pat,  On May 20, 2013, at 9:46am, Pat Ferrel wrote:   Yes - you can "boost" individual terms in the query.  And you can use payloads on terms in the index to adjust their s...
   Author: Ken Krugler, 2013-05-20, 17:45
Re: Feature reduction for LibLinear weights - Mahout - [mail # user]
...Hi Ted,  On Apr 13, 2013, at 8:46pm, Ted Dunning wrote:   I finally got to run this on a full set of training data, and it worked really well - even with a single hash function. &n...
   Author: Ken Krugler, 2013-04-24, 20:56
Re: Feature reduction for LibLinear weights - Mahout - [mail # user]
...Hi Ted,  On Apr 13, 2013, at 8:46pm, Ted Dunning wrote:   Thanks for clarifying - I was stuck on using the hash trick to get rid of the terms to index map, versus creating a denser...
   Author: Ken Krugler, 2013-04-17, 14:29
Re: Feature reduction for LibLinear weights - Mahout - [mail # user]
...On Apr 12, 2013, at 11:55pm, Ted Dunning wrote:   Unfortunately LibLinear takes feature indices directly (assumes they're sequential ints from 0..n-1), so I don't think feature hashing ...
   Author: Ken Krugler, 2013-04-13, 14:05
Feature reduction for LibLinear weights - Mahout - [mail # user]
...Hi all,  We're (ab)using LibLinear (linear SVM) as a multi-class classifier, with 200+ labels and 400K features.  This results in a model that's > 800MB, which is a bit unwieldy...
   Author: Ken Krugler, 2013-04-13, 01:30
Re: Implicit preferences - Mahout - [mail # user]
...On Feb 11, 2013, at 1:57am, Johannes Schulte wrote:   I assume you're talking about passing false for the includeSpanScore parameter in the PayloadTermQuery constructor, yes?  Anyw...
   Author: Ken Krugler, 2013-02-11, 18:27
Re: Implicit preferences - Mahout - [mail # user]
...On Feb 10, 2013, at 2:39pm, Johannes Schulte wrote:   I haven't been following this discussion, but in general using payloads is a way of providing additional information about a term t...
   Author: Ken Krugler, 2013-02-11, 00:11
DARPA XDATA award to Georgia Tech - Mahout - [mail # user]
...http://www.scientificcomputing.com/news-HPC-Georgia-Tech-Wins-2_7-Million-to-Advance-Big-Data-Technology-121212.aspx   Wonder if they know about Mahout? :)   Ken Krugler +1 530-210...
   Author: Ken Krugler, 2012-12-15, 23:15
Re: RowSimilarity - Mahout - [mail # user]
...On Jul 18, 2012, at 9:07am, Pat Ferrel wrote:   Both, since they (must) use the same approach for the query to do a good job of matching against docs in the index.  Often two word ...
   Author: Ken Krugler, 2012-07-18, 16:53
Re: RowSimilarity - Mahout - [mail # user]
...Hi Pat,  On Jul 14, 2012, at 8:17am, Pat Ferrel wrote:   The simplistic approach I used was to extract the top 50 terms (with TF*IDF weights) from the target document, then use tho...
   Author: Ken Krugler, 2012-07-18, 03:22
Sort:
project
Tika (161)
Solr (160)
Nutch (90)
Mahout (56)
Lucene (52)
Droids (4)
type
mail # user (56)
date
last 7 days (1)
last 30 days (1)
last 90 days (5)
last 6 months (8)
last 9 months (56)
author
Ted Dunning (3538)
Sean Owen (2734)
Grant Ingersoll (1214)
Jeff Eastman (1051)
Robin Anil (1004)
Lance Norskog (872)
Jake Mannix (813)
Dmitriy Lyubimov (752)
Sebastian Schelter (697)
Benson Margulies (510)
Drew Farris (406)
Isabel Drost (324)
Paritosh Ranjan (275)
Pat Ferrel (237)
Dan Filimon (205)
Ken Krugler