| clear query|facets|time |
Search criteria: .
Results from 41 to 50 from
872 (0.149s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Maven build unpacks jars- would jar of jars work? - Mahout - [mail # user]
|
|
...Ah. Half the items I see about this include "you need a special classloader". On Sun, Aug 5, 2012 at 1:56 AM, Ted Dunning wrote: Lance Norskog [EMAIL PROTECTED]...
|
|
|
Author: Lance Norskog,
2012-08-07, 01:16
|
|
|
Re: Error in KMeansDriver - Mahout - [mail # user]
|
|
...How do you run this program? Are you running a Hadoop cluster app out of Eclipse? I do not know if that mode copies your jars out to the executors. On Sat, Aug 4, 2012 at 10:30 PM, Abh...
|
|
|
Author: Lance Norskog,
2012-08-05, 06:47
|
|
|
Re: Maven build unpacks jars- would jar of jars work? - Mahout - [mail # user]
|
|
...There's an ant task, a discussion of its use, and a discussion of jars-in-jars. http://ant.apache.org/manual/Tasks/manifestclasspath.html http://stackoverflow.com/questions/858766/gene...
|
|
|
Author: Lance Norskog,
2012-08-04, 23:35
|
|
|
Maven build unpacks jars- would jar of jars work? - Mahout - [mail # user]
|
|
...The Maven build does a grand project unpacking multiple jars into one big one. Java apparently supports packing jars inside other jars- the outer jar needs a classpath property for the insid...
|
|
|
Author: Lance Norskog,
2012-08-04, 23:04
|
|
|
Re: Tags generation? - Mahout - [mail # user]
|
|
...Thanks everyone- I hadn't considered the stem/synonym problem. I have code for regularizing a doc/term matrix with tf, binary, log and augmented norm for the cells and idf, gfidf, entropy, n...
|
|
|
Author: Lance Norskog,
2012-08-04, 01:31
|
|
|
Tags generation? - Mahout - [mail # user]
|
|
...I'm looking for a good tags generator. A function from document/term matrix onto term list is a good bet, since it creates an analysis of the interplay of document and term. I have an LSA im...
|
|
|
Author: Lance Norskog,
2012-08-03, 07:35
|
|
|
Re: ERROR: OutOfMemoryError: Java heap space - Mahout - [mail # user]
|
|
...If you are on Unix, and you want to split your text on line boundaries, the 'split' program will create many files with the same number of lines. On Wed, Aug 1, 2012 at 5:29 AM, pricil...
|
|
|
Author: Lance Norskog,
2012-08-01, 23:59
|
|
|
Re: cmdump - Mahout - [mail # user]
|
|
...In TestNaiveBayesDriver around line 115 is where it reads the output of the Naive Bayes job and makes a confusion matrix. It is probably better to make cmdump read that file instead of expec...
|
|
|
Author: Lance Norskog,
2012-07-31, 03:03
|
|
|
Re: Extracting data from websites - Mahout - [mail # user]
|
|
...The easiest web crawler I know of is 'wget'. On Mon, Jul 30, 2012 at 7:31 AM, David Rose wrote: Lance Norskog [EMAIL PROTECTED]...
|
|
|
Author: Lance Norskog,
2012-07-30, 21:46
|
|
|
Re: FW: cmdump - Mahout - [mail # user]
|
|
...I don't know what the output directory holds. There used to be code in TestNaiveBayesClassifier that saved the confusion matrix to a separate file. This code has evaporated. On S...
|
|
|
Author: Lance Norskog,
2012-07-30, 21:28
|
|
|
|