Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - Naive-Bayes work flow


Copy link to this message
-
Re: Naive-Bayes work flow
Lance Norskog 2012-02-16, 06:23
The file examples/bin/asf-examples.sh shows how to use the Naive Bayes
classifier.  There are a few stages needed to prepare files before
they get to the classifier training and test passes.

If writing your own code, I would use the Apache Commons CSV parser:
http://commons.apache.org/sandbox/csv/

This is pulled into the Mahout integration/ sub-project, and is used
by the 'CSVVectorIterator'. If you need a Hadoop file reader for CSV,
you would create a new one from scratch.
On Wed, Feb 15, 2012 at 9:41 PM, Naveenchandra
<[EMAIL PROTECTED]> wrote:
>
>  Hi all,
>
>  Can any one explain me how and where the input files are being read, now i am
> going through the source code of bayes classifier, i know that we are creating a
> job which calls the class to read that input file.
>
>  May i know which java file is reading the records from the input csv files ?
>
>
>
>
>

--
Lance Norskog
[EMAIL PROTECTED]