|
|
Lithium Guava 2012-04-12, 12:28
Hi,
I've played with the bayes 20newsgroups example, but I'd like to try running the cbayes algorithm on it also. The example script doesn't seem to offer this, so I dug into the code a bit and it looks like the input to cbayes is sequence files rather than key/value text files.
Can anyone tell me how those sequence files should be formatted? I couldn't find it documented anywhere. Also I don't suppose there's a handy prepare data program to get it running on the 20newsgroups data easily?
Thanks,
Tom
+
Lithium Guava 2012-04-12, 12:28
Robin Anil 2012-04-12, 12:53
In the command line example replace "bayes" with "cbayes". That's all you need to do. On Apr 12, 2012 7:29 AM, "Lithium Guava" <[EMAIL PROTECTED]> wrote:
> Hi, > > I've played with the bayes 20newsgroups example, but I'd like to try > running the cbayes algorithm on it also. The example script doesn't seem to > offer this, so I dug into the code a bit and it looks like the input to > cbayes is sequence files rather than key/value text files. > > Can anyone tell me how those sequence files should be formatted? I couldn't > find it documented anywhere. Also I don't suppose there's a handy prepare > data program to get it running on the 20newsgroups data easily? > > Thanks, > > Tom >
+
Robin Anil 2012-04-12, 12:53
Lithium Guava 2012-04-12, 12:57
Thanks, I just realised I was getting the wrong end of the stick - looking at the theta normalizer driver code thinking it was the classifier driver...
Cheers! On 12 April 2012 13:53, Robin Anil <[EMAIL PROTECTED]> wrote:
> In the command line example replace "bayes" with "cbayes". That's all you > need to do. > On Apr 12, 2012 7:29 AM, "Lithium Guava" <[EMAIL PROTECTED]> wrote: > > > Hi, > > > > I've played with the bayes 20newsgroups example, but I'd like to try > > running the cbayes algorithm on it also. The example script doesn't seem > to > > offer this, so I dug into the code a bit and it looks like the input to > > cbayes is sequence files rather than key/value text files. > > > > Can anyone tell me how those sequence files should be formatted? I > couldn't > > find it documented anywhere. Also I don't suppose there's a handy prepare > > data program to get it running on the 20newsgroups data easily? > > > > Thanks, > > > > Tom > > >
+
Lithium Guava 2012-04-12, 12:57
|
|
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by
Sematext