| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
29 (0.18s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: command line input dataset format for k-means and USCensus dataset - Mahout - [mail # user]
|
|
...Eduard, My guess is you will need to convert your CSV vectors to Mahout vector format and then run that through k-means. I believe the seqdirectory program just converts a collection of ...
|
|
|
Author: DAN HELM,
2012-12-01, 01:55
|
|
|
Re: Reading the vector files - Mahout - [mail # user]
|
|
...See: http://amgadmadkour.blogspot.com/2012/07/kmeans-clustering-using-apache-mahout.html ________________________________ From: Chui-Hui Chiu To: [EMAIL PROTECTED] ...
|
|
|
Author: DAN HELM,
2012-11-21, 22:24
|
|
|
Re: Conversion of point numbers to key strings - Mahout - [mail # user]
|
|
...Chris, I assume you ran the kmeans algorithm? I believe the clusteredPoints file should prefix the document vectors with the text version of the processed documents (assuming seq2sparse ...
|
|
|
Author: DAN HELM,
2012-11-18, 21:11
|
|
|
Re: Converting one large text file with multiple documents to SequenceFile format - Mahout - [mail # user]
|
|
...CVB requires the vector input to be Key=IntWritable, Value=VectorWritable. rowid will convert the seq2sparse output to this format as you assumed. But when you ran rowid I assume the vecto...
|
|
|
Author: DAN HELM,
2012-11-12, 17:27
|
|
|
Re: Mahout: CVB: Error - Mahout - [mail # user]
|
|
...Arni, I had not formally contributed that code but it was posted before via email. Here is an initial approach developed where rowid will output one "part" file for each input "part" fil...
|
|
|
Author: DAN HELM,
2012-11-04, 23:42
|
|
|
Re: Mahout: CVB: Error - Mahout - [mail # user]
|
|
...Arni, I believe you are running with the wrong input for the cvb command: ./mahout cvb -i /user/root/sparse-vectors-cvb/docIndex ..... It should be: ./mahout cvb -i /user/root/sparse-vec...
|
|
|
Author: DAN HELM,
2012-11-04, 00:38
|
|
|
Re: Using LDA in Mahout 0.0.7 - Mahout - [mail # user]
|
|
...Hi Diego, A number of us had the same issue when first working with the new CVB algorithm. The vector keys for CVB need to be Integers. You can use the rowid utility to con...
|
|
|
Author: DAN HELM,
2012-10-28, 21:40
|
|
|
Re: K-Means generates only one cluster - Mahout - [mail # user]
|
|
...To look at vectors you can check out the data in the "clusteredPoints" folder generated by k-means. You can write the data out in text format via the seqdumper command (as shown in step 5 h...
|
|
|
Author: DAN HELM,
2012-10-19, 21:32
|
|
|
Re: K-Means generates only one cluster - Mahout - [mail # user]
|
|
...We previously did some k-means clustering runs on different sized collections and noticed how that a large cluster was often created along with some smaller others. In digging deeper it turn...
|
|
|
Author: DAN HELM,
2012-10-19, 03:50
|
|
|
Re: More mappers in RowId - Mahout - [mail # user]
|
|
...Hi Anna, When I wrote that mod if was specifically so that rowid would output multiple files (instead of one Matrix file). This was done so the LDA clustering I was using would run multip...
|
|
|
Author: DAN HELM,
2012-08-17, 21:21
|
|
|
|