Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - Re: Parallel ALS-WR on very large matrix -- crashing (I think)


Copy link to this message
-
Re: Parallel ALS-WR on very large matrix -- crashing (I think)
Sebastian Schelter 2012-02-02, 09:47
Nicholas,

can you give us the detailed arguments you start the job with? I'd
especially be interested in the number of features (--numFeatures) you
use. Do you use the job with implicit feedback data
(--implicitFeedback=true)?

The memory requirements of the job are the following:

In each iteration either the item-features matrix (items x features) or
the user-features matrix (users x features) is loaded into the memory of
each mapper. Then the original user-item matrix (or its transpose) is
read row-wise by the mappers and they recompute the features via
AlternatingLeastSquaresSolver/ImplicitFeedbackAlternatingLeastSquaresSolver.

--sebastian
On 02.02.2012 09:53, Sean Owen wrote:
> I have seen this happen in "normal" operation when the sorting on the
> mapper is taking a long long time, because the output is large. You can
> tell it to increase the timeout.  If this is what is happening, you won't
> have a chance to update a counter as a keep-alive ping, but yes that is
> generally right otherwise. If this is the case it's that a mapper is
> outputting a whole lot of info, perhaps 'too much'. I don't know for sure,
> just another a guess for the pile.
>
> On Thu, Feb 2, 2012 at 1:44 AM, Ted Dunning <[EMAIL PROTECTED]> wrote:
>
>> Status reporting happens automatically when output is generated.  In a long
>> computation, it is good form to occasionally update a counter or otherwise
>> indicate that the computation is still progressing.
>>
>> On Wed, Feb 1, 2012 at 5:23 PM, Nicholas Kolegraff
>> <[EMAIL PROTECTED]>wrote:
>>
>>> Do you know if it should still report status in the midst of a complex
>>> task?  Seems questionable that it wouldn't just send a friendly hello?
>>>
>>>
>>
>