Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # dev - Test Execution Time


Copy link to this message
-
Re: Test Execution Time
Jake Mannix 2011-08-16, 18:16
As long as nobody's changing the logic going into Lanczos, disabling it
should be fine.  But I agree with Grant, that we should have it running
somewhere (continuous integration?).

I can try and squeeze it's size down, but you don't get very good
convergence
on a small matrix when using these iterative matrix multiplier-style
algorithms,
which makes it hard to see that it's still working as expected.

  -jake

On Wed, Aug 10, 2011 at 11:54 PM, Sean Owen <[EMAIL PROTECTED]> wrote:

> That would reduce memory requirements unilaterally? no, don't think so. I
> have not run into memory problems.
> The issue here is execution time and it's a big problem indeed.
>
> Would anyone object to disabling this test for now? it's getting costly in
> the dev cycle.
>
> On Thu, Aug 11, 2011 at 5:52 AM, Lance Norskog <[EMAIL PROTECTED]> wrote:
>
> > Another aspect of testing Map/Reduce jobs is memory bounds. An M/R job
> > should be able to run in a fairly constant amount of ram per JVM from
> > beginning to end, to avoid blowing up late in the game. Is there any
> > harness around that would do this?
> >
> > Lance
> >
> > On Sun, Aug 7, 2011 at 9:04 PM, Ted Dunning <[EMAIL PROTECTED]>
> wrote:
> > > We have used testNg at work for just this segregation of fast and slow
> > tests and it works pretty well. It is also useful for stripping out
> > cascading failures. This means that integration tests can be executed
> only
> > if the underlying unit tests succeed. Another nice thing is that testNg
> says
> > how many tests were skipped.
> > >
> > > Junit has recently been adding similar capabilities but I haven't kept
> > up.
> > >
> > > Sent from my iPhone
> > >
> > > On Aug 7, 2011, at 19:18, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> > >
> > >> It would likely help if we could get them to run in parallel, perhaps.
> >  Also, seems like TestNG might have some better features on paper for
> this
> > kind of stuff (I think you can annotate some things as "slow" or "fast"
> and
> > then choose to run them separately).  I haven't explored much yet in this
> > way.  Has anyone else used TestNG?
> > >>
> > >> -Grant
> > >>
> > >> On Aug 7, 2011, at 9:12 PM, Ted Dunning wrote:
> > >>
> > >>> I really don't think that this distinction needs to be made.  The
> > >>> distinction between unit and integration test is important from a
> > technical
> > >>> point of view, but as an organizing principle the topic and target of
> > the
> > >>> test is probably a better idea than whether the test is a functional
> or
> > unit
> > >>> test or whether it has randomized initial conditions or whether it
> has
> > for
> > >>> loops in it.  Tests should be organized by what they test.
> > >>>
> > >>> On Sun, Aug 7, 2011 at 5:08 PM, Lance Norskog <[EMAIL PROTECTED]>
> > wrote:
> > >>>
> > >>>> Sure! Perhaps the long-running ones can move to a new 'regression'
> > >>>> area? examples/ is partly what these are, so examples/regression
> makes
> > >>>> sense.
> > >>>>
> > >>>> On Sun, Aug 7, 2011 at 11:11 AM, Sean Owen <[EMAIL PROTECTED]>
> wrote:
> > >>>>> This test is indeed by far the culprit. I already reduced its test
> > input
> > >>>>> size to hurry it up, but it's gone slow again.
> > >>>>>
> > >>>>> Lance, indeed, these are not all unit tests -- nobody said they
> were.
> > The
> > >>>>> test is useful.
> > >>>>>
> > >>>>> I do suggest, however, we comment it out. Jake suggested it coudl
> be
> > made
> > >>>>> faster but I don't think he followed up.
> > >>>>>
> > >>>>> Sean
> > >>>>>
> > >>>>> On Sun, Aug 7, 2011 at 12:13 AM, Lance Norskog <[EMAIL PROTECTED]>
> > >>>> wrote:
> > >>>>>
> > >>>>>> Comment out DistributedLanczosWhatsit. Zing!
> > >>>>>>
> > >>>>>> A unit test takes a bit of code X and checks that code path A goes
> > >>>>>> "tick" and code path B goes "tock" and bogus input C throws an
> > >>>>>> exception. There's no such thing as a "unit test" that runs twelve
> > M/R
> > >>>>>> jobs in a row.
> > >>>>>>
> > >>>>>> There's MRUnit, which seems trapped in the Hadoop