Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Mahout, mail # dev - Re: Online machine learning on top of Hama BSP


+
Edward J. Yoon 2012-05-25, 08:35
+
Manuel Blechschmidt 2012-05-25, 10:22
+
Thomas Jungblut 2012-05-25, 10:44
Copy link to this message
-
Re: Online machine learning on top of Hama BSP
Ted Dunning 2012-05-25, 17:20
Apache Giraph probably offers a more mature BSP model of computation.  My
guess is that it would make a stronger implementation substrate.  It
certainly has a very strong community.

On Fri, May 25, 2012 at 10:44 AM, Thomas Jungblut <
[EMAIL PROTECTED]> wrote:

> Hi Manuel,
>
> 300k is small, I have one with 6 mio clicks.
> However it is more a question of interest and what algorithms could be
> suitable for BSP.
> In case you wonder what BSP is, it stands for bulk synchronous parallel
> [1].
> We think that realtime and strongly iterative algorithms that are slow in
> mapreduce could be more efficiently solved with BSP.
> If you're interested, let us know.
>
> Regards,
> Thomas
>
> [1] http://en.wikipedia.org/wiki/Bulk_synchronous_parallel
>
> 2012/5/25 Manuel Blechschmidt <[EMAIL PROTECTED]>
>
> > Hi Edward,
> > do you already have a test dataset?
> >
> > I might get one with about 300.000 clicks for you.
> >
> > It is from www.nelou.com and we are already running a recommender in
> > preview mode:
> >
> http://www.nelou.com/artikel-803746/Overall-von-mysuro#__apaxoPreviewMode
> >
> > It could be the case that you would have to sign an NDA. Would this be
> > possible for you?
> >
> > /Manuel
> >
> > On 25.05.2012, at 10:34, Edward J. Yoon wrote:
> >
> > > OKay, I'm FWD this to mahout dev.
> > >
> > > I'm planning to create a project related to On-line machine learning,
> > > as a Apache Hama sub-module. Since the graph of message queues and
> > > workers could be implemented using BSP (see also [1]). The first idea
> > > is On-line recommendation system based on click-stream data.
> > >
> > > If you have interested in this plan, let's talk together here.
> > >
> > > 1.
> >
> http://codingwiththomas.blogspot.com/2011/10/apache-hama-realtime-processing.html
> > >
> > > ---------- Forwarded message ----------
> > > From: Thomas Jungblut <[EMAIL PROTECTED]>
> > > Date: Fri, May 25, 2012 at 4:55 PM
> > > Subject: Re: Online machine learning on top of Hama BSP
> > > To: [EMAIL PROTECTED]
> > >
> > >
> > > Should we cooperate with the Mahout guys on this? I'm pretty sure they
> > > would have fun with it.
> > > Edward, do you want to ask them?
> > >
> > > 2012/5/25 Tommaso Teofili <[EMAIL PROTECTED]>
> > >
> > >> Do you have a plan for that Edward?
> > >> A separate package in examples or a separate (online) machine learning
> > >> module? Or something else?
> > >> Regards
> > >> Tommaso
> > >>
> > >> 2012/5/25 Edward J. Yoon <[EMAIL PROTECTED]>
> > >>
> > >>> OKay, then let's get started.
> > >>>
> > >>> My first idea is simple online recommendation system based on
> > >> click-stream
> > >>> data.
> > >>>
> > >>> On Thu, May 24, 2012 at 6:26 PM, Praveen Sripati
> > >>> <[EMAIL PROTECTED]> wrote:
> > >>>> +1
> > >>>>
> > >>>> For those who are interested in ML, please check this. GNU Octave is
> > >>> used.
> > >>>>
> > >>>> https://www.coursera.org/course/ml
> > >>>>
> > >>>> Another session is yet to be announced.
> > >>>>
> > >>>> Thanks,
> > >>>> Praveen
> > >>>>
> > >>>> On Thu, May 24, 2012 at 12:54 PM, Thomas Jungblut <
> > >>>> [EMAIL PROTECTED]> wrote:
> > >>>>
> > >>>>> +1
> > >>>>>
> > >>>>> 2012/5/24 Tommaso Teofili <[EMAIL PROTECTED]>
> > >>>>>
> > >>>>>> and same here :)
> > >>>>>>
> > >>>>>> 2012/5/24 Vaijanath Rao <[EMAIL PROTECTED]>
> > >>>>>>
> > >>>>>>> +1 me too
> > >>>>>>> On May 23, 2012 10:26 PM, "Aditya Sarawgi" <
> > >>> [EMAIL PROTECTED]>
> > >>>>>>> wrote:
> > >>>>>>>
> > >>>>>>>> +1
> > >>>>>>>> I would be happy to help :)
> > >>>>>>>>
> > >>>>>>>> On Wed, May 23, 2012 at 6:23 PM, Edward J. Yoon <
> > >>>>> [EMAIL PROTECTED]
> > >>>>>>>>> wrote:
> > >>>>>>>>
> > >>>>>>>>> Hi,
> > >>>>>>>>>
> > >>>>>>>>> Does anyone interesting in online machine learning?
> > >>>>>>>>>
> > >>>>>>>>> --
> > >>>>>>>>> Best Regards, Edward J. Yoon
> > >>>>>>>>> @eddieyoon
> > >>>>>>>>>
> > >>>>>>>>
> > >>>>>>>>
> > >>>>>>>>
> > >>>>>>>
+
Thomas Jungblut 2012-05-25, 17:24
+
Sebastian Schelter 2012-05-25, 19:24
+
Edward J. Yoon 2012-05-25, 23:31
+
Edward J. Yoon 2012-05-25, 23:41
+
Ted Dunning 2012-05-26, 07:54
+
Edward J. Yoon 2012-05-26, 09:58
+
Suraj Menon 2012-05-26, 11:22
+
Ted Dunning 2012-05-26, 21:03
+
Robin Anil 2012-05-27, 16:11
+
Suraj Menon 2012-05-28, 11:40
+
Robin Anil 2012-05-28, 16:08
+
Sean Owen 2012-05-28, 16:12
+
Robin Anil 2012-05-28, 16:17
+
Thomas Jungblut 2012-05-26, 09:26
+
Sebastian Schelter 2012-05-26, 12:05
+
Ted Dunning 2012-05-26, 20:55