Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - Extracting data from websites


Copy link to this message
-
Re: Extracting data from websites
Sean Owen 2012-07-30, 12:26
Extract as in web crawl? No it's nothing to do with that.
Extract as in entity extraction? I don't think there are relevant
implementations here either, though that begins to border on machine
learning.
This is more about clustering and classification of documents than anything
else.

On Mon, Jul 30, 2012 at 1:22 PM, David Rose <[EMAIL PROTECTED]> wrote:

> Hi all,
>
> I  apologize for how basic my question is, but I am very new to all of
> this, machine learning, writing code, all of it.  I was finally able to get
> Mahout downloaded, installed, and running.  I was assigned a project at my
> work to try to use Mahout to extract data from websites that we input.  Is
> this possible? Can anyone help me with suggestions or instructions on how
> to do so? I appreciate any help on this, as I have only two more weeks to
> finish this project.
>
> Thanks,
>
> David Rose