Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Nutch, mail # user - problem running Nutch 1.5.1 in distributed mode- simple crawl


Copy link to this message
-
Re: problem running Nutch 1.5.1 in distributed mode- simple crawl
Lewis John Mcgibbney 2012-09-15, 23:49
Hi Casey,

On Sun, Sep 16, 2012 at 12:22 AM, Casey McTaggart
<[EMAIL PROTECTED]> wrote:

> I run this command:
> sudo -u hdfs hadoop jar build/apache-nutch-1.5.1.job
> org.apache.nutch.crawl.Crawl urls/seed.txt -dir crawl

I don-t think you should do this.

Please see a similar post a couple days back [0] and Julien's [1] answer.

Get back to us if you have probs. I hope this works for you.

Lewis
[1] http://www.mail-archive.com/user%40nutch.apache.org/msg07564.html
[0] http://www.mail-archive.com/user%40nutch.apache.org/msg07565.html