Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Nutch, mail # user - Have yet to complete a very large filesystem crawl


+
webdev1977 2010-08-10, 17:55
+
Eddie Drapkin 2010-08-10, 23:04
Copy link to this message
-
Re: Have yet to complete a very large filesystem crawl
webdev1977 2010-08-11, 10:03

That would make sense, but I am pretty sure this is not the issue. In this
config, I am running with 1024mb of memory.  I kind of thought that nutch
was able to run on this amount of memory?  It would just take much longer.

I tried to run the same crawl using the SMB plugin on a Linux machine with
8GB of memory.  Of course it ran longer, but in the end, I got the same
error.  I have turned on various levels of logging and debugging, and I have
had no luck figuring out what might be causing it.  

--
View this message in context: http://lucene.472066.n3.nabble.com/Have-yet-to-complete-a-very-large-filesystem-crawl-tp1076547p1085270.html
Sent from the Nutch - User mailing list archive at Nabble.com.
+
Claudio Martella 2010-08-11, 13:56
+
webdev1977 2010-08-11, 15:23
+
Julien Nioche 2010-08-11, 15:39
+
Doğacan Güney 2010-08-11, 15:44
+
webdev1977 2010-08-11, 16:59
+
Claudio Martella 2010-08-11, 16:03
+
webdev1977 2010-08-11, 17:00
+
webdev1977 2010-08-11, 14:02