| clear query|facets|time |
Search criteria: .
Results from 41 to 50 from
133 (0.231s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: too few db_fetched - Nutch - [mail # user]
|
|
...Hi Jose, We have this question very often and the short answer, with regard to 'stats' printout, is that everything is probably fine. For a more complete answer plz search in the maili...
|
|
|
Author: remi tassing,
2012-02-29, 03:12
|
|
|
Re: crawldb modifications - Nutch - [mail # user]
|
|
...I think he ment to remove some specific URLs not everything On Tue, Feb 28, 2012 at 1:51 PM, Markus Jelsma wrote: ...
|
|
|
Author: remi tassing,
2012-02-28, 12:04
|
|
|
Re: How to crowl AJAX populated pages - Nutch - [mail # user]
|
|
...Same question here... I have similar issues where (redirection)links are given through JavaScript I hope I haven't hijacked your post as I see these issues very similar Rem...
|
|
|
Author: remi tassing,
2012-02-28, 09:02
|
|
|
Re: crawldb modifications - Nutch - [mail # user]
|
|
...What do in this case is to erase the db, use the.command mergesegs with -filter option and then updatedb. I would.love to know if there is a simpler way Remi On Monday, Feb...
|
|
|
Author: remi tassing,
2012-02-28, 06:03
|
|
|
Re: IOExeption when crawling with nutch in Fetching process - Nutch - [mail # user]
|
|
...Hi, in my case, I had this issue when I inadvertently tempered the segment files. I had another similar issue but clearly different to yours because it happened right before or a...
|
|
|
Author: remi tassing,
2012-02-25, 16:01
|
|
|
Re: Exception in thread "main" java.io.IOException: Job failed! - Nutch - [mail # user]
|
|
...disk size issue? access rights? On Thu, Feb 23, 2012 at 12:39 PM, Daniel Bourrion wrote: ...
|
|
|
Author: remi tassing,
2012-02-23, 10:47
|
|
|
Re: http.redirect.max - Nutch - [mail # user]
|
|
...Would you give Nucth-1.4 a try? Maybe this bug is already solved? Remi On Thursday, February 23, 2012, xuyuanme wrote: pages. http://lucene.472066.n3.nabble.com/http-redire...
|
|
|
Author: remi tassing,
2012-02-23, 04:49
|
|
|
Re: Exception in thread "main" java.io.IOException: Job failed! - Nutch - [mail # user]
|
|
...Hey Daniel, You can find more output log in logs/Hadoop files Remi On Wednesday, February 22, 2012, Daniel Bourrion wrote: solution that should crawl our specific dom...
|
|
|
Author: remi tassing,
2012-02-22, 15:36
|
|
|
Using jcifs for NTLM in HttpClient - Nutch - [mail # user]
|
|
...Hey guys, I've been trying to figure out how to incorporate jcifs [1] into Nutch but I just need a hint here. I downloaded the jcifs class and updated the CLASSPATH. I was planni...
|
|
|
Author: remi tassing,
2012-02-22, 13:58
|
|
|
Re: Optimising the speed of Nutch. - Nutch - [mail # user]
|
|
...Try decreasing the number of fetcher threads instead... On Wed, Feb 22, 2012 at 2:33 PM, Bharat Goyal wrote: ...
|
|
|
Author: remi tassing,
2012-02-22, 13:19
|
|
|
|