Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 41 to 50 from 133 (0.231s).
Loading phrases to help you
refine your search...
Re: too few db_fetched - Nutch - [mail # user]
...Hi Jose,  We have this question very often and the short answer, with regard to 'stats' printout, is that everything is probably fine. For a more complete answer plz search in the maili...
   Author: remi tassing, 2012-02-29, 03:12
Re: crawldb modifications - Nutch - [mail # user]
...I think he ment to remove some specific URLs not everything  On Tue, Feb 28, 2012 at 1:51 PM, Markus Jelsma wrote:  ...
   Author: remi tassing, 2012-02-28, 12:04
Re: How to crowl AJAX populated pages - Nutch - [mail # user]
...Same question here...  I have similar issues where (redirection)links are given through JavaScript  I hope I haven't hijacked your post as I see these issues very similar  Rem...
   Author: remi tassing, 2012-02-28, 09:02
Re: crawldb modifications - Nutch - [mail # user]
...What do in this case is to erase the db, use the.command mergesegs with -filter option and then updatedb.  I would.love to know if there is a simpler way  Remi  On Monday, Feb...
   Author: remi tassing, 2012-02-28, 06:03
Re: IOExeption when crawling with nutch in Fetching process - Nutch - [mail # user]
...Hi,  in my case, I had this issue when I inadvertently tempered the segment files.  I had another similar issue but clearly different to yours because it happened right before or a...
   Author: remi tassing, 2012-02-25, 16:01
Re: Exception in thread "main" java.io.IOException: Job failed! - Nutch - [mail # user]
...disk size issue? access rights?  On Thu, Feb 23, 2012 at 12:39 PM, Daniel Bourrion  wrote:  ...
   Author: remi tassing, 2012-02-23, 10:47
Re: http.redirect.max - Nutch - [mail # user]
...Would you give Nucth-1.4 a try? Maybe this bug is already solved?  Remi  On Thursday, February 23, 2012, xuyuanme  wrote: pages. http://lucene.472066.n3.nabble.com/http-redire...
   Author: remi tassing, 2012-02-23, 04:49
Re: Exception in thread "main" java.io.IOException: Job failed! - Nutch - [mail # user]
...Hey Daniel,  You can find more output log in logs/Hadoop files  Remi  On Wednesday, February 22, 2012, Daniel Bourrion  wrote: solution that should crawl our specific dom...
   Author: remi tassing, 2012-02-22, 15:36
Using jcifs for NTLM in HttpClient - Nutch - [mail # user]
...Hey guys,  I've been trying to figure out how to incorporate jcifs [1] into Nutch but I just need a hint here.  I downloaded the jcifs class and updated the CLASSPATH. I was planni...
   Author: remi tassing, 2012-02-22, 13:58
Re: Optimising the speed of Nutch. - Nutch - [mail # user]
...Try decreasing the number of fetcher threads instead...  On Wed, Feb 22, 2012 at 2:33 PM, Bharat Goyal wrote:  ...
   Author: remi tassing, 2012-02-22, 13:19
Sort:
project
Nutch (133)
Solr (27)
type
mail # user (133)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (133)
author
Markus Jelsma (1767)
Lewis John Mcgibbney (1118)
Julien Nioche (805)
Mattmann, Chris A (402)
lewis john mcgibbney (334)
Andrzej Bialecki (302)
Ferdy Galema (224)
Bai Shen (161)
Tejas Patil (161)
Sebastian Nagel (155)
kiran chitturi (155)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)