Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 111 to 120 from 1181 (0.194s).
Loading phrases to help you
refine your search...
Re: Unable to crawl a series of pages in tutorial - Nutch - [mail # user]
...Which version of Nutch are you using?   On Wed, Apr 24, 2013 at 11:45 AM, Yves S. Garret <[EMAIL PROTECTED]     *Lewis*...
   Author: Lewis John Mcgibbney, 2013-04-24, 19:14
Re: Unable to crawl a series of pages in tutorial - Nutch - [mail # user]
...Yes   On Wed, Apr 24, 2013 at 11:41 AM, Yves S. Garret <[EMAIL PROTECTED]     *Lewis*...
   Author: Lewis John Mcgibbney, 2013-04-24, 19:13
Re: Error Nutch2 and HBase - Nutch - [mail # user]
...Two things here, 1) Your talking an upgrade of the HBase API usage within gora-hbase., 2) We don't and won't aim to support non Apache distributions of such libraries unless the community ab...
   Author: Lewis John Mcgibbney, 2013-04-24, 03:50
Re: Error Nutch2 and HBase - Nutch - [mail # user]
...Hi Maximiliano, This version of HBase is most likely not compatabile with Gora HBase ersion is: 0.94.2-cdh4.2.0   On Tue, Apr 23, 2013 at 8:08 PM, Maximiliano Marin  wrote:   ...
   Author: Lewis John Mcgibbney, 2013-04-24, 03:27
Re: Any way to run tasks after Nutch is done executing? - Nutch - [mail # user]
...You should use this script http://svn.apache.org/repos/asf/nutch/branches/2.x/src/bin/crawl Feng also produced a patch (which we haven;t reviewed yet) for automating the capture of the batch...
   Author: Lewis John Mcgibbney, 2013-04-24, 02:11
Re: Unable to crawl a series of pages in tutorial - Nutch - [mail # user]
...The DmozParser should have created a flat file similar to a bootstrap file which you can inject. The flat file should be inside a the dmoz directory (if you've followed the tutorial). Please...
   Author: Lewis John Mcgibbney, 2013-04-24, 02:09
Re: Crawling and Hadoop problem - Nutch - [mail # user]
...Put simply the generated job Java archive file should contain every thing required to run your Nutch crawls on an Existing cluster. If it does not contain everything then there is a problem ...
   Author: Lewis John Mcgibbney, 2013-04-23, 22:05
Re: Any way to run tasks after Nutch is done executing? - Nutch - [mail # user]
...Hi Yves, We advise to use this script and modify it for your own needs http://svn.apache.org/repos/asf/nutch/trunk/src/bin/crawl hth Lewis   On Tue, Apr 23, 2013 at 12:52 PM, Yves S. Ga...
   Author: Lewis John Mcgibbney, 2013-04-23, 20:25
Re: Any way to run tasks after Nutch is done executing? - Nutch - [mail # user]
...Just write a crawl script? Effectively that's all the crawl script is, just chaining together logical tasks. The one provided with Nutch is not intended to be a be a one size fits all soluti...
   Author: Lewis John Mcgibbney, 2013-04-23, 19:30
Re: Nutch 2 hanging after aborting hung threads - Nutch - [mail # user]
...can you please give examples of the files which were truncated? thank you Lewis  On Tuesday, April 23, 2013, Bai Shen  wrote: [EMAIL PROTECTED] works works  They're http://sta...
   Author: Lewis John Mcgibbney, 2013-04-23, 15:49
Sort:
project
Nutch (1181)
Tika (11)
Lucene (4)
Solr (3)
type
mail # user (855)
mail # dev (326)
date
last 7 days (21)
last 30 days (67)
last 90 days (170)
last 6 months (359)
last 9 months (1181)
author
Markus Jelsma (1783)
Lewis John Mcgibbney (1181)
Julien Nioche (817)
Mattmann, Chris A (406)
lewis john mcgibbney (336)
Andrzej Bialecki (302)
Ferdy Galema (229)
Tejas Patil (218)
Bai Shen (177)
kiran chitturi (165)
Sebastian Nagel (164)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)