Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 164 (0.202s).
Loading phrases to help you
refine your search...
Re: Nutch not passing latest CrawlDatum to IndexingFilter plugin - Nutch - [mail # user]
...Hi Liaokz,  No, or only partially: - multiple CrawlDatums are merged:   determine new status, fetch time, etc.   It is not that the last datum is just written into CrawlDb. &n...
   Author: Sebastian Nagel, 2013-06-18, 21:41
Re: PluginRuntimeException ClassNotFound for ParseFilter plugin in Nutch 2.2 ? - Nutch - [mail # user]
...Hi Tony,  you have to "register" your plugin in  src/plugin/build.xml  Does your  src/plugin/myplugin/plugin.xml properly propagate jar file, extension point and implemen...
   Author: Sebastian Nagel, 2013-06-12, 20:01
Re: Suffix URLFilter not working - Nutch - [mail # user]
...Hi Peter,  please do not hijack threads.  Seed URLs must be fully specified including protocol, e.g.:  http://nutch.apache.org/ but not  apache.org  Sebastian  ...
   Author: Sebastian Nagel, 2013-06-12, 19:54
Re: IndexWriter Plugin Workflow - Nutch - [mail # user]
...Hi,  Have a look at NUTCH-1527 and NUTCH-1541.  with argument "name" = "commit" Intuitively, update resp. delete documents which are already in the index Delete is used, e.g., to b...
   Author: Sebastian Nagel, 2013-06-12, 19:50
Re: Fwd: Nutch Compilation Error with Eclipse - Nutch - [mail # dev]
...Hi Tejas,  you should be able to add images as "Attachments": there is a tab/link left of "More Actions:".  Cheers, Sebastian  On 06/11/2013 01:30 AM, Tejas Patil wrote:...
   Author: Sebastian Nagel, 2013-06-11, 19:36
Re: [DISCUSS] Nutch 1.7 ready for release? - Nutch - [mail # dev]
...+1 go ahead!  Sebastian  On 06/08/2013 11:53 PM, Lewis John Mcgibbney wrote:...
   Author: Sebastian Nagel, 2013-06-09, 12:05
Re: [VOTE] Apache Nutch 2.2 Release Candidate - Nutch - [mail # dev]
...+1 (test with hbase)  On 06/01/2013 01:17 AM, lewis john mcgibbney wrote:...
   Author: Sebastian Nagel, 2013-06-04, 20:30
Re: Fetcher corrupting some segments - Nutch - [mail # user]
...Hi Markus,  a similar problem was posted some time ago:  http://lucene.472066.n3.nabble.com/NegativeArraySizeException-and-quot-problem-advancing-port-rec-quot-during-fetching-tt39...
   Author: Sebastian Nagel, 2013-05-27, 21:03
fix version 1.7 removed in Jira - Nutch - [mail # dev]
...Hi,  please take care not to remove the fix version when applying bulk changes, e.g., 2.2 => 2.3 Alternative fix versions (1.7) are not kept.  Luckily Jira is quite powerful, I ...
   Author: Sebastian Nagel, 2013-05-22, 07:27
Re: Unable to parse flv and epub file contents using nutch - Nutch - [mail # dev]
...No, you don't have to: the plugin parse-tika can parse .epub and .flv - see http://tika.apache.org/1.2/formats.html - test it, eg:   % bin/nutch parsechecker http://.../book.epub  ...
   Author: Sebastian Nagel, 2013-05-13, 22:08
Sort:
project
Nutch (164)
Tika (1)
type
mail # user (95)
mail # dev (42)
issue (27)
date
last 7 days (1)
last 30 days (9)
last 90 days (26)
last 6 months (53)
last 9 months (164)
author
Markus Jelsma (1783)
Lewis John Mcgibbney (1183)
Julien Nioche (817)
Mattmann, Chris A (406)
lewis john mcgibbney (337)
Andrzej Bialecki (302)
Ferdy Galema (229)
Tejas Patil (219)
Bai Shen (177)
kiran chitturi (165)
Sebastian Nagel (164)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)