| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
164 (0.202s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Nutch not passing latest CrawlDatum to IndexingFilter plugin - Nutch - [mail # user]
|
|
...Hi Liaokz, No, or only partially: - multiple CrawlDatums are merged: determine new status, fetch time, etc. It is not that the last datum is just written into CrawlDb. &n...
|
|
|
Author: Sebastian Nagel,
2013-06-18, 21:41
|
|
|
Re: PluginRuntimeException ClassNotFound for ParseFilter plugin in Nutch 2.2 ? - Nutch - [mail # user]
|
|
...Hi Tony, you have to "register" your plugin in src/plugin/build.xml Does your src/plugin/myplugin/plugin.xml properly propagate jar file, extension point and implemen...
|
|
|
Author: Sebastian Nagel,
2013-06-12, 20:01
|
|
|
Re: Suffix URLFilter not working - Nutch - [mail # user]
|
|
...Hi Peter, please do not hijack threads. Seed URLs must be fully specified including protocol, e.g.: http://nutch.apache.org/ but not apache.org Sebastian ...
|
|
|
Author: Sebastian Nagel,
2013-06-12, 19:54
|
|
|
Re: IndexWriter Plugin Workflow - Nutch - [mail # user]
|
|
...Hi, Have a look at NUTCH-1527 and NUTCH-1541. with argument "name" = "commit" Intuitively, update resp. delete documents which are already in the index Delete is used, e.g., to b...
|
|
|
Author: Sebastian Nagel,
2013-06-12, 19:50
|
|
|
Re: Fwd: Nutch Compilation Error with Eclipse - Nutch - [mail # dev]
|
|
...Hi Tejas, you should be able to add images as "Attachments": there is a tab/link left of "More Actions:". Cheers, Sebastian On 06/11/2013 01:30 AM, Tejas Patil wrote:...
|
|
|
Author: Sebastian Nagel,
2013-06-11, 19:36
|
|
|
Re: [DISCUSS] Nutch 1.7 ready for release? - Nutch - [mail # dev]
|
|
...+1 go ahead! Sebastian On 06/08/2013 11:53 PM, Lewis John Mcgibbney wrote:...
|
|
|
Author: Sebastian Nagel,
2013-06-09, 12:05
|
|
|
Re: [VOTE] Apache Nutch 2.2 Release Candidate - Nutch - [mail # dev]
|
|
...+1 (test with hbase) On 06/01/2013 01:17 AM, lewis john mcgibbney wrote:...
|
|
|
Author: Sebastian Nagel,
2013-06-04, 20:30
|
|
|
Re: Fetcher corrupting some segments - Nutch - [mail # user]
|
|
...Hi Markus, a similar problem was posted some time ago: http://lucene.472066.n3.nabble.com/NegativeArraySizeException-and-quot-problem-advancing-port-rec-quot-during-fetching-tt39...
|
|
|
Author: Sebastian Nagel,
2013-05-27, 21:03
|
|
|
fix version 1.7 removed in Jira - Nutch - [mail # dev]
|
|
...Hi, please take care not to remove the fix version when applying bulk changes, e.g., 2.2 => 2.3 Alternative fix versions (1.7) are not kept. Luckily Jira is quite powerful, I ...
|
|
|
Author: Sebastian Nagel,
2013-05-22, 07:27
|
|
|
Re: Unable to parse flv and epub file contents using nutch - Nutch - [mail # dev]
|
|
...No, you don't have to: the plugin parse-tika can parse .epub and .flv - see http://tika.apache.org/1.2/formats.html - test it, eg: % bin/nutch parsechecker http://.../book.epub ...
|
|
|
Author: Sebastian Nagel,
2013-05-13, 22:08
|
|
|
|