Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 91 to 100 from 129 (0.086s).
Loading phrases to help you
refine your search...
[NUTCH-1361] Fix mishandling of malformed urls in generator job - Nutch - [issue]
...This relates to the handling of malformed urls within the Generator Mapper and Reducer. Currently we do not handle such cases. Is there scope here to extend this issue to 1.X trunk?...
http://issues.apache.org/jira/browse/NUTCH-1361    Author: Lewis John McGibbney, 2012-06-08, 15:06
[NUTCH-1372] Improve execution of normalisers - Nutch - [issue]
http://issues.apache.org/jira/browse/NUTCH-1372    Author: Lewis John McGibbney, 2012-05-22, 11:41
[NUTCH-1367] Port ParserChecker to Nutchgora - Nutch - [issue]
...This is such a great tool. It has come in handy so many times I would go blue in the face if I had to try and count. e.g. for (int i = 0; i < infinity; i++)I think you get the idea.  ...
http://issues.apache.org/jira/browse/NUTCH-1367    Author: Lewis John McGibbney, 2012-05-16, 11:32
[NUTCH-1362] Fix error handling of urls with empty fields - Nutch - [issue]
...Within o.a.n.util.TableUtil.reverseAppendSplits() a simple if (split.length > 0) block enables us to address this issue....
http://issues.apache.org/jira/browse/NUTCH-1362    Author: Lewis John McGibbney, 2012-05-12, 05:17
[NUTCH-1363] Make parsing in FetcherJob actually work. - Nutch - [issue]
...We know that parsing during fetching is not recommended, however for those that wish to dive into the abyss the functionality should be available. This issue will address this....
http://issues.apache.org/jira/browse/NUTCH-1363    Author: Lewis John McGibbney, 2012-05-10, 21:37
[NUTCH-1349] Make batchId explcit within debug logging and improve CLI - Nutch - [issue]
...I find this a pain when trying to locate the batchId of some urls which are skipped when going to the Solr index. My DEBUG log output gives me2012-05-03 20:44:55,268 DEBUG indexer.IndexerJob...
http://issues.apache.org/jira/browse/NUTCH-1349    Author: Lewis John McGibbney, 2012-05-09, 05:16
[NUTCH-1205] Upgrade gora modules to 0.2 in ivy/ivy.xml - Nutch - [issue]
...Although gora trunk is unstable, work is ongoing to get this fixed. For the time being, I think Nutchgora should use gora trunk as this will identify more vulnerabilities. I'll get the trivi...
http://issues.apache.org/jira/browse/NUTCH-1205    Author: Lewis John McGibbney, 2012-05-04, 05:19
[NUTCH-1189] add commented out default settings to gora.properties files - Nutch - [issue]
...This issues should have been dealt with as part of its parent issue, however I think as it is a fairly lareg task in itself, it needs to be done independently. The gora.properties file shoul...
http://issues.apache.org/jira/browse/NUTCH-1189    Author: Lewis John McGibbney, 2012-04-27, 06:37
[NUTCH-1333] Introduce AvroStore, DataFileAvroStore and Accumulo Datastore implementations - Nutch - [issue]
...This is to accomodate recent developments over @ Gora....
http://issues.apache.org/jira/browse/NUTCH-1333    Author: Lewis John McGibbney, 2012-04-15, 20:01
[NUTCH-1307] Improve formatting of ant targets for clearer project help - Nutch - [issue]
...This is a trivial formatting issue I will submit a patch shortly and fix it....
http://issues.apache.org/jira/browse/NUTCH-1307    Author: Lewis John McGibbney, 2012-03-09, 10:41
Sort:
project
Nutch (129)
Solr (1)
Tika (1)
type
issue (129)
date
last 7 days (0)
last 30 days (0)
last 90 days (9)
last 6 months (55)
last 9 months (129)
author
Markus Jelsma (1767)
Lewis John Mcgibbney (1125)
Julien Nioche (805)
Mattmann, Chris A (402)
lewis john mcgibbney (334)
Andrzej Bialecki (302)
Ferdy Galema (224)
Tejas Patil (164)
Bai Shen (163)
kiran chitturi (157)
Sebastian Nagel (156)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)