| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
129 (0.84s).
|
|
|
Loading phrases to help you refine your search...
|
|
[NUTCH-1545] capture batchId and remove references to segments in 2.x crawl script. - Nutch - [issue]
|
|
...The concept of segment is replaced by batchId in 2.xI'm currently getting rid of segments references in 2.xThis issue was flagged up and separate from NUTCH-1532 which I am working on....
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1545
Author: Lewis John McGibbney,
2013-03-27, 15:56
|
|
|
[NUTCH-1548] Move all Utils classes into Utils packages & dedup Utils generally - Nutch - [issue]
|
|
...Currently we have IndexUtils, ProtocolStatusUtils and others hanging around within (IMHO) the wrong classes.We should move these to the utils packages.We also seem to be maintaining classes ...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1548
Author: Lewis John McGibbney,
2013-03-26, 19:37
|
|
|
[NUTCH-1532] Replace 'segment' mapping field with batchId - Nutch - [issue]
|
|
...As described here [0], the segment field in solr-mapping.xml should be replaced with the batchId. This reflects the different architecture in 2.x.[0] http://www.mail-archive....
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1532
Author: Lewis John McGibbney,
2013-03-26, 18:47
|
|
|
[NUTCH-1533] Implement getPrevModifiedTime(), setPrevModifiedTime(), getBatchId() and setBatchId() accessors in o.a.n.storage.WebPage - Nutch - [issue]
|
|
...NUTCH-1532 needs to obtain a batchId to add to NutchDocument prior to indexing. This is currently not available as we do not store the information in the WebPage. Additionally, we do not sto...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1533
Author: Lewis John McGibbney,
2013-03-26, 13:55
|
|
|
[NUTCH-1393] Display consistent usage of GeneratorJob with 1.X - Nutch - [issue]
|
|
...If we pass the generate argument to the nutch script, the Generator auto-spings into action and begins generating fetchlists. This should not be the case, instead it should print traditional...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1393
Author: Lewis John McGibbney,
2013-03-24, 22:44
|
|
|
[NUTCH-1373] Implement consistent execution of normalising and filtering in Generator - Nutch - [issue]
|
|
...As per discussion here [0] this issue should address the inconsistencies we see in the scheduled execution of normalising and filtering between Nutchgora Generator Mapper and trunk G...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1373
Author: Lewis John McGibbney,
2013-03-09, 03:57
|
|
|
[NUTCH-1540] Add Gora buffered read and write maximum limits to nutch-default.xml configuration. - Nutch - [issue]
|
|
...I've been experimenting by using this via the command line for some time. It is starting to annoy me, so I wanted to make this more accessible to us all.You can now easily set this in nutch-...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1540
Author: Lewis John McGibbney,
2013-03-07, 00:51
|
|
|
[NUTCH-1537] Legacy metadata package needs to take advantage of Apache Tika metadata package more. - Nutch - [issue]
|
|
...In Nutch, classes from the metadata package are being used in quite a number of places. It is not currently being used to reflect the work going on in Apache Tika and we need to better lever...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1537
Author: Lewis John McGibbney,
2013-03-02, 22:53
|
|
|
[NUTCH-1529] Port nutch-mongdb-parser to trunk - Nutch - [issue]
|
|
...The initial repos is here [0][0] https://github.com/ctjmorgan/nutch-mongdb-parser...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1529
Author: Lewis John McGibbney,
2013-03-01, 08:38
|
|
|
[NUTCH-1486] schema-solr4.xml does not work with Solr 4.1.0 - Nutch - [issue]
|
|
...When attempting to configure a 4 multicore 4.0 instance with Nutch schema-solr4.xml file, I get the following exceptions.This has been discussed previously. As I see it we have two options1....
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1486
Author: Lewis John McGibbney,
2013-02-19, 06:27
|
|
|
|