Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 141 to 150 from 1783 (0.169s).
Loading phrases to help you
refine your search...
[NUTCH-1151] Index-anchor to add numInlinks count - Nutch - [issue]
...Issue to improve in index-anchor to add the number of inlinks per document. This count is useful for calculating some authority metric in the search server.T...
http://issues.apache.org/jira/browse/NUTCH-1151    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1194] CrawlDB lock should be released earlier - Nutch - [issue]
...Lock on the CrawlDB is released when everything is finished. But when generating many segments, the lock remains in place while it's not neccessary anymore. If GENERATE_UPDATE_DB is false we...
http://issues.apache.org/jira/browse/NUTCH-1194    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1202] Fetcher timebomb kills long waiting fetch jobs - Nutch - [issue]
...The timebomb feature kills of mappers of jobs that have been waiting too long in the job queue. The timebomb feature should start at mapper initialization instead, not in job init.Thoughts?...
http://issues.apache.org/jira/browse/NUTCH-1202    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1377] Add option to index via CloudSolrServer instead - Nutch - [issue]
...Nutch indexes to a specific Solr server. With SolrCloud on its way we can still use the current indexer and point to any server. However, the SolrCloudServer can connect to ZooKeeper instead...
http://issues.apache.org/jira/browse/NUTCH-1377    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1034] Create Solr Velocity templates - Nutch - [issue]
...Solr has Velocity integration and provides an easy method for creating HTML based front-ends for the search engine. This issue tracks the development of Velocity templates specifically for N...
http://issues.apache.org/jira/browse/NUTCH-1034    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1103] Port protocol-sftp to 1.4 - Nutch - [issue]
...Port protocol-sftp from trunk back to 1.4...
http://issues.apache.org/jira/browse/NUTCH-1103    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1147] WebGraph nodeDumper uses only 1 reducer - Nutch - [issue]
...The noderDumper is restricted to only one reducer, making it slow and producing too large files....
http://issues.apache.org/jira/browse/NUTCH-1147    Author: Markus Jelsma, 2012-12-06, 14:53
[NUTCH-1226] Migrate CrawlDbReader to MapReduce API - Nutch - [issue]
...Hadoop 0.21 only!...
http://issues.apache.org/jira/browse/NUTCH-1226    Author: Markus Jelsma, 2012-12-06, 14:53
RE: Fetcher hangs for a long time - Nutch - [mail # user]
...Are you also storing content (yes by default) and are your pages very large? It seems everything is alright but the merge and reduce phase of the fetcher just take a long time. This is norma...
   Author: Markus Jelsma, 2012-12-05, 18:39
RE: fetcher partitioning - Nutch - [mail # user]
...      Indeed, you have to modify the partitioner to make this happen. You are free to do so but you can also make it pluggable as fetch schedule via config and provide a patch...
   Author: Markus Jelsma, 2012-12-05, 18:37
Sort:
project
Nutch (1783)
Solr (913)
Tika (57)
Lucene (9)
type
mail # user (1316)
mail # dev (272)
issue (195)
date
last 7 days (2)
last 30 days (16)
last 90 days (19)
last 6 months (96)
last 9 months (1783)
author
Markus Jelsma (1783)
Lewis John Mcgibbney (1183)
Julien Nioche (817)
Mattmann, Chris A (406)
lewis john mcgibbney (337)
Andrzej Bialecki (302)
Ferdy Galema (229)
Tejas Patil (219)
Bai Shen (177)
kiran chitturi (165)
Sebastian Nagel (164)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)