| clear query|facets|time |
Search criteria: .
Results from 141 to 150 from
1783 (0.169s).
|
|
|
Loading phrases to help you refine your search...
|
|
[NUTCH-1151] Index-anchor to add numInlinks count - Nutch - [issue]
|
|
...Issue to improve in index-anchor to add the number of inlinks per document. This count is useful for calculating some authority metric in the search server.T...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1151
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1194] CrawlDB lock should be released earlier - Nutch - [issue]
|
|
...Lock on the CrawlDB is released when everything is finished. But when generating many segments, the lock remains in place while it's not neccessary anymore. If GENERATE_UPDATE_DB is false we...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1194
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1202] Fetcher timebomb kills long waiting fetch jobs - Nutch - [issue]
|
|
...The timebomb feature kills of mappers of jobs that have been waiting too long in the job queue. The timebomb feature should start at mapper initialization instead, not in job init.Thoughts?...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1202
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1377] Add option to index via CloudSolrServer instead - Nutch - [issue]
|
|
...Nutch indexes to a specific Solr server. With SolrCloud on its way we can still use the current indexer and point to any server. However, the SolrCloudServer can connect to ZooKeeper instead...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1377
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1034] Create Solr Velocity templates - Nutch - [issue]
|
|
...Solr has Velocity integration and provides an easy method for creating HTML based front-ends for the search engine. This issue tracks the development of Velocity templates specifically for N...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1034
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1103] Port protocol-sftp to 1.4 - Nutch - [issue]
|
|
...Port protocol-sftp from trunk back to 1.4...
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1103
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1147] WebGraph nodeDumper uses only 1 reducer - Nutch - [issue]
|
|
...The noderDumper is restricted to only one reducer, making it slow and producing too large files....
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1147
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
[NUTCH-1226] Migrate CrawlDbReader to MapReduce API - Nutch - [issue]
|
|
|
|
http://issues.apache.org/jira/browse/NUTCH-1226
Author: Markus Jelsma,
2012-12-06, 14:53
|
|
|
RE: Fetcher hangs for a long time - Nutch - [mail # user]
|
|
...Are you also storing content (yes by default) and are your pages very large? It seems everything is alright but the merge and reduce phase of the fetcher just take a long time. This is norma...
|
|
|
Author: Markus Jelsma,
2012-12-05, 18:39
|
|
|
RE: fetcher partitioning - Nutch - [mail # user]
|
|
... Indeed, you have to modify the partitioner to make this happen. You are free to do so but you can also make it pluggable as fetch schedule via config and provide a patch...
|
|
|
Author: Markus Jelsma,
2012-12-05, 18:37
|
|
|
|