| clear query|facets|time |
Search criteria: .
Results from 21 to 30 from
17012 (0.148s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: DBUpdateJob failed - Exception job failed: name=update-table, - Nutch - [mail # user]
|
|
...Hi Tony, Which gora backend are you on, including the version of the backend itself please? I use Gora 0.3 with gora-cassandra on some cron jobs and injected your URLs into my db. All works ...
|
|
|
Author: Lewis John Mcgibbney,
2013-06-17, 18:21
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...Yes after running the same code on different server machine , issue was resolved ( or disappeared :) ) Thanks alot guyz for your help & support. Tony. On Mon, Jun 17, 2013...
|
|
|
Author: Tony Mullins,
2013-06-17, 17:09
|
|
|
DBUpdateJob failed - Exception job failed: name=update-table, - Nutch - [mail # user]
|
|
...Hi , I am getting weird error on DBUpdater Job in Nutch2.x. I am crawling these two links http://www.amazon.com/Degree-Antiperspirant-Deodorant-Extreme-Blast/dp/B001ET769Y http:/...
|
|
|
Author: Tony Mullins,
2013-06-17, 17:08
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...I've experienced a similar issue on my development station running Mac 10.8 but the same code worked perfectly on my server VM running ubuntu, so no jira was created in the end. Also, in my ...
|
|
|
Author: Ing. Jorge Luis Betancour...,
2013-06-17, 14:17
|
|
|
Re: How to define depth level on crawling time - Nutch - [mail # user]
|
|
...Maybe you can use bin/crawl script and define the numberOfRounds like depth in Nutch 1.x. On Mon, Jun 17, 2013 at 7:03 PM, Jamshaid Ashraf wrote: Don't Grow Old, Grow Up...
|
|
|
Author: feng lu,
2013-06-17, 14:16
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...Hi Tony As Coskun said that you can set http.content.limit to -1, default is 65536, not file.content.limit property. http.content.limit 65536 The length lim...
|
|
|
Author: feng lu,
2013-06-17, 14:10
|
|
|
Re: Solrindex job failed ! - Nutch - [mail # user]
|
|
...I am also facing the same error. The nutch log says "ERROR solr.SolrIndexer - java.io.IOException: Job failed!" and solr log says "SEVERE: org.apache.solr.common.SolrException: Document is m...
|
|
|
Author: kamal11,
2013-06-17, 13:35
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...html in my previous email was incorrect ( I was trying different dns thinking its due to bad internet) ... but in short I am getting incomplete html response.... Is there any property ...
|
|
|
Author: Tony Mullins,
2013-06-17, 11:58
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...I have modified these values as http.timeout *20000* The default network timeout, in milliseconds. file.content.limit *-1* The l...
|
|
|
Author: Tony Mullins,
2013-06-17, 11:43
|
|
|
Re: Incomplete HTML content of a crawled Page in ParseFilter ? - Nutch - [mail # user]
|
|
...Hi Tony, You may need to add http.content.limit parameter in nutch-site.xml file. for size-unlimited crawling: http.content.limit *-1* &n...
|
|
|
Author: H. Coskun Gunduz,
2013-06-17, 11:13
|
|
|
|