| clear query|facets|time |
Search criteria: .
Results from 61 to 70 from
133 (1.12s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...It could be interesting finding out what exactly causes such huge speed difference. For me the speed increase is on the 10x order...crazy! On Wed, Feb 15, 2012 at 9:35 PM, Markus Jelsm...
|
|
|
Author: remi tassing,
2012-02-15, 19:56
|
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...You're both correct, after changing the type for tstamp and lastModified from long to date, no error anymore. Next thing I need to do is setup cygwin/svn to be able to get fresh svn/tr...
|
|
|
Author: remi tassing,
2012-02-15, 19:36
|
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...Awesome! Pushing this to Solr gives me an error (solrindex): SEVERE: java.lang.NumberFormatException: For input string: "2012-02-08T14:40:09.416Z" at java.l...
|
|
|
Author: remi tassing,
2012-02-15, 19:01
|
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...Is it any quick way to see the impact of index-more? I deleted the parse related folders in the segment and re-parsed it but when I readseg there is no.difference.... On Wednesda...
|
|
|
Author: remi tassing,
2012-02-15, 18:18
|
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...Hi, tstamp shows a string of digits like 20020123123212 Never heard of the plugin "index-more" and it's poorly documented. After adding this to plugins.include, I'll need to run ...
|
|
|
Author: remi tassing,
2012-02-15, 16:00
|
|
|
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...Hey Lewis, Thanks for the clarification! For tstamp, I can actually see it in Solr results (even thought the format is weird) How can I get Last-Modified value in Solr as w...
|
|
|
Author: remi tassing,
2012-02-15, 13:51
|
|
|
Re: how are CSV/TXT files handled - Nutch - [mail # user]
|
|
...Hi, Tika is parsing properly, I think it was some kind of proxy issue and also the http.content.limit. Thanks! Remi On Fri, Feb 10, 2012 at 11:16 PM, Lewis John Mcgib...
|
|
|
Author: remi tassing,
2012-02-15, 13:33
|
|
|
tstamp vs. lastModified ... - Nutch - [mail # user]
|
|
...Hello all, What does tstamp represent? I can we shown in Solr results after indexing. I'm interested in showing the "last modified" meta-data in Solr results but I'm not sure if ...
|
|
|
Author: remi tassing,
2012-02-15, 13:26
|
|
|
Re: Failed fetching - Nutch - [mail # user]
|
|
...I just used protocol-http and it works! It's probably a configuration issue. You can download a clean version and start afresh Remi On Wed, Feb 15, 2012 at 3:46 AM, tiagorc...
|
|
|
Author: remi tassing,
2012-02-15, 09:50
|
|
|
From Nutch 1.2 to 1.4 - Nutch - [mail # user]
|
|
...Hi, 1. Freegen won't keep.the db_fetched and db_unfetched info, right? 2. I think it works. My seed was one URL, the first crawl was a redirection, second crawling one page, 3rd onward...
|
|
|
Author: remi tassing,
2012-02-14, 18:09
|
|
|
|