Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 61 to 70 from 133 (1.12s).
Loading phrases to help you
refine your search...
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...It could be interesting finding out what exactly causes such huge speed difference. For me the speed increase is on the 10x order...crazy!  On Wed, Feb 15, 2012 at 9:35 PM, Markus Jelsm...
   Author: remi tassing, 2012-02-15, 19:56
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...You're both correct, after changing the type for tstamp and lastModified from long to date, no error anymore.  Next thing I need to do is setup cygwin/svn to be able to get fresh svn/tr...
   Author: remi tassing, 2012-02-15, 19:36
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...Awesome!  Pushing this to Solr gives me an error (solrindex): SEVERE: java.lang.NumberFormatException: For input string: "2012-02-08T14:40:09.416Z"         at java.l...
   Author: remi tassing, 2012-02-15, 19:01
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...Is it any quick way to see the impact of index-more?  I deleted the parse related folders in the segment and re-parsed it but when I readseg there is no.difference....  On Wednesda...
   Author: remi tassing, 2012-02-15, 18:18
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...Hi,  tstamp shows a string of digits like 20020123123212  Never heard of the plugin "index-more" and it's poorly documented. After adding this to plugins.include, I'll need to run ...
   Author: remi tassing, 2012-02-15, 16:00
Re: tstamp vs. lastModified ... - Nutch - [mail # user]
...Hey Lewis,  Thanks for the clarification!  For tstamp, I can actually see it in Solr results (even thought the format is weird)  How can I get Last-Modified value in Solr as w...
   Author: remi tassing, 2012-02-15, 13:51
Re: how are CSV/TXT files handled - Nutch - [mail # user]
...Hi,  Tika is parsing properly, I think it was some kind of proxy issue and also the http.content.limit.  Thanks!  Remi  On Fri, Feb 10, 2012 at 11:16 PM, Lewis John Mcgib...
   Author: remi tassing, 2012-02-15, 13:33
tstamp vs. lastModified ... - Nutch - [mail # user]
...Hello all,  What does tstamp represent? I can we shown in Solr results after indexing.  I'm interested in showing the "last modified" meta-data in Solr results but I'm not sure if ...
   Author: remi tassing, 2012-02-15, 13:26
Re: Failed fetching - Nutch - [mail # user]
...I just used protocol-http and it works!  It's probably a configuration issue. You can download a clean version and start afresh  Remi  On Wed, Feb 15, 2012 at 3:46 AM, tiagorc...
   Author: remi tassing, 2012-02-15, 09:50
From Nutch 1.2 to 1.4 - Nutch - [mail # user]
...Hi,  1. Freegen won't keep.the db_fetched and db_unfetched info, right? 2. I think it works. My seed was one URL, the first crawl was a redirection, second crawling one page, 3rd onward...
   Author: remi tassing, 2012-02-14, 18:09
Sort:
project
Nutch (133)
Solr (27)
type
mail # user (133)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (133)
author
Markus Jelsma (1767)
Lewis John Mcgibbney (1125)
Julien Nioche (805)
Mattmann, Chris A (402)
lewis john mcgibbney (334)
Andrzej Bialecki (302)
Ferdy Galema (224)
Tejas Patil (164)
Bai Shen (163)
kiran chitturi (157)
Sebastian Nagel (156)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)