Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 29 (0.226s).
Loading phrases to help you
refine your search...
crawling file system but with limits - Nutch - [mail # user]
...dear all, I crawl a directory with a lot of pdf files from my local system.  but my nutch crawling and indexing with solr only a part of this files. the result is:  2012-04-16 15:0...
   Author: alessio crisantemi, 2012-04-16, 14:04
Re: exclude some urls from crawling - Nutch - [mail # user]
...thank you remi for your precious help but i have another problem now: I can crawl and index my website, but when I search a query, I found any results ONLY if my query is contened to the tit...
   Author: alessio crisantemi, 2012-04-13, 15:42
exclude some urls from crawling - Nutch - [mail # user]
...Dear All, I try to exclude some urls of my website to the crawling process, but without success.  For exclude it, I add this code on my regex-urlfilter.txt file BEFORE to write the home...
   Author: alessio crisantemi, 2012-04-10, 20:01
Re: request about snippets (with attachement) - Nutch - [mail # user]
...thank you agin Lewis, but do you think that my strange content field it's for my cause? beacuse I disabled the indexing of about all field.  this is my schema:       &nbs...
   Author: alessio crisantemi, 2012-04-07, 22:06
Re: request about snippets (with attachement) - Nutch - [mail # user]
...may be it'd my cause with my schema? I chose for inex about only title, author and content.  can you help me for setting a parsefilter? thank you alessio  Il giorno 07 aprile 2012 ...
   Author: alessio crisantemi, 2012-04-07, 13:33
Re: request about snippets (with attachement) - Nutch - [mail # user]
...'horribly html'? that's a bad consstruct on my website or it's a no good result of my crawling?  Il giorno 07 aprile 2012 13:53, Lewis John Mcgibbney  ha scritto:  ...
   Author: alessio crisantemi, 2012-04-07, 13:23
Re: request about snippets (with attachement) - Nutch - [mail # user]
...no Lewis, I'm sorry for missunderstanding!   But I dont's know this link, beacause this row, it's a fixed raow on my website template. And also if i see the source code of my html home ...
   Author: alessio crisantemi, 2012-04-07, 11:21
Fwd: request about snippets (with attachement) - Nutch - [mail # user]
...or this:  http://pc-alessio:8983/*WoWSolrWebApp/search?query=gioco&submit=Search*   Da: alessio crisantemi  Date: 06 aprile 2012 22:42 Oggetto: Re: request about snippets (wit...
   Author: alessio crisantemi, 2012-04-06, 20:46
Re: request about snippets (with attachement) - Nutch - [mail # user]
...that's can be good? http://192.168.1.5:8983/WoWSolrWebApp/search?query=gioco&submit=Search Il giorno 06 aprile 2012 22:29, Lewis John Mcgibbney  ha scritto:  ...
   Author: alessio crisantemi, 2012-04-06, 20:42
Re: request about snippets (with attachement) - Nutch - [mail # user]
...any suggestions for my cause?  Il giorno 05 aprile 2012 23:20, alessio crisantemi  ha scritto:  ...
   Author: alessio crisantemi, 2012-04-06, 20:19
Sort:
project
Solr (34)
Nutch (29)
type
mail # user (29)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (29)
author
Markus Jelsma (1767)
Lewis John Mcgibbney (1125)
Julien Nioche (805)
Mattmann, Chris A (402)
lewis john mcgibbney (334)
Andrzej Bialecki (302)
Ferdy Galema (224)
Tejas Patil (164)
Bai Shen (162)
Sebastian Nagel (156)
kiran chitturi (155)
alxsss@...)
remi tassing (133)
Lewis John McGibbney (129)
Gabriele Kahlout (115)
alessio crisantemi