Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 203 (0.568s).
Loading phrases to help you
refine your search...
Re: Crawling just one particular page from a host - ManifoldCF - [mail # user]
...On 14.05.13 14.07, Karl Wright wrote:  It's unfortunately not possible to set a hopcount filter for just one  particular host unless I create a unique job for it. Good idea anyway,...
   Author: Erlend Garåsen, 2013-05-14, 12:19
Re: Crawling just one particular page from a host - ManifoldCF - [mail # user]
...On 14.05.13 13.49, Karl Wright wrote:  Yes, you are right. I'm just trying to find a simple way to crawl just  the starting page of a host and nothing else, i.e.: http://www.ibsen....
   Author: Erlend Garåsen, 2013-05-14, 12:06
Crawling just one particular page from a host - ManifoldCF - [mail # user]
...I just figured out that even though "Include only hosts matching seeds?"  is enabled, the web crawler continues to fetch everything from the host  "www.ibsen.uio.no" if I have plac...
   Author: Erlend Garåsen, 2013-05-14, 11:45
Re: [VOTE] Release Apache ManifoldCF 1.2, RC1 - ManifoldCF - [mail # dev]
...On 08.05.13 18.00, Erlend Gar�sen wrote:   I just want to inform that the job completed without any errors, so  CONNECTORS-682 seems to be resolved. Over 14000 documents crawled....
   Author: Erlend Garåsen, 2013-05-08, 18:44
Re: [VOTE] Release Apache ManifoldCF 1.2, RC1 - ManifoldCF - [mail # dev]
...+1  - Deployed and started a big crawl on Resin. - Ran:      ant uitest      ant doc - Built using Maven 3.0.4  I will withdraw my vote if CONNECTORS-...
   Author: Erlend Garåsen, 2013-05-08, 16:00
Re: Release status - ManifoldCF - [mail # dev]
...On 07.05.13 04.08, Karl Wright wrote:  OK. In case there *is* a MCF bug which causes any of these problems, we  have always the opportunity to run trunk in production mode.   ...
   Author: Erlend Garåsen, 2013-05-07, 11:04
Re: Release status - ManifoldCF - [mail # dev]
...On 28.04.13 23.27, Karl Wright wrote:   I have explained the problem in detail for my colleague who will get  back to work tomorrow when I'm leaving Norway. Unfortunately I don't &...
   Author: Erlend Garåsen, 2013-04-29, 09:30
Re: Timeout problems with web crawling - ManifoldCF - [mail # user]
...After I did an svn up and started a new crawl, I'm still getting a lot  of these. I will analyze futher by running EXPLAIN ANALYZE. The job  stops for some minutes and then continu...
   Author: Erlend Garåsen, 2013-04-25, 12:56
Re: Timeout problems with web crawling - ManifoldCF - [mail # user]
...Yes, and I will assign that issue to me, but possible reassign/unassign  it in case I do not get time before I'm flying to US.  Erlend  On 25.04.13 12.17, Karl Wright wrote: &...
   Author: Erlend Garåsen, 2013-04-25, 11:45
Re: Timeout problems with web crawling - ManifoldCF - [mail # user]
...Thanks Karl!  I have updated the ticket with the information you requested, but there  is an email delay at the moment from Jira (or this list).  Erlend  On 24.04.13 01.3...
   Author: Erlend Garåsen, 2013-04-24, 08:53
Sort:
project
ManifoldCF (203)
Nutch (15)
Solr (11)
type
mail # dev (106)
mail # user (78)
issue (19)
date
last 7 days (0)
last 30 days (15)
last 90 days (27)
last 6 months (67)
last 9 months (203)
author
Karl Wright (2202)
Erlend Garåsen (203)
karl.wright@...)
Piergiorgio Lucidi (167)
Shinichiro Abe (143)
Grant Ingersoll (130)
Jack Krupansky (122)
Ahmet Arslan (66)
Swapna Vuppala (58)
Farzad Valad (57)
Shigeki Kobayashi (54)
Hitoshi Ozawa (43)
Rohan.GPatil@...)
Maciej Liżewski (42)
Fuad Efendi (40)