|
|
+
Danicela nutch 2011-10-04, 10:03
+
Tim Pease 2011-10-04, 18:35
-
Re: Giving priority to seedsJulien Nioche 2011-10-06, 07:55
you can specify the score of a seed using the metadata while injecting with
nutch.score=xxxx see https://issues.apache.org/jira/browse/NUTCH-655 Julien On 4 October 2011 11:03, Danicela nutch <[EMAIL PROTECTED]> wrote: > Hi, > > I want to make a ScoringFilter plugin which will give priority to seeds > file. > > I mean, I have a crawdb and a seeds file with links, I set a topN=5 to > test, and I want that my seeds links are fetched first, before what I have > in the crawldb. > > For that, I tried to implement ScoringFilter methods, particularly > injectedScore(Text text, CrawlDatum cd), I made a 'cd.setScore(100f)'. The > score is correctly given but it's not used and in my 5 pages segment I don't > have these links. > > Maybe I made something wrong ? > > Thanks in advance. > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com +
Danicela nutch 2011-10-06, 10:10
|