| clear query|facets|time |
Search criteria: relevancy computation.
Results from 71 to 80 from
485 (6.84s).
|
|
|
Did you mean:
|
|
Loading phrases to help you refine your search...
|
|
ScoresAsPercentages - Lucene - Lucene - [wiki]
|
|
...
Scores As Percentages
People frequently want to compute a "Percentage" from Lucene scores to determine what is a "100% perfect" match vs a "50%" match. This is also somethings called...
|
|
... didn't suddenly become 7
times more relevant to your query then it was 5 minutes ago.
you might say: "i'm never going to delete documents", but are you ever
going to add documents? because...
|
[+ show more]
[- hide]
| ... are meaningful only for purposes of comparison between other
documents for the exact same query and the exact same index. when you try
to compute a percentage, you are setting up an implicit comparison... |
| ... with
scores from other queries.
In some other threads, the topic of computing a percentage from the maximum *possible* score has been discussed, but an approach like this would pose additional... |
| ... problems...
http://www.nabble.com/theoretical-maximum-score-to17147485.html#a17285093
: Is it possible to compute a theoretical maximum score for a given query if
: constraints are placed on 'tf... |
|
|
http://wiki.apache.org/lucene-java/ScoresAsPercentages
Author: localhost,
2009-09-20, 21:47
|
|
|
NutchHadoopTutorial0.8 - Nutch - [wiki]
|
|
... does not go into the Nutch or Hadoop architecture. It only tells how to get the systems up and running. At the end of the tutorial though I will point you to relevant resources if you want...
|
|
... Nutch 0.8 Dev Revision 385702, and may not be compatible with future releases of either Nutch or Hadoop.
Five, for this tutorial we setup nutch across 6 different computers. If you are using...
|
[+ show more]
[- hide]
| ... improve this tutorial for others.
Our Network Setup
First let me layout the computers that we used in our setup. To setup Nutch and Hadoop we had 7 commodity computers ranging from 750Mghz... |
| ... to 1.0 Ghz. Each computer had at least 128 Megs of RAM and at least a 10 Gigabyte hard drive. One computer had dual 750 Mghz CPUs and another had dual 30 Gigabyte hard drives. All... |
| ... of these computers were purchasedfor under $500.00 at a liquidation sale. I am telling you this to let you know that you don't have to have big hardware to get up and running with Nutch and Hadoop. Our... |
|
|
http://wiki.apache.org/nutch/NutchHadoopTutorial0.8
Author: AlexMc,
2010-06-12, 09:42
|
|
|
Re: Enhancing Solr relevance functions through predefined constants - Solr - [mail # user]
|
|
... FWIW: number of unique terms in a field is reall, really, expensive to compute (although perhaps we could cache it somewhere) -Hoss ...
|
|
|
Author: Chris Hostetter,
2010-06-01, 18:57
|
|
|
Re: Enhancing Solr relevance functions through predefined constants - Solr - [mail # user]
|
|
... once we have some real use cases for it. The number of unique terms (and other similar metrics) is pretty much a query-time constant and we can have it optionally computed and then cached...
|
|
|
Author: Prasanna R,
2010-06-01, 20:06
|
|
|
NutchDistributedFileSystem - Nutch - [wiki]
|
|
...-oriented files over a set of commodity computers. Files are replicated across machines for safety, and load is balanced fairly across the machine set.
The NDFS fills an important hole for the Nutch...
|
|
... the client to retrieve the file
The system should have replicated the relevant blocks, making the data still available in step 6.
If you want to read/write programmatically, use the API exposed...
|
|
|
http://wiki.apache.org/nutch/NutchDistributedFileSystem
Author: localhost,
2009-09-20, 23:09
|
|
|
Re: Relevance's scores on TopFieldCollector/FieldComparator - Lucene - [mail # dev]
|
|
...'t have to re-compute it. I don't understand why you were always seeing 0 score come back from it. Mike On Thu, Jul 9, 2009 at 9:09 AM, Raimon Bosch wrote: on...
|
|
|
Author: Michael McCandless,
2009-07-13, 10:10
|
|
|
Re: Boosting relevance as terms get nearer to each other - Solr - [mail # user]
|
|
... that is used. If I make a (Standard) query like a AND b AND c AND "a b c"~1000000 does that imply that during the computation of the score for "a b c"~1000000, sloppyFreq() will be called? That...
|
|
|
Author: Michael,
2009-08-17, 14:41
|
|
|
Re: Relevancy, Phrase Boosting, Shingles and Long Tail Curves - Lucene - [mail # dev]
|
|
...@lucene.apache.org Sent: Fri, 10 September, 2010 1 8:44:31 Subject: Re: Relevancy, Phrase Boosting, Shingles and Long Tail C urves Thanks Mark H, Maybe I'll look at MLT (More Like This) agai n. I'll also check...
|
|
... normally c onvey 60% of relevancy in general text, Q&A content is skewed a bit more towards verbs. 2: Questions may contain more noise words (though perhaps in useful groupings) 3: Vocabulary...
|
[+ show more]
[- hide]
| .... Still waiting for the relevancy math nerds to chime in about the log-log and IDF stuff ... ;-) I was thinking a bit more about the math involved here.... What is the "best practices... |
| ... Mark sation work where my hold more information value ies on seperate terms will not give (in terms of IDF) - or if they did compute e lots of disk IO to do this. Shingles present... |
| ... apache.org Phrase Boosting, Shingles and Long Tail Curves he relevancy of some Question and Answer content. I'm using , Dismax, and I'm already a fan of Phrase Boosting and have cranked t up... |
|
|
Author: mark harwood,
2010-09-11, 15:11
|
|
|
Re: Relevancy, Phrase Boosting, Shingles and Long Tail Curves - Lucene - [mail # dev]
|
|
... and that phrase queries on sepe rate terms will not give these term combos their true reward (in terms o f IDF) - or if they did compute the true IDF, would require lots of dis k IO to do this...
|
|
... in Lucene indexes.. Ch eers Mark ________________________________ From: Mark Bennett To: [EMAIL PROTECTED] Sent: Fri, 10 Sept ember, 2010 1:42:11 Subject: Relevancy, Phrase Boosting...
|
[+ show more]
[- hide]
| ..., Shingles and Lon g Tail Curves I want to boost the relevancy of some Question and Answe r content. I'm using stop words, Dismax, and I'm already a fan of Phrase Boosting and have cranked... |
| ... that up a bit. But I'm considering using long Shingles to make use of some of the normally stopped out "junk words" i n the content to help relevancy further. Reminder: "Shingles" are arti... |
| ...-is, is-a, a-sentence, th is-is-a, is-a-sentence A few questions on relevance and shingles: 1: How similar are the relevancy calculations compare between Shingles a nd exact phrases? I... |
|
|
Author: mark harwood,
2010-09-10, 10:17
|
|
|
Re: [VOTE] Make the Open Relevance Project (ORP) and official Lucene subproject - Lucene - [mail # general]
|
|
... can find more relevant documents, the better they can prepare their patent app etc. etc. Not to mention the patent offices, patent attorneys. Lucrative ;) 4) It's not hard to find expert...
|
|
... judgements and maintain a user group which could really focus and give devotion to generate relevance judgements (compared to a nonsense, old news collection). Cheers, Murat Yakici Department...
|
[+ show more]
[- hide]
| ... of Computer & Information Sciences University of Strathclyde Glasgow, UK The University of Strathclyde is a charitable body, registered in Scotland, with registration number SC015263. ... |
|
|
Author: Murat Yakici,
2009-05-29, 20:39
|
|
|
|