Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Lucene, mail # user - Relevancy Practices


+
Grant Ingersoll 2010-04-29, 14:14
+
Mark Bennett 2010-04-29, 14:59
Copy link to this message
-
Re: Relevancy Practices
Avi Rosenschein 2010-04-30, 12:00
On Thu, Apr 29, 2010 at 5:59 PM, Mark Bennett <[EMAIL PROTECTED]> wrote:

> Hi Grant,
>
> You're welcome to use any of my slides (Dave's got them), with attribution
> of course.
>
> BUT....
>
> Have you considered a section something like "why the hell do you think
> Relevancy tweaking is gonna save you!?!?"
> Basically that, as a corpus grows exponentially, so do results list sizes,
> so ALL relevancy tweaks will eventually fail.  And FACETS (or other
> navigators) are the answer.  I've got slides on that as well.
>

The idea is to get the relevancy to fail on a smaller and smaller percent of
the queries, as the corpus grows larger. Facets can definitely help, but
they don't solve the basic problem of search, when there is no facet for the
particular way the user is looking for something. The strength of search is
that it can help the user to find things even when other forms of navigation
fail.

Of course relevancy matters.... but it's only ONE of perhaps a three pronged
> approach:
> 1: Organic Relevancy and top query suggetions
> 2: Results list Navigators, the best the system can support, and
> 3: Data quality (spidering, METADATA quality, source weighting, etc)
>

I would prefer to say that data quality can directly contribute to relevance
(besides being important for other reasons as well). Basically, search
relevancy is a combination of quality of data + quality of algorithm. In
general, they are both important, and data even has the potential to be more
important than algorithm, if you structure it right.

Also, tuning the algorithms to the users can be very important. For
instance, we have found that in a basic search functionality, the default
query parser operator OR works very well. But on a page for advanced users,
who want to very precisely tune their search results, a default of AND works
better.

Regards,
-- Avi
+
Grant Ingersoll 2010-04-30, 15:21
+
MitchK 2010-04-30, 21:16
+
Avi Rosenschein 2010-05-02, 12:50
+
Grant Ingersoll 2010-05-05, 14:08
+
Avi Rosenschein 2010-05-05, 16:40
+
Ivan Provalov 2010-05-04, 00:41
+
Peter Keegan 2010-05-03, 18:08
+
Grant Ingersoll 2010-05-05, 14:10
+
Peter Keegan 2010-05-05, 15:31
+
Fornoville, Tom 2010-04-29, 14:38