Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: relevant computing.   Results from 61 to 70 from 488 (0.584s).
Loading phrases to help you
refine your search...
SummerOfCode2011ProjectRankingTerrier - Lucene - Lucene - [wiki]
...Return to main page A short overview of Terrier's scoring architecture Terrier is another Java-based, open source search engine developed at the School of Computing Science, University...
.... Also, even this idea is false. BM25, for instance, does not have its own idfBM25() method in Idf; it is computed in the BM25 class directly. Idf also has methods that compute the logarithm...
[+ show more]
http://wiki.apache.org/lucene-java/SummerOfCode2011ProjectRankingTerrier    Author: DavidNemeskey, 2011-06-20, 12:51
NewScoring - Nutch - [wiki]
...-analysis to get a single global relevancy score for each url.  Building a webgraph assumes that all links are stored in the current segments to be processed.  Links are not held over from one processing...
... links to D which links back to A.  This program is computationally expensive and usually, due to time and space requirement, can't be run on more than a three or four level depth.  While it does...
[+ show more]
http://wiki.apache.org/nutch/NewScoring    Author: LewisJohnMcgibbney, 2011-08-07, 12:55
PublicServers - Nutch - [wiki]
... of Chinese language websites in North America. Ecolhub Web Search an E. coli specific search engine based on Nutch. EcoliHub WebSearch includes only those sites relevant to E. coli, thereby...
... discovery and search add-on. Computes similarity between pages using Nutch crawls. SymbolHound - A search engine targeted toward programming- and math- related queries. Allows users to search...
http://wiki.apache.org/nutch/PublicServers    Author: DallanQuass, 2011-11-10, 21:40
ConversationsBetweenDougMarvinAndGrant - Lucene - Lucene - [wiki]
... boosts are not ordered. Personally I think the eight-bit floats used by Lucene give plenty of precision for this class of computation.  Relevant documents should be easily distinguished from...
... I think the eight-bit floats used by Lucene give plenty of > precision for this class of computation.  Relevant documents should be > easily distinguished from non-relevant documents...
[+ show more]
http://wiki.apache.org/lucene-java/ConversationsBetweenDougMarvinAndGrant    Author: localhost, 2009-09-20, 21:47
Reference Reading - Apache Mahout - Apache Software Foundation - Mahout - [wiki]
..., consider a specialist text, e.g.: Introduction to Bayesian Statistics (2nd Edition), William H. Bolstad, Wiley. (amazon) Then for the computational side of Bayesian (predominantly Markov chain...
... Monte Carlo), e.g. Bolstad's Understanding Computational Bayesian Statistics, Wiley. (amazon) Then you might try Bayesian Data Analysis, Gelman et al., Chapman &Hall/CRC On top...
[+ show more]
https://cwiki.apache.org/confluence/display/MAHOUT/Reference+Reading    Author: Grant Ingersoll, 2011-05-03, 00:00
OldHadoopTutorial - Nutch - [wiki]
... of the tutorial though I will point you to relevant resources if you want to know more about the architecture of Nutch and Hadoop. The tutorial comes in two phases. Firstly we get Hadoop running...
... not be compatible with future releases of either Nutch or Hadoop. Five: For this tutorial we setup nutch across 6 different computers.  If you are using a different number of machines you should still...
[+ show more]
http://wiki.apache.org/nutch/OldHadoopTutorial    Author: LewisJohnMcgibbney, 2011-09-02, 19:58
LuceneFAQ - Lucene - Lucene - [wiki]
... before body matches. But you can also boost queries on title by using query.setBoost(boost) on the relevant clause. How do I find similar documents? See the MoreLikeThis class in the org...
... to the deletable file. Note that as of 2.1 the deletable file is no longer used.  Instead, Lucene computes which files are no longer referenced by the index and removes them whenever a writer is created...
http://wiki.apache.org/lucene-java/LuceneFAQ    Author: SteveRowe, 2011-12-28, 03:22
Collocations - Apache Mahout - Apache Software Foundation - Mahout - [wiki]
... which co-occur more often than would be expected by chance. Statistically relevant combinations of terms identify additional lexical units which can be treated as features in a vector...
... overthruster', the Log-Likelihood ratio is computed by looking at the number of occurences of that word pair in the corpus, the number of word pairs that begin with 'oscillation' but end with something...
[+ show more]
https://cwiki.apache.org/confluence/display/MAHOUT/Collocations    Author: Dan Brickley, 2011-08-30, 00:00
[LUCENE-4574] FunctionQuery ValueSource value computed twice per document - Lucene - [issue]
... in a row.  This computation isn't exactly cheap to calculate so this is a big problem.  I was able to work-around this problem trivially on my end by caching the last value with corresponding docid...
....function.FunctionQuery$AllScorer.score(FunctionQuery.java:153)  at org.apache.lucene.search.ScoreCachingWrappingScorer.score(ScoreCachingWrappingScorer.java:56)  at org.apache.lucene.search.FieldComparator$Relevance...
http://issues.apache.org/jira/browse/LUCENE-4574    Author: David Smiley, 2012-11-30, 17:54
RE: Getting facet counts for 10,000 most relevant hits - Solr - [mail # user]
...  It can, and I have -- but only for the case of a single node...  In general the faceting code in solr just needs a DocSet.  the default  imple uses the DocSet computed as aside effect...
... when executing the main  search, but a custom SearchComponent could pick any DocSet it wants.  A few years back I wrote a custom faceting plugin that computed a "score"  for each constraint...
[+ show more]
   Author: Chris Hostetter, 2011-10-01, 01:19
Sort:
project
Lucene (273)
Solr (70)
Mahout (55)
ElasticSearch (44)
Nutch (23)
OpenRelevance (9)
Tika (7)
Lucene.Net (5)
Lucy (1)
type
mail # user (226)
web site (116)
mail # dev (70)
source code (35)
wiki (20)
mail # general (9)
issue (6)
javadoc (5)
Sematext # blog (1)
date
last 7 days (2)
last 30 days (28)
last 90 days (78)
last 6 months (143)
last 9 months (384)
author
Mattmann, Chris A (20)
Chris Hostetter (11)
Chuck Williams (10)
Terry Steichen (8)
Grant Ingersoll (7)
Michael McCandless (6)
Otis Gospodnetic (6)
Ted Dunning (6)
Gururaja H (5)
Sean Owen (5)
aash dhariya (5)
Erick Erickson (4)
LewisJohnMcgibbney (4)
Mario (4)
Dan Brickley (3)