Spencer, Dave 2002-11-15, 06:36
-Re: extracting top k frequently occuring terms from a given set of documents
Doug Cutting 2002-11-15, 20:01
There was a class in the test directory that efficiently computed this,
but I think Otis recently removed it. Perhaps it should be revived and
go in the sandbox or something...
Vinay Kakade wrote:
> I want to use Lucene to extract top 10 frequently
> occuring terms from the given set of HTML document.
> Please let me know how lucene can be used for this
> purpose. I want to know how can I get the frequently
> occuring terms, after building index on given set of
> documents using Lucene Indexer.
> Please help me
> Do you Yahoo!?
> Yahoo! Web Hosting - Let the expert host your site
> To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
Otis Gospodnetic 2002-11-15, 20:15
Vinay Kakade 2002-11-15, 06:02