Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Lucene, mail # user - RE: extracting top k frequently occuring terms from a given set of documents


+
Spencer, Dave 2002-11-15, 06:36
Copy link to this message
-
Re: extracting top k frequently occuring terms from a given set of documents
Doug Cutting 2002-11-15, 20:01
There was a class in the test directory that efficiently computed this,
but I think Otis recently removed it.  Perhaps it should be revived and
go in the sandbox or something...

http://nagoya.apache.org/eyebrowse/ReadMsg?[EMAIL PROTECTED]&msgNo=2620

Doug
Vinay Kakade wrote:
> Hi
> I want to use Lucene to extract top 10 frequently
> occuring terms from the given set of HTML document.
> Please let me know how lucene can be used for this
> purpose. I want to know how can I get the frequently
> occuring terms, after building index on given set of
> documents using Lucene Indexer.
> Please help me
> regards
> Vinay.
>
> __________________________________________________
> Do you Yahoo!?
> Yahoo! Web Hosting - Let the expert host your site
> http://webhosting.yahoo.com
>
> --
> To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
>

--
+
Otis Gospodnetic 2002-11-15, 20:15
+
Vinay Kakade 2002-11-15, 06:02