It is also in 4.10.3 as part of the analysis-common module:

https://lucene.apache.org/core/4_10_3/analyzers-common/org/apache/lucene/analysis/miscellaneous/LimitTokenCountFilter.html

 

Uwe

 

-----

Uwe Schindler

Achterdiek 19, D-28357 Bremen

http://www.thetaphi.de <http://www.thetaphi.de/>

eMail: [EMAIL PROTECTED]

 

From: Chris Bamford [mailto:[EMAIL PROTECTED]]
Sent: Monday, March 20, 2017 1:02 PM
To: [EMAIL PROTECTED]
Subject: Limiting terms / field

 

 
Hello,

We are using Lucene 4.10.3 and are interested in limiting the number of terms per field. In the past this was set by the IndexWriter (maxFieldLength) and the default was 10K; as I understand it this is no longer the case, in fact it is now unlimited by default?

Anyway, what is the best way we can do this? I have found some references to a class called LimitTokenCountFilter, but I believe it is only found in later versions.

Thanks

- Chris

Chris Bamford

m: +44 7860 405292

 <http://www.mimecast.com/> www.mimecast.com
Lead Software Engineer

p: +44 207 847 8700

Address click  <http://www.mimecast.com/About-us/Contact-us/> here
  _____  

 <https://eu-api.mimecast.com/s/click/V5cKV3mUEd00vOSgvXwtbZyPc4HU7YXzH3Q5Ov2IZlOUD5KnfY0Eo__Me97k70LeLdXjnAQzFGO9rpjwJts3InRknkny11ed5T74o6AsRNboAh8dqyqFsq0unf3MyHXrJZy3M1JP91JCAA_brBpnBkIxsGjUljn71poOVL3N1hyJLCqRsucHp-dI8GBbMwFeLX53666oCIpB3mfN4i4LuA>
 
 <https://eu-api.mimecast.com/s/click/1K7xTdhoqgjnB3PEFCIbObk_ZFAnXYKlTiLISV6xNSUBOHgJ34O23NXNATid7364YTyNegMgTtFqBtW54vnckhfn0k-UaDEHtFDzrnfXx8Dpjgv85mz2AnanRV970OKhhoQOsOkKG1l_SYGT0ryVgPfhypOP2MXKfgsbjzlGSXlax271RkXX8mjHRhvuYoZHUWsThNMfSE_TxYg9ZhC-Fg>
 <https://eu-api.mimecast.com/s/click/0ChSNgfhxT33DvPLIaGrHSdBcfZACHC3pDIPU_BoN-kfqbJfezAkQS-MSlF3hLCB6ZVlhiRGR3wIEZlIHEMA3_LGld58ajMxdXLLk9tbO55u1ZecyQu36ZqO8fIxY9q4sSZO-ADUPIfmajCv7yv1VUPEzCN51poTuYrGu4oP3PJPCydVJitflwcdsJM11tr4__5Kprgsvpnc9fHlp2BM5A>
 <https://eu-api.mimecast.com/s/click/NjE9ed9agLcHnu6tdfXIcnD6b0cN73NhGASU7Y7-fBAF3h92PJA8wj2nkLyj7kkcWp7LU4ny37JS_YM1LdyBM4VQArdMfl-tqEm2M0WJoNheY2bxkI-ZKSKKWjfj_z8nmZTbVxfHKPaaHmak2vnDwueGzhDFduwa6BKj3FyGqy-QtlJQ7csd0taeAvhVnpYsylQFZDAlgBPm2se9Vfqssw>
 <https://eu-api.mimecast.com/s/click/v4zOP0KQ-MJlMJOoXyVCSXfOawzU6b7Yl3xFw0ODhytdAbDQ50RRte1KJKbgMViNVkD6fs8BNrBlwT-55EdBK5a4oonpL1ZATKUlP8fjrVpcAdHrVTp4NRc31q1WYWJ0gLhNSCW_kYsoyKBrHYkvgoUmKNvgh-54BQllH7JNn_KJ6jMV5TlrkToHgOmUWKJQUWsThNMfSE_TxYg9ZhC-Fg>
 <https://eu-api.mimecast.com/s/click/XujAZpejvFW2OIhYbUKIG4bbDde5QWSLvav2Zd1T1pHjrccG5l08Ssefc_H8Zr-BkL00127s5rL6kUxtJHwqZ3VPCRBYg4JXq_Wd9owjxjfb3LUf-kNIrJE7XBBExF_k-1-DXs8HNoBxB7OUVIfjBbm80zerQX9iyu2hUqSsBeorOQA5m0DSs02m-WfDE0D8t8DxXx5osyLjtMdIc2MC7g>

 
 
 <https://eu-api.mimecast.com/s/click/gNXpRy8Di3hABeg9FCvOkJyWCtych0vtbAF2YBaOE5exD9teAozt-UCmgN0eOSWeBMLfnjwJKrTgL9QmOD6wAHVPCRBYg4JXq_Wd9owjxjfb3LUf-kNIrJE7XBBExF_k-1-DXs8HNoBxB7OUVIfjBbm80zerQX9iyu2hUqSsBeorOQA5m0DSs02m-WfDE0D8DI8sdYf5O9VTZPl6r-07iA>
Disclaimer
The information contained in this communication from  <mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED] sent at 2017-03-20 12:02:15 is confidential and may be legally privileged. It is intended solely for use by  <mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED] and others authorized to receive it. If you are not  <mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED] you are hereby notified that any disclosure, copying, distribution or taking action in reliance of the contents of this information is strictly prohibited and may be unlawful.

This email message has been scanned for viruses by Mimecast. Mimecast delivers a complete managed email solution from a single web based platform. For more information please visit http://www.mimecast.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB