Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Solr, mail # user - How to index long words with StandardTokenizerFactory?


+
Sergey Bartunov 2010-10-22, 16:07
+
Steven A Rowe 2010-10-22, 16:36
+
Sergey Bartunov 2010-10-22, 19:18
+
Steven A Rowe 2010-10-22, 22:43
+
Sergey Bartunov 2010-10-23, 12:56
+
Ahmet Arslan 2010-10-23, 13:45
+
Sergey Bartunov 2010-10-23, 14:45
+
Ahmet Arslan 2010-10-23, 14:53
+
Sergey Bartunov 2010-10-23, 15:01
+
Yonik Seeley 2010-10-23, 14:55
+
Sergey Bartunov 2010-10-23, 15:00
+
Ahmet Arslan 2010-10-23, 21:29
+
Sergey Bartunov 2010-10-24, 14:47
+
Yonik Seeley 2010-10-24, 15:19
+
Sergey Bartunov 2010-10-24, 15:29
Copy link to this message
-
Re: How to index long words with StandardTokenizerFactory?
Yonik Seeley 2010-10-24, 16:02
On Sun, Oct 24, 2010 at 11:29 AM, Sergey Bartunov <[EMAIL PROTECTED]> wrote:
> It's a kind of research. There is no particular practical use case as
> far as I know.
> Do you know how to set all these max token lengths?

It's a practical limit given how things are coded, not an arbitrary
one.  Given the lack of use cases, It would be a mistake to complicate
the code or make it less performant trying to support a larger limit.

-Yonik
http://www.lucidimagination.com