-Re: Synonym and Whitespaces and optional TokenizerFactory
Erick Erickson 2011-08-18, 19:16
Hmmm, why doesn't the multi word synonym syntax in your
synonym.txt handle this case? Or am I missing something
On Wed, Aug 17, 2011 at 10:02 PM, Will Milspec <[EMAIL PROTECTED]> wrote:
> Hi all,
> This may be obvious. My question pertains to use of tokenizerFactory
> together with SynonymFilterFactory. Which tokenizerFactory does one use to
> treat "synonyms with spaces" as one token,
> Example these two entries are synonyms: "lms", "learning management system"
> index time expansion would expand "lms" to these terms
> "learning management system"
> i.e. not like this:
> Excerpt from the wiki article:
> The optional *tokenizerFactory* parameter names a tokenizer factory class to
> analyze synonyms (see https://issues.apache.org/jira/browse/SOLR-319), which
> can help with the synonym+stemming problem described in
> http://search-lucene.com/m/hg9ri2mDvGk1 .