Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Solr, mail # user - PorterStemmerTokenizerFactory ?


Copy link to this message
-
Re: PorterStemmerTokenizerFactory ?
Jack Krupansky 2012-06-07, 21:12
Look at the text_en field type in the Solr 3.6 example schema.

-- Jack Krupansky

-----Original Message-----
From: Carrie Coy
Sent: Thursday, June 07, 2012 5:04 PM
To: [EMAIL PROTECTED]
Subject: PorterStemmerTokenizerFactory ?

I've read different suggestions on how to handle cases where synonyms
are used and there are multiple
version of the original word that need to point to the same set of
synonyms (/responsibility, responsibilities, obligation, duty/ ).

The approach that seems most logical is to configure a
SynonymFilterFactory to use a custom TokenizerFactory that stems
synonyms by calling out to the PorterStemmer.

Does anyone know if a PorterStemmerTokenizerFactory already exists
somewhere?

Thank you.
Carrie Coy