Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Solr, mail # user - How to get stopwords and synonyms files for several lanuages


+
abiratsis 2011-03-18, 16:09
Copy link to this message
-
Re: How to get stopwords and synonyms files for several lanuages
Markus Jelsma 2011-03-18, 16:18
On Friday 18 March 2011 17:09:35 abiratsis wrote:
> Hello everyone,
>
> I am developing a multilingual index so there is a need for different
> languages support. I need some answers to the follwing questions:
>
> 1. Which steps should I follow in order to get(download) all the
> stopwords-synonyms files for several languages?

Synonyms largely depend on what you're indexing. There is no general list of
synonyms. Also, because if you expand synonyms at index time, the index grows
to extreme proportions.

>
> 2. Is there any site containing them?

The wiki has a nice list for many languages. Which stemmer to use, whether
special lowercasing is needed and stopwords.

http://wiki.apache.org/solr/LanguageAnalysis

>
> 3. Should I download them somehow or they are already embedded to the
> solr.war?

They're stored in your SOLR_HOME/conf directory.

>
> Thanx,
> Alex
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/How-to-get-stopwords-and-synonyms-files
> -for-several-lanuages-tp2698494p2698494.html Sent from the Solr - User
> mailing list archive at Nabble.com.

--
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350
+
abiratsis 2011-03-18, 16:27
+
abiratsis 2011-03-18, 16:27
+
abiratsis 2011-03-18, 16:31
+
Markus Jelsma 2011-03-18, 16:38