Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Solr, mail # user - RE: solr 3.5 and indexing performance


+
Agnieszka Kukałowicz 2012-03-13, 15:42
+
Jan Høydahl 2012-03-13, 23:54
+
Agnieszka Kukałowicz 2012-03-14, 13:36
+
mizayah 2012-02-22, 14:44
+
Ahmet Arslan 2012-02-22, 15:49
+
mizayah 2012-02-22, 20:53
+
mizayah 2012-02-23, 09:19
+
Agnieszka Kukałowicz 2012-03-12, 15:42
Copy link to this message
-
Re: solr 3.5 and indexing performance
Jan Høydahl 2012-03-13, 08:47
Hi,

Have you confirmed that disabling Hunspell in solrconfig gets you back to normal speed?
What Hunspell configuration and dictionaries do you have?
Can you share more about your environment and documents?
Do you have a chance to run a profiler on your Solr instance? Try i.e. VisualVM and run the profiler to see what part of the code takes up the time
http://docs.oracle.com/javase/6/docs/technotes/tools/share/jvisualvm.html

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com

On 12. mars 2012, at 16:42, Agnieszka Kukałowicz wrote:

> Hi guys,
>
> I have hit the same problem with Hunspell.
> Doing a few tests for 500 000 documents, I've got:
>
> Hunspell from http://code.google.com/p/lucene-hunspell/ with 3.4 version -
> 125 documents per second
> Build Hunspell from 4.0 trunk - 11 documents per second.
>
> All the tests were made on 8 core CPU with 32 GB RAM and index on SSD
> disks.
> For Solr 3.5 I've tried to change JVM heap size, rambuffersize,
> mergefactor but the speed of indexing was about 10 -20 documents per
> second.
>
> Is it possible that there is some performance bug with Solr 4.0? According
> to previous post the problem exists in 3.5 version.
>
> Best regards
> Agnieszka Kukałowicz
>
>
>> -----Original Message-----
>> From: mizayah [mailto:[EMAIL PROTECTED]]
>> Sent: Thursday, February 23, 2012 10:19 AM
>> To: [EMAIL PROTECTED]
>> Subject: Re: solr 3.5 and indexing performance
>>
>> Ok i found it.
>>
>> Its becouse of Hunspell which now is in solr. Somehow when im using it
>> by myself in 3.4 it is a lot of faster then one from 3.5.
>>
>> Dont know about differences, but is there any way i use my old Google
>> Hunspell jar?
>>
>> --
>> View this message in context: http://lucene.472066.n3.nabble.com/solr-
>> 3-5-and-indexing-performance-tp3766653p3769139.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
+
Agnieszka Kukałowicz 2012-03-13, 09:39