Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Lucene, mail # dev - remove seek-back in terms dict / fold appending codec into default?


Copy link to this message
-
Re: remove seek-back in terms dict / fold appending codec into default?
Andrzej Bialecki 2012-06-26, 16:58
On 26/06/2012 17:32, Robert Muir wrote:
> Just looking at the previous thread, I wonder if we should consider
> removing AppendingCodec and just removing this seek stuff.
>
> Currently this is essentially metadata stuff in terms dict/index (e.g.
> terms dict field summary section and offsets for each field in terms
> index: https://builds.apache.org/job/Lucene-trunk/javadoc/core/org/apache/lucene/codecs/lucene40/Lucene40PostingsFormat.html)
>
> I know the typical argument for keeping this stuff is that we would
> need to rely upon additional file operations (e.g. length), and we
> want to limit that, but this isn't the only possible solution, e.g. we
> could write a read-once file with this metadata thats just slurped in.
>
> And really relying upon seek at write could be viewed as just as bad
> as relying upon length, obviously we know some filesystems dont
> support it.

We could do even better - remove seek from IndexOutput.
--
Best regards,
Andrzej Bialecki
http://www.sigram.com, blog http://www.sigram.com/blog
  ___.,___,___,___,_._. __________________<><____________________
[___||.__|__/|__||\/|: Information Retrieval, System Integration
___|||__||..\|..||..|: Contact: info at sigram dot com
---------------------------------------------------------------------