Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # dev - Pushing parsers upstream


Copy link to this message
-
Re: Pushing parsers upstream
Jukka Zitting 2011-12-16, 15:21
Hi,

On Tue, Dec 13, 2011 at 6:05 PM, Michael McCandless
<[EMAIL PROTECTED]> wrote:
> It's true users could directly upgrade their PDFBox w/owaiting for a
> Tika release but I suspect most users don't do that...

Currently people don't do that because it's so easy to break things by
upgrading a parser library in sync with Tika. We've even been actively
discouraging people from selectively upgrading parser libraries to
avoid such problems.

With my proposal this problem would no longer apply, and we could
actually start proactively instructing people that they can and should
try upgrading the relevant parser libraries if they face problems with
a particular document.

BR,

Jukka Zitting