Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # dev - Timeout support with parsers


Copy link to this message
-
Re: Timeout support with parsers
Ken Krugler 2010-01-26, 21:56
Hi Jukka,

>> Any thoughts on the value of this support, and input on the approach?
>
> This would certainly be useful and something I've been hoping to focus
> more on (see the "security" point in
> http://markmail.org/message/ggihw2cns53t6ayl from 2007 :-).
>
> The biggest problem with the FutureTask approach is that it's quite
> difficult if not impossible to support proper streaming with it.

Could you provide more details about the problems here?

E.g. correctly interrupting the read requests when a timeout happens?  
Or something else.

Thanks,

-- Ken

> One
> possible approach that would avoid this problem is to modify the
> ParsingReader class to include timeout support in the pipe it uses.
> Modifying that approach to support the full SAX event stream should be
> doable, though not necessarily trivial.
>
> BR,
>
> Jukka Zitting

--------------------------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g