Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # user - Configure Tika to ignore whitespace


Copy link to this message
-
Re: Configure Tika to ignore whitespace
Taylor, Wade 2012-05-03, 11:55
Hi,

You can implement a ContentHandler that ignores whitespace as necessary.
Regards,
Wade

On Wed, May 2, 2012 at 7:05 PM, Alec Swan <[EMAIL PROTECTED]> wrote:

> Hello,
>
> What's the recommended way to configure Tika to skip whitespace during
> text extraction?
>
> Thanks,
>
> Alec
>