Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 530 (0.145s).
Loading phrases to help you
refine your search...
Re: Problem detecting Microsoft Office formats from InputStream - Tika - [mail # user]
...Hi,  On Sun, Sep 23, 2012 at 8:07 PM, naskoo  wrote:  It doesn't add extra metadata (unless explicitly requested). Instead the TikaInputStream class allows Tika parsers and de...
   Author: Jukka Zitting, 2012-09-23, 19:33
Re: Question about XPath Matcher code & MatchingContentHandler - Tika - [mail # dev]
...Hi,  On Mon, Sep 3, 2012 at 7:50 PM, Ken Krugler  wrote:  Note that we're dealing with SAX events instead of DOM hierarchies here. So what the startElement() methods does not ...
   Author: Jukka Zitting, 2012-09-04, 17:12
Re: Failing to detect SJIS - Tika - [mail # user]
...Hi,  On Mon, Sep 3, 2012 at 5:33 PM, Benson Margulies  wrote:  The site is located and built separately from the main source (http://svn.apache.org/repos/asf/tika/site/) so we...
   Author: Jukka Zitting, 2012-09-03, 16:38
Re: Question about XPath Matcher code & MatchingContentHandler - Tika - [mail # dev]
...Hi,  On Thu, Aug 30, 2012 at 7:35 PM, Ken Krugler  wrote:  That's as intented, as the BodyContentHandler is only interested in stuff inside the  element, not outside it. ...
   Author: Jukka Zitting, 2012-09-03, 15:02
Re: Failing to detect SJIS - Tika - [mail # user]
...Hi,  On Sun, Sep 2, 2012 at 2:01 PM, Benson Margulies  wrote:  The text detector in Tika doesn't have a reliable way to detect Shift-JIS, which is why you're seeing the defaul...
   Author: Jukka Zitting, 2012-09-03, 14:48
Re: Article and section tags - Tika - [mail # user]
...Hi,  On Thu, Aug 30, 2012 at 2:05 PM, Markus Jelsma  wrote:  Looks like a reasonable workaround. Can you file a TIKA issue for this and attach a patch with your changes?  ...
   Author: Jukka Zitting, 2012-08-30, 12:06
Re: AutoDetectParser is not parsing UTF-16 content types - Tika - [mail # dev]
...Hi,  On Wed, Aug 29, 2012 at 6:02 PM, chraj007  wrote:  Looks like that file has an incorrect http-equiv declaration:        The encoding of the file is no...
   Author: Jukka Zitting, 2012-08-29, 16:24
Re: Logging in Tika - Tika - [mail # user]
...Hi,  On Mon, Aug 27, 2012 at 12:32 PM, Markus Jelsma  wrote:  You can use whatever logging framework you like.  Tika itself intentionally doesn't use or require any speci...
   Author: Jukka Zitting, 2012-08-27, 11:15
[TIKA-771] "Hello, World!" in UTF-8/ASCII gets detected as IBM500 - Tika - [issue]
...Looks like the encoding detection heuristics need some adjustment....
http://issues.apache.org/jira/browse/TIKA-771    Author: Jukka Zitting, 2012-08-13, 18:55
Re: TIKA-431 and CONTENT_ENCODING - Tika - [mail # dev]
...Hi,  On Thu, Aug 9, 2012 at 10:56 PM, Ken Krugler  wrote:  Right, there might still be clients out there that expect this information to be present as CONTENT_ENCODING.  ...
   Author: Jukka Zitting, 2012-08-10, 00:44
Sort:
project
Tika (530)
Lucene (78)
ManifoldCF (38)
Mahout (5)
Droids (2)
Nutch (2)
Solr (1)
type
issue (201)
mail # dev (197)
mail # user (132)
date
last 7 days (0)
last 30 days (0)
last 90 days (5)
last 6 months (22)
last 9 months (530)
author
Jukka Zitting (530)
Nick Burch (410)
Mattmann, Chris A (324)
Michael McCandless (176)
Ken Krugler (161)
buildbot@...)
Oleg Tikhonov (58)
Markus Jelsma (56)
Mark Kerzner (53)
Dave Meikle (49)
Maxim Valyanskiy (46)
Keith R. Bennett (45)
Ray Gauss II (40)
Antoni Mylka (37)
Benson Margulies (37)