| clear query|facets|time |
Search criteria: .
Results from 31 to 40 from
530 (0.145s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Problem detecting Microsoft Office formats from InputStream - Tika - [mail # user]
|
|
...Hi, On Sun, Sep 23, 2012 at 8:07 PM, naskoo wrote: It doesn't add extra metadata (unless explicitly requested). Instead the TikaInputStream class allows Tika parsers and de...
|
|
|
Author: Jukka Zitting,
2012-09-23, 19:33
|
|
|
Re: Question about XPath Matcher code & MatchingContentHandler - Tika - [mail # dev]
|
|
...Hi, On Mon, Sep 3, 2012 at 7:50 PM, Ken Krugler wrote: Note that we're dealing with SAX events instead of DOM hierarchies here. So what the startElement() methods does not ...
|
|
|
Author: Jukka Zitting,
2012-09-04, 17:12
|
|
|
Re: Failing to detect SJIS - Tika - [mail # user]
|
|
...Hi, On Mon, Sep 3, 2012 at 5:33 PM, Benson Margulies wrote: The site is located and built separately from the main source (http://svn.apache.org/repos/asf/tika/site/) so we...
|
|
|
Author: Jukka Zitting,
2012-09-03, 16:38
|
|
|
Re: Question about XPath Matcher code & MatchingContentHandler - Tika - [mail # dev]
|
|
...Hi, On Thu, Aug 30, 2012 at 7:35 PM, Ken Krugler wrote: That's as intented, as the BodyContentHandler is only interested in stuff inside the element, not outside it. ...
|
|
|
Author: Jukka Zitting,
2012-09-03, 15:02
|
|
|
Re: Failing to detect SJIS - Tika - [mail # user]
|
|
...Hi, On Sun, Sep 2, 2012 at 2:01 PM, Benson Margulies wrote: The text detector in Tika doesn't have a reliable way to detect Shift-JIS, which is why you're seeing the defaul...
|
|
|
Author: Jukka Zitting,
2012-09-03, 14:48
|
|
|
Re: Article and section tags - Tika - [mail # user]
|
|
...Hi, On Thu, Aug 30, 2012 at 2:05 PM, Markus Jelsma wrote: Looks like a reasonable workaround. Can you file a TIKA issue for this and attach a patch with your changes?  ...
|
|
|
Author: Jukka Zitting,
2012-08-30, 12:06
|
|
|
Re: AutoDetectParser is not parsing UTF-16 content types - Tika - [mail # dev]
|
|
...Hi, On Wed, Aug 29, 2012 at 6:02 PM, chraj007 wrote: Looks like that file has an incorrect http-equiv declaration: The encoding of the file is no...
|
|
|
Author: Jukka Zitting,
2012-08-29, 16:24
|
|
|
Re: Logging in Tika - Tika - [mail # user]
|
|
...Hi, On Mon, Aug 27, 2012 at 12:32 PM, Markus Jelsma wrote: You can use whatever logging framework you like. Tika itself intentionally doesn't use or require any speci...
|
|
|
Author: Jukka Zitting,
2012-08-27, 11:15
|
|
|
[TIKA-771] "Hello, World!" in UTF-8/ASCII gets detected as IBM500 - Tika - [issue]
|
|
...Looks like the encoding detection heuristics need some adjustment....
|
|
|
http://issues.apache.org/jira/browse/TIKA-771
Author: Jukka Zitting,
2012-08-13, 18:55
|
|
|
Re: TIKA-431 and CONTENT_ENCODING - Tika - [mail # dev]
|
|
...Hi, On Thu, Aug 9, 2012 at 10:56 PM, Ken Krugler wrote: Right, there might still be clients out there that expect this information to be present as CONTENT_ENCODING. ...
|
|
|
Author: Jukka Zitting,
2012-08-10, 00:44
|
|
|
|