Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 530 (0.211s).
Loading phrases to help you
refine your search...
Re: Not Parsing HTML Elements with a class - Tika - [mail # user]
...Hi,  On Tue, Apr 9, 2013 at 3:19 PM, Jason Tesser  wrote:  The IdentityHtmlMapper makes Tika pass the parsed HTML as-is to the specified SAX ContentHandler, so you'll get also...
   Author: Jukka Zitting, 2013-04-09, 12:25
Re: Not Parsing HTML Elements with a class - Tika - [mail # user]
...Hi,  On Mon, Apr 8, 2013 at 9:32 PM, Jason Tesser  wrote:  I see two options:  1) Use the IdentityHtmlMapper strategy to have Tika pass you all HTML elements as-is. Then ...
   Author: Jukka Zitting, 2013-04-09, 04:49
Re: Releasing TikaInputStream resources - Tika - [mail # user]
...Hi,  On Thu, Mar 28, 2013 at 3:09 PM, Public Network Services  wrote:  Can you describe the scenario where you'd need to do something like this?  The code that instantiat...
   Author: Jukka Zitting, 2013-04-02, 06:56
Re: Releasing TikaInputStream resources - Tika - [mail # user]
...Hi,  On Thu, Mar 28, 2013 at 1:56 PM, Public Network Services  wrote:  The TemporaryResources class [1] is designed for this purpose. See the javadocs of the TikaInputStream.g...
   Author: Jukka Zitting, 2013-03-28, 12:51
[TIKA-100] Structured PDF parsing - Tika - [issue]
...The PDF parser currently extracts and outputs document content as a single string. PDFBox could be used to support structuring at least down to page and paragraph (not sure how accurate) lev...
http://issues.apache.org/jira/browse/TIKA-100    Author: Jukka Zitting, 2013-03-01, 11:04
Re: Issue Using Tika to Parse Sling Node Files - Tika - [mail # user]
...Hi,  On Mon, Feb 18, 2013 at 4:46 PM, Matthew Taylor  wrote:  Perhaps the stream simply can't be parsed by Tika? Have you tried      java -jar tika-app-1.3.jar ...
   Author: Jukka Zitting, 2013-02-18, 14:52
Re: Issue Using Tika to Parse Sling Node Files - Tika - [mail # user]
...Hi,  On Mon, Feb 18, 2013 at 6:21 AM, Matthew Taylor  wrote:  Have you tried:      new Tika().parseToString(node.getBinary().getStream())  That should cove...
   Author: Jukka Zitting, 2013-02-18, 13:50
Re: [DISCUSS] Should Tika require Java6? (was Re: Build failed in Jenkins: Tika-trunk #977) - Tika - [mail # dev]
...Hi,  On Fri, Feb 8, 2013 at 6:54 PM, Mattmann, Chris A (388J)  wrote:  +1 to drop Java 5. If anyone is still in production with Java 5, it's high time to start planning for th...
   Author: Jukka Zitting, 2013-02-08, 17:02
Re: [ANNOUNCE] Apache Tika 1.3 Released - Tika - [mail # dev]
...Hi,  On Wed, Jan 23, 2013 at 1:07 AM, Joe Wicentowski  wrote:  No, we unfortunately didn't yet upgrade to latest POI version. Can you please file an improvement request for th...
   Author: Jukka Zitting, 2013-01-23, 07:06
Re: KEYS file and dist.apache.org (Re: [VOTE] Apache Tika 1.3 Release Candidate #1) - Tika - [mail # dev]
...Hi,  On Mon, Jan 21, 2013 at 1:39 PM, Michael McCandless  wrote:  Yes, my point is just that it's better if it's not expressed as a *part* of the release candidate.  Othe...
   Author: Jukka Zitting, 2013-01-21, 12:13
Sort:
project
Tika (530)
Lucene (78)
ManifoldCF (38)
Mahout (5)
Droids (2)
Nutch (2)
Solr (1)
type
issue (201)
mail # dev (197)
mail # user (132)
date
last 7 days (0)
last 30 days (0)
last 90 days (4)
last 6 months (18)
last 9 months (530)
author
Jukka Zitting (530)
Nick Burch (414)
Mattmann, Chris A (331)
Michael McCandless (186)
Ken Krugler (162)
buildbot@...)
Oleg Tikhonov (63)
Markus Jelsma (57)
Mark Kerzner (53)
Dave Meikle (52)
Maxim Valyanskiy (46)
Keith R. Bennett (45)
Ray Gauss II (41)
Antoni Mylka (37)
Benson Margulies (37)